Lemuria Chen's Projects
An argumentation mining annotation tool mentioned in the paper "A Structure-Aware Argument Encoder for Literature Discourse Analysis"
大数据金融课程final
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Code and created datasets for our ACL 2022 paper: "Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations"
Colossal-AI: A Unified Deep Learning System for Big Model Era
Code and released pre-trained model for our ACL 2022 paper: "DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation"
An Open-Source Tool for Automatic Disease Diagnosis..
Code for our Bioinformatics 2022 paper: "DxFormer: A Decoupled Automatic Diagnostic System Based on Decoder-Encoder Transformer with Dense Symptom Representations"
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
Code and dataset for our Bioinformatics 2022 paper: "A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets"
This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi
Config files for my GitHub profile.
A medical dialogue annotation tool that supports multi-level annotation for doctor-patient conversations, including named entities, dialog intents, and medical reports.
Ongoing research training transformer language models at scale, including: BERT & GPT-2
OneFlow models for benchmarking.
Code and data for our COLING 2022 paper: "A Structure-Aware Argument Encoder for Literature Discourse Analysis"
Crosslingual Generalization through Multitask Finetuning