Ongoing projects

Harmonization of Different EHR Schemas for Alzheimer’s Disease Cohorts

Aug 2025 - Present

- Designing implicit harmonization methods that preserve dataset-specific clinical signals for downstream tasks.

Classifying Alzheimer’s Disease Using Patient Speech and Interview Transcripts

Jun 2025 - Present

- Studying early detection of Alzheimer’s disease using real-world Korean patient speech and interview transcripts.
- Continuing this work as part of an academic-industry collaboration (UPenn, Yonsei University College of Medicine, Wonju).

Evaluating Information Differences across CXR Modalities for Mortality Prediction

Oct 2024 - Present

- Analyzing how CXR modality (raw images vs. radiology reports) affects post-ICU mortality prediction combined with discharge notes.
- Currently under review as “Analyzing Information Disparities across Modalities in Mortality Prediction.”

English-Guided Reasoning for Multilingual Medical LLMs

Jun 2024 - Present

- Developing a compact pivot-language framework that encodes English reasoning into latent representations for multilingual medical QA.
- Currently under review as “ComPLUG: Compact English-Guided Reasoning for Multilingual Medical LLMs.”

Finished projects

Aspect-Oriented Summarization for Psychiatric Readmission Prediction

Oct 2024 - Aug 2025

- Developed LLM-based aspect-oriented summarization pipelines for 30-day psychiatric readmission prediction using multi-hospital EHR data.
- Published at EMNLP 2025: “Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction.”

Enhancing Generative Entity Linking in the Biomedical Domain

Oct 2023 - May 2025

- Improving biomedical generative entity linking via preference optimization with negative samples to handle ambiguous mentions more robustly.
- Published at ACL 2025 Findings: “Learning from Negative Samples in Biomedical Generative Entity Linking.”
- Code: GitHub; checkpoints: Hugging Face.

Human-in-the-loop LLMs for Biomedical Hypothesis Generation

Jul 2023 - Sep 2024

- Exploring retrieval-augmented generation workflows where domain experts iteratively refine biomedical hypotheses with LLM assistance.

AI Platform for Precision Medicine in Diabetes Using Big Data

Jun 2023 - Sep 2024

- Fine-tuning LLMs to provide guideline-grounded explanations for diabetes care as part of a broader precision-medicine platform.

Open-source LLM for Explaining Korean Cultural Heritage

May 2023 - Sep 2024

- Developing an open-source Korean LLM that explains cultural heritage sites in faithful, accessible language for non-experts.