cv
Welcome to my CV !!!!
Basics
| Name | Doan Duc Khiem |
| Label | Bioinformatics Researcher & Data Scientist |
| duckhiem.airesearcher@gmail.com | |
| Phone | +33 75 10 93 603 |
| Url | https://khiemducdoan.github.io |
| Summary | Master's student in Data Analysis and Pattern Recognition at Institut Polytechnique de Paris with a strong background in Biomedical Engineering. Experienced in applying deep learning (GNNs, Transformers) to biomedical problems such as Drug Response Prediction and Traumatic Brain Injury prognosis. Passionate about theoretical statistical physics, biophysics, and personalized medicine. |
Work
-
2025.03 - 2025.09 Research Student
BKAI Laboratory (Bioinformatics Research Team)
Working with the Bioinformatics team on the Drug Response Prediction project. Developed novel multi-level information integrating fragment-level and atom-level GNN paradigms.
- Proposed a novel paradigm bridging atom- and fragment-level representations using compact chemical vocabulary.
- Achieved state-of-the-art performance on standard CCLE dataset for within-dataset drug response prediction.
-
2023.01 - Present Research Student
International Research Institute MICA
Conducting research on Traumatic Brain Injury (TBI) Classification using multimodal deep learning and NLP techniques.
- First author of 1 journal paper and 1 conference paper.
- Developed viTBI-BERT, a Vietnamese Language Model for TBI prediction.
Education
-
2025.09 - Present Paris, France
-
2021.09 - 2025.06 Hanoi, Vietnam
Bachelor
Hanoi University of Science and Technology
Biomedical Engineering
- Introduction to Biology (A+)
- Medical Image Processing (A)
- Bioinformatics (A+)
-
2018.09 - 2021.06 Hanoi, Vietnam
Awards
- 2025.06.01
Best Presentation Award
Electronics and Electrical Engineering Council, HUST
Recognized for delivering the top-ranked thesis presentation among all candidates in the Electronics and Electrical Engineering department.
- 2025.01.01
Second Prize - Health Systems Innovation Hackathon
Hanoi Medical University, Vietnam Hub
Building high-value health systems through Artificial Intelligence.
- 2025.01.01
Third Prize - HUST Annual Student Research Conference
Hanoi University of Science and Technology
Awarded for research excellence in the track of AI Applications, Blockchain, and Big Data.
- 2021.01.01
Scholarships for Outstanding Academic Performance
Nguyen Hue High School for the Gifted
Received scholarships in 2019, 2020, and 2021.
Certificates
| ISODS Summer Practicum Program: Data Science & AI in Computer Vision | ||
| ISODS | 2023-12-31 |
| Machine Learning Specialization | ||
| DeepLearning.AI (Coursera) | 2023-01-01 |
| Deep Learning Specialization | ||
| DeepLearning.AI (Coursera) | 2023-01-01 |
Publications
-
2025.01.01 TBI-TTM: Traumatic Brain Injury Prognosis with Textual Data under Missing Tabular Conditions
The 24th International Symposium on Communications and Information Technologies (ISCIT 2025)
First author. Proposed a method to enhance robustness with missing data without imputation, increasing sensitivity by approximately 10%.
-
2024.01.01 viTBI-BERT: A Vietnamese Language Model for Prediction of Traumatic Brain Injury
Journal on Information Technologies & Communications, Vol. 2025, No. 1, pp. 54-67
First author. Developed a domain-adapted BERT model for TBI prediction using Vietnamese physician notes.
Skills
| Programming & Tools | |
| Python | |
| PyTorch | |
| R Programming | |
| Linux Terminal | |
| Git |
| Machine Learning & AI | |
| Deep Learning | |
| Graph Neural Networks (GNN) | |
| Transformers (BERT) | |
| Multimodal Learning | |
| Natural Language Processing (NLP) | |
| Computer Vision |
| Bioinformatics | |
| Drug Response Prediction | |
| Genomic Variant Analysis (GATK, SAMtools) | |
| Protein Structure Prediction | |
| Medical Image Processing |
Languages
| Vietnamese | |
| Native speaker |
| English | |
| Fluent |
| French | |
| Professional Working Proficiency |
Interests
| Research Interests | |
| Statistical Physics | |
| Biophysics | |
| Personalized Medicine | |
| Generative AI in Biology | |
| Protein-Protein Interactions | |
| RNA Analysis |
Projects
- 2024.09 - 2025.05
Multimodal Traumatic Brain Injury Prognosis Assessment
Developed an end-to-end multimodal deep learning pipeline to predict TBI severity by fusing structured clinical data with unstructured physician notes from 503 patients. The model outperformed uni-modal baselines by 1–2% across four key metrics.
- Fine-tuned a LLM domain-adapted BERT model achieving 71% sensitivity.
- Used adjusted Transformer encoder to handle missing data without imputation.
- 2025.07 - 2025.10
Drug Response Prediction System
Proposed a novel multi-level information integrating fragment-level and atom-level GNN paradigm. Bridges atom- and fragment-level representations using a compact chemical vocabulary and principal subgraph mining.
- Achieved state-of-the-art performance on standard CCLE dataset.
- Focused on within-dataset drug response prediction.
- 2024.01 - 2024.04
Genomic Variant Analysis and Annotation
Analyzed germline and somatic variants using GATK. Practiced workflows for variant preprocessing, discovery, refinement, and evaluation.
- Utilized tools: GATK, SAMtools, Picard, IGV.
- Performed analysis using R and Python in Linux environment.