cv

Welcome to my CV !!!!

Basics

Name Doan Duc Khiem
Label Bioinformatics Researcher & Data Scientist
Email duckhiem.airesearcher@gmail.com
Phone +33 75 10 93 603
Url https://khiemducdoan.github.io
Summary Master's student in Data Analysis and Pattern Recognition at Institut Polytechnique de Paris with a strong background in Biomedical Engineering. Experienced in applying deep learning (GNNs, Transformers) to biomedical problems such as Drug Response Prediction and Traumatic Brain Injury prognosis. Passionate about theoretical statistical physics, biophysics, and personalized medicine.

Work

  • 2025.03 - 2025.09
    Research Student
    BKAI Laboratory (Bioinformatics Research Team)
    Working with the Bioinformatics team on the Drug Response Prediction project. Developed novel multi-level information integrating fragment-level and atom-level GNN paradigms.
    • Proposed a novel paradigm bridging atom- and fragment-level representations using compact chemical vocabulary.
    • Achieved state-of-the-art performance on standard CCLE dataset for within-dataset drug response prediction.
  • 2023.01 - Present
    Research Student
    International Research Institute MICA
    Conducting research on Traumatic Brain Injury (TBI) Classification using multimodal deep learning and NLP techniques.
    • First author of 1 journal paper and 1 conference paper.
    • Developed viTBI-BERT, a Vietnamese Language Model for TBI prediction.

Education

  • 2025.09 - Present

    Paris, France

    Master
    Institut Polytechnique de Paris
    Data Analysis and Pattern Recognition
  • 2021.09 - 2025.06

    Hanoi, Vietnam

    Bachelor
    Hanoi University of Science and Technology
    Biomedical Engineering
    • Introduction to Biology (A+)
    • Medical Image Processing (A)
    • Bioinformatics (A+)
  • 2018.09 - 2021.06

    Hanoi, Vietnam

    High School
    Nguyen Hue High School for the Gifted
    Physics

Awards

Certificates

Machine Learning Specialization
DeepLearning.AI (Coursera) 2023-01-01
Deep Learning Specialization
DeepLearning.AI (Coursera) 2023-01-01

Publications

Skills

Programming & Tools
Python
PyTorch
R Programming
Linux Terminal
Git
Machine Learning & AI
Deep Learning
Graph Neural Networks (GNN)
Transformers (BERT)
Multimodal Learning
Natural Language Processing (NLP)
Computer Vision
Bioinformatics
Drug Response Prediction
Genomic Variant Analysis (GATK, SAMtools)
Protein Structure Prediction
Medical Image Processing

Languages

Vietnamese
Native speaker
English
Fluent
French
Professional Working Proficiency

Interests

Research Interests
Statistical Physics
Biophysics
Personalized Medicine
Generative AI in Biology
Protein-Protein Interactions
RNA Analysis

Projects

  • 2024.09 - 2025.05
    Multimodal Traumatic Brain Injury Prognosis Assessment
    Developed an end-to-end multimodal deep learning pipeline to predict TBI severity by fusing structured clinical data with unstructured physician notes from 503 patients. The model outperformed uni-modal baselines by 1–2% across four key metrics.
    • Fine-tuned a LLM domain-adapted BERT model achieving 71% sensitivity.
    • Used adjusted Transformer encoder to handle missing data without imputation.
  • 2025.07 - 2025.10
    Drug Response Prediction System
    Proposed a novel multi-level information integrating fragment-level and atom-level GNN paradigm. Bridges atom- and fragment-level representations using a compact chemical vocabulary and principal subgraph mining.
    • Achieved state-of-the-art performance on standard CCLE dataset.
    • Focused on within-dataset drug response prediction.
  • 2024.01 - 2024.04
    Genomic Variant Analysis and Annotation
    Analyzed germline and somatic variants using GATK. Practiced workflows for variant preprocessing, discovery, refinement, and evaluation.
    • Utilized tools: GATK, SAMtools, Picard, IGV.
    • Performed analysis using R and Python in Linux environment.