MDD-thinker: A reasoning-enhanced large language model for diagnosis of major depressive disorder

  • Yuyang Sha
  • , Hongxin Pan
  • , Gang Luo
  • , Caijuan Shi
  • , Wei Chen
  • , Jing Wang
  • , Kefeng Li

Research output: Contribution to journalArticlepeer-review

Abstract

Background Major depressive disorder (MDD) is a leading cause of global disability and poses a substantial public health burden. However, current diagnostic approaches largely rely on subjective assessments and lack the ability to integrate heterogeneous clinical and sociodemographic information. Recent advances in large language models (LLMs) offer new opportunities to support MDD diagnosis through reasoning over complex data, yet their clinical applicability is constrained by challenges related to interpretability, hallucinations, and reliance on synthetic data. Methods We propose MDD-Thinker, an LLM-based diagnostic system that integrates supervised fine-tuning (SFT) and reinforcement learning (RL) to enhance reasoning and interpretability under the evaluated conditions. Using the UK Biobank dataset, we constructed 40,000 structured reasoning samples and incorporated an additional 10,000 records from publicly available mental health datasets. MDD-Thinker was trained on these heterogeneous textual data and evaluated against conventional machine learning models, deep learning methods, and representative LLM baselines in terms of diagnostic performance and interpretability. Results MDD-Thinker achieved high performance in MDD diagnosis, with an accuracy of 0.8268 and an F1-score of 0.8081, showing better performance than conventional machine learning models, deep learning approaches, and representative LLM baselines on the evaluated dataset. Beyond predictive accuracy, it consistently produced structured reasoning paths that were clinically coherent, enabling transparent interpretation of diagnostic decisions in the evaluated experiments. The integration of SFT and RL contributed to notable improvements in both diagnostic reliability and reasoning quality. Conclusion MDD-Thinker demonstrates the potential of reasoning-enhanced LLMs for large-scale MDD diagnosis under the evaluated settings. By jointly optimizing accuracy, interpretability, and efficiency, the proposed system provides a scalable and explainable approach for intelligent psychiatric assessment within the scope of the study, highlighting the potential of reasoning-oriented LLMs in mental health care.

Original languageEnglish
Article number121405
JournalJournal of Affective Disorders
Volume403
DOIs
Publication statusPublished - 15 Jun 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Large language models
  • Major depressive disorder
  • Medical data process
  • Reasoning ability
  • Reinforcement learning
  • Supervised fine-tuning

Fingerprint

Dive into the research topics of 'MDD-thinker: A reasoning-enhanced large language model for diagnosis of major depressive disorder'. Together they form a unique fingerprint.

Cite this