Skip to content

BIDS-Xu-Lab/psychiatry-frontier-llm-evaluation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

69 Commits
 
 
 
 

Repository files navigation

Clinician-rated reasoning quality predicts diagnostic correctness in a psychiatric evaluation of large language models

NOTE: Code and data will made public upon publication, according to this data availability statement as reported in the manuscript: Clinician-authored fictitious vignettes will be publicly available. We will not publicly redistribute text derived from published case reports or verbatim model reasoning traces; citations to original sources will be provided, and access to restricted materials may be provided under controlled conditions (e.g., to qualified researchers under a data-use agreement and/or institutional approval).

Description: Code and data repository for the paper "Clinician-rated reasoning quality predicts diagnostic correctness in a psychiatric evaluation of large language models".

Preprint: https://www.medrxiv.org/content/10.64898/2026.02.03.26345402v2

Corresponding author: Kevin W. Jin

About

Code and data repository for "Clinician-rated reasoning quality predicts diagnostic correctness in a psychiatric evaluation of large language models".

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors