PHQ-9: Professional Clinical Reference
This document provides a technically detailed overview of the Patient Health Questionnaire-9 (PHQ-9) for clinicians, researchers, and mental health professionals. For a plain-language patient overview, see the PHQ-9 Overview.
Disclaimer: This content is for educational and informational purposes only. It does not constitute clinical guidelines or professional medical advice. Always refer to current DSM-5 criteria and your professional training when applying clinical tools.
Origins and Validation
The PHQ-9 was developed by Drs. Robert Spitzer, Janet Williams, Kurt Kroenke and colleagues at Columbia University, funded by an educational grant from Pfizer. It was first published in the Journal of General Internal Medicine in 2001.
The scale is derived directly from the DSM-IV (now DSM-5) criteria for major depressive disorder. Each of the nine items maps to one of the nine diagnostic criteria. This structural alignment makes PHQ-9 scores directly interpretable in the context of DSM categorical diagnosis.
Psychometric Properties
Sensitivity and Specificity
At the standard cutoff of ≥10 for major depression:
- Sensitivity: 88%
- Specificity: 88%
- Positive predictive value: ~75% (varies by population prevalence)
- Negative predictive value: ~95%
Source: Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001.
Internal Consistency
- Cronbach's alpha: 0.86–0.89 (excellent internal consistency)
- Test-retest reliability (over 48 hours): r = 0.84
Factor Structure
The PHQ-9 has been extensively assessed for its factor structure. A robust bifactor model has been identified, with one general depression factor and two specific factors (somatic and cognitive-affective), though the unidimensional model performs adequately for most clinical uses.
Scoring and Cutoff Points
| Total Score | Severity | DSM-5 Aligned Category | |-------------|----------|------------------------| | 0–4 | None–Minimal | Subthreshold | | 5–9 | Mild | Monitor; counseling may be warranted | | 10–14 | Moderate | Evaluate for MDD; treatment consideration | | 15–19 | Moderately Severe | Active treatment warranted | | 20–27 | Severe | Immediate intervention; refer for psychiatric evaluation |
Item 9 (SI/HI): Independent clinical attention is warranted for any item 9 score of ≥1, regardless of total PHQ-9 score. Suicidal ideation documented on item 9 should trigger immediate safety assessment.
PHQ-9 as Severity Measure vs. Screening Tool
The PHQ-9 functions both as a binary screener (positive/negative for MDD at ≥10) and as a continuous severity measure. For treatment monitoring, tracking total score change over time is more informative than categorical assignment.
Clinically meaningful change: A change of ≥5 points is generally considered clinically significant. Response is conventionally defined as ≥50% reduction from baseline. Remission is commonly defined as PHQ-9 ≤4.
Limitations and Clinical Considerations
Somatic Item Confounding
Items 3 (sleep), 4 (fatigue), and 8 (psychomotor changes) carry significant somatic loading. In patients with chronic medical illness, these items may inflate scores independent of depression. Several modified scoring approaches (e.g., PHQ-9 with somatic items excluded) have been proposed for medical settings.
Not Designed for Diagnosis Alone
The PHQ-9 is a severity measure and screening tool — not a diagnostic instrument. A clinically significant PHQ-9 score should prompt a full clinical interview, not trigger automatic treatment decisions.
Cultural and Linguistic Validity
The PHQ-9 has been validated in multiple languages and cultural contexts, but cultural expression of depression symptoms varies. The somatic presentation dominant in some non-Western populations may not be fully captured by the PHQ-9's cognitive-affective emphasis.
Population Applicability
The PHQ-9 was developed and initially validated in primary care and obstetrics populations. Validity data in specific populations (adolescents, elderly, perinatal, oncology) is growing but differs. The PHQ-A is the validated adaptation for adolescent use.
Comparison With Related Instruments
| Instrument | Items | Time | Best For | |-----------|-------|------|---------| | PHQ-9 | 9 | 2–3 min | Depression screening + severity monitoring | | BDI-II | 21 | 10 min | Detailed depression assessment | | MADRS | 10 | 15–20 min | Clinician-rated, treatment sensitivity | | QIDS-SR | 16 | 5–7 min | Research, multiple symptom domains | | PHQ-2 | 2 | 1 min | Ultra-brief initial screen only |
Integration With Digital Health Platforms
The PHQ-9 has been implemented in multiple digital health modalities — from standalone self-report apps to EHR-integrated workflows. Digital delivery of the PHQ-9 produces scores that are highly concordant with paper administration (r > 0.90).
Ecologically valid, continuous mood monitoring through journal-based platforms like Rohy AI captures behavioral and emotional data between PHQ-9 administrations — providing longitudinal signal that infrequent clinic-based screening cannot. The combination of standardized point-in-time assessment (PHQ-9 administered per visit) with continuous linguistic and behavioral tracking (Rohy AI daily entries) represents an emerging best practice in digital mental health integration.
License and Attribution
The PHQ-9 is freely available for use by clinicians and researchers. No special permission is required to reproduce or use the PHQ-9. The PHQ is not in the public domain — copyright is held by Pfizer, which grants free use in all settings.
For citation purposes: Kroenke K, Spitzer RL, Williams JB (2001). The PHQ-9: validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606-613.
See also: Understanding Clinical Scales · PHQ-9 Simple Overview · GAD-7 Overview · Clinical Hub · Provider Resources
PHQ-9 Correlates with Rohy AI Psychological Models
Rohy AI maps journal entries against 19 validated psychological models simultaneously. Depression severity measured by the PHQ-9 shows the strongest research-supported correlations with the following models in Rohy's analysis engine:
| Rohy AI Model | PHQ-9 Relationship | Evidence Basis | |---|---|---| | Trait EI | Inverse — low emotional intelligence correlates with higher PHQ-9 | Martins et al., 2010 meta-analysis | | Self-Esteem | Strong inverse — Rosenberg self-esteem is one of the most robust predictors of depression | Sowislo & Orth, 2013 | | PsyCap (Hope/Efficacy/Resilience/Optimism) | Inverse — higher PsyCap associated with lower depression | Avey et al., 2011 | | Grit | Moderate inverse — perseverance buffers against depressive episodes | Salles et al., 2014 | | Mindfulness | Strong inverse — mindfulness practice significantly reduces PHQ-9 scores | Hofmann et al., 2010 | | Locus of Control | Internal locus correlates with lower depression | Benassi et al., 1988 meta-analysis | | Hardiness | Inverse — psychological hardiness mediates depression under stress | Kobasa, 1979; Maddi, 2006 | | Procrastination | Positive — chronic procrastination elevates depression risk | Sirois & Pychyl, 2016 | | Big Five (Neuroticism) | Strongest personality predictor of depression — high neuroticism ↔ high PHQ-9 | DeNeve & Cooper, 1998 |
These correlations inform Rohy AI's cross-model analysis, allowing the platform to identify protective and risk factors in journal language beyond what any single scale captures. The platform monitors all 19 clinical models simultaneously to provide a comprehensive mental wellness snapshot.
Sources
- Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001;16(9):606-613. doi:10.1046/j.1525-1497.2001.016009606.x
- Spitzer RL, Kroenke K, Williams JBW, Patient Health Questionnaire Primary Care Study Group. Validation and utility of a self-report version of PRIME-MD. JAMA. 1999;282(18):1737-1744.
- Löwe B, Unützer J, Callahan CM, Perkins AJ, Kroenke K. Monitoring depression treatment outcomes with the Patient Health Questionnaire-9. Med Care. 2004;42(12):1194-1201.
- Martins A, Ramalho N, Morin E. A comprehensive meta-analysis of the relationship between emotional intelligence and health. Pers Individ Dif. 2010;49(6):554-564.
- Sowislo JF, Orth U. Does low self-esteem predict depression and anxiety? A meta-analysis of longitudinal studies. Psychol Bull. 2013;139(1):213-240.
- Avey JB, Reichard RJ, Luthans F, Mhatre KH. Meta-analysis of the impact of positive psychological capital on employee attitudes, behaviors, and performance. Hum Resour Dev Q. 2011;22(2):127-152.
- Hofmann SG, Sawyer AT, Witt AA, Oh D. The effect of mindfulness-based therapy on anxiety and depression: a meta-analytic review. J Consult Clin Psychol. 2010;78(2):169-183.
Put it into practice
Start your AI journaling practice — free
Rohy AI turns insights like these into a personalized mental wellness practice built from your own daily words.
Start Free — No Credit CardFree plan available · Cancel anytime
