Major depressive disorder (MDD) occurs commonly in the general population. Cross-sectional epidemiological surveys that systematically screen for psychiatric disorders in unselected, representative samples find that approximately 7% of U.S. adults had experienced an episode of MDD sometime in the 12 months before the interview (1). Using retrospective assessment, such surveys estimate that 16% of adults have experienced a major depressive episode during their lifetime; longitudinal studies suggest that cumulative lifetime prevalence rates may be higher still (2, 3). Recurrent MDD, i.e., a condition in which individuals experience multiple clinically significant depressive episodes over the course of their life, occurs in about one-third to two-thirds of these individuals (4); as discussed below, this subtype may have more relevance for genetics. Because of the high prevalence of MDD in countries around the world and the substantial impairment with which it is associated, the World Health Organization has predicted that it would rank second to ischemic heart disease in global burden of disease by 2020 (5). Therefore, understanding its etiological and pathophysiological basis is critical.
This article will review and summarize the extant research regarding the genetics of major depressive disorder. The term "disorder" emphasizes the focus on clinically defined depressive conditions such as MDD rather than a vaguely defined state of "depression." Some have reasonably argued that the nosological definitions for psychiatric diagnoses, although designed and useful for making clinical assignments and treatment decisions, do not necessarily provide an optimum basis for genetic or other biological lines of inquiry. For example, the precise thresholds requiring a 2-week duration or five of nine associated symptoms for DSM-IV diagnosis of MDD do not seem to represent a genetically distinct condition (6). Nor can we answer the deeper question, with its philosophical and pragmatic implications, as to whether depressive "illness" is categorically different from states of normal sadness (as is assumed in the "medical model") or simply a more severe manifestation of it on the same continuum of affect. This sort of question has arguably more relevance for depressive disorders than for, say, schizophrenia. If, as the medical model assumes, there are "disease genes" that have specific dysfunction in MDD, then the procedure is to detect these using the classic case-control design, testing for differences in the structure or function of specific genes between those who are affected with and those who are free from the disorder. On the other hand, if the mechanisms underlying normal variation in mood are the same as those that "cause" affective illness, then one could simply study the former in a general population sample to identify the genetic mechanisms underlying MDD. This approach has clear parallels to the study of other common medical conditions such as hypertension and diabetes and their relationship to physiological regulation of blood pressure and glucose homeostasis, respectively. Because of the absence of knowledge of these fundamental features of the entity of clinical depression, I will limit this review to studies that examine the syndrome of MDD, for lack of a better current model.
The usual sequence of establishing the genetic basis of a condition goes from (1) family studies to establish its familial aggregation, to (2) twin and adoption studies to distinguish genetic versus family environmental sources of aggregation, and then to (3) molecular genetic studies to identify specific susceptibility loci and their functional role in producing the condition. Family studies compare rates of illness in relatives of those who have the condition (case probands) with rates in relatives of healthy control subjects. Higher rates in the former group of relatives suggest familial aggregation; i.e., there is an association of illness in probands with illness in their relatives. A systematic meta-analysis that included five well-done family studies of MDD reported a summary odds ratio of 2.84, i.e., individuals biologically related to someone with MDD have almost three times the risk of MDD compared with those without a family history of the disorder (7). To put this statistic into perspective, relatives of patients with schizophrenia have about a 10-fold risk of developing that condition compared with the general population. Both of these are similar to familiar risk estimates for common medical conditions such as type II diabetes but far smaller than those for a Mendelian disorder such as Huntington's disease.
Twin studies allow one to separate genetic from environmental influences by comparing resemblance for the condition within two types of twin pairs: monozygotic twins, who share 100% of their DNA, and dizygotic twins, who share, on average, 50% of their DNA like other sibling pairs. Greater monozygotic than dizygotic resemblance for the condition suggests at least a partial genetic basis. This is quantified by estimating the heritability, the portion of individual differences due to genetic effects. [See Neale and Cardon (8) for more details on twin study methodology.] The meta-analysis cited above includes data from five well-done twin studies of MDD and reports a summary heritability of 37% (7). One large twin study that used longitudinal data to reduce measurement error estimated the reliability-adjusted heritability to be 66% (9). These results suggest that the effects of genes account for one-third to two-thirds of liability to MDD, with the remaining portion of risk due to environmental influences. For the latter, childhood adversity and recent stressful life events are firmly established risk factors in MDD [see, for example, Paykel (10) and Kessler (11)].
+
Factors affecting genetic risk of MDD
Several modifiers of MDD genetic risk have been identified [reviewed in detail by Levinson (12)]. Recurrent MDD has been shown in some studies to have greater familial risk; thus, having a single depressive episode sometime in one's lifetime may have less to do with genetics than does MDD that recurs. Similarly, for age of onset: more familial forms of the disorder may manifest earlier in life. For these reasons, some molecular genetic researchers seeking to identify depression susceptibility genes have focused their efforts on the phenotype of recurrent, early-onset MDD. Female gender, on the other hand, although an important independent risk factor for MDD (13), has not been reliably associated with an overall difference in familial risk (14), although some of the specific genetic factors may differ between men and women (15). For this reason, some studies stratify their genetic analyses by gender to be sensitive to these effects.
Although too complex to review in detail here, an important consideration for genetic studies of MDD is psychiatric comorbidity. More than half of all individuals with MDD develop an anxiety disorder during their lifetime (16, 17). In addition, many of those who develop MDD or an anxiety disorder have elevated premorbid personality traits such as neuroticism. This observation may be explained, in part, by the suggestion by some genetic epidemiological studies that there may be liability genes that increase risk for internalizing psychopathology in general rather than for specific psychiatric disorders. [For reviews of these studies, see Middeldorp et al. (18) or Hettema (19).] This hypothesis suggests a gene-finding strategy for MDD, adopted by some groups, that blurs diagnostic boundaries in choosing subjects for study, such as including individuals with either MDD or a putatively genetically related condition such as generalized anxiety disorder or simply studying the genetics of neuroticism as a continuous trait in the population.
Once a genetic basis for a condition has been demonstrated by genetic epidemiological studies, molecular genetic studies are undertaken to attempt to identify the particular genes underlying that basis. As explained in greater detail by other articles in this issue, the two main types of molecular genetic approaches are linkage and association studies, as applied to MDD below. Before these are reviewed in detail, they need to be placed in their historical context. In classic Mendelian genetic disorders that display specific patterns of familial transmission such as autosomal dominant or recessive, a relatively rare genetic disease strongly aggregates in certain families due to relatively large detrimental effects of a chromosomal abnormality or changes in a specific gene that changes its functional integrity. However, MDD, with complex patterns of genetic transmission similar to those of other common medical conditions such as hypertension or type 2 diabetes, is probably due to the accumulated effects of many genes of small, more subtle effects that interact with each other and environmental factors longitudinally across one's lifespan to "cause" an individual to develop the condition. Many of the early studies, not appreciating this distinction, did not recruit large enough numbers of subjects, making them insufficiently powered to detect genes of small effect. This limitation applies to both linkage and genetic association studies of psychiatric disorders such as MDD, providing one reason among several to explain why it has been difficult to unambiguously identify the liability genes for these conditions.
Linkage studies, necessarily performed in related individuals such as family pedigrees or sibling pairs, allow one to roughly locate genetic variation underlying a biological trait or condition to particular regions on a chromosome while reducing the likelihood that other regions contain susceptibility loci. In classic Mendelian disorders in which a relatively rare genetic disease strongly aggregated in certain families due to a specific gene, this historically proved to be a powerful technique for localizing disease genes. Using this strategy, several groups performed linkage analyses for MDD, by itself or in combination with related phenotypes, the main findings of which are displayed in Table 1. Few studies found convincing evidence for linkage (genome-wide level significance), and there was little replication across studies for regions with at least suggestive linkage. For MDD, only the short arm of chromosome 1 (1p) contained linkage signals in more than two studies; with this information alone, it is difficult to determine the potential importance of other regions reported.
Consistent with genetic epidemiological studies that support some anxiety disorders and personality traits such as neuroticism as genetically related to MDD, a number of linkage scans for these other phenotypes have identified genomic regions that overlap those from scans involving MDD, as reviewed in Fullerton (28) and Levinson (12). A meta-analysis of eight independent linkage scans for neuroticism (29) implicated regions on 11q and 12q that overlap with linkage to MDD from Zubenko et al. (21) and Abkevich et al. (22), respectively. As is generally the case with linkage studies, the exact location can be quite variable, and there are typically tens to hundreds of potential candidate genes in the linked regions.
Genetic association studies, which may take the form of case-control comparisons in unrelated individuals or family-based transmission tests, allow one to test specific genes or markers within genes for their contribution to normal traits or illness. Most association studies have thus far involved testing one or a few candidate genes implicated indirectly from other data, such as pharmacological agents (e.g., the gene for the serotonin transporter, which underlies the mechanism of action of selective serotonin reuptake inhibitor antidepressants), stress-related biology (e.g., hypothalamic-pituitary-adrenal axis genes), or animal models of depression (e.g., serotonin 1A receptor gene knockouts). The most popularly studied polymorphism in this respect is the 44-base pair insertion/deletion polymorphism (5-HTTLPR) occurring in the promoter region of the serotonin transporter gene (SLC6A4). Both positive and negative reports abound for relating this genetic variant to a myriad of depressive-related phenotypes, making it difficult to draw solid conclusions about its etiological role. The most recent meta-analyses in MDD found a modest main effect for the short (S) allele of this polymorphism (30, 31). Meta-analyses of related phenotypes suggest that there are small but statistically significant effects in association with neuroticism (32—34), suicide (35—37), and obsessive-compulsive disorder (38) but not panic disorder (39). In addition, there is mixed evidence for and subsequent debate over gene-environment interaction effects between 5-HTTLPR and stressful life events (40—42). Such an interaction implies that variation in this gene may increase liability to MDD primarily in individuals who experience significant lifetime stressors such as maltreatment or loss ["genetic control of sensitivity to the environment" (43)].
Similar to linkage studies, most genetic association studies of MDD have been insufficiently powered because of relatively small sample sizes, resulting in a poor record of replicable findings similar to other complex traits (44). Meta-analyses attempt to overcome this limitation by pooling data across studies, effectively increasing the total sample analyzed. However, they are often limited by the inability to include data from all available investigations owing to issues such as heterogeneity between studies or other criteria designed to maximize the quality of the results. Besides the serotonin transporter, separate meta-analyses have examined association of MDD with the gene encoding the dopamine D4 receptor (DRD4) (45), methylenetetrahydrofolate reductase (MTHFR) (46), the serotonin 2A receptor (HTR2A) (47), tyrosine hydroxylase (TH) (48), and angiotensin-converting enzyme (ACE) (49), with only the first two showing significant association across included studies. A recent study reviewed the extant literature for all candidate gene association studies of MDD published before June 2007 (30). This study identified 183 articles that analyzed 393 polymorphisms in 102 candidate genes specifically for association with MDD using acceptable genetic methodology. The authors performed new meta-analyses for 20 polymorphisms in 18 genes that had available data from three or more studies not included in prior meta-analyses. Based on their new analyses together with the prior ones, there is statistical evidence supporting some role for at least five genes in liability to MDD (listed in Table 2): MTHFR, SLC6A3, DRD4, GNB3, and SLC6A4. [Note: Since this analysis, a recent, large study failed to find association with MTHFR (50).] There are probably many more genes involved in MDD, but either (1) insufficient evidence exists for those already studied because of too few or inadequately powered studies or (2) they have yet not been identified as probable candidates for study because of our limited understanding of the pathophysiology of MDD.
To overcome the latter limitation, complex genetic conditions are now being studied with genome-wide association studies (GWASs) that use breakthroughs in high-throughput genotyping technology to interrogate upward of hundreds of thousands of polymorphisms across the genome simultaneously in one experiment rather than gene by gene as is done in candidate gene studies. Three GWASs of MDD have been published to date and one is in press. The first, by Muglia et al. (51), reports the findings of GWASs in two large, independent Caucasian samples: (1) 1,022 patients with recurrent MDD and 1,000 matched control subjects recruited from clinical settings in Munich, Germany, and (2) 492 subjects with recurrent MDD and 1,052 healthy control subjects recruited from a community survey in Lausanne, Switzerland. The researchers analyzed about 522,000 single nucleotide polymorphism (SNP) markers in the first sample and about 370,000 in the second. Although the analyses, separately or combined via meta-analysis, did not provide any findings meeting formal genome-wide statistical significance, the authors provided a table of 27 SNPs on 14 chromosomes that provided nominal but consistent evidence for association to MDD across the two samples. Of note, all of these are novel findings; i.e., they are in or near genes that have never been examined before in relation to MDD. The authors also examined association for candidate genes that have been tested in prior mood disorder studies; GRM7, the gene for the metabotrophic glutamate receptor 7 implicated in a recent GWAS of bipolar disorder, emerged as the most significant of these.
Sullivan et al. (52) conducted another large GWAS using the Genetic Association Information Network-MDD (GAIN-MDD) sample: 1,738 Dutch subjects with MDD recruited from clinical and community settings and 1,802 control subjects selected primarily from the Netherlands Twin Registry as having low genetic liability for depressive or anxiety disorders. Similar to the first study, this one also did not identify any associations with genome-wide significance after testing of approximately 435,000 SNPs, and their best signals were from novel candidates, although with no substantial overlap with the top candidates as reported in the first study. They followed up the most convincing candidate gene, PCLO on chromosome 7 coding for the presynaptic protein Piccolo, by testing its association to MDD in five independent samples totaling 6,079 individuals with MDD and 5,893 control subjects. In only one of the replication samples did several of the PCLO SNPs show modest association, although, overall, strict guidelines for replication were not met. However, in a reanalysis of those data Bochdanovits et al. (53) claimed that there is support for a role in liability to MDD for a nonsynonymous coding SNP (rs2522833) in PCLO. Other replication attempts are currently in progress. An independent study performed in a population-based cohort of elderly Dutch individuals recently reported association of rs2522833 with depressive disorders (54).
The third GWAS for MDD (55) was performed in 1,221 Caucasian patients selected from those who participated in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) (56), a multisite clinical trial; the 1,636 control subjects were drawn from subjects who screened negative for MDD, bipolar disorder, or schizophrenia from the Molecular Genetics of Schizophrenia study in GAIN (57). That analysis, like the others, did not identify any markers with genome-wide significance. Their most strongly associated signals came from a region containing no known genes on 19q12. In that article, the authors combined their data with those from two other MDD GWASs: the Sullivan et al. study in the GAIN-MDD sample described above and an analysis performed in the same 1,636 control subjects from the Molecular Genetics of Schizophrenia sample together with 1,020 subjects from the Genetics of Early-Onset Major Depression (GenRED) sample (listed in Table 1 under linkage analyses) and its follow-up, GenRED II (58). They performed a meta-analysis of these three studies using more than 2.4 million imputed (estimated) SNPs, the most promising of which implicated three genes: (1) ATP6V1B2 on 8p21.3, a gene also modestly implicated in a bipolar disorder GWAS that encodes for a vacuolar proton ATPase, (2) SP4 on 7p15.3, encoding a brain-specific zinc-finger transcription factor, and (3) GRM7 on 3p26.1, the same metabotrophic glutamate receptor gene mentioned in the first MDD GWAS above.
MDD is a common condition that aggregates in families to a modest degree, most likely due to a large number of genes of small effect. It probably shares some of its genetic risk factors with other internalizing phenotypes such as neuroticism and the anxiety disorders. Linkage studies have not strongly and consistently implicated any specific regions in the genome for harboring susceptibility loci for MDD, a problem that has plagued linkage analyses of other complex disorders. Although hundreds of candidate gene association studies have been published, meta-analyses report only modest evidence supporting five genes thus far: MTHFR (related to folate metabolism), SLC6A3 (dopamine transporter), DRD4 (dopamine receptor type 4), GNB3 (G-protein β subunit 3), and SLC6A4 (serotonin transporter). Thus far, available GWAS data suggest a number of novel susceptibility candidates, including replicated support for the glutamate receptor gene GRM7 gene and, possibly, PCLO; the prior five candidate genes were not among the top-ranked loci. More powerful GWASs, meta-analyses, and replication studies are forthcoming, so this is a rapidly moving target. Except for the serotonin transporter gene, models of gene-environment interaction in MDD, probably an important factor in understanding how genetic factors and life events produce illness in some individuals but not others, have been explored in few studies. Thus, there is still much work to be done before a clear understanding of depression susceptibility genes and their role in the development of MDD will be attained.