Advertisement
Archival Report| Volume 85, ISSUE 12, P1065-1073, June 15, 2019

Genome-wide Burden of Rare Short Deletions Is Enriched in Major Depressive Disorder in Four Cohorts

Open AccessPublished:March 12, 2019DOI:https://doi.org/10.1016/j.biopsych.2019.02.022

      Abstract

      Background

      Major depressive disorder (MDD) is moderately heritable, with a high prevalence and a presumed high heterogeneity. Copy number variants (CNVs) could contribute to the heritable component of risk, but the two previous genome-wide association studies of rare CNVs did not report significant findings.

      Methods

      In this meta-analysis of four cohorts (5780 patients and 6626 control subjects), we analyzed the association of MDD to 1) genome-wide burden of rare deletions and duplications, partitioned by length (<100 kb or >100 kb) and other characteristics, and 2) individual rare exonic CNVs and CNV regions.

      Results

      Patients with MDD carried significantly more short deletions than control subjects (p = .0059) but not long deletions or short or long duplications. The confidence interval for long deletions overlapped with that for short deletions, but long deletions were 70% less frequent genome-wide, reducing the power to detect increased burden. The increased burden of short deletions was primarily in intergenic regions. Short deletions in cases were also modestly enriched for high-confidence enhancer regions. No individual CNV achieved thresholds for suggestive or significant association after genome-wide correction. p values < .01 were observed for 15q11.2 duplications (TUBGCP5, CYFIP1, NIPA1, and NIPA2), deletions in or near PRKN or MSR1, and exonic duplications of ATG5.

      Conclusions

      The increased burden of short deletions in patients with MDD suggests that rare CNVs increase the risk of MDD by disrupting regulatory regions. Results for longer deletions were less clear, but no large effects were observed for long multigenic CNVs (as seen in schizophrenia and autism). Further studies with larger sample sizes are warranted.

      Keywords

      Major depressive disorder (MDD) is a common psychiatric disorder with a lifetime prevalence of 10% to 20% (
      • Kessler R.C.
      • Bromet E.J.
      The epidemiology of depression across cultures.
      ). It was the third leading cause of global disability in 2015 (
      GBD 2015 Disease and Injury Incidence and Prevalence Collaborators
      Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015: A systematic analysis for the Global Burden of Disease Study 2015.
      ). Heritability is approximately 37%, lower than that of several other psychiatric disorders (
      • Sullivan P.F.
      • Daly M.J.
      • O’Donovan M.
      Genetic architectures of psychiatric disorders: The emerging picture and its implications.
      ). The genome-wide contribution of common single nucleotide polymorphisms (SNPs) to MDD risk is approximately 20% (
      • Lee S.H.
      • Ripke S.
      • Neale B.M.
      • Faraone S.V.
      • Purcell S.M.
      • Perlis R.H.
      • et al.
      Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs.
      ). Consistent with the moderate heritability and high population prevalence, it has required more than 100,000 MDD cases to detect large numbers of genome-wide significant SNP associations, e.g., 15 loci in 121,380 cases plus 338,101 control subjects (
      • Hyde C.L.
      • Nagle M.W.
      • Tian C.
      • Chen X.
      • Paciga S.A.
      • Wendland J.R.
      • et al.
      Identification of 15 genetic loci associated with risk of major depression in individuals of European descent.
      ) and 44 loci in a partially overlapping sample (135,458 cases plus 344,901 control subjects) (
      • Wray N.R.
      • Ripke S.
      • Mattheisen M.
      • Trzaskowski M.
      • Byrne E.M.
      • Abdellaoui A.
      • et al.
      Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression.
      ).
      Rare copy number variants (CNVs) could be contributing to the unexplained portion of genetic risk and provide information about disease mechanisms. Two previous MDD studies of longer CNVs reported no significant genome-wide burden in patients with MDD (
      • Rucker J.J.
      • Tansey K.E.
      • Rivera M.
      • Pinto D.
      • Cohen-Woods S.
      • Uher R.
      • et al.
      Phenotypic association analyses with copy number variation in recurrent depressive disorder.
      ,
      • O’Dushlaine C.
      • Ripke S.
      • Ruderfer D.M.
      • Hamilton S.P.
      • Fava M.
      • Iosifescu D.V.
      • et al.
      Rare copy number variation in treatment-resistant major depressive disorder.
      ). Here, to achieve a larger sample size, we performed a meta-analysis of the association of MDD to rare CNVs in 5780 patients and 6626 control subjects from four cohorts. A significant increase of rare, shorter deletions (<100,000 base pairs) was observed in patients with MDD, and this was driven by CNVs in intergenic regions.

      Methods and Materials

      Samples

      We studied four European ancestry cohorts as shown in Table 1. All participants gave signed informed consent under protocols approved by the relevant institutional review boards.
      Table 1Cohorts and Sample Sizes Before and After Quality Control Filtering


      Cohort
      Pre–Quality Control Sample SizePost–Quality Control Sample Size
      Patients With MDD (Male/Female)Control Subjects (Male/Female)Patients With MDD (Male/Female)Control Subjects (Male/Female)
      RADIANT3087 (908/2179)3157 (1522/1635)2460 (724/1736)2587 (1240/1347)
      NESDA/NTR1637 (509/1128)2030 (765/1265)1568 (488/1080)1913 (719/1194)
      GenRED I1089 (319/770)1345 (784/561)941 (271/670)1264 (743/521)
      GenRED II831 (144/687)944 (418/526)811 (139/672)862 (384/478)
      Total6644 (1880/4764)7476 (3489/3987)5780 (1622/4158)6626 (3086/3540)
      GenRED, Genetics of Recurrent Early-Onset Depression; MDD, major depressive disorder; NESDA, Netherlands Study of Depression and Anxiety; NTR, Netherlands Twin Register.

      RADIANT Cohort

      The RADIANT cohort (
      • Rucker J.J.
      • Tansey K.E.
      • Rivera M.
      • Pinto D.
      • Cohen-Woods S.
      • Uher R.
      • et al.
      Phenotypic association analyses with copy number variation in recurrent depressive disorder.
      ) included patients from three studies of recurrent MDD and two control subject cohorts (458 control subjects who were screened for lifetime absence of psychiatric disorder and 2699 control subjects from phase 2 of the National Blood Service Wellcome Trust Case Control Consortium subcohort). Patients were interviewed with the Schedules for Clinical Assessment in Neuropsychiatry (
      • Wing J.K.
      • Babor T.
      • Brugha T.
      • Burke J.
      • Cooper J.E.
      • Giel R.
      • et al.
      SCAN. Schedules for Clinical Assessment in Neuropsychiatry.
      ) and diagnosed using ICD-10 or DSM-IV criteria. Exclusion criteria were any history or family history of schizophrenia or bipolar disorder or any history of mood disorder secondary to alcohol/substance misuse or of mood-incongruent psychosis (
      • Rucker J.J.
      • Tansey K.E.
      • Rivera M.
      • Pinto D.
      • Cohen-Woods S.
      • Uher R.
      • et al.
      Phenotypic association analyses with copy number variation in recurrent depressive disorder.
      ).

      Netherlands Study of Depression and Anxiety/Netherlands Twin Register

      Patients with MDD and control subjects were drawn from the Netherlands Twin Register (NTR) (
      • Willemsen G.
      • Vink J.M.
      • Abdellaoui A.
      • den Braber A.
      • van Beek J.H.
      • Draisma H.H.
      • et al.
      The Adult Netherlands Twin Register: Twenty-five years of survey and biological data collection.
      ) and the Netherlands Study of Depression and Anxiety (NESDA) (
      • Penninx B.W.
      • Beekman A.T.
      • Smit J.H.
      • Zitman F.G.
      • Nolen W.A.
      • Spinhoven P.
      • et al.
      The Netherlands Study of Depression and Anxiety (NESDA): Rationale, objectives and methods.
      ). Cases had DSM-IV MDD diagnoses as assessed with the Composite Interview Diagnostic Instrument (
      • Boomsma D.I.
      • Willemsen G.
      • Sullivan P.F.
      • Heutink P.
      • Meijer P.
      • Sondervan D.
      • et al.
      Genome-wide association of major depression: Description of samples for the GAIN Major Depressive Disorder Study: NTR and NESDA biobank projects.
      ).

      Genetics of Recurrent Early-Onset Depression

      Patients with MDD and control subjects were drawn from the Genetics of Recurrent Early-Onset Depression (GenRED) cohort (
      • Levinson D.F.
      • Zubenko G.S.
      • Crowe R.R.
      • DePaulo R.J.
      • Scheftner W.S.
      • Weissman M.M.
      • et al.
      Genetics of recurrent early-onset depression (GenRED): Design and preliminary clinical characteristics of a repository sample for genetic linkage studies.
      ,
      • Shi J.
      • Potash J.B.
      • Knowles J.A.
      • Weissman M.M.
      • Coryell W.
      • Scheftner W.A.
      • et al.
      Genome-wide association study of recurrent early-onset major depressive disorder.
      ). Patients had a consensus DSM-IV MDD diagnosis based on a Diagnostic Interview for Genetic Studies interview and other information, with recurrence or chronicity (an episode lasting ≥3 years), onset before 31 years of age, one or more siblings or parents with recurrent MDD and onset before 41 years of age, MDD independent of substance dependence, no bipolar, schizoaffective disorder or schizophrenia diagnosis, and no parent or sibling with suspected bipolar disorder I. The control subjects (n = 1345) from the Molecular Genetics of Schizophrenia cohort (
      • Sanders A.R.
      • Duan J.
      • Levinson D.F.
      • Shi J.
      • He D.
      • Hou C.
      • et al.
      No significant association of 14 candidate genes with schizophrenia in a large European ancestry sample: Implications for psychiatric genetics.
      ) denied (by online screen) ever meeting DSM-IV MDD gate criteria (no 2-week period of depressed mood or anhedonia most of the day, nearly every day), whereas the published GenRED genome-wide association study (GWAS) (
      • Levinson D.F.
      • Zubenko G.S.
      • Crowe R.R.
      • DePaulo R.J.
      • Scheftner W.S.
      • Weissman M.M.
      • et al.
      Genetics of recurrent early-onset depression (GenRED): Design and preliminary clinical characteristics of a repository sample for genetic linkage studies.
      ) included control subjects who never met full MDD criteria by online screen (
      • Sanders A.R.
      • Levinson D.F.
      • Duan J.
      • Dennis J.M.
      • Li R.
      • Kendler K.S.
      • et al.
      The Internet-based MGS2 control sample: Self report of mental illness.
      ).

      GenRED II

      Patients with MDD were from the second GenRED GWAS wave (same criteria as GenRED). Control subjects were drawn from the Genomic Psychiatry Cohort (
      • Pato M.T.
      • Sobell J.L.
      • Medeiros H.
      • Abbott C.
      • Sklar B.M.
      • Buckley P.F.
      • et al.
      The genomic psychiatry cohort: Partners in discovery.
      ), Depression Genes and Networks (
      • Mostafavi S.
      • Battle A.
      • Zhu X.
      • Potash J.B.
      • Weissman M.M.
      • Shi J.
      • et al.
      Type I interferon signaling genes in recurrent major depression: Increased expression detected by whole-blood RNA sequencing.
      ), and the Mayo Clinic (
      • Sobell J.L.
      • Heston L.L.
      • Sommer S.S.
      Novel association approach for determining the genetic predisposition to schizophrenia: Case-control resource and testing of a candidate gene.
      ). Control subjects were drawn from the Genomic Psychiatry Cohort (
      • Pato M.T.
      • Sobell J.L.
      • Medeiros H.
      • Abbott C.
      • Sklar B.M.
      • Buckley P.F.
      • et al.
      The genomic psychiatry cohort: Partners in discovery.
      ) (screened for lifetime depression with a self-report questionnaire), Depression Genes and Networks (
      • Mostafavi S.
      • Battle A.
      • Zhu X.
      • Potash J.B.
      • Weissman M.M.
      • Shi J.
      • et al.
      Type I interferon signaling genes in recurrent major depression: Increased expression detected by whole-blood RNA sequencing.
      ) (screened with a Structured Clinical Interview for DSM-IV), and the Mayo Clinic (
      • Sobell J.L.
      • Heston L.L.
      • Sommer S.S.
      Novel association approach for determining the genetic predisposition to schizophrenia: Case-control resource and testing of a candidate gene.
      ) (screened based on diagnoses in the electronic medical record over an extended period).

      Genotyping

      Patients with MDD in the RADIANT cohort and screened control subjects were genotyped with the HumanHap 610-Quad beadchip (Illumina, Inc., San Diego, CA), and the unscreened National Blood Service samples were genotyped with Illumina Infinium 1M beadchips (hg18 for both) (
      • Rucker J.J.
      • Tansey K.E.
      • Rivera M.
      • Pinto D.
      • Cohen-Woods S.
      • Uher R.
      • et al.
      Phenotypic association analyses with copy number variation in recurrent depressive disorder.
      ). The NTR/NESDA (
      • Abdellaoui A.
      • Ehli E.A.
      • Hottenga J.J.
      • Weber Z.
      • Mbarek H.
      • Willemsen G.
      • et al.
      CNV concordance in 1,097 MZ twin pairs.
      ) and GenRED cohorts were genotyped with the Affymetrix Human Genome-Wide SNP 6.0 Array (Affymetrix, Santa Clara, CA) (hg18) (
      • Shi J.
      • Potash J.B.
      • Knowles J.A.
      • Weissman M.M.
      • Coryell W.
      • Scheftner W.A.
      • et al.
      Genome-wide association study of recurrent early-onset major depressive disorder.
      ), and GenRED II patients and control subjects were genotyped with the Illumina Omni1-Quad beadchip (hg19) (
      • Ripke S.
      • Wray N.R.
      • Lewis C.M.
      • Hamilton S.P.
      • Weissman M.M.
      • Breen G.
      • et al.
      A mega-analysis of genome-wide association studies for major depressive disorder.
      ).

      Selection of CNV Calling Algorithms

      CNVs were called with PennCNV (
      • Wang K.
      • Li M.
      • Hadley D.
      • Liu R.
      • Glessner J.
      • Grant S.F.
      • et al.
      PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.
      ), QuantiSNP (
      • Colella S.
      • Yau C.
      • Taylor J.M.
      • Mirza G.
      • Butler H.
      • Clouston P.
      • et al.
      QuantiSNP: An objective Bayes hidden-Markov model to detect and accurately map copy number variation using SNP genotyping data.
      ) and iPattern (
      • Pinto D.
      • Pagnamenta A.T.
      • Klei L.
      • Anney R.
      • Merico D.
      • Regan R.
      • et al.
      Functional impact of global rare copy number variation in autism spectrum disorders.
      ) in the RADIANT dataset (using 562,329 probes common to the two platforms), with Birdsuite (
      • Korn J.M.
      • Kuruvilla F.G.
      • McCarroll S.A.
      • Wysoker A.
      • Nemesh J.
      • Cawley S.
      • et al.
      Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.
      ) and PennCNV (
      • Wang K.
      • Li M.
      • Hadley D.
      • Liu R.
      • Glessner J.
      • Grant S.F.
      • et al.
      PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.
      ) in the NTR/NESDA dataset, with Birdsuite (
      • Korn J.M.
      • Kuruvilla F.G.
      • McCarroll S.A.
      • Wysoker A.
      • Nemesh J.
      • Cawley S.
      • et al.
      Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.
      ) in the GenRED dataset, and with QuantiSNP (
      • Colella S.
      • Yau C.
      • Taylor J.M.
      • Mirza G.
      • Butler H.
      • Clouston P.
      • et al.
      QuantiSNP: An objective Bayes hidden-Markov model to detect and accurately map copy number variation using SNP genotyping data.
      ) and PennCNV (
      • Wang K.
      • Li M.
      • Hadley D.
      • Liu R.
      • Glessner J.
      • Grant S.F.
      • et al.
      PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.
      ) in the GenRED II dataset. There is no consensus “optimal” calling algorithm for each platform. Various authors use a single calling method (
      • O’Dushlaine C.
      • Ripke S.
      • Ruderfer D.M.
      • Hamilton S.P.
      • Fava M.
      • Iosifescu D.V.
      • et al.
      Rare copy number variation in treatment-resistant major depressive disorder.
      ,
      • Szatkiewicz J.P.
      • O’Dushlaine C.
      • Chen G.
      • Chambert K.
      • Moran J.L.
      • Neale B.M.
      • et al.
      Copy number variation in schizophrenia in Sweden.
      ), agreement between two methods (
      • Abdellaoui A.
      • Ehli E.A.
      • Hottenga J.J.
      • Weber Z.
      • Mbarek H.
      • Willemsen G.
      • et al.
      CNV concordance in 1,097 MZ twin pairs.
      ,
      • Buizer-Voskamp J.E.
      • Muntjewerff J.W.
      • Strengman E.
      • Sabatti C.
      • Stefansson H.
      • Vorstman J.A.
      • et al.
      Genome-wide analysis shows increased frequency of copy number variation deletions in Dutch schizophrenia patients.
      ), or more complex approaches (
      • Sanders S.J.
      • Ercan-Sencicek A.G.
      • Hus V.
      • Luo R.
      • Murtha M.T.
      • Moreno-De-Luca D.
      • et al.
      Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism.
      ,
      • Marshall C.R.
      • Howrigan D.P.
      • Merico D.
      • Thiruvahindrapuram B.
      • Wu W.
      • Greer D.S.
      • et al.
      Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects.
      ).
      We conducted a preliminary analysis of CNV call concordance for duplicate genotypes for 115 Affymetrix 6.0 samples and 20 Illumina Human610-Quad samples. For Affymetrix, we compared CNVision (
      • Sanders S.J.
      • Ercan-Sencicek A.G.
      • Hus V.
      • Luo R.
      • Murtha M.T.
      • Moreno-De-Luca D.
      • et al.
      Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism.
      ), QuantiSNP (
      • Colella S.
      • Yau C.
      • Taylor J.M.
      • Mirza G.
      • Butler H.
      • Clouston P.
      • et al.
      QuantiSNP: An objective Bayes hidden-Markov model to detect and accurately map copy number variation using SNP genotyping data.
      ), PennCNV (
      • Wang K.
      • Li M.
      • Hadley D.
      • Liu R.
      • Glessner J.
      • Grant S.F.
      • et al.
      PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.
      ), and Birdsuite (
      • Korn J.M.
      • Kuruvilla F.G.
      • McCarroll S.A.
      • Wysoker A.
      • Nemesh J.
      • Cawley S.
      • et al.
      Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.
      ) and each pair of algorithms, plus the addition of CNVision’s pCNV parameter (estimating the probability of a true CNV, based on per-SNP variability of log R ratio [LRR] and the number of SNPs consistent with a CNV based on B allele frequency [BAF]). For Illumina we compared all algorithms (except Birdsuite) and pairs plus the addition of pCNV. We also conducted the analyses for short (<100 kb) and long (>100 kb) CNVs separately.
      For Affymetrix, Birdsuite had the highest concordance rate (deletions and duplications), whereas combining it with any other method slightly increased concordance but excluded >40% of calls (Supplemental Table S1). We therefore used Birdsuite alone for Affymetrix data. For Illumina data, QuantiSNP alone had the best concordance for deletions (Supplemental Table S2). For duplications, concordance was highest for QuantiSNP alone; calls made by both PennCNV and QuantiSNP showed improved concordance but excluded >30% of calls. We used QuantiSNP for primary analyses, plus a secondary “narrow” QuantiSNP + PennCNV analysis. For both platforms, concordance was similar for shorter and longer CNVs (Supplemental Tables S3–S6).

      Quality Control of Samples and CNV Calls

      Exclusion criteria for samples were applied to each cohort separately. For NTR/NESDA (
      • Abdellaoui A.
      • Ehli E.A.
      • Hottenga J.J.
      • Weber Z.
      • Mbarek H.
      • Willemsen G.
      • et al.
      CNV concordance in 1,097 MZ twin pairs.
      ) and GenRED (
      • Shi J.
      • Potash J.B.
      • Knowles J.A.
      • Weissman M.M.
      • Coryell W.
      • Scheftner W.A.
      • et al.
      Genome-wide association study of recurrent early-onset major depressive disorder.
      ), exclusions were applied to samples retained by the original studies, using the previous Birdsuite calls: 1) probe intensity variances >4 SDs above the cohort mean; 2) total number or length of deletions or duplications >3 SDs above the mean; 3) any chromosome with number or length of deletions or duplications >7 SDs above the mean; and 4) only autosomal CNVs were called. For Illumina data (RADIANT and GenRED II cohorts), we re-called CNVs with QuantiSNP and PennCNV from raw LRR and BAF data. Exclusion criteria for unfiltered calls were 1) genotype call rate <99%; 2) >5% of SNPs with LRR < −0.5 or > 0.5; 3) >1% of SNPs with LRR < −1; 4) BAF drift > 0.01; 5) LRR SD > 0.28; 6) waviness factor < −0.05 or > 0.05; or 7) total CNV number or length >3 SDs above the cohort mean (deletions or duplications).
      For both platforms, we removed CNVs with fewer than 10 probes and of Birdsuite calls with a logarithm of odds score <10 (duplications) or <6 (deletions) and QuantiSNP calls with a maximum log(Bayes factor) <10. We then merged adjacent deletions (copy numbers 0 or 1) or adjacent duplications (copy numbers 3 or 4) if the number of probes separating them was <30% of probes in the merged region (iterating through each chromosome until all eligible segments were merged, using an in-house script). We removed CNVs with ≥50% overlap with centromeres, telomeres, segmental duplications, or immunoglobulin genes, or length <10 kb (too few probes to call reliably) or >4 Mb [in previous work (
      • Levinson D.F.
      • Duan J.
      • Oh S.
      • Wang K.
      • Sanders A.R.
      • Shi J.
      • et al.
      Copy number variants in schizophrenia: Confirmation of five previous findings and new evidence for 3q29 microdeletions and VIPR2 duplications.
      ), CNVs >4 Mb were disproportionately detected in DNA from lymphoblastic cell lines], or with a frequency of >1% in any of four large-sample cohorts included in the Database of Genomic Variants (
      • Coe B.P.
      • Witherspoon K.
      • Rosenfeld J.A.
      • van Bon B.W.
      • Vulto-van Silfhout A.T.
      • Bosco P.
      • et al.
      Refining analyses of copy number variation identifies specific genes associated with developmental delay.
      ,
      • Cooper G.M.
      • Coe B.P.
      • Girirajan S.
      • Rosenfeld J.A.
      • Vu T.H.
      • Baker C.
      • et al.
      A copy number variation morbidity map of developmental delay.
      ,
      • Abecasis G.R.
      • Auton A.
      • Brooks L.D.
      • DePristo M.A.
      • Durbin R.M.
      • Handsaker R.E.
      • et al.
      An integrated map of genetic variation from 1,092 human genomes.
      ,
      • Mills R.E.
      • Walter K.
      • Stewart C.
      • Handsaker R.E.
      • Chen K.
      • Alkan C.
      • et al.
      Mapping copy number variation by population-scale genome sequencing.
      ), or with a frequency of >1% (based on 50% overlap) in any of our control cohorts.

      Statistical Analysis Overview

      All analyses were conducted for post–quality control (QC) deletions and duplications using PLINK and R software. Genomic locations with hg18 coordinates were converted to hg19 (University of California Santa Cruz LiftOver tool). We first determined (as described below) that effects of cohort and sex had to be controlled appropriately. We chose primary analyses that directly compute an odds ratio (OR) and were equivalent to meta-analysis: logistic regression with sex and cohort covariates (for burden tests) or Cochrane-Mantel-Haenszel (CMH) tests stratified for sex and cohort (for single CNVs), plus meta-analysis and/or permutation tests to check results. We tested two main hypotheses, correcting for multiple tests within each hypothesis.
      Our first hypothesis was that the global burden of rare CNVs is greater in patients with MDD than in control subjects. The four primary analyses were for deletions and duplications, each subdivided by size (<100 kb, >100 kb); the threshold of significance was p < .0125 (.05/4).
      Our second hypothesis was that patients with MDD are more likely to carry specific CNVs. Primary analyses tested association by 1) gene (CNVs impacting exon[s] of the gene) and 2) CNV region defined by pools of overlapping CNVs (PLINK). We established thresholds for significant suggestive association as described below (
      • Levinson D.F.
      • Duan J.
      • Oh S.
      • Wang K.
      • Sanders A.R.
      • Shi J.
      • et al.
      Copy number variants in schizophrenia: Confirmation of five previous findings and new evidence for 3q29 microdeletions and VIPR2 duplications.
      ). Genic tests considered only exonic CNVs because of the stronger mechanistic hypothesis and because exonic and “genic” CNVs were largely overlapping (Supplemental Table S7)—93.2% (deletions) and 99.4% (duplications) of long genic CNVs and 62.6% and 83.6% of short genic CNVs were exonic.

      Effects of Cohort and Sex

      We evaluated two potential confounding variables: cohort and sex (the female proportion was higher in patients with MDD and was variable across cohorts). Multiple linear regressions were performed for total rare deletions or duplications per subject or summed length (Supplemental Table S8), with case-control status, cohort, and sex as independent variables. There were significant effects for cohort (deletions and duplications) and sex (deletions).
      Genome-wide burden analyses were thus performed for short and long deletions and duplications, using logistic regression with sex and cohort as covariates to test for case-control difference. Secondary analyses considered intergenic and genic CNVs, separate analyses of exonic and intronic-only CNVs, singletons, CNVs >500 kb and >1000 kb, and short deletions by 10-kb length bins (10–20, 20–30, etc.). Results were checked against logistic regression for each cohort (with sex as a covariate) followed by meta-analysis of the beta coefficients and standard errors [R function “metagen” (
      • Schwarzer G.
      • Carpenter J.R.
      • Rücker G.
      Meta-Analysis with R.
      )], and permutation tests stratified for cohort and sex (randomly swapping case-control status within the same sex and cohort 100,000 times using the PLINK “--within” option).

      Down-sampled Analysis

      As a check on the effects of uneven numbers of patients/control subjects and males/females per cohort, we repeated burden analyses using a down-sampled dataset: 1622 male and female patients and control subjects (6488 total) drawn from each cohort proportional to its size (Supplemental Table S9).

      Analyses of Single CNVs

      We performed one-sided CMH tests (stratified by sex and cohort) of a case excess of exonic CNVs impacting each RefSeq gene and of CNVs in each “CNV region,” and we checked results with a stratified permutation tests (results were almost identical). To define regions, we used the PLINK “--segment-group” command to identify 994 CNV “pools” of overlapping post-QC CNVs (from all cohorts) and termed the union a CNV region.
      For any CNV with nominally increased case frequency (CMH p < .01), we carried out additional filtering because calling artifacts often produce “significant” results for rare events. We visualized regional LRR and BAF plots for all carriers and a threefold number of noncarriers and superimposed on LRR a plot of estimated probe-by-probe copy number using a different algorithm (
      • Lai T.L.
      • Xing H.
      • Zhang N.
      Stochastic segmentation models for array-based comparative genomic hybridization data analysis.
      ). We also plotted all CNVs in the region. We excluded CNVs for which the probewise algorithm showed no copy number change. After excluding genes/regions where most calls were considered artifacts or were the edges of a common CNV region, we recomputed the CMH tests. We computed a proportion test across the four cohorts for each gene/region and excluded those with significant heterogeneity (p < 3.53 × 10−5 to correct for multiple tests, see below). Supplemental Table S10 lists the inspected regions and reasons for all exclusions.
      Additional exploratory analyses (permutation tests) considered each transcript (http://genome.ucsc.edu/), Encyclopedia of DNA Elements regulatory region, Roadmap Project putative enhancer, promoter and dyadic region, and in aggregate for lists of CNVs with reported associations to psychiatric disorders (
      • Marshall C.R.
      • Howrigan D.P.
      • Merico D.
      • Thiruvahindrapuram B.
      • Wu W.
      • Greer D.S.
      • et al.
      Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects.
      ,
      • Rees E.
      • Walters J.T.
      • Georgieva L.
      • Isles A.R.
      • Chambert K.D.
      • Richards A.L.
      • et al.
      Analysis of copy number variations at 15 schizophrenia-associated loci.
      ) or developmental delay (
      • Cooper G.M.
      • Coe B.P.
      • Girirajan S.
      • Rosenfeld J.A.
      • Vu T.H.
      • Baker C.
      • et al.
      A copy number variation morbidity map of developmental delay.
      ).
      We used a previously described method (
      • Levinson D.F.
      • Duan J.
      • Oh S.
      • Wang K.
      • Sanders A.R.
      • Shi J.
      • et al.
      Copy number variants in schizophrenia: Confirmation of five previous findings and new evidence for 3q29 microdeletions and VIPR2 duplications.
      ) to estimate thresholds for significant association (expected by chance once in 20 genome-wide studies) and suggestive association (expected once per study). For all 994 CNV regions, the 329 deletion regions intersected with 487 genes, and 665 duplication regions intersected with 1475 genes (a total of 1962 genic tests). However, tests of genes within a region are correlated, and each region contained 4.64 genes on average. The 1962 genic tests represented 423 independent tests (∼1962/4.64). We corrected for 1417 tests (994 regions and 423 genes)—a conservative estimate, because some regions were partially overlapping, and many genes were in more than one region, resulting in a p value threshold for significant association of 3.53 × 10−5 (.05/1417) and for suggestive association of 7.06 × 10−4 (1.0/1417).

      Power Analysis

      Power analyses were conducted for detection of specific CNVs (Supplemental Figure S1). For the ranges of allele frequencies and genotypic relative risks that were observed in this study, power was good to excellent to detect associations at p = .01, but the detection of suggestive or significant association would have required larger relative risks than were observed here.

      Enrichment Analysis of Functional Pathways

      To detect gene sets associated with MDD, we downloaded pathways from the Kyoto Encyclopedia of Genes and Genomes (http://rest.kegg.jp/list/pathway) and Gene Ontology (http://geneontology.org/page/download-annotations). Geneset enrichment methods (
      • Raychaudhuri S.
      • Korn J.M.
      • McCarroll S.A.
      • Altshuler D.
      • Sklar P.
      • Purcell S.
      • et al.
      Accurately assessing the risk of schizophrenia conferred by rare copy-number variation affecting genes with brain function.
      ) were used to test for the enrichment of CNVs (separately for all or exonic CNVs) in all the genes of each pathway relative to all genic CNVs using “--cnv-enrichment-test” in PLINK. Permutation tests of enrichment in cases were also performed by adding “--mperm 10000” in PLINK, with batch and sex as covariates. A set of schizophrenia-associated genes (
      Schizophrenia Working Group of the Psychiatric Genomics Consortium
      Biological insights from 108 schizophrenia-associated genetic loci.
      ) was also tested.
      We also evaluated whether case CNVs were enriched in high-confidence DNaseI regions (−log10 p ≥ 10) from the Encyclopedia of DNA Elements (
      ENCODE Project Consortium
      An integrated encyclopedia of DNA elements in the human genome.
      ) or the Roadmap Epigenomics Project (
      • Kundaje A.
      • Meuleman W.
      • Ernst J.
      • Bilenky M.
      • Yen A.
      • Heravi-Moussavi A.
      • et al.
      Integrative analysis of 111 reference human epigenomes.
      ) (https://personal.broadinstitute.org/meuleman/reg2map/HoneyBadger2_release/). Separately for promoter, enhancer, and dyadic regions, we analyzed all tissues together (i.e., whether more case short deletions intersected with at least one high-confidence regulatory sequence from any tissue) and then each tissue separately (counting high-confidence sequences for that tissue). For intergenic short deletions, averaged across tissues, the proportion of CNVs that overlap high-confidence regulatory regions was 1.3% for promoter regions, 2.0% for dyadic regions, and 14.9% for enhancer regions.

      Results

      Of 14,429 samples, 12,406 passed QC (5780 patients with MDD and 6626 control subjects) (Table 1). The total numbers of rare deletion and duplication calls are shown in Supplemental Table S11.

      Genome-wide Burden

      Cases had more CNVs per subject for rare, short (<100 kb) deletions (p = .00592, OR = 1.0483), driven by intergenic deletions (p = .00714, OR = 1.0716) (Table 2 and by cohort in Supplemental Table S12). Similar results were observed by the primary logistic regression tests (Supplemental Table S13), meta-analysis of cohort-specific logistic regressions (Supplemental Table S14), stratified permutation tests (Supplemental Table S7), and the down-sampled dataset (Supplemental Table S9). Short deletions across the 10- to 100-kb range contributed to the case-control difference (Supplemental Table S15 and Supplemental Figures S2–S3). No significant differences were observed for duplications or long deletions, but the OR for long deletions was positive (1.03), the confidence interval overlapped with that for short deletions (Table 2), and a secondary analysis of all rare deletions was significant (OR = 1.044; 95% confidence interval = 1.013–1.075; p = .0046) (Supplemental Table S16). No significant effect was observed for singleton or very long (>500 kb, >1000 kb) deletions or duplications. There was no evidence of strong heterogeneity by cohort for short deletions (Cochran’s Q test; p = .31) or short intergenic deletions (p = .14) (Supplemental Table S14 and Supplemental Table S12 and Supplemental Figure S4 for results by cohort). The excess of short deletions in cases became more significant when CNVs with frequency >1% in each cohort separately were excluded (rather than >1% in any cohort) (Supplemental Table S17) or when QuantiSNP + PennCNV calls were required for Illumina data (Supplemental Table S18). Burden results did not change after excluding nominally significant CNV regions that failed manual checks (Supplemental Table S19).
      Table 2Genome-wide Burden Analyses of Long and Short Deletions and Duplications (CNVs/Subject)
      CNV TypeCNVs/SubjectOR (95% CI)p Value
      Patients With MDDControl Subjects
      Deletions
      >100 kb—All0.3240.3181.0296 (0.9658–1.0975)3.71 × 10−1
       Intergenic0.1340.1380.9881 (0.8956–1.0899)8.11 × 10−1
       Genic0.1910.1811.0606 (0.9754–1.1531)1.68 × 10−1
      Exonic0.1750.1681.0521 (0.9646–1.1475)2.51 × 10−1
      Intronic0.0150.0121.1672 (0.8591–1.5876)3.23 × 10−1
      <100 kb—All1.0150.9781.0483 (1.0139–1.0843)5.92 × 10−3
      Significant result.
       Intergenic0.5060.4831.0716 (1.0190–1.1270)7.14 × 10−3
      Significant result.
       Genic0.5090.4951.0343 (0.9877–1.0842)1.56 × 10−1
      Exonic0.3300.3101.0552 (0.9965–1.1192)6.95 × 10−2
      Intronic0.1790.1850.9952 (0.9149–1.0825)9.11 × 10−1
      Duplications
      >100 kb—All0.4960.4761.0268 (0.9837–1.0725)2.29 × 10−1
       Intergenic0.0870.0791.0912 (0.9654–1.2333)1.62 × 10−1
       Genic0.4090.3971.0187 (0.9723–1.0677)4.37 × 10−1
      Exonic0.4060.3951.0166 (0.9702–1.0657)4.89 × 10−1
      Intronic0.0040.0021.5254 (0.7880–3.0095)2.13 × 10−1
      <100 kb—All0.6700.7020.9850 (0.9512–1.0194)3.90 × 10−1
       Intergenic0.2520.2660.9788 (0.9166–1.0449)5.21 × 10−1
       Genic0.4180.4360.9845 (0.9410–1.0296)4.96 × 10−1
      Exonic0.3450.3650.9730 (0.9253–1.0225)2.83 × 10−1
      Intronic0.0730.0721.0586 (0.9285–1.2066)3.94 × 10−1
      For rare CNVs (carried by <1% of control subjects in each cohort), we defined four primary case-control tests of CNV subsets: deletions and duplications, and within each type, long (>100 kb) and short (<100 kb). For each subset, the case-control difference in CNVs per subject was tested by logistic regression, stratified for cohort and sex (Bonferroni-corrected threshold of significance p = .05/4, or .0125). Further exploration then considered genomic location: only intergenic, genic (exonic and/or intronic impact), exonic (subset of genic), and only intronic (subset of genic). See Supplemental Table S8 for complete results.
      CI, confidence interval; CNV, copy number variant; MDD, major depressive disorder; OR, odds ratio.
      a Significant result.
      We considered two possible within-cohort confounding factors: DNA source and genotyping platforms. In GenRED II, there were two DNA sources: blood (137 patients and all control subjects) or lymphoblastic cell lines (674 patients) (Supplemental Table S20). CNV burden did not significantly differ between blood and lymphoblastoid cell line case DNAs for any category, with a trend for more long deletions in lymphoblastoid cell line DNA (Supplemental Table S21). RADIANT CNV calls used probes common to Illumina 610-Quad (assayed in patients and screened control subjects) and Illumina 1M (unscreened control subjects). Burden results were similar for patients versus screened or unscreened control subjects, except that patients had more short deletions than screened control subjects (assayed with the same array) (Supplemental Table S22). Thus, neither factor accounted for the main finding.

      Exonic CNVs and CNV Regions

      After all QC, no gene or region met the criteria for significant or suggestive association (Supplemental Table S10). Results with p < .01 are shown in Table 3. These represent four independent loci. Duplications in 15q11.2 achieved p = .00076 (OR = 3.88). These duplications are reciprocal to a well-known deletion region (see Discussion), consistently impacting four genes. Less consistent results are observed in surrounding genes in segmental duplication regions (Supplemental Table S10). Exonic deletions in MSR1 achieve p = .0019 (OR = 1.96); the region test includes several intronic deletions, with a similar result (p = .00075; OR = 2.05). A CNV region containing exonic and intronic deletions in PRKN (formerly PARK2) produced p = .00097 (OR = 1.92); the exonic test for PRKN had p > .01. Finally, there were six duplications, all in patients with MDD, in 6q21 (p = .0059; OR = ∞), including five exonic duplications in ATG5 that overlapped with one upstream duplication. LRR/BAF plots of CNVs shown in Table 3 are provided in Supplemental Figure S5.
      Table 3Copy Number Variant Genes and Regions
      p < .01 case-control difference.
      Gene or RegionChromosomeStartEndAllCMH TestRADGR2GR1NethAnnotation
      CaCoORp ValueCaCoCaCoCaCoCaCo
      Genes (Exonic)
      Deletion
      MSR1815,965,38616,050,30055321.961.9 × 10−32310631510119
      Duplication
      TUBGCP51522,833,39422,873,8912473.887.6 × 10−473110214315q11.2 (reciprocal to well-known deletion region)
      CYFIP11522,892,64823,003,603
      NIPA21523,004,68323,034,427
      NIPA11523,043,27823,086,843
      Regions (Genic and/or Intergenic)
      Deletion
       6q266162,136,159163,489,66865401.929.7 × 10−43317101195137PRKN
       8p22815,817,19616,092,65659332.057.5 × 10−42410181163119MSR1
      Duplication
       6q216106,549,398107,026,32360Inf5.9 × 10−320100030ATG5
       15q11.21522,652,33023,309,2942473.887.6 × 10−4732111043TUBGCP5, CYFIP1, NIPA1, and NIPA2
      Shown are the numbers of cases (patients with major depressive disorder) (out of 5780) and of control subjects (out of 6626) carrying each CNV with post–quality control p < .01. Start and end are genomic positions in base pairs (build hg19) either for the gene for which one or more exons was impacted by each CNV or for the region within which CNVs were counted.
      Ca, cases; CMH, Cochrane-Mantel-Haenszel; CNV, copy number variant; Co, control subjects; GR1, Genetics of Recurrent Early-Onset Depression (genotyped with Affymetrix); GR2, Genetics of Recurrent Early-Onset Depression II (genotyped with Illumina); Neth, Netherlands Study of Depression and Anxiety and Netherlands Twin Register (genotyped with Affymetrix); OR, CMH odds ratio; Inf, infinite; RAD, RADIANT (genotyped with Illumina).
      a p < .01 case-control difference.

      Pathway Enrichment Analysis

      After correction for multiple testing, no Kyoto Encyclopedia of Genes and Genomes or Gene Ontology pathway was enriched with short deletions in patients with MDD.

      Regulatory Regions

      Enhancer regions were modestly enriched in patients with MDD for all tissues combined as defined above (p = .024), and in 5 of 127 specific tissues (p < .05) (Supplemental Table S23).

      Known Loci Associated With Psychiatric Disorders or Developmental Delay

      Permutation tests did not demonstrate case enrichment of CNVs in loci associated with psychiatric disorders (Supplemental Table S24) or developmental delay (Supplemental Table S25). There was no overlap between the CNVs reported in Table 3 and significant MDD GWAS loci (
      • Wray N.R.
      • Ripke S.
      • Mattheisen M.
      • Trzaskowski M.
      • Byrne E.M.
      • Abdellaoui A.
      • et al.
      Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression.
      ,
      • Howard D.M.
      • Adams M.J.
      • Clarke T.K.
      • Hafferty J.D.
      • Gibson J.
      • Shirali M.
      • et al.
      Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions.
      ).

      Discussion

      This is the largest genome-wide study to date of the association of MDD with rare CNVs. An excess of long CNVs (>100 kb) was initially reported in an analysis of the RADIANT cohort that included additional controls (
      • Rucker J.J.
      • Breen G.
      • Pinto D.
      • Pedroso I.
      • Lewis C.M.
      • Cohen-Woods S.
      • et al.
      Genome-wide association analysis of copy number variation in recurrent depressive disorder.
      ), but a subsequent reanalysis (without the extra controls and with stricter QC, producing a substantial reduction in number of CNVs per subject similar to that reported here) detected no significant excess (
      • Rucker J.J.
      • Tansey K.E.
      • Rivera M.
      • Pinto D.
      • Cohen-Woods S.
      • Uher R.
      • et al.
      Phenotypic association analyses with copy number variation in recurrent depressive disorder.
      ). Another study of longer CNVs in 452 patients with treatment-resistant depression and 811 control subjects also reported no significant differences (
      • O’Dushlaine C.
      • Ripke S.
      • Ruderfer D.M.
      • Hamilton S.P.
      • Fava M.
      • Iosifescu D.V.
      • et al.
      Rare copy number variation in treatment-resistant major depressive disorder.
      ). For schizophrenia, evidence for association of several long CNVs with large effects on risk could be detected with samples comparable in size to RADIANT (
      • Stefansson H.
      • Rujescu D.
      • Cichon S.
      • Pietiläinen O.P.
      • Ingason A.
      • Steinberg S.
      • et al.
      Large recurrent microdeletions associated with schizophrenia.
      ). There were no such findings for single CNVs in the present, larger study. It appears that long, multigenic CNVs are less likely to have large effects on the risk of MDD.

      Global Burden of Short Deletions

      We observed enrichment of short deletions (<100 kb) in patients with MDD, and particularly intergenic deletions. This suggests that the effect on MDD risk is due to the deletion of regulatory elements, consistent with the (modest) enrichment of high-confidence enhancer regions in short deletions in patients with MDD. This is consistent with the extensive analyses of the Psychiatric Genomics Consortium’s meta-analysis of depression GWAS data (
      • Wray N.R.
      • Ripke S.
      • Mattheisen M.
      • Trzaskowski M.
      • Byrne E.M.
      • Abdellaoui A.
      • et al.
      Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression.
      ) that detected 44 significant associations primarily in nonexonic SNPs, including several in genes that are involved with extensive regulatory networks (RBFOX1, RBFOX2, RBFOX3, and CELF4), as well as genome-wide enrichment of highly conserved regions, open chromatin in human brain and an epigenetic mark of active enhancers (H3K4me1).
      One might expect an increased burden of longer CNVs as well, because they contain more genes and regulatory elements. We analyzed short and long deletions separately because longer CNVs have been more frequently implicated in disease risk. Similar ORs were observed for burden of short and of long deletions in patients with MDD, and their confidence intervals overlapped, but we had less power to detect an excess of long deletions because they were 70% less frequent than short deletions. Thus, an increased burden of longer deletions might be observed in larger meta-analyses. We also suspect that the ascertainment methods of most MDD studies are biased against individuals with long multigenic CNVs, whose carriers are at higher risk of disorders such as schizophrenia, autism, and intellectual disability. Individuals with these phenotypes have an increased risk of depression (
      • Upthegrove R.
      • Marwaha S.
      • Birchwood M.
      Depression and schizophrenia: Cause, consequence, or trans-diagnostic issue?.
      ,
      • Baudewijns L.
      • Ronsse E.
      • Verstraete V.
      • Sabbe B.
      • Morrens M.
      • Bertelli M.O.
      Problem behaviours and major depressive disorder in adults with intellectual disability and autism.
      ), but they are often excluded from MDD cohorts and are often not specifically diagnosed with, or treated for, depression—resulting in exclusion even from registry-based cohorts. Thus, both short and long rare deletions could impact the risk of MDD, but the current results are significant only for shorter deletions (10–100 kb), and larger cohorts will be needed to resolve the issue.

      Individual Genes and Regions

      No significant or suggestive associations were detected for individual exonic CNVs or for CNV regions, after conservative correction for genome-wide testing. Larger datasets will be needed to identify true positive findings. Nominal association was observed in several regions (p < .01 but not achieving suggestive or significant thresholds). The first region was 15q11.2 duplications encompassing the small, nonimprinted BP1/BP2 segment of the Prader-Willi/Angelman region. Deletions of this segment are weakly associated with risk of schizophrenia (
      • Marshall C.R.
      • Howrigan D.P.
      • Merico D.
      • Thiruvahindrapuram B.
      • Wu W.
      • Greer D.S.
      • et al.
      Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects.
      ,
      • Rees E.
      • Walters J.T.
      • Georgieva L.
      • Isles A.R.
      • Chambert K.D.
      • Richards A.L.
      • et al.
      Analysis of copy number variations at 15 schizophrenia-associated loci.
      ) and have been reported to be associated with dyslexia and dyscalculia (with deletions and duplications associated with reductions or increases, respectively, in the size and activity of the left fusiform gyrus) (
      • Ulfarsson M.O.
      • Walters G.B.
      • Gustafsson O.
      • Steinberg S.
      • Silva A.
      • Doyle O.M.
      • et al.
      15q11.2 CNV affects cognitive, structural and functional correlates of dyslexia and dyscalculia.
      ). Second, deletions in exons of MSR1 (or all deletions in that region) have been implicated in atherosclerosis, Alzheimer’s disease, and host defense. Third, deletions in 6q26 impact introns or exons of PRKN, where recessive mutations cause early-onset Parkinson’s disease (type 2), but heterozygous variants are not associated with Parkinson’s disease (
      • Hattori N.
      • Mizuno Y.
      Twenty years since the discovery of the parkin gene.
      ), although Parkinson’s disease is associated with increased depressive symptoms (
      • Larsen J.P.
      • Dalen I.
      • Pedersen K.F.
      • Tysnes O.B.
      The natural history of depressive symptoms in patients with incident Parkinson’s disease: A prospective cohort study.
      ). The final region included duplications in exons of, or upstream sequence near, ATG5, which has multiple immune functions, including negative regulation of the type I interferon production pathway—this is of note because reduced white blood cell expression of interferon I response genes was reported (
      • Mostafavi S.
      • Battle A.
      • Zhu X.
      • Potash J.B.
      • Weissman M.M.
      • Shi J.
      • et al.
      Type I interferon signaling genes in recurrent major depression: Increased expression detected by whole-blood RNA sequencing.
      ) but not replicated (
      • Jansen R.
      • Penninx B.W.
      • Madar V.
      • Xia K.
      • Milaneschi Y.
      • Hottenga J.J.
      • et al.
      Gene expression in major depressive disorder.
      ) in studies of MDD.

      Limitations

      The sample size is larger than previous CNV studies of MDD but remains underpowered. Combining CNV cohorts presents challenges including differences in clinical methods (inclusion criteria, ascertainment, and assessments) and genotyping (platforms that differ in genome coverage and signal to noise ratio). Also, the present cohorts are not ideal for testing whether the long, multigenic “neuropsychiatric” CNVs are also predisposing for depression: the psychiatric and neurological features of these CNVs may be considered exclusion criteria from MDD studies; and the associated cognitive impairments reduce the probability of being recruited into MDD cohorts because individual carriers are less likely to volunteer or to be treated in the targeted clinical settings. On the other hand, the cohorts are broadly representative of the current concept of clinically significant MDD.
      In conclusion, we found significant evidence for an increased global burden of shorter rare deletions that was mainly driven by intergenic deletions in patients with MDD from four cohorts. The evidence regarding longer deletions was inconclusive: They were not significantly increased in patients with MDD, but the confidence intervals overlapped with the case-control ORs for shorter and longer deletions, and there was less power to detect a difference because longer deletions are less frequent. Overall, the results suggest that the effects of CNVs on regulatory elements, primarily in intergenic regions, play a role in predisposition to MDD.

      Acknowledgments and Disclosures

      GenRED and GenRED II: These projects were supported by National Institute of Mental Health (NIMH) R01 Grants MH061686 (to DFL), MH059542 (to WH Coryell), MH075131 (to WB Lawson), MH059552 (to JBP), MH059541 (to WA Scheftner) and MH060912 (to MMW). The NIMH Cell Repository at Rutgers University and the NIMH Center for Collaborative Genetic Studies on Mental Disorders made essential contributions to this project. Genotyping was carried out by the Broad Institute Center for Genotyping and Analysis with support from Grant U54 RR020278 (which partially subsidized the genotyping of the GenRED cases). Collection and quality control analyses of the control data set were supported by grants from NIMH and the National Alliance for Research on Schizophrenia and Depression.
      For the Molecular Genetics of Schizophrenia (MGS) control cohort from which GenRED-I controls were drawn: This work was supported primarily by the National Institutes of Health (Grant Nos. R01MH067257 [to NG Buccola], R01MH 059588 [to BJ Mowry], R01MH059571 [to PVG], R01MH059565 to [R Freedman], R01MH059587 [to F Amin], R01MH060870 [to WF Byerley], R01M H059566 [to DW Black], R01MH059586 [to JM Silverman], R01MH061675 [to DFL], R01MH060879 to [CR Cloninger], R01MH081800 [to PVG], U01MH046276 to [CR Cloninger], U01MH046289 [to C Kaufmann], U01MH046318 [to MT Tsuang], U01MH079469 to [PVG], and U01MH079470 to [DFL]), the Genetic Association Information Network (GAIN, for genotyping of half of the EA sample), and The Paul Michael Donovan Charitable Foundation. Genotyping was carried out by the Center for Genotyping and Analysis at the Broad Institute of Harvard and MIT (to S Gabriel and DB Mirel), supported by NIH Grant No. U54RR020278. We are grateful to Knowledge Networks (Menlo Park, CA) for assistance in collecting the control data set.
      NESDA/NTR: The Netherlands Study of Depression and Anxiety (NESDA) and the Netherlands Twin Register (NTR) contributed to GAIN-MDD and to MDD2000. Funding was from: the Netherlands Organization for Scientific Research (MagW/ZonMW Grants 904-61-090, 985-10002, 904-61-193, 480-04-004, 400-05-717, 912-100-20; Spinozapremie 56-464-14192; Geestkracht program Grant 10-000-1002); the Center for Medical Systems Biology (NWO Genomics), Biobanking and Biomolecular Resources Research Infrastructure, VU University’s Institutes for Health and Care Research and Neuroscience Campus Amsterdam, NBIC/BioAssist/ RK (2008.024); the European Science Foundation (EU/QLRT-2001-01254); the European Community’s Seventh Framework Program (FP7/2007-2013); ENGAGE (HEALTH-F4-2007-201413); and the European Science Council (ERC, 230374). Genotyping was funded in part by the Genetic Association Information Network (GAIN) of the Foundation for the US National Institutes of Health, and analysis was supported by grants from GAIN and the NIMH (MH081802). CM Middeldorp was supported by the Netherlands Organization for Scientific Research (NOW-VENI grant 916-76-125).
      RADIANT: This work was supported by a joint grant from the United Kingdom Medical Research Council and GlaxoSmithKline (Grant No. G0701420) and the National Institute for Health Research (NIHR) Biomedical Research Centre for Mental Health at South London and Maudsley National Health Service (NHS) Foundation Trust and Institute of Psychiatry, King’s College London. This work presents independent research in part funded by the NIHR. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health. This work was also supported by the Wellcome Trust Grant No. 086635 (JJHR); NIHR Specialist Biomedical Research Centre for Mental Health at the South London and Maudsley NHS Foundation Trust and the Institute of Psychiatry, King’s College London (SC-W); a Marie Curie Intra-European Fellowship within the 7th European Community Framework Programme; European Commission Grant Agreement No. 115008); and Canada Research Chairs program (http://www.chairs-chaires.gc.ca/). The Genome Based Therapeutic Drugs for Depression study was funded by a European Commission Framework 6 grant, European Commission Contract Reference LSHB-CT-2 003-503428, and GlaxoSmithKline. Genotyping was performed at the Centre Nationale De Genotypage, Evry, Paris. We acknowledge the contribution of phase 2 of the Wellcome Trust Case Control Consortium in providing access to control datasets from the 1958 British birth cohort and the National Blood Service cohort.
      We thank Stephan Sanders from the University of California at San Francisco for his assistance using CNVision. We also thank the individuals who participated in these projects and to the many clinicians who facilitated or contributed to them.
      Availability of Data and Biomaterials: Biomaterials and clinical data are available from the NIMH repository (https://nimhgenetics.org) for the GenRED cases (the GenRED1 cohort includes the family-based linkage cohort and part of the subsequent case collection; the GenRED2 cohort includes the remainder of the case collection); for the MGS controls; and for Genomic Psychiatry Cohort controls, including the Mayo Clinic controls.
      The authors report no biomedical financial interests or potential conflicts of interest.

      Supplementary Material

      References

        • Kessler R.C.
        • Bromet E.J.
        The epidemiology of depression across cultures.
        Annu Rev Public Health. 2013; 34: 119-138
        • GBD 2015 Disease and Injury Incidence and Prevalence Collaborators
        Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015: A systematic analysis for the Global Burden of Disease Study 2015.
        Lancet. 2016; 388: 1545-1602
        • Sullivan P.F.
        • Daly M.J.
        • O’Donovan M.
        Genetic architectures of psychiatric disorders: The emerging picture and its implications.
        Nat Rev Genet. 2012; 13: 537-551
        • Lee S.H.
        • Ripke S.
        • Neale B.M.
        • Faraone S.V.
        • Purcell S.M.
        • Perlis R.H.
        • et al.
        Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs.
        Nat Genet. 2013; 45: 984-994
        • Hyde C.L.
        • Nagle M.W.
        • Tian C.
        • Chen X.
        • Paciga S.A.
        • Wendland J.R.
        • et al.
        Identification of 15 genetic loci associated with risk of major depression in individuals of European descent.
        Nat Genet. 2016; 48: 1031-1036
        • Wray N.R.
        • Ripke S.
        • Mattheisen M.
        • Trzaskowski M.
        • Byrne E.M.
        • Abdellaoui A.
        • et al.
        Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression.
        Nat Genet. 2018; 50: 668-681
        • Rucker J.J.
        • Tansey K.E.
        • Rivera M.
        • Pinto D.
        • Cohen-Woods S.
        • Uher R.
        • et al.
        Phenotypic association analyses with copy number variation in recurrent depressive disorder.
        Biol Psychiatry. 2016; 79: 329-336
        • O’Dushlaine C.
        • Ripke S.
        • Ruderfer D.M.
        • Hamilton S.P.
        • Fava M.
        • Iosifescu D.V.
        • et al.
        Rare copy number variation in treatment-resistant major depressive disorder.
        Biol Psychiatry. 2014; 76: 536-541
        • Wing J.K.
        • Babor T.
        • Brugha T.
        • Burke J.
        • Cooper J.E.
        • Giel R.
        • et al.
        SCAN. Schedules for Clinical Assessment in Neuropsychiatry.
        Arch Gen Psychiatry. 1990; 47: 589-593
        • Willemsen G.
        • Vink J.M.
        • Abdellaoui A.
        • den Braber A.
        • van Beek J.H.
        • Draisma H.H.
        • et al.
        The Adult Netherlands Twin Register: Twenty-five years of survey and biological data collection.
        Twin Res Hum Genet. 2013; 16: 271-281
        • Penninx B.W.
        • Beekman A.T.
        • Smit J.H.
        • Zitman F.G.
        • Nolen W.A.
        • Spinhoven P.
        • et al.
        The Netherlands Study of Depression and Anxiety (NESDA): Rationale, objectives and methods.
        Int J Methods Psychiatr Res. 2008; 17: 121-140
        • Boomsma D.I.
        • Willemsen G.
        • Sullivan P.F.
        • Heutink P.
        • Meijer P.
        • Sondervan D.
        • et al.
        Genome-wide association of major depression: Description of samples for the GAIN Major Depressive Disorder Study: NTR and NESDA biobank projects.
        Eur J Hum Genet. 2008; 16: 335-342
        • Levinson D.F.
        • Zubenko G.S.
        • Crowe R.R.
        • DePaulo R.J.
        • Scheftner W.S.
        • Weissman M.M.
        • et al.
        Genetics of recurrent early-onset depression (GenRED): Design and preliminary clinical characteristics of a repository sample for genetic linkage studies.
        Am J Med Genet B Neuropsychiatr Genet. 2003; 119B: 118-130
        • Shi J.
        • Potash J.B.
        • Knowles J.A.
        • Weissman M.M.
        • Coryell W.
        • Scheftner W.A.
        • et al.
        Genome-wide association study of recurrent early-onset major depressive disorder.
        Mol Psychiatry. 2011; 16: 193-201
        • Sanders A.R.
        • Duan J.
        • Levinson D.F.
        • Shi J.
        • He D.
        • Hou C.
        • et al.
        No significant association of 14 candidate genes with schizophrenia in a large European ancestry sample: Implications for psychiatric genetics.
        Am J Psychiatry. 2008; 165: 497-506
        • Sanders A.R.
        • Levinson D.F.
        • Duan J.
        • Dennis J.M.
        • Li R.
        • Kendler K.S.
        • et al.
        The Internet-based MGS2 control sample: Self report of mental illness.
        Am J Psychiatry. 2010; 167: 854-865
        • Pato M.T.
        • Sobell J.L.
        • Medeiros H.
        • Abbott C.
        • Sklar B.M.
        • Buckley P.F.
        • et al.
        The genomic psychiatry cohort: Partners in discovery.
        Am J Med Genet B Neuropsychiatr Genet. 2013; 162B: 306-312
        • Mostafavi S.
        • Battle A.
        • Zhu X.
        • Potash J.B.
        • Weissman M.M.
        • Shi J.
        • et al.
        Type I interferon signaling genes in recurrent major depression: Increased expression detected by whole-blood RNA sequencing.
        Mol Psychiatry. 2014; 19: 1267-1274
        • Sobell J.L.
        • Heston L.L.
        • Sommer S.S.
        Novel association approach for determining the genetic predisposition to schizophrenia: Case-control resource and testing of a candidate gene.
        Am J Med Genet. 1993; 48: 28-35
        • Abdellaoui A.
        • Ehli E.A.
        • Hottenga J.J.
        • Weber Z.
        • Mbarek H.
        • Willemsen G.
        • et al.
        CNV concordance in 1,097 MZ twin pairs.
        Twin Res Hum Genet. 2015; 18: 1-12
        • Ripke S.
        • Wray N.R.
        • Lewis C.M.
        • Hamilton S.P.
        • Weissman M.M.
        • Breen G.
        • et al.
        A mega-analysis of genome-wide association studies for major depressive disorder.
        Mol Psychiatry. 2013; 18: 497-511
        • Wang K.
        • Li M.
        • Hadley D.
        • Liu R.
        • Glessner J.
        • Grant S.F.
        • et al.
        PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.
        Genome Res. 2007; 17: 1665-1674
        • Colella S.
        • Yau C.
        • Taylor J.M.
        • Mirza G.
        • Butler H.
        • Clouston P.
        • et al.
        QuantiSNP: An objective Bayes hidden-Markov model to detect and accurately map copy number variation using SNP genotyping data.
        Nucleic Acids Res. 2007; 35: 2013-2025
        • Pinto D.
        • Pagnamenta A.T.
        • Klei L.
        • Anney R.
        • Merico D.
        • Regan R.
        • et al.
        Functional impact of global rare copy number variation in autism spectrum disorders.
        Nature. 2010; 466: 368-372
        • Korn J.M.
        • Kuruvilla F.G.
        • McCarroll S.A.
        • Wysoker A.
        • Nemesh J.
        • Cawley S.
        • et al.
        Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.
        Nat Genet. 2008; 40: 1253-1260
        • Szatkiewicz J.P.
        • O’Dushlaine C.
        • Chen G.
        • Chambert K.
        • Moran J.L.
        • Neale B.M.
        • et al.
        Copy number variation in schizophrenia in Sweden.
        Mol Psychiatry. 2014; 19: 762-773
        • Buizer-Voskamp J.E.
        • Muntjewerff J.W.
        • Strengman E.
        • Sabatti C.
        • Stefansson H.
        • Vorstman J.A.
        • et al.
        Genome-wide analysis shows increased frequency of copy number variation deletions in Dutch schizophrenia patients.
        Biol Psychiatry. 2011; 70: 655-662
        • Sanders S.J.
        • Ercan-Sencicek A.G.
        • Hus V.
        • Luo R.
        • Murtha M.T.
        • Moreno-De-Luca D.
        • et al.
        Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism.
        Neuron. 2011; 70: 863-885
        • Marshall C.R.
        • Howrigan D.P.
        • Merico D.
        • Thiruvahindrapuram B.
        • Wu W.
        • Greer D.S.
        • et al.
        Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects.
        Nat Genet. 2017; 49: 27-35
        • Levinson D.F.
        • Duan J.
        • Oh S.
        • Wang K.
        • Sanders A.R.
        • Shi J.
        • et al.
        Copy number variants in schizophrenia: Confirmation of five previous findings and new evidence for 3q29 microdeletions and VIPR2 duplications.
        Am J Psychiatry. 2011; 168: 302-316
        • Coe B.P.
        • Witherspoon K.
        • Rosenfeld J.A.
        • van Bon B.W.
        • Vulto-van Silfhout A.T.
        • Bosco P.
        • et al.
        Refining analyses of copy number variation identifies specific genes associated with developmental delay.
        Nat Genet. 2014; 46: 1063-1071
        • Cooper G.M.
        • Coe B.P.
        • Girirajan S.
        • Rosenfeld J.A.
        • Vu T.H.
        • Baker C.
        • et al.
        A copy number variation morbidity map of developmental delay.
        Nat Genet. 2011; 43: 838-846
        • Abecasis G.R.
        • Auton A.
        • Brooks L.D.
        • DePristo M.A.
        • Durbin R.M.
        • Handsaker R.E.
        • et al.
        An integrated map of genetic variation from 1,092 human genomes.
        Nature. 2012; 491: 56-65
        • Mills R.E.
        • Walter K.
        • Stewart C.
        • Handsaker R.E.
        • Chen K.
        • Alkan C.
        • et al.
        Mapping copy number variation by population-scale genome sequencing.
        Nature. 2011; 470: 59-65
        • Schwarzer G.
        • Carpenter J.R.
        • Rücker G.
        Meta-Analysis with R.
        Springer International Publishing, Basel, Switzerland2015
        • Lai T.L.
        • Xing H.
        • Zhang N.
        Stochastic segmentation models for array-based comparative genomic hybridization data analysis.
        Biostatistics. 2008; 9: 290-307
        • Rees E.
        • Walters J.T.
        • Georgieva L.
        • Isles A.R.
        • Chambert K.D.
        • Richards A.L.
        • et al.
        Analysis of copy number variations at 15 schizophrenia-associated loci.
        Br J Psychiatry. 2014; 204: 108-114
        • Raychaudhuri S.
        • Korn J.M.
        • McCarroll S.A.
        • Altshuler D.
        • Sklar P.
        • Purcell S.
        • et al.
        Accurately assessing the risk of schizophrenia conferred by rare copy-number variation affecting genes with brain function.
        PLoS Genet. 2010; 6: e1001097
        • Schizophrenia Working Group of the Psychiatric Genomics Consortium
        Biological insights from 108 schizophrenia-associated genetic loci.
        Nature. 2014; 511: 421-427
        • ENCODE Project Consortium
        An integrated encyclopedia of DNA elements in the human genome.
        Nature. 2012; 489: 57-74
        • Kundaje A.
        • Meuleman W.
        • Ernst J.
        • Bilenky M.
        • Yen A.
        • Heravi-Moussavi A.
        • et al.
        Integrative analysis of 111 reference human epigenomes.
        Nature. 2015; 518: 317-330
        • Howard D.M.
        • Adams M.J.
        • Clarke T.K.
        • Hafferty J.D.
        • Gibson J.
        • Shirali M.
        • et al.
        Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions.
        Nat Neurosci. 2019; 22: 343-352
        • Rucker J.J.
        • Breen G.
        • Pinto D.
        • Pedroso I.
        • Lewis C.M.
        • Cohen-Woods S.
        • et al.
        Genome-wide association analysis of copy number variation in recurrent depressive disorder.
        Mol Psychiatry. 2013; 18: 183-189
        • Stefansson H.
        • Rujescu D.
        • Cichon S.
        • Pietiläinen O.P.
        • Ingason A.
        • Steinberg S.
        • et al.
        Large recurrent microdeletions associated with schizophrenia.
        Nature. 2008; 455: 232-236
        • Upthegrove R.
        • Marwaha S.
        • Birchwood M.
        Depression and schizophrenia: Cause, consequence, or trans-diagnostic issue?.
        Schizophr Bull. 2017; 43: 240-244
        • Baudewijns L.
        • Ronsse E.
        • Verstraete V.
        • Sabbe B.
        • Morrens M.
        • Bertelli M.O.
        Problem behaviours and major depressive disorder in adults with intellectual disability and autism.
        Psychiatry Res. 2018; 270: 769-774
        • Ulfarsson M.O.
        • Walters G.B.
        • Gustafsson O.
        • Steinberg S.
        • Silva A.
        • Doyle O.M.
        • et al.
        15q11.2 CNV affects cognitive, structural and functional correlates of dyslexia and dyscalculia.
        Transl Psychiatry. 2017; 7: e1109
        • Hattori N.
        • Mizuno Y.
        Twenty years since the discovery of the parkin gene.
        J Neural Transm (Vienna). 2017; 124: 1037-1054
        • Larsen J.P.
        • Dalen I.
        • Pedersen K.F.
        • Tysnes O.B.
        The natural history of depressive symptoms in patients with incident Parkinson’s disease: A prospective cohort study.
        J Neurol. 2017; 264: 2401-2408
        • Jansen R.
        • Penninx B.W.
        • Madar V.
        • Xia K.
        • Milaneschi Y.
        • Hottenga J.J.
        • et al.
        Gene expression in major depressive disorder.
        Mol Psychiatry. 2016; 21: 339-347