boundaries nih public access neurodevelopmental loci that confer risk across diagnostic ·...

24
Sequencing chromosomal abnormalities reveals neurodevelopmental loci that confer risk across diagnostic boundaries Michael E. Talkowski 1,2,3 , Jill A. Rosenfeld 4 , Ian Blumenthal 1 , Vamsee Pillalamarri 1 , Colby Chiang 1 , Adrian Heilbut 1 , Carl Ernst 1 , Carrie Hanscom 1 , Elizabeth Rossin 1,3,5 , Amelia Lindgren 6 , Shahrin Pereira 6 , Douglas Ruderfer 1,3 , Andrew Kirby 1,3,5 , Stephan Ripke 1,3,5 , David Harris 7 , Ji-Hyun Lee 1 , Kyungsoo Ha 8 , Hyung-Goo Kim 9 , Benjamin D. Solomon 10 , Andrea L. Gropman 11,12 , Diane Lucente 1 , Katherine Sims 1 , Toshiro K. Ohsumi 13 , Mark L. Borowsky 13 , Stephanie Loranger 14 , Bradley Quade 15 , Kasper Lage 3,5,16,17,18 , Judith Miles 19 , Bai-Lin Wu 15,20,21 , Yiping Shen 1,15,20,22 , Benjamin Neale 1,3,5 , Lisa G. Shaffer 4 , Mark J. Daly 1,3,5,14 , Cynthia C. Morton 3,6,15 , and James F. Gusella 1,3,14,23 1 Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA 2 Department of Neurology, Harvard Medical School, Boston, MA 3 Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 4 Signature Genomic Laboratories, PerkinElmer, Inc., Spokane, WA 5 Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 6 Departments of Obstetrics, Gynecology and Reproductive Biology, Brigham and Women’s Hospital, Boston, MA 7 Division of Clinical Genetics, Children’s Hospital of Boston, Boston, MA 8 Cancer Research Center, Georgia Health Sciences University, Augusta, GA 9 Department of OB/GYN, IMMAG, Georgia Health Sciences University, Augusta, GA 10 Medical Genetics Branch, National Human Genome Research Institute, Bethesda, MD, USA 11 Department of Neurology, Children’s National Medical Center, Washington, DC, USA 12 Department of Neurology, George Washington University of Health Sciences, Washington, DC, USA 13 Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 14 Autism Consortium of Boston, Boston, MA 15 Department of Pathology, Massachusetts General Hospital, Boston, MA 16 Pediatric Surgical Research Laboratories, MassGeneral Hospital for Children, Massachusetts General Hospital, Boston, MA, USA © 2012 Elsevier Inc. All rights reserved. Correspondence James F. Gusella, Ph.D., Director, Center for Human Genetic Research, Massachusetts General Hospital, Bullard Professor of Neurogenetics, Department of Genetics, Harvard Medical School, Richard B. Simches Research Center, 185 Cambridge Street, Boston, MA 02114, [email protected], Tel: 617.726.5724, Fax: 617.726.5735. Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. NIH Public Access Author Manuscript Cell. Author manuscript; available in PMC 2013 April 27. Published in final edited form as: Cell. 2012 April 27; 149(3): 525–537. doi:10.1016/j.cell.2012.03.028. NIH-PA Author Manuscript NIH-PA Author Manuscript NIH-PA Author Manuscript

Upload: others

Post on 08-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Sequencing chromosomal abnormalities revealsneurodevelopmental loci that confer risk across diagnosticboundaries

Michael E. Talkowski1,2,3, Jill A. Rosenfeld4, Ian Blumenthal1, Vamsee Pillalamarri1, ColbyChiang1, Adrian Heilbut1, Carl Ernst1, Carrie Hanscom1, Elizabeth Rossin1,3,5, AmeliaLindgren6, Shahrin Pereira6, Douglas Ruderfer1,3, Andrew Kirby1,3,5, Stephan Ripke1,3,5,David Harris7, Ji-Hyun Lee1, Kyungsoo Ha8, Hyung-Goo Kim9, Benjamin D. Solomon10,Andrea L. Gropman11,12, Diane Lucente1, Katherine Sims1, Toshiro K. Ohsumi13, Mark L.Borowsky13, Stephanie Loranger14, Bradley Quade15, Kasper Lage3,5,16,17,18, JudithMiles19, Bai-Lin Wu15,20,21, Yiping Shen1,15,20,22, Benjamin Neale1,3,5, Lisa G. Shaffer4, MarkJ. Daly1,3,5,14, Cynthia C. Morton3,6,15, and James F. Gusella1,3,14,23

1Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA2Department of Neurology, Harvard Medical School, Boston, MA3Program in Medical and Population Genetics, Broad Institute, Cambridge, MA4Signature Genomic Laboratories, PerkinElmer, Inc., Spokane, WA5Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA6Departments of Obstetrics, Gynecology and Reproductive Biology, Brigham and Women’sHospital, Boston, MA7Division of Clinical Genetics, Children’s Hospital of Boston, Boston, MA8Cancer Research Center, Georgia Health Sciences University, Augusta, GA9Department of OB/GYN, IMMAG, Georgia Health Sciences University, Augusta, GA10Medical Genetics Branch, National Human Genome Research Institute, Bethesda, MD, USA11Department of Neurology, Children’s National Medical Center, Washington, DC, USA12Department of Neurology, George Washington University of Health Sciences, Washington, DC,USA13Department of Molecular Biology, Massachusetts General Hospital, Boston, MA14Autism Consortium of Boston, Boston, MA15Department of Pathology, Massachusetts General Hospital, Boston, MA16Pediatric Surgical Research Laboratories, MassGeneral Hospital for Children, MassachusettsGeneral Hospital, Boston, MA, USA

© 2012 Elsevier Inc. All rights reserved.

Correspondence James F. Gusella, Ph.D., Director, Center for Human Genetic Research, Massachusetts General Hospital, BullardProfessor of Neurogenetics, Department of Genetics, Harvard Medical School, Richard B. Simches Research Center, 185 CambridgeStreet, Boston, MA 02114, [email protected], Tel: 617.726.5724, Fax: 617.726.5735.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to ourcustomers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review ofthe resulting proof before it is published in its final citable form. Please note that during the production process errors may bediscovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

NIH Public AccessAuthor ManuscriptCell. Author manuscript; available in PMC 2013 April 27.

Published in final edited form as:Cell. 2012 April 27; 149(3): 525–537. doi:10.1016/j.cell.2012.03.028.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 2: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

17Center for Biological Sequence Analysis, Department of Systems Biology, Technical Universityof Denmark, Lyngby, Denmark18Center for Protein Research, University of Copenhagen, Copenhagen, Denmark19Departments of Pediatrics, Medical Genetics & Pathology, The Thompson Center for Autism &Neurodevelopmental Disorders, University of Missouri Hospitals and Clinics, Columbia, MO20Department of Laboratory Medicine, Children’s Hospital Boston, Boston, MA21Children’s Hospital and Institutes of Biomedical Science, Fudan University, Shanghai, China22Shanghai Children’s Medical Center, Shanghai Jiaotong University School of Medicine,Shanghai, China23Department of Genetics, Harvard Medical School, Boston, MA

SUMMARYBalanced chromosomal abnormalities (BCAs) represent a reservoir of single gene disruptions inneurodevelopmental disorders (NDD). We sequenced BCAs in autism and related NDDs,revealing disruption of 33 loci in four general categories: 1) genes associated with abnormalneurodevelopment (e.g., AUTS2, FOXP1, CDKL5), 2) single gene contributors to microdeletionsyndromes (MBD5, SATB2, EHMT1, SNURF-SNRPN), 3) novel risk loci (e.g., CHD8,KIRREL3, ZNF507), and 4) genes associated with later onset psychiatric disorders (e.g., TCF4,ZNF804A, PDE10A, GRIN2B, ANK3). We also discovered profoundly increased burden of copynumber variants among 19,556 neurodevelopmental cases compared to 13,991 controls (p =2.07×10−47) and enrichment of polygenic risk alleles from autism and schizophrenia genome-wideassociation studies (p = 0.0018 and 0.0009, respectively). Our findings suggest a polygenic riskmodel of autism incorporating loci of strong effect and indicate that some neurodevelopmentalgenes are sensitive to perturbation by multiple mutational mechanisms, leading to variablephenotypic outcomes that manifest at different life stages.

INTRODUCTIONMounting evidence indicates that genomic structural variants (SV) collectively play asubstantial role in susceptibility to autism spectrum disorders (ASD) and otherneurodevelopmental disorders (NDD). However, the near ubiquitous use of chromosomalmicroarrays in research and clinical diagnostics has limited the assessment of those classesof SV that do not involve large gains and losses of genetic material, such as inversions,excision/insertions, and translocations, which together constitute balanced chromosomalabnormalities (BCAs). These events are typically defined clinically at karyotypic resolution,implicating only broad chromosomal regions, without the gene-level sequence specificitythat would permit informative interpretation. Even in the research setting, there is often afailure to consider or to assess BCAs, consequently bypassing a meaningful proportion ofsubjects with genetic events that may mark a single locus of potentially large effect size.These balanced events offer a unique route, complementary to conventional approaches, foridentifying individual genes or functional sequences that contribute to otherwise geneticallycomplex human disorders. At cytogenetic resolution, the estimated frequency of BCAs inASD is 1.3% (Marshall et al., 2008), an approximately six-fold increase over that observedin more than 10,000 reproductively normal controls (Ravel et al., 2006). This ratio is almostcertainly a lower bound for relative risk, given the resolution of available techniques and theinability to survey submicroscopic balanced alterations. Thus, BCAs clearly have ameaningful impact in ASD and represent a fertile area for high resolution study to identifyfunctional sequences that contribute to human neurodevelopment.

Talkowski et al. Page 2

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 3: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

We previously described innovations in the molecular approach to massively parallelsequencing and tailored bioinformatics applicable to the rapid, high resolution discovery ofchromosomal rearrangement breakpoints (Talkowski et al., 2011a). These and conceptuallysimilar methods were recently used to sequence B lymphocytes to derive potentialmechanisms of rearrangement (Chiarle et al., 2011; Klein et al., 2011), as they have beenpreviously to delineate complex chromosomal rearrangements and chromothripsis in cancercells (Stephens et al., 2011), as well as balanced chromothripsis and predominant non-homologous repair in the human germline and transgenic animals (Chiang et al., 2012).Here, we precisely characterize to nucleotide resolution karyotypically-defined humanconstitutional chromosomal rearrangements: 36 de novo BCAs and two inheritedrearrangements that were transmitted from an affected parent. The results of our sequencinganalyses, coupled with extensive secondary orthogonal genomics support, indicate thatdisruption of genes from a wide range of biological pathways can contribute to ASD, butthat in many instances the same genes also confer risk, sometimes via different mutationalmechanisms, to other NDDs or to a range of psychiatric disorders in both children andadults.

RESULTSIdentification of genes contributing to neurodevelopmental abnormalities

Using a series of previously developed next-generation sequencing techniques ranging fromhigh-depth whole-genome sequencing to a targeted capture of breakpoints approach(Talkowski et al., 2011a), we delineated BCAs in 38 subjects with neurodevelopmentalabnormalities, including two monozygotic twin pairs (36 independent probands). Allharbored a BCA that appeared balanced at karyotypic resolution and was interpreted aspathogenic by the clinical geneticist; 36 aberrations arose de novo while two alterationswere transmitted from an affected parent and thus segregated with the phenotype. Extensiveclinical data were collected for all subjects and affected parents, as described in theSupplemental Information (SI). If a structured diagnostic interview was performed, or apatient was formally diagnosed with autism or an ASD by the referring clinician usingDSM-IV criteria, they are referred to herein as ASD (50% of subjects); otherwise thedisorder was classified as NDD, although many such subjects also displayed clinical featuresconsistent with ASD. Complete clinical information for each subject and each BCAbreakpoint, confirmed by PCR and capillary sequencing, are provided in SI. Previouslyperformed genetic testing was also obtained and all results were unremarkable with 244,000or one million feature aCGH unless otherwise described in the SI.

This BCA sequencing approach uncovered genes that conformed to four generalclassifications, with meaningful overlap between categories: (1) genes implicated previouslyin ASD or NDD and confirmed here by their heterozygous inactivation, (2) genesdiscovered to be single locus contributors to previously recognized microdeletionsyndromes, (3) completely novel genes not previously implicated individually in ASD orNDD, and (4) genes that had been associated with adolescent and adult onset psychiatricdisorders by common variant genome-wide association studies (GWAS) or otherapproaches, all but three of which represent novel ASD or NDD loci. Alterations in geneexpression were also assessed for all subjects where a lymphoblastoid cell line could beobtained (33 of 38 subjects; Fig. S1). If a gene was not directly disrupted, positional effectson expression were evaluated for genes in proximity to the breakpoint. All genes andsequences disrupted by BCAs are presented in Table S1, along with results of mRNAexpression studies, while the subset of genes supported by secondary analyses is presentedbelow and in Table 1.

Talkowski et al. Page 3

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 4: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Convergent data from molecular diagnosticsAlthough the presence of a BCA is conservatively associated with an ~6 fold increased riskof ASD, any individual BCA is rare and generally non-recurrent, precluding assessment offalse discovery by replication; consequently, we sought secondary support, by analysis ofcopy number variants (CNVs), for the hypothesis that specific locus hemizygositycontributes to genetic risk of neurodevelopmental abnormalities. We curated and analyzed alarge collection of 33,573 cases from molecular diagnostic facilities and identified thatsubset of cases referred for a variety of neurodevelopmental abnormalities (n = 19,556), aswell as that comprising cases without a referral indication of NDD (n = 14,017). We alsosurveyed CNV data from 13,991 controls screened for absence of a reported developmentalor psychiatric phenotype. Although almost 25% of the 19,556 cases withneurodevelopmental abnormalities were referred for testing with an indication of autism orASD, we did not have direct access to clinical records to confirm this diagnosis so weconservatively describe these cases using the broader term NDD (see SI). Our findingsindicate that genes disrupted by BCAs in ASD individuals frequently show increased CNVburden across the diagnostic cases referred for ASD as well as those referred for otherneurodevelopmental abnormalities, suggesting that the genes discovered can contribute tophenotypic outcomes that are not constrained by ASD diagnostic criteria.

An example of the convergent genomic information collected in this study is depicted inFigure 1, where sequencing of monozygotic twins with extensive clinical informationrevealed a translocation disrupting TCF4, a gene previously implicated in both Pitt-Hopkinssyndrome and schizophrenia, with concomitantly decreased mRNA expression in bothsubjects and a significant impact of CNV burden across the clinical diagnostic samples (14case CNVs, 0 control CNVs; p = 0.0006). Additional follow-up analyses revealed asignificant GWAS signal from common SNPs at TCF4 in both autism and schizophrenia(see below).

Across all loci disrupted by BCAs, a dramatic increase in overall CNV burden was observedin cases compared to controls (CNV burden across 33 loci: p = 2.07×10−47, OR = 5.12, 95%CI = 3.92–6.79) and this result remained robust to subset analyses and one million randomsimulations to assess empirical significance (see SI for all CNV analyses and results).Comparison of the NDD cases to the 14,017 diagnostic cases referred for a primaryindication other than NDD and analyzed on identical platforms also showed an increasedCNV burden across these genes (p = 1.8×10−5), a result that again exceeded the significanceof 1 million random simulations despite the previously established enrichment of largeCNVs in the latter cases, which we replicate here. Notably, restricting CNV analyses to onlythose genes disrupted by a BCA in an individual with a confirmed diagnosis of ASD wasalso highly significant (p = 2.76×10−28; Table 1). Individually, increased CNV burden wasnominally significant for 14 of the genes in this study, with three non-significant trends (p <0.10), while five additional genes were disrupted by CNVs in three or more independentcases but never altered in controls. For the latter, the rarity of dosage alteration in cases andabsence of alterations in controls limit statistical power but are consistent with a strongdeleterious effect. In each category discussed further below, those genes supported bysecondary CNVs are briefly mentioned and presented in Table 1, while the full list of genesdisrupted by BCAs is delineated in Table S1 and discussed in the SI.

Category 1. Balanced alterations confirming loci previously implicated inneurodevelopment

The power of sequencing BCAs as a discovery tool for loci contributing to disease etiologyis illustrated in our independent identification of several genes previously suggested ascandidates in ASD or NDD. We found direct disruption of AUTS2 and CDKL5, two

Talkowski et al. Page 4

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 5: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

established neurodevelopmental loci (Bakkaloglu et al., 2008; Sultana et al., 2002). We alsofound disruption of genes implicated during completion of this study by de novo mutationsfrom an exome-sequencing study of 20 ASD families (the fork-head transcription factorFOXP1 and the glutamate receptor GRIN2B) (O’Roak et al., 2011), or by mutations in Pitt-Hopkins syndrome (TCF4). Other genes have been implicated by CNV analysis from anautism cohort (the transcription factor SOX5 and the dystrophin regulator SNTG2)(Rosenfeld et al., 2010). Consistent with their status as previously recognized contributors toneurodevelopment, their collective CNV burden was significant (p = 7.74×10−20; OR = 3.6)(Table 2).

Category 2. Individual genes contributing to microdeletion syndromesMicrodeletion syndromes, which are recognized contributors to neurodevelopmental andpsychiatric disorders, typically involve hemizygosity of large genomic regions where thedifficulty of defining specific genes responsible for core phenotypes has been an obstacle forclinical genetics, predictive diagnostics, and the study of disease pathogenesis. Here, BCAsequencing in three ASD subjects pinpointed three individual gene contributors tomicrodeletion syndromes, each of which is involved in transcriptional and/or epigeneticregulation, in the 2q23.1, 2q33.1, and 9q34.3 microdeletion syndrome regions, respectively.In 2q23.1, a translocation disrupted MBD5, a member of the methyl-CpG binding domainprotein family defined by a highly conserved methyl binding domain and including MeCP2,a causal locus in Rett syndrome. We recently completed an international consortium follow-up study that found 65 structural variations spanning the 2q23.1 microdeletion region incases with syndromic features including ASD, seizures, and intellectual disability (includinganother translocation that disrupted MBD5), establishing MBD5 as a necessary, sufficient,and predictive locus contributing to a majority of the phenotypic features of the 2q23.1microdeletion syndrome (Talkowski et al., 2011b). In the 2q33.1 and 9q34.3 regions, werespectively identified disruption of SATB2, a gene involved in transcriptional regulationand chromatin remodeling (Rosenfeld et al., 2009a), and of EHMT1, encoding a histonemethyltransferase (Kleefstra et al., 2006). In a fourth region (15q11-13), the nested genesSNURF-SNRPN were disrupted at 15q11.2 in a subject with ASD and multipledevelopmental abnormalities, including sensory integration disorder, but without Angelmanor Prader-Willi syndromes, both of which result from imprinting within the region. Our dataargue for increased resolution and interpretation from molecular diagnostic testing ofregional disorders, particularly for loci in which individual gene disruptions can yieldphenotypes that are similar to or indistinguishable from that defining the syndrome. Giventheir localization to syndromic regions, it was not surprising that CNV analysis of genes inthis category reflected a strong impact in neurodevelopment (p = 1.64×10−26; OR = 10.2)(Table 2, Fig. 3).

Category 3. Novel genes conferring risk to autism and neurodevelopmentBCA sequencing yielded 22 novel ASD/NDD candidate genes that, like the genes inCategories 1 and 2, are also likely to contribute to the neurodevelopmental phenotype in thecorresponding subjects. The collective increase in CNV burden for these novel candidateswas highly significant (p = 2.21×10−15, OR = 4.1) (Table 2).

The most significant individual genes were two loci localized to previously identifiedgenomic disorder regions (KIRREL3, SMG6) and one encoding a novel DNA helicase(CHD8). Similar to the Category 2 genes, we observed dysregulation by BCA and secondaryCNV support for novel genes in regions associated with classic terminal deletion disorders:KIRREL3 in 11q24.1 (Jacobsen syndrome) and SMG6 in 17p13.3 (Miller-Diekersyndrome). KIRREL3 encodes a cell adhesion molecule of the immunoglobulin familyexpressed in developing and adult brain of mouse and developing sensory pathways

Talkowski et al. Page 5

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 6: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

(Morikawa et al., 2007; Serizawa et al., 2006; Tamura et al., 2005). The locus was disruptedby a BCA 39.6 kb upstream of the mRNA coding region that altered both mRNA andprotein levels (Table 1 and Fig. S2). Disruption of SMG6 nominates nonsense mediateddecay as another novel pathway in ASD and NDD, but not necessarily in the lissencephalyphenotype commonly seen in Miller-Dieker syndrome (see SI and Fig. S3). Like many ofthe genes described above that are involved in transcriptional regulation, epigeneticmodification, and methylation patterning, CHD8 encodes a DNA helicase that remodelschromatin structure (Thompson et al., 2008). It has never been individually linked to ahuman disorder but was disrupted by a BCA in a subject diagnosed with ASD, wassupported by our CNV analyses, was among the loci within the minimal region of overlapfrom previous analyses of de novo microdeletions, is an important transcriptional repressor,and interacts with genes involved in NDD such as CHD7, a causal locus in CHARGEsyndrome, thus representing a strong autism and NDD candidate locus (Batsukh et al., 2010;Nishiyama et al., 2009; Rodriguez-Paredes et al., 2009).

Additionally, in seven subjects no annotated gene was disrupted, but several expressedsequence tags (ESTs), conserved sequences, and regions with predicted regulatory effectswere impacted by breakpoints (Table S1), suggesting that BCAs may also provide a novelentrée into such regions. In these subjects, we considered both loci in proximity to thebreakpoint for positional effects (e.g., KIRREL3), as well as disrupted but functionallyunannotated sequences themselves, such as the highly conserved 6q16.3 sequence ofunknown function disrupted by a BCA in an ASD subject (denoted as “High Cons” in TableS1) and the noncoding RNA LOC401324.

Category 4. Neurodevelopmental loci associated with risk that crosses traditionaldiagnostic boundaries

A remarkable number of genes implicated in ASD or NDD by single gene disruption fromBCAs in our study have also been recently associated with a spectrum of developmental,psychiatric, and behavioral phenotypes by other strategies, such as GWAS and mutationscreening, including TCF4 (Pitt-Hopkins syndrome, intellectual disability, schizophrenia)(Amiel et al., 2007; Blake et al., 2010; Rosenfeld et al., 2009b), GRIN2B (schizophrenia,bipolar disorder, neurodevelopment) (Endele et al., 2010), EHMT1 (schizophrenia) (Kirovet al., 2012) and four novel neurodevelopmental genes that overlap with Category 3:ZNF804A (schizophrenia, psychosis, cognitive function) (O’Donovan et al., 2008; Walterset al., 2010), ANK3 (bipolar disorder, schizophrenia) (Ferreira et al., 2008; Williams et al.,2011), C18orf1 (schizophrenia) (Meerabux et al., 2009) and PDE10A (schizophrenia)(Kehler, 2011). All loci with the exception of ANK3 are supported by secondary CNVanalyses (Table 1). Of these novel ASD or NDD candidates, only PDE10A has anestablished function, encoding a phosphodiesterase suggested as a biological candidate inschizophrenia due to its high tissue specific expression in the caudate nucleus. SpecificPDE10A inhibitors provide a potential therapeutic approach to schizophrenia due to theirregulation of cAMP and cGMP, thereby altering dopamine D1 and D2 receptor activity(Kehler, 2011; Lakics et al., 2010).

Genes in this category are apparently capable of contributing to pleiotropic effects rangingfrom early onset autism and intellectual disability to adult onset psychosis, often throughdifferent mutational mechanisms. Several were previously associated with psychiatricdisorders by unbiased GWAS and/or by candidate gene studies of common variants, whichare thought to reflect a more subtle effect on gene regulation than the outright inactivationcaused by the BCA disruption. Perhaps the most compelling example of different mutationalmechanisms is TCF4, where rare mutations are recognized as causing NDD, sometimes witha diagnosis of Pitt-Hopkins syndrome, but common variation has recently emerged as asignificant risk factor for schizophrenia. Taken together, our findings support the long-

Talkowski et al. Page 6

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 7: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

hypothesized notion of a neurodevelopmental component to adult onset neuropsychiatricdisorders like schizophrenia (Murray and Lewis, 1987; Owen et al., 2011; Weinberger,1986). However, our data extend this hypothesis to suggest that differing mutational impacton the same sets of genes constitutes a significant overlap in the genetic etiology of autism,schizophrenia, psychosis, bipolar disorder, and intellectual disability, comprising at least asubset of the total genetic variance for each of these disorders. The collective CNV burdenfor these genes decisively supports the fundamental hypothesis that some psychiatricdisease-associated genes are important in neurodevelopment (p = 5.1×10−15; OR = 6.7)(Table 2).

Polygenic risk from genome-wide association studiesOur identification of genes disrupted by BCA in ASD or NDD with an increased CNVburden among diagnostic cases with neurodevelopmental abnormalities suggests that theseare relatively penetrant alterations in human development, consistent with oligogenic riskfactors of modest to large effect. However, the discovery of the genes in Category 4 suggeststhat for some loci, an accumulation of subtle genetic effects associated with commonpolymorphisms could have a pleiotropic impact across a spectrum of early childhood andadult onset psychiatric disorders. To test this hypothesis, we performed gene-set enrichmentanalyses in autism and schizophrenia GWAS datasets (Ripke et al., 2011; Wang et al., 2009;Weiss et al., 2009) using an established method in which each linkage disequilibrium blockacross the genome is scored with the maximum Z-score achieved in the block (Rossin et al.,2011). Analysis of an initial autism study revealed a highly significant enrichment of riskalleles across the BCA disrupted gene-set (empirical p = 0.0018), a result that persisted inthe second autism GWAS study (p = 0.068). Moreover, we discovered a significantenrichment of associated alleles from the largest schizophrenia GWAS meta-analysis to date(empirical p = 0.0009). Struck by these results, we evaluated the potential for anyunforeseen confounding variables by performing identical enrichment analyses inphenotype-permuted datasets from the schizophrenia and autism studies meta-analysis (p =0.444 and p = 0.518, respectively), in a well-powered GWAS study of Crohn’s disease (p =0.819) (Franke et al., 2010), and from GWAS data for seven other unrelated traits (p-valuesranged from p = 0.06 to p = 0.917, fitting nicely to the expected null distribution). Thesedata indicate an unusually strong enrichment of subtle effects from common polygenic riskloci in autism and schizophrenia among the genes identified by our BCA sequencing, andfurther support the hypothesis that diverse mutational mechanisms at these loci can conferpleiotropic effects across conventional diagnostic classifications (Fig. 2).

Genes sensitive to dosage dysregulationThe finding that some genes associated with ASD or NDD due to inactivation of one allelemay also contribute to abnormal phenotypes when more subtly disrupted suggests that someASD or NDD genes require tight control of their expression for appropriateneurodevelopmental function. In such circumstances, disruption by increased dosage mightalso produce an NDD phenotype. This is highlighted in more detailed examination of theCNV analyses presented in Figures 1, 3, and 4 which indicate that for some genes, the CNVdata predict that both deletion and duplication are risk factors for abnormalneurodevelopment, whereas for other loci the mechanism of disruption appears to be dosagespecific. For example, previously established NDD risk loci (TCF4, Fig. 1; SATB2, MBD5,Fig. 3) almost exclusively display deletion among the CNV cases, clearly supporting asimilar mechanism of dysregulation to that seen for the BCA disruption. This was also truefor novel NDD candidates such as PDE10A and KIRREL3. However, in Figure 4a are threeinstances, including two well-established loci (AUTS2, CDKL5), for which CNV analysissupported genetic risk from both deletion and duplication, approaching a near 50:50 balance.Interestingly, Figure 4b shows a group of genes where the CNV support is primarily from

Talkowski et al. Page 7

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 8: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

duplication at the locus (CHD8, GRIN2B, FOXP1), suggesting similar phenotypic outcomesfrom both dosage increase and heterozygous inactivation by BCA disruption, as in thisstudy, or de novo mutations in a previous study (for GRIN2B and FOXP1) (O’Roak et al.,2011). These data build upon previous findings in recurrent rearrangement regions, such thecommon recurrent 16p11.2 microdeletion/microduplication (Weiss et al., 2008), where bothdeletion and duplication increase risk for autism to different degrees and have disparateimpact (including reciprocal phenotypes) for other disorders and physiological traits. Theclear distinction between neurodevelopmental loci associated primarily with deletion orduplication, and those displaying similarly abnormal neurodevelopment from either eventemphasizes the need for detailed experimental annotation of the genome with respect todosage sensitive loci and phenotypic prediction.

Disrupted genes do not converge on a single pathway in neurodevelopmentThe genomics approach in this study enabled direct interpretation of locus specificity forfurther downstream analyses. We evaluated the networks in which these genes mayparticipate and whether any biological pathways emerged as significantly enriched for genesdisrupted by these BCAs. A qualitative network analysis based on interactions from PubMedabstracts using a natural language processing algorithm identified a network of 429interacting genes (Fig. 5 and Fig. S4). Fourteen of the original genes, many of which areinvolved in transcriptional regulation, were found to interact indirectly in a large,interconnected network, and TCF4, SNRPN, CHD8, and GTF2F1 were confirmed asinteracting partners of the RNA Polymerase II complex. Analysis of GO terms foundenrichment of transcription factors, phosphoproteins, and protein heterodimerization activity(p < 0.005). A quantitative network assessment did not find statistically significant first orsecond-order interactions compared to chance expectations (given the size and compositionof the given networks). The nominally significant networks (statistical enrichment ofpathways at p < 0.05) included shared interactions between SNTG2, UTRN, GNA14, andCDKRAP2 (p = 0.01) as well as KCND2 and GRIN2B (p = 0.03) (Fig. S4). The SNURF-SNRPN complex was nominally significant in both analyses and in an assessment ofphysical interactions (SI). No results were significant after correction for multiple testing.There was also no convergence on one or a few pathways involved in neurodevelopment,likely a reflection of the modest number of genes studied and the unknown function of manyof the novel genes. These results could also signify that interaction of a diverse range offunctional networks at many levels is critical to normal human neurodevelopment and thatthere is ample opportunity for genetic lesions to disrupt different functional pathways whilestill leading to similar neurodevelopmental outcomes.

DISCUSSIONDirect sequencing of BCA breakpoints, followed by targeted assessment of moleculardiagnostic CNV findings in independent subjects, proved to be an efficient strategy forindividual gene discovery in abnormal neurodevelopment. As this approach begins withsequence resolution of individual BCAs, it is not subject to notable limitations of other denovo mutation studies that rely purely on CNVs or on exome sequencing (i.e., primarilyidentification of large multi-gene regions in the former and failure to assess most non-codingsequence in the latter) and therefore provides an effective complement to these approaches.The yield of this study is considerable, with 22 novel loci being disrupted by BCAs andsupported by secondary analyses either by a statistical enrichment of CNV burden ordisruption in multiple cases and absence of dosage alterations in controls, with severaladditional genes and sequences whose potential for contribution merits further examination.These findings not only extend specific knowledge of neurodevelopmental genes implicatedby microdeletion syndromes, GWA, CNV, and exome sequencing studies, but also reveal

Talkowski et al. Page 8

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 9: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

entirely new, unsuspected genes in ASD and NDD. This work also presents direct evidencefor a complex genetic architecture that connects neurodevelopmental and adult onsetpsychiatric disorders by implicating robustly associated schizophrenia loci as contributors toneurodevelopmental abnormalities.

Our analyses were based upon the premise that alone, a de novo BCA disruption in a singlecase, CNVs spanning a given region, or the presence of a de novo mutation within thecoding region of an interesting biological candidate does not by itself represent compellingevidence that a gene contributes to neurodevelopment. Instead, we sought convergentgenomic evidence combining disruption by a de novo BCA (or in two subjects a BCA thatco-segregated with the phenotype) with association by CNV or mutation in independent andsimilarly affected cases, followed by comparison with risk loci from GWAS studies. Thestrength of evidence varied between loci, as shown in Table 1 and Table S2. In someinstances, genes were statistically supported by secondary CNV analyses, in others, thestatistical support was inconclusive, and in several the CNV data do not nominate the locusas an NDD candidate (e.g. ZBTB20; see Table 1, Table S2, and Table S3). Additionally, likeany genetic study, we cannot discount a potential contribution from altered expression ofother genes at a distance from the site of the disruption. While such an effect could resultfrom the transposition of functional elements, the comprehensive testing of this possibility isproblematic, since it could involve dysregulation that is tissue-specific, that occurs due tothe potentially altered nuclear organization of the rearranged chromosomes, or that is asecondary physiological consequence of the primary gene disruption. Nonetheless,secondary CNV analyses from independent NDD cases indicate a profound collectivecontribution of the disrupted genes on neurodevelopment. Simulations and subset analysesto evaluate empirical significance established that the increased CNV burden of the locidisrupted by BCAs was robust, unusually specific to neurodevelopmental disorderscompared to other phenotypic presentations referred for molecular diagnostic testing, andwas not driven by any single gene, individual disruption category, cluster of symptoms, ordiscrete diagnostic category. Rather, it was an accumulation of risk factors from all fourcategories that collectively contributed to the significant burden observed (see SI). For somegenes, developmental abnormalities were predominantly associated with dosage alteration inonly one direction while for others the increased CNV burden involved both deletions andduplications. These data illustrate the immature state of annotation of the human genomewith respect to dosage sensitivity and to prediction of phenotypic outcomes from geneticlesions, a limitation that may be alleviated at least in part by the growing capacity tosequence BCAs in a relatively high-throughput manner.

A surprising number of genes previously associated with adolescent or adult onsetpsychiatric disorders were disrupted in children with autism and NDD in this study. Theconcept of schizophrenia as a neurodevelopmental disorder has long been proposed (Murrayand Lewis, 1987; Weinberger, 1986), and a growing consensus in the recent literaturesuggests that there are shared risk factors across what are viewed clinically as distinctphenotypic classifications, although few of these have been described at the individual genelevel (Owen et al., 2011). Our study supports a shared genetic etiology for at least a portionof the phenotypic spectrum of schizophrenia, autism, and the neurodevelopmentalabnormalities studied here. We find unambiguous gene disruption by BCAs in ASD andNDD subjects, coupled with an increased CNV burden and a substantial over-representationof polygenic risk compared to null expectations from schizophrenia GWAS. Individualgenes may thus show a differential risk depending on the nature of the genetic lesion, withheterozygous inactivation from BCA, CNV, or deactivating point mutation being a relativelypenetrant contributor to ASD or NDD while subtle effects from common variants contributeto later onset disorders. However, contrary to this simple hypothesis, we were surprised tofind persistent enrichment of these genes also among common variant associations from

Talkowski et al. Page 9

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 10: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

autism GWAS studies, suggesting that even subtle perturbation of genes important in normalhuman development can contribute to abnormal outcomes across the lifespan, presumably ininteraction with other genetic and environmental influences.

This initial sequence-based delineation of a large collection of subjects harboringchromosomal aberrations in autism and related neurodevelopmental disorders establishes anapproach that can be exploited for efficient discovery of individual genetic factorscontributing to otherwise complex disorders. Each individual gene revealed provides a new,specific hypothesis concerning the disease to be tested with further genetic and biologicalstudy. If supported, each then represents a foundation for investigations into the role that thebiochemical activity and regulation of its product play in pathogenesis and into the potentialfor treatment through their manipulation. Ultimately, such data will also provide invaluableannotation of the human genome and profoundly impact the clinical interpretation ofgenomic events in subjects referred to diagnostic laboratories for autism and otherdevelopmental abnormalities.

EXPERIMENTAL PROCEDURESPatients

Subjects were obtained from the Developmental Genome Anatomy Project (DGAP)(Higgins et al., 2008), the Autism Consortium of Boston, the Center for Human GeneticResearch (CHGR) Neurodevelopmental Repository, and the Autism Genome ResourceExchange (AGRE). These studies were approved by the Institutional Review Board ofPartners HealthCare System. Clinical information was obtained by direct questionnaires,medical records, or structured clinical interviews (see SI).

SequencingSequencing was performed on the Illumina platform (Illumina Inc). Libraries were createdby four different methods optimized for delineating BCAs, including 1) Illumina standardinsert paired-end sequencing, 2) Illumina mate-pair sequencing (long 2000–4000 bp inserts),3) our customized jumping libraries (long 3000–4500 bp inserts), and 4) our capture ofbreakpoints method (CapBP) for rearrangements previously localized (see SI with completeprotocols in (Talkowski et al., 2011a)). Interestingly, some rearrangements proved to be farmore complex than suspected by karyotyping, and in at least two subjects this complexityrepresented balanced germline chromothripsis similar to but distinct from previouslydescribed events in cancer cells (DGAP127 and DGAP203; see Table S4 and (Chiang et al.,2012)). In complex rearrangements where more than two genes were disrupted by BCAbreakpoints, we conservatively excluded these cases from interpretation in our secondaryCNV/GWAS analyses.

BioinformaticsSequencing reads were aligned using publicly available alignment programs and customscripts, followed by processing of BAM files using a C++ program, Bamstat, to tabulatemapping statistics and output lists of anomalous read-pairs (defined as having ends that mapto two different chromosomes, having an abnormal insert size, and/or unexpected strandorientations) (Talkowski et al., 2011a). Anomalous pairs were clustered by their mappedmates with a program that performs a single-linkage clustering of paired reads ifcorresponding ends map within a specified distance. All junction fragments predicted frompaired-end sequencing were PCR amplified from genomic DNA, independent of thelibraries and sequencing reads, and all breakpoints presented in this study were confirmedby capillary sequencing (Table S1 and also Table S5).

Talkowski et al. Page 10

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 11: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Evaluation of CNVs in independent subjectsComplete details on the CNV data, statistical analyses, and simulations to determineempirical significance are fully described in SI. We compiled CNV data from 33,573 casesfrom molecular diagnostic laboratories of Signature Genomic Laboratories (SG) andChildren’s Hospital Boston (CHB) analyzed by oligonucleotide aCGH. A subset of the SGdata was included in a recent CNV study (Cooper et al., 2011). We fully annotated bothdatasets and collapsed all subjects with an indication for study (but without additionalclinical information to verify a specific diagnosis) of autism, ASD, or a relatedneurodevelopmental abnormality into a combined NDD cohort (19,556 cases) and those forwhich the clinical indication did not indicate an NDD (14,017 cases). Control data wereobtained across multiple published resources as described in the SI (n = 13,991 independentcontrols). Only 6,239 controls were analyzed for CNVs on the X chromosome, and thisreduced comparison group was used for analyses of CDKL5. The resolution between theclinical aCGH and higher resolution control SNP microarrays varied extensively. Toovercome this disparity but retain the native resolution of each individual array platform, weanalyzed all CNVs that disrupted any documented coding or non-coding exon from alltranscripts available from multiple database sources (Table S1). Notably, post-hoc analysesreveal that a size filter of 100 kb, or the resolution of the most sparse control arrays, wouldhave resulted in an almost identical CNV burden test to our exon disruption model (p =1.18×10−47), but would have omitted high confidence CNV calls in controls that could pointtowards reduced penetrance or non-significant loci in this study such as ZBTB20. Empiricalsignificance was established by simulations performed in MATLAB (Mathworks Inc.) usingcustom scripts with all methods and findings described in the SI. In brief, one million gene-sets of 33 loci were randomly selected from the genome, and case-control CNV burden testswere performed for each random gene-set. These same analyses were performed for NDDcases compared to all other cases in the molecular diagnostic cohort. Neither simulationdetected a random gene-set exceeding the significance of the experimental gene-set. Anotherset of experiments performed subset analyses, randomly selecting up to 1,000 gene-sets ofeach possible number of k genes from the experimental gene-set of 33 loci, ranging from k =5 to k = 33, finding a minimum of 90.5% of all subsets at a level as low as k = 5 exceededmultiple testing correction and 100% of the 1,000 gene-sets to be significant at k > 13 (seeSI).

Genome-wide association studiesWe performed gene-set enrichment analysis from GWAS data using published methods(Rossin et al., 2011) on a recent schizophrenia meta-analysis (Ripke et al., 2011) and twoautism genome-wide association studies (Weiss et al., 2009) (Wang et al., 2009). Briefly,linkage disequilibrium blocks across the genome are scored with the maximum Z-scoreachieved in the block. That score is corrected for the number of tests across the block usinglinear regression and the residuals are then used as the new, corrected score for each block.Genes nominate scores based upon the unique set of blocks they overlap, and the nominatedscores are compared to background scores from all genes in the genome using a one-tailedrank-sum test.

Network analysisA qualitative assessment of interacting genes and complexes for each of the genes in whichfunctional information was available (little data existed for some of the novel genes) wasinitially performed using Natural Language Processing (NLP) from published abstracts (seeSI). Secondary analyses of GO and KEGG enrichments were then performed followed byquantitative network building of first- and second-order interactions of the set of proteinscoded by the genes disrupted by BCAs. The significance of these networks was determinedby permutation testing (SI).

Talkowski et al. Page 11

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 12: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Supplementary MaterialRefer to Web version on PubMed Central for supplementary material.

AcknowledgmentsWe are grateful to all participating subjects and families, and to the many healthcare professionals who havecontributed to this study, including Mary-Alice Abbott, Darius J. Adams, Kwame Anyane-Yeboa, Stephen G.Bamforth, Tina Bartell, David P. Bick, Joann N. Bodurtha, Carol Clericuzio, Stephanie Cohen, Kristin Dalton,Maria Descartes, Joanne Milisa Drautz, Dawn L. Earl, Luis F. Escobar, Shannon Gerner, Edwin Guzman, KennethHandelman, Tim Heshka, Robert J. Hopkin, Micheil Innes, Debby Lambert, Emmanuelle Lemyre, Cynthia Lim,Livija Medne, Graciela Moya, Katie Rutledge, Wendy Smith, Mark Stephan, Darci Sternen, Katie Stoll, Paulienvan Galen, Nancy J. Van Vranken, Erica Wahl, Susan E. Wiley, Amy L White, Anne Woods, and Elaine H. Zackai.The invaluable control data for this study were provided by Pamela Sklar, Shaun Purcell, the InternationalSchizophrenia Consortium, the Wellcome Trust Case Control Consortium, Evan Eichler, Bradley Coe, and GregCooper. We thank Tammy Gillis, Mary Anne Anderson, Jayla Ruliera, and Thon de Boer for technical assistance.We also thank Dennis Gurgul, Nilay Roy, and Brent Richter of Partners Research Computing at MassachusettsGeneral Hospital, and contributing staff from Signature Genomic Laboratories and Children’s Hospital Boston.This work was funded by grants GM061354 and HD065286 from the NIH, the Simons Foundation AutismResearch Initiative, and Autism Speaks. This research was also supported by the Division of Intramural Research,National Human Genome Research Institute (NHGRI), National Institutes of Health and Human Services, UnitedStates of America. M. Talkowski was supported by an NIMH National Research Service Award (MH087123) andan MGH ECOR Fund for Medical Discovery Award.

ReferencesAmiel J, Rio M, de Pontual L, Redon R, Malan V, Boddaert N, Plouin P, Carter NP, Lyonnet S,

Munnich A, et al. Mutations in TCF4, encoding a class I basic helix-loop-helix transcription factor,are responsible for Pitt-Hopkins syndrome, a severe epileptic encephalopathy associated withautonomic dysfunction. Am J Hum Genet. 2007; 80:988–993. Epub 2007 Mar 2023. [PubMed:17436254]

Bakkaloglu B, O’Roak BJ, Louvi A, Gupta AR, Abelson JF, Morgan TM, Chawarska K, Klin A,Ercan-Sencicek AG, Stillman AA, et al. Molecular cytogenetic analysis and resequencing ofcontactin associated protein-like 2 in autism spectrum disorders. Am J Hum Genet. 2008; 82:165–173. [PubMed: 18179895]

Batsukh T, Pieper L, Koszucka AM, von Velsen N, Hoyer-Fender S, Elbracht M, Bergman JE,Hoefsloot LH, Pauli S. CHD8 interacts with CHD7, a protein which is mutated in CHARGEsyndrome. Human molecular genetics. 2010; 19:2858–2866. [PubMed: 20453063]

Blake DJ, Forrest M, Chapman RM, Tinsley CL, O’Donovan MC, Owen MJ. TCF4, schizophrenia,and Pitt-Hopkins Syndrome. Schizophr. 2010; 36:443–447.

Chiang C, Jacobsen JC, Ernst C, Hanscom C, Heilbut A, Blumenthal I, Mills RE, Kirby A, LindgrenAM, Rudiger SR, et al. Complex reorganization and predominant non-homologous repair followingchromosomal breakage in karyotypically balanced germline rearrangements and transgenicintegration. Nature Genetics. 2012

Chiarle R, Zhang Y, Frock RL, Lewis SM, Molinie B, Ho YJ, Myers DR, Choi VW, Compagno M,Malkin DJ, et al. Genome-wide Translocation Sequencing Reveals Mechanisms of ChromosomeBreaks and Rearrangements in B Cells. Cell. 2011; 147:107–119. [PubMed: 21962511]

Cooper GM, Coe BP, Girirajan S, Rosenfeld JA, Vu TH, Baker C, Williams C, Stalker H, Hamid R,Hannig V, et al. A copy number variation morbidity map of developmental delay. Nature genetics.2011; 43:838–846. [PubMed: 21841781]

Endele S, Rosenberger G, Geider K, Popp B, Tamer C, Stefanova I, Milh M, Kortum F, Fritsch A,Pientka FK, et al. Mutations in GRIN2A and GRIN2B encoding regulatory subunits of NMDAreceptors cause variable neurodevelopmental phenotypes. Nat Genet. 2010; 42:1021–1026.[PubMed: 20890276]

Ferreira MA, O’Donovan MC, Meng YA, Jones IR, Ruderfer DM, Jones L, Fan J, Kirov G, Perlis RH,Green EK, et al. Collaborative genome-wide association analysis supports a role for ANK3 andCACNA1C in bipolar disorder. Nature genetics. 2008; 40:1056–1058. [PubMed: 18711365]

Talkowski et al. Page 12

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 13: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Franke A, McGovern DP, Barrett JC, Wang K, Radford-Smith GL, Ahmad T, Lees CW, Balschun T,Lee J, Roberts R, et al. Genome-wide meta-analysis increases to 71 the number of confirmedCrohn’s disease susceptibility loci. Nature Genetics. 2010; 42:1118–1125. [PubMed: 21102463]

Higgins AW, Alkuraya FS, Bosco AF, Brown KK, Bruns GA, Donovan DJ, Eisenman R, Fan Y, FarraCG, Ferguson HL, et al. Characterization of apparently balanced chromosomal rearrangementsfrom the developmental genome anatomy project. Am J Hum Genet. 2008; 82:712–722. [PubMed:18319076]

Kehler J. PDE10A Inhibitors: Novel Therapeutic Drugs for Schizophrenia. Curr Pharm Des. 2011;17:137–150. [PubMed: 21355834]

Kirov G, Pocklington AJ, Holmans P, Ivanov D, Ikeda M, Ruderfer D, Moran J, Chambert K,Toncheva D, Georgieva L, et al. De novo CNV analysis implicates specific abnormalities ofpostsynaptic signalling complexes in the pathogenesis of schizophrenia. Molecular psychiatry.2012; 17:142–153. [PubMed: 22083728]

Kleefstra T, Brunner HG, Amiel J, Oudakker AR, Nillesen WM, Magee A, Genevieve D, Cormier-Daire V, van Esch H, Fryns JP, et al. Loss-of-function mutations in euchromatin histone methyltransferase 1 (EHMT1) cause the 9q34 subtelomeric deletion syndrome. Am J Hum Genet. 2006;79:370–377. Epub 2006 Jun 2013. [PubMed: 16826528]

Klein IA, Resch W, Jankovic M, Oliveira T, Yamane A, Nakahashi H, Di Virgilio M, Bothmer A,Nussenzweig A, Robbiani DF, et al. Translocation-capture sequencing reveals the extent andnature of chromosomal rearrangements in B lymphocytes. Cell. 2011; 147:95–106. [PubMed:21962510]

Lakics V, Karran EH, Boess FG. Quantitative comparison of phosphodiesterase mRNA distribution inhuman brain and peripheral tissues. Neuropharmacology. 2010; 59:367–374. [PubMed: 20493887]

Marshall CR, Noor A, Vincent JB, Lionel AC, Feuk L, Skaug J, Shago M, Moessner R, Pinto D, RenY, et al. Structural variation of chromosomes in autism spectrum disorder. Am J Hum Genet.2008; 82:477–488. [PubMed: 18252227]

Meerabux JM, Ohba H, Iwayama Y, Maekawa M, Detera-Wadleigh SD, DeLisi LE, Yoshikawa T.Analysis of a t(18;21)(p11.1;p11.1) translocation in a family with schizophrenia. J Hum Genet.2009; 54:386–391. [PubMed: 19461657]

Morikawa Y, Komori T, Hisaoka T, Ueno H, Kitamura T, Senba E. Expression of mKirre in thedeveloping sensory pathways: its close apposition to nephrin-expressing cells. Neuroscience.2007; 150:880–886. [PubMed: 18022324]

Murray RM, Lewis SW. Is schizophrenia a neurodevelopmental disorder? Br Med J (Clin Res Ed).1987; 295:681–682.

Nishiyama M, Oshikawa K, Tsukada Y, Nakagawa T, Iemura S, Natsume T, Fan Y, Kikuchi A,Skoultchi AI, Nakayama KI. CHD8 suppresses p53-mediated apoptosis through histone H1recruitment during early embryogenesis. Nature cell biology. 2009; 11:172–182.

O’Donovan MC, Craddock N, Norton N, Williams H, Peirce T, Moskvina V, Nikolov I, Hamshere M,Carroll L, Georgieva L, et al. Identification of loci associated with schizophrenia by genome-wideassociation and follow-up. Nature genetics. 2008; 40:1053–1055. [PubMed: 18677311]

O’Roak BJ, Deriziotis P, Lee C, Vives L, Schwartz JJ, Girirajan S, Karakoc E, Mackenzie AP, Ng SB,Baker C, et al. Exome sequencing in sporadic autism spectrum disorders identifies severe de novomutations. Nature genetics. 2011; 43:585–589. [PubMed: 21572417]

Owen MJ, O’Donovan MC, Thapar A, Craddock N. Neurodevelopmental hypothesis of schizophrenia.Br J Psychiatry. 2011; 198:173–175. [PubMed: 21357874]

Ravel C, Berthaut I, Bresson JL, Siffroi JP. Prevalence of chromosomal abnormalities inphenotypically normal and fertile adult males: large-scale survey of over 10,000 sperm donorkaryotypes. Hum Reprod. 2006; 21:1484–1489. [PubMed: 16484311]

Ripke S, Sanders AR, Kendler KS, Levinson DF, Sklar P, Holmans PA, Lin DY, Duan J, Ophoff RA,Andreassen OA, et al. Genome-wide association study identifies five new schizophrenia loci.Nature genetics. 2011; 43:969–976. [PubMed: 21926974]

Rodriguez-Paredes M, Ceballos-Chavez M, Esteller M, Garcia-Dominguez M, Reyes JC. Thechromatin remodeling factor CHD8 interacts with elongating RNA polymerase II and controlsexpression of the cyclin E2 gene. Nucleic Acids Res. 2009; 37:2449–2460. [PubMed: 19255092]

Talkowski et al. Page 13

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 14: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Rosenfeld JA, Ballif BC, Lucas A, Spence EJ, Powell C, Aylsworth AS, Torchia BA, Shaffer LG.Small deletions of SATB2 cause some of the clinical features of the 2q33.1 microdeletionsyndrome. PLoS One. 2009a; 4:e6568. [PubMed: 19668335]

Rosenfeld JA, Ballif BC, Torchia BS, Sahoo T, Ravnan JB, Schultz R, Lamb A, Bejjani BA, ShafferLG. Copy number variations associated with autism spectrum disorders contribute to a spectrum ofneurodevelopmental disorders. Genetics in medicine : official journal of the American College ofMedical Genetics. 2010; 12:694–702. [PubMed: 20808228]

Rosenfeld JA, Leppig K, Ballif BC, Thiese H, Erdie-Lalena C, Bawle E, Sastry S, Spence JE,Bandholz A, Surti U, et al. Genotype-phenotype analysis of TCF4 mutations causing Pitt-Hopkinssyndrome shows increased seizure activity with missense mutations. Genet Med. 2009b; 11:797–805. [PubMed: 19938247]

Rossin EJ, Lage K, Raychaudhuri S, Xavier RJ, Tatar D, Benita Y, Cotsapas C, Daly MJ. Proteinsencoded in genomic regions associated with immune-mediated disease physically interact andsuggest underlying biology. PLoS Genet. 2011; 7:e1001273. [PubMed: 21249183]

Serizawa S, Miyamichi K, Takeuchi H, Yamagishi Y, Suzuki M, Sakano H. A neuronal identity codefor the odorant receptor-specific and activity-dependent axon sorting. Cell. 2006; 127:1057–1069.[PubMed: 17129788]

Stephens PJ, Greenman CD, Fu B, Yang F, Bignell GR, Mudie LJ, Pleasance ED, Lau KW, Beare D,Stebbings LA, et al. Massive genomic rearrangement acquired in a single catastrophic eventduring cancer development. Cell. 2011; 144:27–40. [PubMed: 21215367]

Sultana R, Yu CE, Yu J, Munson J, Chen D, Hua W, Estes A, Cortes F, de la Barra F, Yu D, et al.Identification of a novel gene on chromosome 7q11.2 interrupted by a translocation breakpoint ina pair of autistic twins. Genomics. 2002; 80:129–134. [PubMed: 12160723]

Talkowski ME, Ernst C, Heilbut A, Chiang C, Hanscom C, Lindgren A, Kirby A, Liu S,Muddukrishna B, Ohsumi TK, et al. Next-generation sequencing strategies enable routinedetection of balanced chromosome rearrangements for clinical diagnostics and genetic research.Am J Hum Genet. 2011a; 88:469–481. [PubMed: 21473983]

Talkowski ME, Mullegama SV, Rosenfeld JA, van Bon BW, Shen Y, Repnikova EA, Gastier-Foster J,Thrush DL, Kathiresan S, Ruderfer DM, et al. Assessment of 2q23.1 microdeletion syndromeimplicates MBD5 as a single causal locus of intellectual disability, epilepsy, and autism spectrumdisorder. American Journal of Human Genetics. 2011b; 89:551–563. [PubMed: 21981781]

Tamura S, Morikawa Y, Hisaoka T, Ueno H, Kitamura T, Senba E. Expression of mKirre, amammalian homolog of Drosophila kirre, in the developing and adult mouse brain. Neuroscience.2005; 133:615–624. [PubMed: 15908127]

Thompson BA, Tremblay V, Lin G, Bochar DA. CHD8 is an ATP-dependent chromatin remodelingfactor that regulates beta-catenin target genes. Molecular and cellular biology. 2008; 28:3894–3904. [PubMed: 18378692]

Walters JT, Corvin A, Owen MJ, Williams H, Dragovic M, Quinn EM, Judge R, Smith DJ, Norton N,Giegling I, et al. Psychosis susceptibility gene ZNF804A and cognitive performance inschizophrenia. Archives of general psychiatry. 2010; 67:692–700. [PubMed: 20603450]

Wang K, Zhang H, Ma D, Bucan M, Glessner JT, Abrahams BS, Salyakina D, Imielinski M, BradfieldJP, Sleiman PM, et al. Common genetic variants on 5p14.1 associate with autism spectrumdisorders. Nature. 2009; 459:528–533. [PubMed: 19404256]

Weinberger, D. The pathogenesis of schizophrenia: a neurodevelopmental theory. In: Weinberger,RND., editor. The Neurology of Schizophrenia. Elsevier; 1986. p. 387-405.

Weiss LA, Arking DE, Daly MJ, Chakravarti A. A genome-wide linkage and association scan revealsnovel loci for autism. Nature. 2009; 461:802–808. [PubMed: 19812673]

Weiss LA, Shen Y, Korn JM, Arking DE, Miller DT, Fossdal R, Saemundsen E, Stefansson H,Ferreira MA, Green T, et al. Association between microdeletion and microduplication at 16p11.2and autism. The New England journal of medicine. 2008; 358:667–675. [PubMed: 18184952]

Williams HJ, Craddock N, Russo G, Hamshere ML, Moskvina V, Dwyer S, Smith RL, Green E,Grozeva D, Holmans P, et al. Most genome-wide significant susceptibility loci for schizophreniaand bipolar disorder reported to date cross-traditional diagnostic boundaries. Hum Mol Genet.2011; 20:387–391. [PubMed: 21037240]

Talkowski et al. Page 14

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 15: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Highlights

• Balanced chromosomal abnormality breakpoints harbor individual strong effectgenes

• Disrupted genes implicate epigenetic and transcriptional regulation in autism

• Dosage sensitive loci confer risk to autism from a spectrum of mutationalmechanisms

• Genes disrupted are associated with diverse childhood and adult clinicaloutcomes

Talkowski et al. Page 15

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 16: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Figure 1. Convergent genomic evidence implicates TCF4 in neurodevelopment(a) A t(3;18)(q13.32;q21.2) translocation was sequenced using our custom jumping libraryprotocol in monozygotic twin boys with multiple developmental abnormalities, directlydisrupting TCF4 in intron 8, a gene previously associated with Pitt-Hopkins syndrome(PHS) by mutation analysis and with schizophrenia by GWAS. (b) Analysis of CNVsrevealed disruption of coding and noncoding exons in 14 independent NDD among 19,556NDD cases compared to no CNVs observed in 13,991 controls (p = 0.0006), implicatinghemizygosity of TCF4 as a highly penetrant locus in NDD, PHS, and ASD (see also TableS2 and Rosenfeld et al., 2009b). (c) Reduced mRNA expression of TCF4 was detected inboth cases from EBV-transformed lymphoblastoid cell lines compared to two gender-matched controls using two independent primers spanning exons 16–17 and exons 18–19.

Talkowski et al. Page 16

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 17: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Figure 2. Genes disrupted by chromosomal abnormalities confer risk across diagnostic groupsAll genes disrupted by a BCA and analyzed in the CNV analyses are shown. While all genesare implicated in autism spectrum disorder (ASD) or other neurodevelopmental disorder(NDD) by BCA disruption in this study, some loci also represented single gene contributorsto previously recognized genomic disorder (GD) regions (three microdeletion syndromes,two terminal deletion syndromes, and one duplication syndrome), and genes discovered inASD or NDD in this study that had been previously linked to adolescent or adult onsetneuropsychiatric disorders (NPD) by common variation association studies. Genes in italicsare novel loci implicated for the first time in ASD or NDD by this study (Category 3), bolditalics represents novel loci not previously implicated in any ASD, NDD, or NPD.

Talkowski et al. Page 17

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 18: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Figure 3. Disruption of individual genes in microdeletion syndromes regionsGene-specific view of sequenced BCAs and CNVs implicating individual genes involved intranscriptional or epigenetic regulation and localized to previously described microdeletionsyndromes, each of which implicates a single gene by the combined BCA and CNVanalyses. Haploinsufficiency of mRNA was confirmed for each gene (Figure S1). Abreakpoint disrupting a two gene locus (SNURF and SNRPN) was detected in a fourthmicrodeletion syndrome, 15q11-13 (not shown). Red bars indicate copy loss, blue bars arecopy gains, and striped bars are gene-specific alterations. Green bars represent “Other”alterations (balanced or unbalanced translocation, inversion, or complex alteration involvingboth deletion and duplication). The bar at the top provides the count of gains and losses thatextend beyond the window and is color coded to reflect the relative proportion of eachvariant type. Confidence intervals provide the distance to the next probe without a detecteddosage imbalance. A single RefSeq transcript is provided. Breakpoint sequence from each

Talkowski et al. Page 18

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 19: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

derivative is provided with text color coded by originating chromosome or non-templatedinserted sequence in gray. A subset of the SATB2 patients included here were previouslyreported by Rosenfeld et al. (Rosenfeld et al., 2009), and all cases reported here for MBD5,as well as 44 additional structural variants, are reported in a follow-up consortium study(Talkowski et al., 2011b).

Talkowski et al. Page 19

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 20: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Figure 4. Genes sensitive to dosage dysregulationProvided is an example of genes illustrated by this study that appear to be sensitive to aspectrum of mutational mechanisms reciprocally altering their gene dosage. In Figures 1 and2 are examples of genes such as TCF4 and SATB2 which are disrupted exclusively bydeletions. (a) Genes sensitive to dosage dysregulation from which both deletion andduplication appear to confer risk for similar phenotypes (AUTS2, CDKL5, C18orf1). (b)Three loci, FOXP1, GRIN2B, and CHD8, are recognized for duplication of the locus as thepredominant mutational mechanism, yet all three loci are implicated by heterozygousinactivation from BCA disruption in this study. Two of these genes, FOXP1 and GRIN2Bare also supported by de novo mutation in ASD families (O’Roak et al., 2011). CHD8 is acompletely novel candidate not previously implicated in ASD or any human abnormality.These data suggest annotation of the human genome is insufficient to reliably predict theimpact of various mutational mechanisms of specific loci involved in neurodevelopment.

Talkowski et al. Page 20

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 21: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

Figure 5. Network analysis of genes implicated in autism or neurodevelopment in this studyA large network of genes disrupted by BCAs in this study are connected by first, second, orhigher order interactions. No networks were significantly enriched for genes disrupted byBCAs after correction for multiple comparisons, though a number of loci have limitedfunctional annotation available or remain of unknown function.

Talkowski et al. Page 21

Cell. Author manuscript; available in PMC 2013 April 27.

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Page 22: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Talkowski et al. Page 22

Tabl

e 1

Gen

es d

isru

pted

by

chro

mos

omal

rea

rran

gem

ents

*

Cat

IDD

xC

hrA

Chr

BD

isru

pted

Fis

her’

s E

xact

P‡

Fun

ctio

n

1D

GA

P201

ASD

7q11

.22

7q36

.3A

UT

S25.

6E-0

4U

nkno

wn

1&4

ND

R27

031

ND

D3q

13.3

218

q21.

2T

CF4

6.2E

-04

Tra

nscr

iptio

n fa

ctor

1D

GA

P093

ND

DX

p22.

1319

p13.

3C

DK

L5

7.2E

-02

Prot

ein

kina

se

1D

GA

P157

ND

D3p

1310

q21.

2FO

XP1

4.5E

-02

Tra

nscr

iptio

n fa

ctor

1&4

ND

R25

941

ASD

12p1

3.1

12q2

1.31

GR

IN2B

7.9E

-02

Glu

tam

ate

rece

ptor

1D

GA

P189

ND

D11

p13

12p1

2.1

SOX

58.

4E-0

2T

rans

crip

tion

fact

or in

em

bryo

nic

deve

lopm

ent

2D

GA

P232

ASD

9p11

.215

q11.

2SN

UR

F-SN

RPN

1.1E

-13

Gen

omic

Im

prin

ting

in A

ngel

man

– P

WS

regi

on

2&4

DG

AP1

55A

SD9q

34.3

11p1

1.2

EH

MT

13.

3E-0

7H

isto

ne M

ethy

ltran

sfer

ase

2D

GA

P142

ASD

2q23

.122

q13

MB

D5

3.1E

-05

Met

hyla

tion

bind

ing

2D

GA

P211

ASD

2q33

.16q

16.3

SAT

B2

1.1E

-03

Tra

nscr

iptio

nal r

egul

atio

n an

d C

hrom

atin

rem

odel

ing

3D

GA

P148

ND

DX

p11.

411

q24.

2K

IRR

EL

31.

6E-0

4C

ell a

dhes

ion

3D

GA

P154

ND

DX

q22

17p1

3.3

SMG

65.

9E-0

4N

onse

nse

med

iate

d de

cay

3N

DR

2686

7A

SD3q

25.3

114

q11.

2C

HD

82.

4E-0

2C

hrom

atin

rem

odel

ing

3D

GA

P125

ND

D7q

32.1

19q1

3.11

ZN

F507

8.0E

-02

Zin

c Fi

nger

3D

GA

P132

**N

DD

5q12

.27q

21.3

PON

31.

5E-0

1L

acto

nase

3A

C02

-005

3A

SD6q

16.1

9q21

.13

GN

A14

2.7E

-01

G-p

rote

in s

igna

ling

3D

GA

P131

ND

D1p

22.3

5q33

ZN

HIT

62.

7E-0

1Z

inc

fing

er p

rote

in

3D

GA

P193

ASD

2p22

.32q

31.3

SPA

ST2.

7E-0

1M

embr

ane

traf

fick

ing

3&4

DG

AP1

43N

DD

6q22

.16q

27PD

E10

A5.

2E-0

3Ph

osph

odie

ster

ase

3&4

DG

AP1

71N

DD

17p1

3.2

18p1

1.21

C18

orf1

3.2E

-02

Unk

now

n

3&4

DG

AP1

80**

ND

D2q

3211

q14

ZN

F804

A4.

7E-0

2Z

inc

fing

er p

rote

in

* BC

A-d

isru

pted

gen

es in

divi

dual

ly im

plic

ated

by

case

-con

trol

CN

V b

urde

n at

p <

0.1

0 or

by

a m

inim

um o

f 3

CN

Vs

in c

ases

with

non

e in

con

trol

s ar

e pr

ovid

ed. S

ee T

able

S1

and

SI f

or a

ll su

bjec

ts a

ndph

enot

ypes

and

Tab

le S

2 fo

r C

NV

cou

nts

on a

ll su

bjec

ts.

‡ Fish

er’s

exa

ct te

st p

-val

ue f

rom

com

pari

son

of C

NV

bur

den

betw

een

ND

D c

ases

and

con

trol

s.

Cat

= d

isru

ptio

n ca

tego

ry, D

x =

dia

gnos

is

ASD

= a

utis

m s

pect

rum

dis

orde

r

ND

D =

oth

er n

euro

deve

lopm

enta

l dis

orde

rs

Cell. Author manuscript; available in PMC 2013 April 27.

Page 23: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Talkowski et al. Page 23C

hrA

and

Chr

B =

seq

uenc

ed c

hrom

osom

al s

ub-b

and

cont

aini

ng th

e B

CA

**B

CA

inhe

rite

d fr

om s

imila

rly

affe

cted

par

ent.

Cell. Author manuscript; available in PMC 2013 April 27.

Page 24: boundaries NIH Public Access neurodevelopmental loci that confer risk across diagnostic · 2018-02-26 · inability to survey submicroscopic balanced alterations. Thus, BCAs clearly

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

NIH

-PA Author Manuscript

Talkowski et al. Page 24

Tabl

e 2

Ana

lysi

s of

cop

y nu

mbe

r va

rian

ts in

inde

pend

ent s

ampl

es

Gen

e C

ateg

orie

sN

DD

Cou

nt*

Con

trol

Cou

nt*

P-V

alue

OR

95%

CI

All

Gen

es44

363

2.1E

-47

5.1

3.9–

6.8

Cat

egor

y 1

114

237.

7E-2

03.

62.

3–5.

9

Cat

egor

y 2

169

121.

6E-2

610

.25.

7–20

.1

Cat

egor

y 3

160

282.

2E-1

54.

12.

7–6.

4

Cat

egor

y 4

111

125.

1E-1

56.

73.

7–13

.3

Cat

egor

ies

1, 3

, 427

451

5.2E

-24

3.9

2.9–

5.3

* Cou

nts

of C

NV

s fr

om 1

9,55

6 N

DD

cas

es a

nd 1

3,99

1 co

ntro

ls.

All

Gen

es =

all

33 lo

ci d

isru

pted

by

BC

As

and

incl

uded

in th

e C

NV

ana

lyse

s

Cat

egor

ies

1, 3

, 4 =

ana

lyse

s of

all

gene

s no

t loc

aliz

ed to

mic

rode

letio

n sy

ndro

mes

kno

wn

to b

e as

soci

ated

with

incr

ease

d C

NV

bur

den.

See

Tab

le S

2 fo

r al

l gen

es in

eac

h ca

tego

ry.

Cell. Author manuscript; available in PMC 2013 April 27.