16+
DOI: 10.18413/2658-6533-2020-6-4-0-3

Bioinformatic analysis of microduplications at 5p15.33: identification of TPPP as a candidate gene for autism and intellectual disability

Abstract

Background: Autism is a common psychiatric disorder in children. Since autism is a multifactorial disease, the genetic predisposition plays a significant role in the pathogenesis. However, numerous studies focused on genomic abnormalities in autism are unable to provide reproducible information about pathogenic processes causing this devastating disorder. The aim of the study: The identification of candidate genes by bioinformatic analysis of recurrent copy number variations (CNV) (5p15.33 duplications) revealed by molecular karyotyping in a clinical cohort. Materials and methods: Molecular karyotyping of 296 children with idiopathic autism, intellectual disability was performed by SNP-array. Bioinformatic analysis was made using an original algorithm. Results: Molecular karyotyping genome-wide analysis revealed 3 cases of 5p15.33 duplications. Bioinformatic analysis identified a candidate gene TPPP for brain dysfunction. TPPP is highly expressed in the brain; the gene encodes a protein catalyzing tubulin polymerization, which is important for oligodendrocytes myelination. Interactome analysis was performed to identify pathogenic processes associated with CNV involving TPPP. Expanded TPPP interactome network encompasses 37 proteins, 19 of which are associated with the synaptic plasticity and axonal guidance involved in the normal development and functioning of the brain. Changes in these processes may lead to autism and intellectual disability. Interestingly, clinical genetic databases have not previously associated this gene with a disease condition. Conclusion: Bioinformatic analysis of 5p15.33 CNV allowed us to show that TPPP is a candidate gene for alterations to the development and functioning of the brain. Accordingly, possible disease mechanisms leading to the development of autism with intellectual disability have been proposed. Since data on candidate processes is useful for personalized treatment, we conclude that molecular karyotyping complemented by our original in silico analysis of epigenome, proteome and metabolome is to become an important component for basic and applied research in psychiatric genetics.


Introduction. Autism spectrum disorders and intellectual disability are common in children and adolescents. Average incidence of autism varies from 1 to 10 cases per 1000 individuals around the world [1, 2]. The main symptoms of autism include speech and communication abnormalities, impaired social adaptation, motor function alterations, and stereotypies. Autism research shows that genetic predisposition plays a significant role in the etiology [3]. Autism may be associated with large chromosomal aberrations (2-10%), copy number variations (CNV) (5-15%) [3-5], and single-gene mutations (5-10%) [4]. In total, genomic pathology affects no less than 25-35% of patients with autistic disorders. However, focusing on autism predisposition genes is limited to expanding the list of candidates. Available data indicate high heterogeneity and low penetrance of mutations associated with this disease. Moreover, there are speculations about reducing autism risks by variations in a number of genes. Apparently, autism is not associated with single genes, but with a variety of molecular and cellular processes [3, 6, 7]. Determining the mechanism underlying the pathogenesis of autism represents an important area of biological psychiatry and medical genetics. Consequently, bioinformatic analysis of genomic variations in individuals with autism and/or intellectual disability to uncover altered molecular and cellular processes is a significant step forward for unraveling mechanisms of the disease and providing the evidence-based therapeutic opportunities.

Aim of the study. Here, we have attempted to characterize microduplications (large CNV) affecting chromosome 5p15.33 by a bioinformatics analysis of molecular karyotyping data.

Materials and methods. Among 296 children with autism, intellectual disability and congenital malformations/developmental delays (age: 2-13 years (average age: 5.5 years); sex ratio: 128/168 (females/males) or 1/1.3), three individuals have been found to demonstrate CNV manifesting as duplications at 5p15.33. Duplications encompassed same genomic loci in three children with autism and intellectual disability. All three children had been diagnosed to have autism, intellectual disability and developmental delays. Molecular karyotyping using Affymetrix CytoScan HD Arrays platform has been performed with an average resolution of ~1000 bp. In silico analysis of the phenotype outcome has been performed using original bioinformatic techniques allowing modulating consequences of genome imbalances at transcriptome, proteome and metabolome levels. The technique has been previously described in details [8-11].

Results and discussion. Clinically, all three children demonstrated an idiopathic autism assessed by the Child Autism Rating Scale (CARS) and intellectual disability. In addition to autistic traits, two children had congenital anomalies. The first patient had a hydrocephalic shape of the skull, lowered outer corners of palpebral fissures. The second had severe microcephaly, a triangular face, wide distal finger phalanges, protruding auricles, a flat bridge of the nose, skin rashes and corpus callosum hypoplasia. The third child demonstrated exclusively wavy hair, which was not observed in other members of his family.

The first patient exhibited 5p15.33 duplication (genomic location: 448542-676847, size: 228305 bp) affecting 7 genes: EXOC3, PP7080, SLC9A3, MIR4456, LOC100996325, CEP72, TPPP. The second patient demonstrated 5p15.33 duplication (genomic location: 448542-819920), size 371378 bp) affecting 8 genes: EXOC3, PP7080, SLC9A3, MIR4456, LOC100996325, CEP72, TPPP, ZDHHC11. The third patient showed 5p15.33 duplication (genomic location: 448542-1175479, size: 726937 bp) affecting 14 genes: EXOC3, PP7080, SLC9A3, MIR4456, LOC100996325, CEP72, TPPP, ZDHHC11, BRD9, TRIP13, MIR4635, NKD2, SLC12A7, LOC100506688. Further characteristics of the duplications are given in Figure 1 and Table 1. No other detectable CNV have been associated with the phenotypic features in these children.

Regardless of differences in length of duplications and the number of affected genes, the overlapping region exists. It is important to note that distal breakpoint is the same in all three cases. Since CNV/duplications result from alterations to recombination, replication, and DNA repair in specific genomic loci [12, 13], one can suggest a recombination hotspot localized at 5p15.33.

Previously, essential genetic processes/changes mediating brain diseases were defined as those occurring directly in cells of the diseased brain [14, 15]. Therefore, in silico gene expression analysis may provide information for prioritizing candidate genes/processes for brain dysfunction [9]. Our analysis has demonstrated that TPPP has the highest expression in the brain as to remaining duplicated genes. Moreover, the relative expression is increased in almost all analyzed areas of the brain (Fig. 2).

The first duplication affected exons 3 and 4 of TPPP, which encode phosphorylated protein domains, which are involved in the regulation of the protein activity. In the second and third cases, TPPP is completely duplicated. The gene has 4 exons and encodes a protein catalyzing the polymerization of tubulin, a component of microtubules. Microtubules are essential components of cellular processes, such as intracellular transport and cell division (chromosomal disjunction in mitosis and meiosis). In addition, microtubules play a role in myelination of oligodendrocytes [16, 17]. Previously, possible roles of TPPP in the development of neuropsychiatric disorders in children have been suggested [9]. Taking these data into account, TPPP has been associated with autism and intellectual disability.

To gain further insights into mechanisms of phenotypic outcomes of the duplications, an interactome analysis was carried out. As a result, TPPP was found to interact with 7 proteins: SCNA, GAPDH, CDK5, CDK5R1, TMED3, LIMK1, ROCK1 (Fig. 3). These are involved in cell cycle regulation, metabolism, and development of the nervous system. Figure 3 shows TPPP interactome.

To describe ontologies associated with alterations to the interactome due to TPPP duplications, a brief overview of proteins involved in this network is given.

  • SNCA (alpha-synuclein) protein belongs to the synuclein family. SNCA expression is high in brain cells. Alpha-synuclein is associated with the membrane of vesicles in neurons. The protein is involved in the control of the transport to presynaptic membrane. SNCA is associated with Parkinson's disease [18].
  • GAPDH (glyceraldehyde-3-phosphate dehydrogenase) is an enzyme required for oxidative phosphorylation. It is involved in the nitrosylation of nuclear proteins and regulation of mRNA stability, as well [19].
  • Cyclin-dependent kinase 5 (CDK5) — unlike other members of the cyclin-dependent kinase family the protein does not directly regulates of cell cycle. Instead, CDK5 is associated with synaptic plasticity and neuronal migration. This enzyme phosphorylates proteins. The phosphorylation is involved in regulation of cytoskeleton, endocytosis, exocytosis, and apoptosis. CDK5 expression is increased in postmitotic cells of the central nervous system [20].
  • CDK5R1 (p35) is a neuron-specific activator of the cyclin-dependent kinase CDK5 by calpain-based proteolytic cleavage of p35 to form the p25 form. CDK5-p25 complexes cause changes in kinase structure and activity [21].
  • TMED3 (transmembrane protein p24 containing domain 3) is encoded by TMED3, a gene, which is not indexed in the OMIM database [22].
  • LIMK1 regulates actin remodeling by phosphorylating cophilin and converting it into an inactive form. This rearranges dendrite spines and axon modifications to form synaptic plasticity [23].
  • ROCK1 (serine/threonine kinase) is a Rho-associated kinase activated by binding between Rho and guanosine triphosphate (GTP). This protein is involved in bioprocesses mediated by modification of the actin cytoskeleton and formation of actomyosin complexes [24].

Apparently, LIMK1 and ROCK1 are involved in pathways of axonal guidance and cytoskeleton regulation. These pathways play a key role in the formation and functioning of the nervous system [25, 26]. The improper regulation causes a decrease in the viability and functional activity of cells, and leads to genomic (chromosomal) instability, which is an element of pathogenic cascades in a wide spectrum of brain diseases [27-29]. Interactome analysis shows that TPPP is involved in brain development and functioning. Therefore, TPPP copy number changes leading to altered gene dosage may have negative effects on neurodevelopmental diseases (i.e. autism and intellectual disability).

Expanded interactome analysis was carried out to highlight ontologies or candidate processes for neurodevelopmental abnormalities associated with 5p15.33 duplications. As a result, 37 proteins were found to interact with TPPP: CDK5R1, GAPDH, SNCA, LIMK1, TMED3, ROCK1, CDK5, RHOA, CCND2, CABLES1, UBC, FYN, ENO1, MAPT, SNCAIP, PARK2, SLC6A3, CDK5R2, NDEL1, PPP1R1B, RAC1, APP, CDC42, MSN, CSNK2A1, RND3, PGK2, HSPA4, RPL13A, PARK7, CCNB1, LRRK2, CDKN1B, DPYSL2, AMPH, MYH14, RHOC (Fig. 4).

According to the largest ontologies, proteins were clustered as follows: proteins associated with the glycolysis enzymes, structural and functional proteins of synaptic connections, proteins involved in actin polymerization and axonal guidance; proteins regulating cell cycle proliferation and cell differentiation (Fig. 4). It should be taken into account that proteins may be simultaneously involved in several processes. Consequently, we have clustered proteins according to ontologies relevant to brain development and functioning. The proteins of synaptic plasticity (I) and axonal guidance (II) are of the interest in the context of brain dysfunction. The first group includes APP, SLC6A3, PPP1R1B, TMED3, SNCA, AMPH, and the second group includes CDKN1B, CDK5R1, CDK5R2, LIMK1, DPYSL2, CDK5, NDEL1, MYH14, RND3, MAPT, CABLES1, MSN, ROCK1. According to the literature, a large number of autism-associated genes encode proteins involved in synaptic plasticity [30]. Axonal guidance is a key mechanism for the development of brain structures during brain development. Genes ontologically associated with axonal guidance are occasionally mutated in autism and intellectual disability [31-34]. It should be noted that 12 out of 37 proteins of the expanded TPPP interactome (GAPDH, HSPA4, PARK7, PARK2, LRRK2, CDK5R2, MAPT, CDK5, APP, SNCA, SLC6A3, PPP1R1B) are elements of the neurodegeneration pathway (Fig. 4, marked with asterisks).

Conclusion. We report on an in silico analysis of functional consequences of 5p15.33 microduplications in 3 children with autism and intellectual disability. Using an original in silico technology modulating phenotypic outcomes of CNV (CNV prioritization) at transcriptome, proteome and metabolome levels, we have found that TPPP is a candidate gene for autism with intellectual disability. Additionally, we have been able to propose a number of candidate processes for neurodevelopmental abnormalities in individuals with 5p15.33 duplications encompassing TPPP. These data are intrinsically useful for forthcoming efforts in developing personalized therapeutic strategies for neurodevelopmental diseases mediated by CNV. Knowledge about consequences of genomic variations generated by identifying candidate processes based on in silico analysis of transcriptome, proteome and metabolome is an important tool for basic and diagnostic genome research.

 

Financial support

Our study is partially supported by RFBR and CITMA within the research project №18-515-34005. Prof. S.G. Vorsanova’s laboratory is partially supported by the Government Assignment of the Russian Ministry of Health, Assignment no. AAAA-A18–118051590122-7. Prof. I.Y. Iourov’s laboratory is partially supported by the Government Assignment of the Russian Ministry of Science and Higher Education, Assignment no. AAAA-A19–119040490101-6.

Reference lists

  1. Baxter AJ, Brugha TS, Erskine HE, et al. The epidemiology and global burden of autism spectrum disorders. Psychological Medicine. 2015;45(3):601-613. DOI: https://doi.org/10.1017/S003329171400172X
  2. Hodges H, Fealko C, Soares N. Autism spectrum disorder: definition, epidemiology, causes, and clinical evaluation. Translational Pediatrics. 2020;9(Suppl 1):S55-S65. DOI: https://doi.org/10.21037/tp.2019.09.09
  3. Vorsanova SG, Iourov IY, Demidova IA, et al. Cytogenetics and molecular cytogenetics of autism. Moscow: The publishing house of The Russian Academy of Natural History; 2016.
  4. Ziats MN, Rennert OM. The evolving diagnostic and genetic landscapes of autism spectrum disorder. Frontiers in Genetics. 2016;7(65):1-6. DOI: https://doi.org/10.3389/fgene.2016.00065
  5. Vorsanova SG, Voinova VI, Iurov II, et al. Cytogenetic, molecular cytogenetic, clinical and genealogical study of mothers of children with autism: a search for family genetic markers of autistic disorders. Zhurnal nevrologii i psikhiatrii im. SS Korsakova. 2009;109(6):54-64.
  6. Robinson EB, Lichtenstein P, Anckarsäter H, et al. Examining and interpreting the female protective effect against autistic behavior. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(13):5258-5262. DOI: https://doi.org/10.1073/pnas.1211070110
  7. Iourov IY, Vorsanova SG, Yurov YB. Genomic and chromosomal disorders of the central nervous system. In: Molecular and cytogenetic aspects. Moscow: Medpraktika Moscow; 2014.
  8. Iourov IY, Vorsanova SG, Saprina EA, Yurov YB. Identification of candidate genes of autism on the basis of molecular cytogenetic and in silico studies of the genome organization of chromosomal regions involved in unbalanced rearrangements. Russian Journal of Genetics. 2010;46:1190-1193. DOI: https://doi.org/10.1134/S102279541010011X
  9. Iourov IY, Vorsanova SG, Yurov YB. In silico molecular cytogenetics: a bioinformatic approach to prioritization of candidate genes and copy number variations for basic and clinical genome research. Molecular Cytogenetics. 2014;7(1:98):1-10. DOI: https://doi.org/10.1186/s13039-014-0098-z
  10. Iourov IY, Vorsanova SG, Zelenova MA, et al. Bioinformatic technology accessing functional concequences of genomic variations. Fundamental research. 2015;19(2):4209-4214.
  11. Vorsanova SG, Yurov YB, Iourov IY. Neurogenomic pathway of autism spectrum disorders: Linking germline and somatic mutations to genetic-environmental interactions. Current Bioinformatics. 2017;12:19-26.
  12. Weischenfeldt J, Symmons O, Spitz F, et al. Phenotypic impact of genomic structural variation: insights from and for human disease. Nature Reviews Genetics. 2013;14(2):125-138. DOI: https://doi.org/10.1038/nrg3373
  13. Khil PP, Camerini-Otero RD. Genetic crossovers are predicted accurately by the computed human recombination map. PLoS Genetics. 2010;6(1):e1000831. DOI: https://doi.org/10.1371/journal.pgen.1000831
  14. Iourov IY, Vorsanova SG, Yurov YB. Chromosomal variation in mammalian neuronal cells: known facts and attractive hypotheses. International Review of Cytology. 2006;249:143-191. DOI: https://doi.org/10.1016/S0074-7696(06)49003-3
  15. Iourov IY, Vorsanova SG, Yurov YB. Molecular cytogenetics and cytogenomics of brain diseases. Current Genomics. 2008;7(9):452-465. DOI: https://doi.org/10.2174/138920208786241216
  16. Lee J, Gravel M, Zhang R, et al. Process outgrowth in oligodendrocytes is mediated by CNP, a novel microtubule assembly myelin protein. Journal of Cell Biology. 2005;170(4):661-673. DOI: https://doi.org/10.1083/jcb.200411047
  17. Lee BY, Hur EM. A role of microtubules in oligodendrocyte differentiation. International Journal of Molecular Sciences. 2020;21(3):1062. DOI: https://doi.org/10.3390/ijms21031062
  18. Siddiqui IJ, Pervaiz N, Abbasi AA. The Parkinson disease gene SNCA: evolutionary and structural insights with pathological implication. Scientific Reports. 2016;6:24475. DOI: https://doi.org/ 10.1038/srep24475
  19. Gong Y, Zou B, Peng S, et al. Nuclear GAPDH is vital for hypoxia-induced hepatic stellate cell apoptosis and is indicative of aggressive hepatocellular carcinoma behavior. Cancer Management and Research. 2019;11:4947-4956. DOI: https://doi.org/10.2147/CMAR.S202268
  20. Hawasli AH, Benavides DR, Nguyen C, et al. Cyclin-dependent kinase 5 governs learning and synaptic plasticity via control of NMDAR degradation. Nature Neuroscience. 2007;10(7):880-886. DOI: https://doi.org/10.1038/nn1914
  21. Moncini S, Castronovo P, Murgia A, et al. Functional characterization of CDK5 and CDK5R1 mutations identified in patients with non-syndromic intellectual disability. Journal of Human Genetics. 2016;61(4):283-293. DOI: https://doi.org/10.1038/jhg.2015.144
  22. Emery G, Rojo M, Gruenberg J. Coupled transport of p24 family members. Journal of Cell Science. 2000;113(13):2507-2516.
  23. Wang J, Cui WY, Wei J, et al. Genome-wide expression analysis reveals diverse effects of acute nicotine exposure on neuronal function-related genes and pathways. Frontiers in Psychiatry. 2011;2:5. DOI: https://doi.org/10.3389/fpsyt.2011.00005
  24. Newell-Litwa KA, Badoual M, Asmussen H, et al. ROCK1 and 2 differentially regulate actomyosin organization to drive cell and synaptic polarity. Journal of Cell Biology. 2015;210(2):225-242. DOI: https://doi.org/ 10.1083/jcb.201504046
  25. Anitha A, Nakamura K, Yamada K, et al. Genetic analyses of roundabout (ROBO) axon guidance receptors in autism. American Journal of Medical Genetics, Part B: Neuropsychiatric Genetics. 2008;147B(7):1019-1027. DOI: https://doi.org/10.1002/ajmg.b.30697
  26. Van Battum EY, Brignani S, Pasterkamp RJ. Axon guidance proteins in neurological disorders. The Lancet Neurology. 2015;14(5):532-546. DOI: https://doi.org/10.1016/S1474-4422(14)70257-1
  27. Yurov YB, Vorsanova SG, Demidova IA, et al. Mosaic brain aneuploidy in mental illnesses: an association of low-level post-zygotic aneuploidy with schizophrenia and comorbid psychiatric disorders. Current Genomics. 2018;19(3):163-172. DOI: https://doi.org/10.2174/1389202918666170717154340
  28. Yurov YB, Vorsanova SG, Iourov IY. Human molecular neurocytogenetics. Current Genetic Medicine Reports. 2018;6:155-164. DOI: https://doi.org/10.1007/s40142-018-0152-y
  29. Iourov IY, Vorsanova SG, Yurov YB, Kutsev SI. Ontogenetic and pathogenetic views on somatic chromosomal mosaicism. Genes. 2019;10(5):379. DOI: https://doi.org/10.3390/genes10050379
  30. Bourgeron T. From the genetic architecture to synaptic plasticity in autism spectrum disorder. Nature Reviews Neuroscience. 2015;16(9):551-563. DOI: https://doi.org/ 10.1038/nrn3992
  31. Anitha A, Nakamura K, Yamada K, et al. Genetic analyses of roundabout (ROBO) axon guidance receptors in autism. American Journal of Medical Genetics, Part B: Neuropsychiatric Genetics. 2008;147B(7):1019-1027. DOI: https://doi.org/10.1002/ajmg.b.30697
  32. Abou-Donia MB, Suliman HB, Siniscalco D, et al. De novo blood biomarkers in autism: autoantibodies against neuronal and glial proteins. Behavioral Sciences. 2019;9(5):47. DOI: https://doi.org/10.3390/bs9050047
  33. Yan J, Pan Y, Zheng X, et al. Comparative study of ROCK1 and ROCK2 in hippocampal spine formation and synaptic function. Neuroscience Bulletin. 2019;35(4):649-660. DOI: https://doi.org/10.1007/s12264-019-00351-2
  34. Geschwind DH, Levitt P. Autism spectrum disorders: developmental disconnection syndromes. Current Opinion in Neurobiology. 2007;17(1):103-111. DOI: https://doi.org/10.1016/j.conb.2007.01.009
  35. Wu C, Jin X, Tsueng G, et al. BioGPS: building your own mash-up of gene annotations and expression profiles. Nucleic Acids Research. 2016;44(D1):D313-D316. DOI: https://doi.org/10.1093/nar/gkv1104
  36. Szklarczyk D, Gable AL, Lyon D et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Research. 2019;47(D1):D607-613. DOI: https://doi.org/10.1093/nar/gky1131