PPI network
The PPI network comprised 302 nodes and 644 edges, with an average node degree of 2.96 (online supplemental figure 5A). The significant PPI enrichment p value of 2.89×10-9 emphasised a notable association among the proteins within this network. Enrichment analysis indicated that these proteins were involved in various biological processes (online supplemental figure 5B). Notably, 30 proteins in the network exhibited a degree >10, with 76% of these proteins falling within the same subnetwork. This subnetwork was constructed with FN1 protein that had a higher degree of 37 (online supplemental figure 5C). GO predicted that these proteins in the subnetwork were associated with cell-cell communication (online supplemental figure 2). Additionally, SMARCA4 with 16 edges and SCN5A with 10 edge counts formed subnetworks that were enriched for positive regulation of transcription by RNA polymerase II (GO:0045944) and high voltage-gated calcium channel activity (GO:0008331), respectively (online supplemental figure 5E,F).
Furthermore, focusing on DEGs in PPI subnetworks, we found 14 novel genes as candidate genes in this study with functional significance by the adapted integrated analysis (online supplemental table 4). Among these, 10 DEGs were present in the major network: 6 postnatally upregulated genes (TLN1, THBS2, ITGA7, CD63, ALCAM, TNS1) that were enriched for Kyoto Encyclopedia of Genes and Genomes ECM receptor interaction pathway (M7098) with a false discovery rate-adjusted p value <0.001, and 4 prenatally upregulated genes (HDAC2, SOS1, HMGA2 and FBN3) that were enriched for ECM organisation and epithelial development, interconnected with known SFARI genes (online supplemental figure 5C). Additionally, we observed five DEGs in a small subnetwork associated with positive regulation of transcription, all of which were prenatally upregulated (FOXM1, HDAC2, MED17, VTA1, SS18L1). The study also highlighted DEGs such as FOXM1, TNS1, FBN3 in these subnetworks, which were recurrently mutated in our cohort.
Robust prevalence studies, which are currently lacking, are crucial to understanding the true burden of ASD in the Indian population. Moreover, there are a limited number of genetic studies on Indian families affected by ASD. To identify potential disease-causing genes in the Indian population and to understand the intricate molecular mechanisms underlying the pathology of ASD, we conducted whole exome sequencing on 23 Indian trio familial samples. Given the higher prevalence of ASD in males, as well as factors such as the female protective effect and sex hormones that make females less susceptible to ASD symptoms,9 this study focused exclusively on males.
Our investigation identified inherited and de novo variants, revealing a higher burden of inherited variants despite 72% of families reporting no history of psychiatric or neurological disorders. These findings are noteworthy given the prevailing belief in the higher heritability rate in ASD, as supported by the literature.10 In contrast, a recent study on the Indian population observed a higher prevalence of de novo mutations. However, a limitation of this study is that exome sequencing was conducted only on probands. Additionally, the recurrent mutations in the MECP2 gene observed may be attributed to sample bias.11
The study identified 96% of subjects with ASD in our cohort as harbouring deleterious variants, either in high confidence or strong candidate genes for ASD, suggesting their potential relevance to disease manifestation. Interestingly, the study highlights eight ‘high-risk’ genes carrying rare protein-damaging variants, potentially contributing to the ASD phenotype in 30% of the subjects with low to moderately low adaptive function. Among the eight high-risk genes, CACNA1D, RELN, NRXN2, SHANK2, ZNF462 harboured inherited rare missense variants and were identified as candidate genes in four subjects with ASD (030C, 105C, 081C, 092C) (table 1). These high-risk genes are associated with a diverse array of symptoms, including intellectual disability, developmental disorder, neurological disorder, decreased social interaction and restrictive repetitive behaviour.7 12
Additionally, in three probands (090C, 034C and 010C) with no reported family history of ASD, we infer that the ASD phenotype is driven by high-risk genes (WDFY3, BRSK2, DEAF1) carrying de novo variants. This is consistent with the literature suggesting the implication of de novo variants in sporadic ASD.13 Literature underscores that autism-associated genes are more intolerant to LoF variants; even carriers of these variants exhibit defective cognitive function, highlighting their significant impact.14 Hence, we consider BRSK2 and WDFY3 with high-impact variants as candidate genes in subjects 010C and 034C with ASD, respectively. Another de novo missense variant in DEAF1 could be a candidate gene in subject 010C. This inference is supported by studies reporting multiple missense variations in the DEAF1 gene, predominantly residing in the Sp100, AIRE-1, NucP41/75, DEAF-1 (SAND) domain, similar to our current findings.7
Considering genetic heterogeneity, the study investigated novel genes that could contribute to the ASD phenotype, potentially arising from ethnic differences. The integrated functional analysis identified 14 novel DEGs that are functionally important during the critical stage of brain development. The prenatally upregulated genes, including FOXM1, HDAC2, MED17, VTA1, SS18L1 and HMGA2, are recognised as transcriptional modulators. Given that the prenatal period is characterised by intense synaptogenesis, it can be inferred that the mutation burden in this phase affecting the dynamic regulation of transcription and translation processes may impact epigenetic regulation, contributing to dysregulated neuronal development.
Furthermore, our functional analysis prioritised TLN1, THBS2, ITGA7, CD36, ALCAM, TNS1, SOS1 and FBN3, which were enriched for various biological processes such as ECM and cell junction organisation. These processes converge into similar molecular mechanisms, contributing to fundamental aspects of cellular communication within the brain. Remarkably, the ‘high-risk’ genes identified in this study, such as SHANK2, NRXN2 and RELN, were also enriched for cell-cell communication pathways. This finding aligns with a broader body of research that supports the pathology of ASD mediated by cell adhesion molecules (the Neurexin gene family) and neural ECM molecules.15 Interestingly, one of the large exome studies revealed that out of 102 identified genes, 29 were associated with neuronal communications, including NRXN2 and SHANK2, leading to defective synaptogenesis and synaptic plasticity.7 Collectively, we posit that there is a dysregulation of intercellular communication that might have caused defective neurogenesis and synaptogenesis in ASD.
It is crucial to acknowledge the limitations of our study, particularly the limited sample size. Nevertheless, the study successfully identified candidate genes associated with 30% of subjects with ASD in our cohort. The identified novel genes carrying protein-damaging variants may add to the existing literature. Further research and evidence are warranted to substantiate these findings.