Back to Journals » International Journal of General Medicine » Volume 19

A Multi-Center Study: Developing a Nomogram for Predicting Genetic Results of Trio-Based Whole-Exome Sequencing (Trio-WES) in Diagnosing Children with Syndromic Neurodevelopmental Disorders (s-NDDs)

Authors Wu R ORCID logo, Qiu R, Tang D, Meng Z, Li X, Li D, Tang W

Received 27 March 2026

Accepted for publication 6 June 2026

Published 12 June 2026 Volume 2026:19 612553

DOI https://doi.org/10.2147/IJGM.S612553

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Professor Kenneth Adler



Ruohao Wu,1– 3,* Ronglin Qiu,2,4,* Danxia Tang,1,3,* Zhe Meng,1 Xiaojuan Li,2,5 Dongfang Li,1– 3 Wenting Tang6

1Department of Children’s Neuro-Endocrinology, Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University, Guangzhou, Guangdong, People’s Republic of China; 2Department of Pediatrics, Shenshan Medical Center of Sun Yat-Sen Memorial Hospital, Shanwei, Guangdong, People’s Republic of China; 3Weierkang Children’s Rehabilitation Center, Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University, Guangzhou, Guangdong, People’s Republic of China; 4Department of Pediatric Surgery, Sun Yat-sen Memorial Hospital of Sun Yat-sen University, Guangzhou, Guangdong, People’s Republic of China; 5Department of Cellular and Molecular Diagnostics, Sun Yat-sen Memorial Hospital of Sun Yat-sen University, Guangzhou, Guangdong, People’s Republic of China; 6Department of Research and Molecular Diagnostics, Sun Yat-Sen University Cancer Center, Guangzhou, Guangdong, People’s Republic of China

*These authors contributed equally to this work

Correspondence: Dongfang Li, Email [email protected] Wenting Tang, Email [email protected]

Objective: The diagnostic efficacy of empirical trio-based whole-exome sequencing (trio-WES) for unexplained syndromic neurodevelopmental disorders (NDDs) remains unsatisfactory. This study aimed to explore the diagnostic value of phenotypic indicators and establish a nomogram for predicting the genetic results of trio-WES for diagnosing patients with unexplained s-NDDs.
Methods: We retrospectively collected phenotypic and genotypic data from 265 children with s-NDDs who received trio-WES at Sun Yat-sen Memorial Hospital (training cohort), 38 and 97s-NDDs patients with trio-WES test at Shenshan Medical Center (validation cohort-1) and Weierkang Children’s Rehabilitation Center (validation cohort-2), respectively. Logistic analysis was employed to identify the independent predictors of a positive trio-WES diagnosis in the training cohort for model establishment. The predictive performance and robustness of the model were assessed using receiver operating characteristic (ROC) and confusion matrix analyses, respectively.
Results: The analysis revealed that the severity of neurodevelopmental delays, head circumference abnormality, and complexity of neurodevelopmental comorbidities were independent predictive indicators for distinguishing s-NDDs patients with positive trio-WES results. The nomogram combining the three predictors showed good predictive performance with an area under the ROC (AUC) in the training cohort of 0.827 (95% CI: 0.775– 0.879), yielding a confusion matrix with sensitivity, specificity, accuracy, precision, and F1 score of 78.23%, 78.01%, 78.11%, 75.78%, and 0.77, respectively. The model also had an excellent prediction in the external validation cohorts (AUC: 0.953; 95% CI: 0.881– 0.998, sensitivity: 88.89%; specificity: 80.00%; accuracy: 84.21%; precision: 80.00%; and F1 score: 0.84 in validation cohort-1 and AUC: 0.910; 95% CI: 0.843– 0.978, sensitivity: 83.33%; specificity: 85.45%; accuracy: 84.54%; precision: 81.40%; and F1 score: 0.82 in validation cohort-2).
Conclusion: The model can serve as a useful tool for assisting decision-making in applying trio-WES in the diagnostic strategy for s-NDDs, helping to implement personalized pre-diagnosis assessments for affected families.

Keywords: syndromic neurodevelopmental disorders, trio-based whole-exome sequencing, diagnostic yield, phenotype-driven nomogram

Introduction

Syndromic neurodevelopmental disorders (s-NDDs) are genetically and clinically heterogeneous syndromes characterized by a complex group of neurological diseases with observable neurodevelopmental deficits beginning at a very early age, commonly <6 months-old.1 As our understanding of s-NDDs has increased, the prevalence has dramatically increased, accounting for almost 1–3% of children worldwide.2,3 The rapidly increasing prevalence of s-NDDs has caused a substantial burden to society and affected families. The main clinical manifestations of s-NDDs include global developmental delay/intellectual disability (GDD/ID), autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), and epilepsy (EP), of which GDD/ID is the core of s-NDDs.4,5 Other common neurological manifestations, such as ADHD, ASD, and EP, often act as comorbidities, solely or jointly present in affected individuals, which dramatically adds layers of complexity to the diagnosis of s-NDDs.6

Table 1 Comparison of Baseline Clinical Features Between Training and the Two External Validation Cohorts

Table 2 Independent Variables Selection for Constructing a Nomogram of Predicting Genetic Results of Trio-WES in Diagnosing Cases with s-NDDs in Training Cohort Using Logistic Regression Analysis

Table 3 The Collinearity Diagnostic Analysis of Incorporated Variables for Predicting Genetic Results of Trio-WES in Diagnosing Cases with s-NDDs in Training Cohort

Table 4 Coefficients of Established Model for Prediction of Obtaining Genetic Diagnosis via Trio-WES in Individuals with s-NDDs in Training Cohort

Table 5 Predictive Parameters of the Established Phenotype-Driven Model in Training and Validation Cohorts

Currently, the etiology of s-NDDs remains unclear; however, genetic disturbances such as single-nucleotide variants (SNVs) and copy-number variants (CNVs) are considered to play a key role in the pathogenesis of s-NDDs.7 Owing to significant advancements in identifying genetic alterations in human diseases, particularly the popularization of trio-based (parental-offspring pattern) whole-exome sequencing (trio-WES), the most common sequencing technology enabling the identification of exon-level variants, including SNVs and CNVs, and genetic disturbances are being found more frequently than before in many individuals with idiopathic s-NDDs.8 Nonetheless, nearly half of s-NDDs patients cannot obtain a genetic diagnosis after undergoing the trio-WES test because of the complex and heterogeneous genetic components underlying s-NDDs,9 and the diagnostic yield of trio-WES in s-NDDs remains unsatisfactory. Therefore, developing a simple-to-use approach for the early identification of patients with s-NDDs who can be diagnosed by trio-WES is important for clinicians, which may provide affected individuals with further assessment of related medical conditions earlier in the disease course. Meanwhile, a user-friendly tool is crucial to assist decision-making for trio-WES diagnostic strategy at admission, promoting individualized medical planning and reducing unnecessary financial and time burdens for affected families.

The nomogram is a common graphical calculator used to visualize the scoring system of the linear regression model10 and is widely used to predict various clinical outcomes across many pediatric neurodevelopmental diseases such as ASD and ADHD.11,12 However, to the best of our knowledge, few studies have developed nomograms to forecast clinical outcomes in individuals with s-NDDs. Here, we conducted a multicenter study with independent external validation based on the phenotype-driven concept, developing and validating a nomogram model for predicting the genetic results of trio-WES in diagnosing patients with s-NDDs.

Materials and Methods

Participants Enrollment and Screening

The phenotypic definition of s-NDDs in this study is as follows: (1) exhibiting various severities of GDD/ID. Clinical diagnosis and severity assessment of GDD/ID were performed by neuropsychiatric pediatricians according to the guidelines of the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-V)1 and the criteria of the International Classification of Disease Version 11 (ICD-11),13 respectively. (2) Comorbid with at least one type of common neurodevelopmental manifestation including EP, ADHD, and ASD. Diagnoses of these comorbidities were made by neuropsychiatric pediatricians according to the DSM-V guidelines for ADHD and ASD and the International League Against Epilepsy guidelines for EP.14 (3) Children with a clear non-genetic etiology, such as bilirubin encephalopathy or hypoxic-ischemic encephalopathy, were excluded from this study.

As depicted in Figure 1, information on 871 individuals with s-NDDs from Children’s Medical Center of Sun Yat-sen Memorial Hospital (SYSMH) from September 1, 2016, to December 31, 2022, were retrospectively analyzed and used as the training cohort enrollment. After carrying out a series of phenotypic data, informed consents of genetic tests, and routine genetic (G-band karyotyping and fragile-X analysis) results screenings to exclude ineligible cases, including s-NDDs patients from out-patient department with unclear/incomplete clinical information (excluding 346 subjects), patients’ parents/guardians who refused to undergo genetic tests or declined to use their genetic reports for publication due to the financial or other personal issues (excluding 239 subjects), and patients who harbored apparent chromosomal disorders, such as fragile-X syndrome and Down syndrome, for which it was not appropriate to use trio-WES as their diagnostic strategy (excluding 21 cases). Finally, 265 patients with unexplained s-NDDs and trio-WES test results were included in this retrospective study and served as the training set. Meanwhile, 92 patients with s-NDDs from the Shenshan Medical Center (SSMC) on July 1, 2024, and July 31, 2025, were retrospectively analyzed and used as an external validation cohort enrollment. After performing a series of screenings (same as the screenings in the training set), 38 trio-WES-tested children with unexplained s-NDDs were included in this retrospective study and served as the validation set-1 in the current study. Moreover, 115 subjects with s-NDDs from the Weierkang Children’s Rehabilitation Center (WCRC), a specialized children’s neurorehabilitation clinic focusing on diagnosis and rehabilitation of pediatric neurodevelopmental disorders, from January 1, 2023, to December 31, 2023. After carrying out the same process of screenings, 97s-NDDs children with trio-WES test results were enrolled and served as the validation set-2 in the current study.

A flowchart of a multi-center study on s-NDDs cases across three centers, detailing screening and validation steps.

Figure 1 Flowchart of this retrospective multi-center study. “±” means enrolled subjects obtained a positive genetic results or a negative genetic results.

Abbreviations: s-NDDs, syndromic neurodevelopmental disorders; trio-WES, trio-based whole-exome sequencing; SYSMH, Sun Yat-Sen Memorial Hospital; SSMC, Shenshan Medical Center; GDD/ID, WCRC, Weierkang Children’s Rehabilitation Center; global developmental delay/intellectual disability.

Ethical Compliance

This retrospective study was approved by the Ethical Committee of Sun Yat-sen Memorial Hospital (Approval Number: SYSKY-2025-244-01) and Shenshan Medical Center (Approval Number: 2025-SSKY-111-01). Written informed consent for genetic testing and the publication of related anonymous genetic reports were obtained from the parents/guardians of all 400 participants.

Definition of the Outcome Indicator

The binary outcome indicator in this study was the genetic results obtained via trio-WES in participants with s-NDDs, and the definition of trio-WES detected exon-level variants (including SNVs and CNVs) was based on the American College of Medical Genetics (ACMG) guidelines. Specifically, the pathogenicity of detected SNVs was assessed according to the 2015 ACMG guideline,15 and these SNVs were accordingly divided into “pathogenic/likely pathogenic SNVs” and “benign/uncertain significance SNVs”. The pathogenicity of the identified CNVs was evaluated according to the 2019 ACMG guideline for CNVs,16 and these CNVs were also divided into “pathogenic/likely pathogenic CNVs” and “benign/uncertain significance CNVs” accordingly. Based on their pathogenicity evaluations, the participants in the current study were divided into s-NDDs patients with a positive trio-WES result (+, having pathogenic/likely pathogenic SNVs or CNVs from their genetic reports) and patients with negative trio-WES results (−, having benign/uncertain significance SNVs or CNVs from their genetic reports).

Interpretation of the Candidate Predictors

Candidate predictors in this study were retrospectively collected from patients’ medical records in the hospital information system. This included (a) Baseline demographic characteristics such as sex, admission date, and onset age. (b) Phenotypic predictors: 1) GDD/ID severity evaluated via ICD-11: a participant under 5-years-old who had the developmental quotient (DQ) scores <35 points can be regarded to have severe-profound GDD, otherwise, this subject with DQ scores ranging from 35 to 75 points was regarded to have mild-moderate GDD. A child older than (≥) 5-years-old who presented the intelligence quotient (IQ) scores <40 points can be considered to have severe-profound ID, otherwise, this child with IQ scores ranging from 40 to 70 points was considered to have mild-moderate ID. 2) Comorbidity condition defined as follows: single, an s-NDDs participant coexisting only with one type of neurodevelopmental comorbidities, including ASD, ADHD, and EP; multiple, an s-NDDs participant coexisting with two or more types of neurodevelopmental comorbidities (ASD, ADHD, and EP). 3) ASD: yes or no. 4) ADHD: yes or no. 5) EP: yes or no. 6) HCA: yes (a participant exhibiting microcephaly or macrocephaly) or no (a participant has normal head phenotype). 7) BM detected by cranial magnetic resonance imaging (c-MRI): yes (a participant with pathological brain structure/parenchyma malformation, mainly including cerebral white matter changes, basal ganglia lesions, and dysplasia of the corpus callosum, revealed by c-MRI) or no (a participant with normal brain structure detected via c-MRI). 8) Hearing impairment and ear anomaly diagnosed by auditory brainstem responses (ABR) and Human Phenotype Ontology (HPO):17 yes (a participant with hearing impairment diagnosed via the ABR and/or ear malformation according to the HPO) or no (a participant with normal hearing and normal ear phenotype). 9) Visual impairment and ocular malformation diagnosed by visual evoked potentials (VEP) and HPO: yes (a participant with visual impairment detected via the VEP and/or ocular malformation on the basis of the HPO) or no (a participant showing normal visual function with normal ocular phenotype).

Nomogram Establishment and Validation of Model Performance

Univariate and multivariate logistic regression with collinearity diagnostic analysis (variance inflation factor [VIF] < 5 and tolerance > 0.2) was used to screen independent predictive factors (P < 0.05) in the training cohort. A nomogram prediction model was constructed based on the regression coefficients of the included independent predictive factors and was visualized using the R package.

For the internal assessment of model performance in the training cohort, the stability of the nomogram was evaluated using a calibration curve with the Hosmer–Lemeshow (H-L) test. The H-L test (P > 0.05) indicated a good fit between the model-predicted calculation and the observed data. The receiver operating characteristic (ROC) curve with the area under the curve (AUC) was used to evaluate the discriminative ability of the model, with a value exceeding 0.7 indicating good discriminative ability. The clinical application value of the model was evaluated using a clinical impact curve (CIC) and a decision curve analysis (DCA). Moreover, we also used the 10-fold cross and the 1000-times bootstrap resampling validation methods to evaluate the reliability of the model, with a concordance index (C-index) exceeding 0.7 from both methods, revealing good reliability of the nomogram.

For external evaluation of model performance, the optimal cutoff value of the nomogram-total point was first calculated based on the maximal Youden index value of the training cohort. Then, according to this optimal cut-off value, all subjects in the external validation cohort were transformed into cases with “nomogram-predicted positive genetic results” or “nomogram-predicted negative genetic results”. Finally, ROC, calibration curve, and DCA/CIC were used to evaluate the performance of the nomogram model in the transformed external cohort. We also calculated the model’s sensitivity, specificity, accuracy, precision, and F1 score in the training and external validation sets to assess the predictive power of this nomogram comprehensively.

Statistical Analysis

In the current study, count data were expressed as the number of cases (%), and the χ2 test was used for comparisons between groups. For quantitative data, a normal distribution test was performed first. The data conformed to a normal distribution and were expressed as mean ± standard deviation (SD), and the t-test was used for comparison between two groups. All data analyses and result visualizations were performed in the R environment (version 4.3.1). Following previous works used R packages for statistical analyses and visualization,18–20 the R packages used in this study are mainly from Bioconductor (https://bioconductor.org). P <0.05 was considered statistically significant.

Results

Comparisons of Clinical Features Between Training and Validation Sets

In this multicenter retrospective study, 265s-NDDs cases from SYSMH were enrolled in the training cohort to establish the nomogram model. 38 and 97s-NDDs subjects from SSMC and WCRC were enrolled as the two independent validation cohorts to verify the generalization performance of the established model. In the training set, the global diagnostic yield of trio-WES was 46.79%, while the global diagnostic rate of trio-WES in the validation set was 50.00%. Comparisons of the baseline demographics and phenotypic information between the three cohorts are presented in Table 1. The raw data of training set, external validation set-1 and external validation set-2 can be found in the Supplementary Table 13, respectively.

Independent Predictive Indicators Selection

A univariate Logistic regression analysis was performed to screen for potential diagnosis-related indicators. Of the 265 subjects in the training cohort, 124 were genetically diagnosed using the trio-WES test. As shown in Table 2, compared with patients with negative trio-WES outcomes, patients with positive trio-WES outcomes appeared to have severe-profound GDD/ID, multiple neurodevelopmental comorbidities, ASD, HCA, and BM more often (all P < 0.05), which indicated that the five variables may have potential positive associations with a high possibility of receiving a genetic diagnosis via trio-WES.

Following the univariate logistic-based selection, those five phenotypic indicators were incorporated into the multivariable logistic regression model. As shown in Table 2, GDD/ID severity (mild to moderate vs. severe to profound), comorbidity condition (single vs. multiple), and HCA (yes vs. no) were strong independent predictors of a positive trio-WES diagnosis (all P < 0.05). In contrast, ASD and BM were not independently associated with positive trio-WES diagnosis (both P > 0.05). Finally, we excluded ASD and BM from our multivariable logistic regression model for model adjustment, and the results showed that the three final-included variables (GDD/ID severity, OR 7.683, 95% CI 4.087–14.439, P < 0.001; comorbidity condition, OR 2.897, 95% CI 1.383–6.071, P = 0.005; HCA, OR 3.893, 95% CI 1.905–7.958, P < 0.001) were all independently associated with positive trio-WES outcomes in children with s-NDDs (Table 2).

Moreover, the three final-incorporated predictors were further analyzed using a collinearity diagnosis to exclude potential interdependencies among them. As shown in Table 3, the tolerances of the three predictors (ranging from 0.792 to 0.802) were all >0.2, while their VIF values (ranging from 1.015 to 1.263) were all <5, indicating that there were no significant interdependencies among the three phenotypic predictors.

Finally, a regression model was established to predict the genetic outcomes of trio-WES for diagnosing children with s-NDDs. The constructed model formula was as follows: logit (P) = 2.039 × GDD/ID severity + 1.064 × comorbidity condition + 1.359 × HCA − 1.556. The full regression coefficients (such as β, SE, and Wald) of the model are listed in Table 4.

Nomogram Construction and Evaluation of Model Discriminatory Performance

A prediction nomogram model (Figure 2A) and its related points system (“Nomogram points” column in Table 4) was visualized using R packages (“rms” and “rmda”) based on three independent phenotypic indicators (GDD/ID severity, comorbidity condition, and HCA) identified by multivariate logistic regression analysis. For each predictor, a point is calculated by drawing a vertical line from the corresponding answer to the point bar at the top of the diagram. The points for all predictors were aggregated into total points. The total point was subsequently converted into the probability (p) of obtaining a trio-WES diagnosis by drawing a vertical line upward from the total point bar to the probability bar. The nomogram plot provided an estimate of the probability of obtaining positive trio-WES outcomes via the point contributions of the three independent predictors and can assist decision-making in using trio-WES in the initial diagnostic strategy of an s-NDDs individual.

A nomogram and a stacked bar chart showing trio WES diagnosis probability and outcomes by score.

Figure 2 Logistic regression model visualized by the nomogram and the evaluation of the nomogram scoring system across 265 individuals in the training set. (A) There are two parts in the nomogram diagram: the upper part (from “Points” to the last variable, “HCA”) showing the calculation of points for each incorporated variables, and the lower part (From “Total points” to “Probabilities of obtaining a genetic diagnosis by trio-WES”) showing the probability of getting a genetic result via trio-WES testing. (B) Bar charts showing the percentage distributions of individuals having positive genetic diagnosis at each score in training cohort.

Abbreviations: GDD/ID, global developmental delay/intellectual disability; HCA, head circumference abnormality; trio-WES, trio-based whole-exome sequencing.

According to the aggregate points across all subjects in the training set, these cases could be classified into different subgroups. As shown in Figure 2B, the bar chart revealing the percentage of subjects who had positive trio-WES outcomes in subgroups with different aggregate points across the 265 cases in the training cohort showed that a higher aggregate point was consistent with a higher percentage of subjects with a positive trio-WES result, indicating the robust discriminatory power of the nomogram scoring system we constructed.

Internal Evaluation and Validation of the Nomogram Performance

As indicated in Figure 3A, the calibration curve with the H-L test demonstrated that both the nomogram predictions from one-time calculation (apparent line) and 1000-times bootstrap-resampling (bias-corrected line) were closely aligned with the diagonal line in the training set with χ2 = 1.175, df = 4, and P value = 0.882 from the H-L test, indicating excellent consistency between the predicted probabilities and the actual observed values. As shown in Figure 3B, ROC plots showed an AUC of 0.827 (95% CI: 0.775–0.879), indicating that the model had good predictive ability in the training cohort. Meanwhile, according to the optimal cutoff point that corresponded to the highest values of sensitivity and specificity in the ROC plot in the training cohort, the maximal Youden index was calculated to be 0.562, and its corresponding cutoff total point in the nomogram scoring system was 52, which generated a confusion matrix with the optimal values of sensitivity, specificity, accuracy, precision, and F1 score being 78.23%, 78.01%, 78.11%, 75.78%, and 0.77, respectively, in the training set (Table 5). These results further revealed that the nomogram model had a robust discriminatory ability to predict the genetic results of trio-WES in diagnosing patients with s-NDDs. Moreover, within a relevant threshold range of trio-WES diagnostic rate (approximately 0.18 ~ 0.50), DCA and CIC analyses demonstrated that using this nomogram for cases from the training cohort could obtain a greater net benefit compared with treat-none and treat-all diagnostic strategies. When the diagnostic rate threshold <0.18, using this model could also obtain a greater net benefit compared with treat-none diagnostic strategy and equal to the treat-all diagnostic strategy (Figure 3C and D), indicating that the predictive model had promising clinical application.

Four plots showing calibration, receiver operating characteristic, decision curve and clinical impact curves.

Figure 3 Discriminatory performance of the nomogram in the training cohort. (A) Calibration plot. (B) ROC curve. (C) DCA and (D) CIC for assessment of the predicted clinical utility and impact of the nomogram in s-NDDs individuals of the training set.

Abbreviations: AUC, area under the ROC curves; 95% CI, 95% confidence interval; ROC, receiver operating characteristic; DCA, decision curve analysis; CIC, clinical impact curve; s-NDDs, syndromic neurodevelopmental disorders.

Finally, 10-fold-cross and 1000-times bootstrap-resampling analysis was used to internally verify the performance stability of the nomogram in the training set. As shown in Figure 4, the C-index values were 0.824 (95% CI: 0.757–0.879) and 0.821 (95% CI: 0.757–0.886), based on 10-fold-cross validation (Figure 4A) and 1000-times bootstrap validation (Figure 4B), suggesting that the predictive model had excellent stability and did not overfit.

Two plots showing C-index results from resampling and a bootstrap distribution histogram.

Figure 4 Internal verification of the robustness performance of the model across the training set. (A) 10-fold-cross validation revealing the model had excellent stable consistence with C-index (95% CI) being 0.824 (0.757–0.886). (B) 1000-time resampling bootstrap validation further verifying the model had good stability with C-index (95% CI) being 0.821 (0.757–0.886).

Abbreviations: C-index, concordance index; 95% CI, 95% confidence interval.

External Evaluation of the Model’s Generalization Performance

According to the optimal cutoff total point (52) calculated in the training set, the 39s-NDDs cases in the external validation cohort-1 from SSMC could be transformed into two subgroups (model-predicted positive trio-WES subgroup: 20 subjects and model-predicted negative trio-WE subgroup: 18 subjects). As illustrated in Figure 5A, the calibration curve with the H-L test showing both the model predictions from one-time calculation (apparent line) and 1000-times bootstrap-resampling (Bias-corrected line) were in good agreement with the diagonal line in the transformed external cohort with χ2 = 1.047, df = 4, and P value = 0.903. Additionally, the ROC plots (Figure 5B) revealed an AUC value of 0.953 (95% CI: 0.881–0.998). DCA and CIC analysis (Figure 5C and D) showed significant net benefits could be obtained in the transformed external cohort-1 using the nomogram within the threshold range of trio-WES diagnostic rate (approximately 0.10~0.50). Finally, based on the transformed data in the external cohort-1, the confusion matrix analysis showed that the sensitivity, specificity, accuracy, precision, and F1 score of the nomogram in the external validation cohort-1 were 88.89%, 80.00%, 84.21%, 80.00%, and 0.84, respectively (Table 5). Similarly, the calibration curve with the H-L test (χ2: 1.399, df: 4, and P value: 0.844), ROC curve (AUC = 0.910 with 95% CI = 0.843–0.978) and DCA/CIC (obtaining a greater net benefit compared with treat-none and treat-all diagnostic strategies within the diagnostic rate threshold range of 0.10~0.50, approximately) were also performed in the external validation set-2 (Figure 6), and a confusion matrix analysis revealed the sensitivity, specificity, accuracy, precision, and F1 score of the model in the external validation cohort-2 were 83.33%, 85.45%, 84.54%, 81.40% and 0.82, respectively. All these results from two independent external cohorts confirmed that the nomogram model exhibited good generalization ability and stable repeatability in other external sets.

Four plots evaluating nomogram performance: calibration, ROC, decision curve and clinical impact.

Figure 5 Discriminatory performance of the nomogram in the external validation cohort-1. (A) Calibration plot. (B) ROC curve. (C) DCA and (D) CIC for assessment of the predicted clinical utility and impact of the nomogram in s-NDDs individuals of an independent validation set.

Abbreviations: AUC, area under the ROC curves; 95% CI, 95% confidence interval; ROC, receiver operating characteristic; DCA, decision curve analysis; CIC, clinical impact curve; s-NDDs, syndromic neurodevelopmental disorders.

Graphs: model calibration, discrimination, decision curve, clinical impact for s-NDDs patients.

Figure 6 Discriminatory ability of the model in the external validation cohort-2. (A) Calibration plot. (B) ROC curve. (C) DCA and (D) CIC for evaluation of the predicted clinical utility and impact of the model in s-NDDs patients of another external validation cohort.

Abbreviations: AUC, area under the ROC curves; 95% CI, 95% confidence interval; ROC, receiver operating characteristic; DCA, decision curve analysis; CIC, clinical impact curve; s-NDDs, syndromic neurodevelopmental disorders.

Discussion

With the rapid popularization of next-generation sequencing technology in clinical practice, genetic components are being found more often than before in many individuals with idiopathic syndrome involving developmental abnormalities, including intellectual disability.21 The diagnostic patterns of clinical genetics has been obviously changed since the popularization of trio-WES; trio-WES can bring us an efficient way to obtain a genetic diagnosis for many idiopathic disorders, preventing “diagnostic odysseys” experienced for many affected individuals and their parents.22,23 However, we should be aware of the technical limitations of trio-WES (exon-level sequencing only), along with the complexity of the human genomic abnormalities (like deep intronic or non-coding variants) might inevitably hinder the diagnostic power of this technology. Whole genome sequencing (WGS) might definitely provide advantages over trio-WES via detecting both variants at exon-level and non-exon-level. However, due to its high-priced screening and more intense work requirement, the popularization of WGS in clinical practice is currently restricted in many developing countries, including China. Nowadays, trio-WES technology still be considered as the first-tier genetic diagnosis approach in the world.24 Thus, it is insightful to develop effective tool to assist the decision-making of using trio-WES in diagnostic strategy of idiopathic syndrome, like s-NDDs.

In this study, by carefully and objectively collecting the phenotypic data that can potentially reflect neurodevelopmental abnormalities in children with s-NDDs, the global diagnostic yield of trio-WES in s-NDDs patients from the training and external validation cohort was 46.79% and 50.00%, respectively, which were both higher than those recorded in previous studies of individuals with GDD/ID alone with trio-WES detection (27% ~39%).25,26 Our findings emphasize the importance of key phenotypic factors enrichment in enhancing diagnostic rate of using trio-WES, and reinforce the previous concept of phenotype-driven strategy27 in genetic diagnosis, which is the core principle of the “DeepRare”, an AI system powered by large language models accepted by Nature in 2025.28 We therefore speculate on the possibility of developing a model for predicting the probability of positive trio-WES results by enriching core phenotypic signifiers that can well-reflect neurodevelopmental conditions in s-NDDs patients.

Among the three phenotypic indicators (GDD/ID severity, Comorbidity condition, and HCA) identified in current work, the strongest association with positive genetic results of trio-WES in s-NDDs patients was severe-profound GDD/ID (OR: 7.683, 95% CI: 4.087–14.439), followed by the existence of HCA (OR: 3.893, 95% CI: 1.905–7.958) and multiple comorbidities (OR: 2.897, 95% CI: 1.383–6.071). These findings indicated that severe neurodevelopmental delays, craniofacial anomalies and complicated neurodevelopmental comorbidities coexistence can share overlapping genetic backgrounds directly pointing to monogenic neurodevelopmental disorders, thus bringing higher diagnostic yield of trio-WES into patients with these three strong signifiers. As previous studies revealed, alterations in genes related to the functions of gene-expression regulation (mainly including chromatin remodeling, histone modification and DNA methylation) and neuronal communication can cause severe neurodevelopmental delays with a broad spectrum of neurodevelopmental comorbidities, like ASD and ADHD.29 Meanwhile, in the craniofacial development, the process of crosstalk between neural crest cells and craniofacial ectoderm matrix plays an essential role in craniofacial patterning and morphogenesis, and the formation of neural crest requires complicated epigenetic modifications.30 Disruptions in genes associated with chromatin remodeling, histone modification, DNA methylation and neuronal communication can severely impair the normal development of neural crest, resulting in a series of craniofacial anomalies, among which HCA is the prominent one.31 Given these shared gene-functional abnormalities (dysfunctions of gene-expression regulation and neuronal communication) among phenotypes of severe neurodevelopmental delays, craniofacial anomalies and complicated neurodevelopmental comorbidity conditions, it is reasonable to speculate that an s-NDDs patient presenting with more of these phenotypic signifiers can have more probabilities of having genetic background, and thus can obtain a positive genetic result more easily by receiving trio-WES test.

Importantly, by combining the three easily assessed phenotypic signifiers that independently links to the genetic causes in s-NDDs subjects, we successfully constructed and validated a nomogram showing good precision (training set: 75.78%, validation set-1: 80.00%, and validation set-2: 81.40%) and accuracy (training set: 78.11%, validation set-1: 84.21% and validation set-2: 84.54%) in the prediction of genetic results of using trio-WES in s-NDDs patients, which indicates promising clinical applications. However, this work has some limitations. First, due to the lack of established and Chinese version scales for emotional disorders, such as childhood anxiety disorder, and sleep disturbances in young children, we cannot include those conditions or manifestations into candidate phenotypic indicators, which may inevitably bring potential confounding bias into the model, more candidate variables, such as childhood anxiety disorder and sleep disorder, are required in the future researches. Second, with the continue updates of ACMG interpretation frameworks and reference databases or the probands’ phenotypic information expanding with age, a very small part of variants that assessed as uncertain significance previously can be potentially re-assessed as pathogenic/likely pathogenic, which may bring potential measurement bias into the model. However, due to the nature limitations of the retrospective work, we cannot retrospectively change the original trio-WES genetic reports that had already been made at a previous fixed time point. Continuous updates on the variants interpretation frameworks and reference databases and phenotypic information of grow-up patients should be considered in the future studies. Third, due to the sample size for the external validation cohort-1 (n = 38) and external validation cohort-2 (n = 97) are all relatively small, which might affect statistical power of the model and limit its generalizability. Further validation with larger external cohorts is required. Finally, this pilot study was based on a retrospective analysis, leaving the model’s performance in a prospective research undetermined.

Conclusion

In summary, GDD/ID severity, comorbidity condition, and HCA are independent phenotypic signifiers linked to positive trio-WES diagnoses in children with s-NDDs. The nomogram model based on them showed good calibration, discrimination, and clinical applicability, providing an accurate and effective tool for clinicians to identify children with s-NDDs who have a high probability of obtaining a genetic diagnosis via trio-WES in a timely manner, and for affected families to estimate their personalized diagnostic probability of receiving trio-WES test at the pre-diagnosis stage. However, it should be noted that due to the established model based on three medical centers with a small number of enrolled cases (training cases: 265; validation cases: 38+97), the statistical power and the robustness of the model is limited. The overall conclusions still require confirmation through studies with larger sample sizes and more complete data. Nonetheless, the current findings can provide a reference for the identification of cases with higher probability of obtaining genetic diagnosis via trio-WES in clinical practice.

Data Sharing Statement

The data supporting this study’s findings are presented in this article. Further inquiries can be directed to the corresponding author (Dr. Wenting Tang) via Email with reasonable requests.

Ethics Approval and Consent to Participate

All procedures of the study were performed in accordance with the Declaration of Helsinki and approved by the Ethical Committee of the Sun Yat-sen Memorial Hospital (Approval Number: SYSKY-2025-100-01) and the Shenshan Medical Center (Approval Number: 2025-SSKY-111-01), respectively. Written informed consent was obtained from all enrolled subjects’ parents or guardians.

Acknowledgments

We thank all the children and parents for their participation in this study. Additionally, we acknowledge the use of Digital (AI) Language Editing Service for assistance in identifying and correcting grammatical and spelling errors in the first submission of this manuscript. All subsequent revisions, including content editing and interpretation, were performed manually by the authors.

Author Contributions

All authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis and interpretation, or in all these areas; took part in drafting, revising or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.

Funding

This work was supported by the grants from the Science and Technology Program (Social Development Field) Project of Shanwei, Guangdong Province, China (Grant No. 2025C019) and the Administration of Traditional Chinese Medicine of Guangdong Province, China (Grant No. 20251414).

Disclosure

The authors declare no conflicts of interest in this work.

References

1. First MB. Diagnostic and statistical manual of mental disorders, 5th edition, and clinical utility. J Nerv Ment Dis. 2013;201(9):727–17. doi:10.1097/NMD.0b013e3182a2168a

2. Wu R, Li X, Meng Z, Li P, He Z, Liang L. Phenotypic and genetic analysis of children with unexplained neurodevelopmental delay and neurodevelopmental comorbidities in a Chinese cohort using trio-based whole-exome sequencing. Orphanet J Rare Dis. 2024;19(1):205. doi:10.1186/s13023-024-03214-w

3. Hiraide T, Yamoto K, Masunaga Y, et al. Genetic and phenotypic analysis of 101 patients with developmental delay or intellectual disability using whole-exome sequencing. Clin Genet. 2021;100(1):40–50. doi:10.1111/cge.13951

4. Bertelli MO, Munir K, Harris J, Salvador-Carulla L. “Intellectual developmental disorders”: reflections on the international consensus document for redefining “mental retardation-intellectual disability” in ICD-11. Adv Ment Health Intellect Disabil. 2016;10(1):36–58. doi:10.1108/AMHID-10-2015-0050

5. Hanly C, Shah H, Au PY, Murias K. Description of neurodevelopmental phenotypes associated with 10 genetic neurodevelopmental disorders: a scoping review. Clin Genet. 2021;99(3):335–346. doi:10.1111/cge.13882

6. Chen S, Xiong J, Chen B, et al. Autism spectrum disorder and comorbid neurodevelopmental disorders (ASD-NDDs): clinical and genetic profile of a pediatric cohort. Clin Chim Acta. 2022;524:179–186. doi:10.1016/j.cca.2021.11.014

7. Srour M, Shevell M. Genetics and the investigation of developmental delay/intellectual disability. Arch Dis Child. 2014;99(4):386–389. doi:10.1136/archdischild-2013-304063

8. Srivastava S, Love-Nichols JA, Dies KA, et al. Meta-analysis and multidisciplinary consensus statement: exome sequencing is a first-tier clinical diagnostic test for individuals with neurodevelopmental disorders. Genet Med. 2019;21(11):2413–2421. doi:10.1038/s41436-019-0554-6

9. Bowling KM, Thompson ML, Amaral MD, et al. Genomic diagnosis for children with intellectual disability and/or developmental delay. Genome Med. 2017;9(1):43. doi:10.1186/s13073-017-0433-1

10. Wu R, Li X, Chen Z, et al. Development and validation of a nomogram based on common biochemical indicators for survival prediction of children with high-risk neuroblastoma: a valuable tool for resource-limited hospitals. BMC Pediatr. 2023;23(1):426. doi:10.1186/s12887-023-04228-2

11. Wu R, Luo X, He Z, Meng Z, Tang W, Liang L. Predicting the diagnostic efficacy of trio-based whole exome sequencing in children with low-function autism spectrum disorders: a multicenter study. Front Neurol. 2025;16:1597588. doi:10.3389/fneur.2025.1597588

12. Gao T, Yang L, Zhou J, et al. Development and validation of a nomogram prediction model for ADHD in children based on individual, family, and social factors. J Affect Disord. 2024;356:483–491. doi:10.1016/j.jad.2024.04.069

13. Harrison JE, Weber S, Jakob R, Chute CG. ICD-11: an international classification of diseases for the twenty-first century. BMC Med Inform Decis Mak. 2021;21(Suppl 6):206. doi:10.1186/s12911-021-01534-6

14. Scheffer IE, Berkovic S, Capovilla G, et al. ILAE classification of the epilepsies: position paper of the ILAE commission for classification and terminology. Epilepsia. 2017;58(4):512–521. doi:10.1111/epi.13709

15. Richards S, Aziz N, Bale S, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17(5):405–424. doi:10.1038/gim.2015.30

16. Riggs ER, Andersen EF, Cherry AM, et al. Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). Genet Med. 2020;22(2):245–257. doi:10.1038/s41436-019-0686-8

17. Jangir D, Joshi A, Thrigulla SR, Kumar A, Mukerji M. A systems perspective on rare diseases: integrating human phenotype ontology with the Anukta framework of Ayurveda. J Ayurveda Integr Med. 2026;17(2):101322. doi:10.1016/j.jaim.2026.101322

18. Tang W, Shao Q, He Z, Zhang X, Li X, Wu R. Clinical significance of nonerythrocytic spectrin Beta 1 (SPTBN1) in human kidney renal clear cell carcinoma and uveal melanoma: a study based on Pan-Cancer Analysis. BMC Cancer. 2023;23(1):303. doi:10.1186/s12885-023-10789-3

19. Wu R, Luo X, Meng Z, Tang W, Liang L. Establishment and validation of a phenotype-driven predictive model for the diagnostic efficacy of trio-based whole exome sequencing (trio-WES) in children with genetic neurodevelopmental disorders (g-NDDs). Front Neurol. 2025;16:1574021. doi:10.3389/fneur.2025.1574021

20. Wu R, Tang W, Qiu K, et al. An integrative pan-cancer analysis of the prognostic and immunological role of casein kinase 2 alpha protein 1 (CSNK2A1) in human cancers: a study based on bioinformatics and immunohistochemical analysis. Int J Gen Med. 2021;14:6215–6232. doi:10.2147/IJGM.S330500

21. Pan X, Qian G, Liu L, et al. Genetic analysis and reporting from whole-exome sequencing data in 1052 patients with intellectual disability. J Mol Med. 2026;104(1). doi:10.1007/s00109-026-02652-2.

22. Bertier G, Hétu M, Joly Y. Unsolved challenges of clinical whole-exome sequencing: a systematic literature review of end-users’ views. BMC Med Genomics. 2016;9(1):52. doi:10.1186/s12920-016-0213-6

23. Niguidula N, Alamillo C, Shahmirzadi Mowlavi L, Powis Z, Cohen JS, Farwell Hagman KD. Clinical whole-exome sequencing results impact medical management. Mol Genet Genomic Med. 2018;6(6):1068–1078. doi:10.1002/mgg3.484

24. Taylor J, Craft J, Blair E, et al. Implementation of a genomic medicine multi-disciplinary team approach for rare disease in the clinical setting: a prospective exome sequencing case series. Genome Med. 2019;11(1):46. doi:10.1186/s13073-019-0651-9

25. Anazi S, Maddirevula S, Faqeih E, et al. Clinical genomics expands the morbid genome of intellectual disability and offers a high diagnostic yield. Mol Psychiatry. 2017;22(4):615–624. doi:10.1038/mp.2016.113

26. Monroe GR, Frederix GW, Savelberg SM, et al. Effectiveness of whole-exome sequencing and costs of the traditional diagnostic trajectory in children with intellectual disability. Genet Med. 2016;18(9):949–956. doi:10.1038/gim.2015.200

27. Marinakis NM, Svingou M, Veltra D, et al. Phenotype-driven variant filtration strategy in exome sequencing toward a high diagnostic yield and identification of 85 novel variants in 400 patients with rare Mendelian disorders. Am J Med Genet A. 2021;185(8):2561–2571. doi:10.1002/ajmg.a.62338

28. Zhao W, Wu C, Fan Y, et al. An agentic system for rare disease diagnosis with traceable reasoning. Nature. 2026;651(8106):775–784. doi:10.1038/s41586-025-10097-9

29. Iwase S, Bérubé NG, Zhou Z, et al. Epigenetic Etiology of Intellectual Disability. J Neurosci. 2017;37(45):10773–10782. doi:10.1523/JNEUROSCI.1840-17.2017

30. Hu N, Strobl-Mazzulla PH, Bronner ME. Epigenetic regulation in neural crest development. Dev Biol. 2014;396(2):159–168. doi:10.1016/j.ydbio.2014.09.034

31. Wang C, Zhou W, Zhang L, et al. Diagnostic yield and novel candidate genes for neurodevelopmental disorders by exome sequencing in an unselected cohort with microcephaly. BMC Genomics. 2023;24(1):422. doi:10.1186/s12864-023-09505-z

Creative Commons License © 2026 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms and incorporate the Creative Commons Attribution - Non Commercial (unported, 4.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.