Introduction

Schizophrenia, a severe psychiatric illness involving hallucinations, delusions and disorganized thinking, is one of the greatest causes of disability worldwide, affecting about 1% of the world’s population [1]. The neurobiology of this complex illness is not well understood. Neuroimaging studies have revealed distributed brain abnormalities in structure and function [2]. However, these findings have been inconsistent, potentially due to the effects of antipsychotic medication and long illness duration in many study samples [2]. Previous studies have primarily focused on volumetric and functional measures, but studying antipsychotic-naïve first-episode schizophrenia (FES) using alternative MRI approaches may be an important strategy for characterizing the neurobiological substrate and identifying useful biological markers.

Widespread structural brain abnormalities have been observed in individuals with schizophrenia. Among many reported brain abnormalities in schizophrenia, the most consistent are in the thalamo-cortical and default mode networks [2]. The macroscopic neuroimaging findings of structural brain abnormalities have been confirmed and to some extent explained by postmortem studies [3]. However, it is difficult to identify in vivo the histopathological alterations that underlie these macroscopic findings. Postmortem alterations point to changes that could be identified with MRI, but that which might be difficult to characterize with standard morphometric imaging. Techniques are needed with sufficient sensitivity to detect subtle neuropathology such as changes in the nature and concentration of macromolecules.

Magnetization transfer imaging (MTI) can provide neuropathologically relevant information in vivo unavailable by conventional T1- and T2-weighted MRI techniques [4]; this is conveniently and reproducibly characterized using the semi-quantitative magnetization transfer ratio (MTR). MTR signals are influenced by macromolecules in brain tissue, which mainly form neuronal cell membranes in gray matter and myelin in white matter [5] and is considered to be a more sensitive measure for distinguishing schizophrenia than conventional MRI measures [6,7,8]. Previous schizophrenia studies have found widespread MTR abnormality, e.g., in the frontal, temporal, and occipital regions [7,8,9,10,11,12,13,14,15,16]. However, the sample sizes of these prior studies were relatively small, and the majority of participants at the time of scanning were on antipsychotic medication, so it remains unclear whether the pattern of deficits reported is related to illness itself or to antipsychotic medications that are believed to impact structural alterations. Moreover, most have used region-of-interest methods which risk systematic placement bias [17,18,19,20].

Our study of a relatively large sample of individuals with antipsychotic-naïve FES used MTI to identify subtle biophysical alterations and their relation to symptom severity, and explored their potential diagnostic value. Given previous evidence of structural disruptions in schizophrenia [21, 22], we hypothesized (i) that FES would exhibit voxelwise brain MTR abnormalities and (ii) that these regional MTR alterations would be associated with symptom severity of schizophrenia. Additionally, given existing evidence that machine learning applied to structural images can identify individuals with schizophrenia [23, 24], we hypothesized that (iii) whole-brain MTR maps would distinguish individuals with FES from healthy controls (HCs) with significant classification accuracy.

Materials and methods

Participants

Individuals with antipsychotic-naïve FES were consecutively recruited from inpatient and outpatient programs of West China Hospital of Sichuan University from August 2005 to July 2014. The diagnosis of schizophrenia was established using the Structured Interview for the DSM-IV Axis I Disorder. Individuals with FES were recruited within days of their first hospitalization and scanned before beginning antipsychotic medication. The severity of schizophrenia symptoms was evaluated by an experienced clinical psychiatrist using the Positive and Negative Syndrome Scale (PANSS) [25]. The duration of untreated illness prior to scans was evaluated by the Nottingham Onset Schedule [26] based on information provided by the FES and their guardians and family members. We have previously reported structural MRI data from this cohort [27]: cortical thickness and surface area data from high-resolution T1-weighted MRI in 128 individuals with antipsychotic-naïve FES and 128 HCs.

HCs were recruited from the local area through poster advertisements during the same period and screened using the nonpatient edition of the Structured Interview for the DSM-IV Axis I Disorders to rule out lifetime psychiatric illness. They were from the same geographical region as the FES and had similar socioeconomic and educational backgrounds. There was no history of major psychiatric illness in their first-degree relatives. Exclusion criteria for both groups were as follows: any neurological disorder, another axis I psychiatric disorder including substance abuse or dependency, pregnancy, serious systemic illness, or MRI contraindication.

This retrospective study was approved by the Medical Research Ethics Committee of West China Hospital, Sichuan University, and written informed consent was obtained from all FES and HCs. All procedures in the present study adhered to the ethical standards of the Declaration of Helsinki and the ethical principles in the Belmont Report.

Image acquisition

MRI of each participant was performed with the same 3.0 tesla MRI system (GE Excite) with an 8-channel phased array head coil. Whole-brain MTI scans were acquired with a three-dimensional fast low-angle shot sequence with the following parameters: repetition time, 37 ms; echo time, 5 ms; field of view, 24×24 cm2; flip angle, 15°; matrix size, 512×512; voxel size, 0.47×0.47×3 mm3; 50 contiguous axial slices; section thickness, 3 mm. One acquisition was obtained with a magnetization saturation pulse at 1.5 kHz off-resonance, then another without this pulse to produce separate images with (MS) and without (M0) saturation.

Image preprocessing

MRIs were first reviewed by an experienced neuroradiologist to ensure that there were no gross structural abnormalities or quality flaws. Statistical Parametric Mapping software (SPM12) was used for data processing and analysis. For each patient and HC, MS and M0 images were first co-registered using a mutual information registration algorithm. The MTR was then calculated on a voxel-by-voxel basis as MTR = 100 × (M0–MS)/M0, where M0 and MS represent the signal intensities without and with the saturation pulse. As non-MTI images are partially T1-weighted, we directly normalized them to the Montreal Neurologic Institute T1-weighted template and then used the transformation parameters to normalize the coregistered MTR map. The normalized non-MTI images were skull-stripped using the brain extraction tool (http://www.fmrib.ox.ac.uk/fsl/bet/) and then applied as masks to remove nonbrain tissue in the normalized MTR maps. Finally, MTR maps were smoothed with a 6 mm full-width half-maximum 3D Gaussian kernel.

Statistical analysis

Demographic and clinical data analyses

Statistical group comparisons were performed using SPSS software (IBM Corp., IBM SPSS Statistics for Windows, version 23.0). Group differences in age and education level were assessed using two-sample t tests, while sex differences were compared using a chi-square test. Sample size was determined using neuro power tools (http://neuropowertools.org/), which indicated that 140 participants per group would provide statistical power of 0.8.

Voxel-based MTR analysis

MTR maps of individuals with FES were compared with those of HCs using two-sample t tests in SPM12. The significance of group differences was estimated by distributional approximations from the theory of random Gaussian fields. Familywise error correction was used to correct for multiple comparisons, and clusters with at least 40 voxels with nominal P < 0.05 were considered significant.

Clinical measures association analysis

Individual mean MTR values were extracted from regions with significant intergroup differences using the volume of interest method implemented in SPM12. Partial correlation analyses were conducted between regional mean MTR values and clinical indices: PANSS Positive, Negative and Total Scores, with age, sex, and education years as covariates. Multiple comparison correction was performed using the Bonferroni correction.

Whole-brain MTR maps classification analysis

To assess classification effectiveness, we used whole-brain MTR maps to distinguish individuals with FES and HCs based on a support vector machine, using a C-Support Vector Classification from the python module scikit-learn (version 0.24.2; https://scikit-learn.org/) with the implementation based on libsvm. Support vector machines is the most commonly used machine learning algorithm in neuroimaging [28, 29]. By finding the hyperplane maximizing the margin between binary classes in feature space, support vector machine can learn the classification strategy from the training set, be optimized and fine-tuned with the validation set, and make individual classification decisions in a test set. Our code is available at https://github.com/QKmeans0902/MTR_classification (version 1.0).

We first applied a brain mask on the whole-brain MTR maps to exclude MTR values outside the brain, and the effects of age and sex on model performance were controlled using a Gaussian process regression model based on HCs only [30]. Then MTR values of each patient were vectorized as features for model training. Ten-fold stratified cross-validation was performed to split the training and testing sets. In this method, the participants were divided into 10 non-overlapping partitions, each with the same proportion of FES and HCs. In each one of the ten iterations of the cross-validation, 9 partitions were used as the training set, then the trained classification model was used to obtain predictions in the remaining partition. A linear kernel was used to avoid the risk of overfitting. The sole hyperparameter C was determined via grid search on a set of values (i.e., [10−3, 10−2, 10−1, 1, 101, 102, 103]), and the grid search was performed using another nested 10-fold stratified cross-validation within the training set. Classification performance was examined based on accuracy, sensitivity, specificity and area under receiver operating characteristic curve (AUC) across 10 folds. Accuracy was determined as the percentage of correctly classified individuals among all study participants. For estimation of AUC, we plotted ROC curves showing classification performance at all classification thresholds according to true positive rate (i.e., sensitivity) and false positive rate (i.e., 1 – specificity). The AUC value was calculated as the area under the ROC curve to provide an aggregate measure of performance irrespective classification threshold selection. To estimate the significance for each ML model and data features, we performed a nonparametric permutation test to calculate a P value for the balanced accuracy [31]. After repeating the classification procedure 1000 times with different random permutations of the group labels, P is calculated as the number of times the balanced accuracy was higher for the permuted labels than the real labels, divided by 1000.

Results

Demographic and clinical characteristics

A total of 322 people were recruited for this retrospective study, with 18 declining to complete the study and 14 excluded based on their MTI results (Supplementary Fig. S1). Ultimately, 143 individuals with FES (mean age, 24 years ±8; 73 women; mean education years, 11.4 years ±3.5) and 147 HCs matched for age, sex, and education years (mean age, 24 years ±7; 74 women; mean education years, 11.7 years ±3.1) were included. Demographic and clinical characteristics of study participants are reported in Table 1. In individuals with FES mean illness duration was 11.1 months and PANSS Total Score was 97.6.

Table 1 Demographic and clinical characteristics of antipsychotic-naïve first-episode schizophrenia and healthy controls.

Voxel-wise MTI analysis

Compared with the HC group, the FES group exhibited higher MTR values in left hemisphere regions including the middle temporal gyrus (P = 0.001), inferior parietal lobule (P = 0.001), precuneus (P < 0.001), the cuneus (P = 0.001), paracentral lobule (P = 0.003), thalamus (P = 0.01), and cerebellum (P = 0.005) (familywise error corrected, cluster size > 40) (Fig. 1 and Table 2). There were no lower MTR values in FES compared to HC. Supplementary Fig. S2 shows the distribution of individual MTR values in the brain regions which differed between the FES and HC groups.

Fig. 1: MTR in voxel-based differences between individuals with first-episode schizophrenia and healthy controls.
figure 1

A Schematic illustration of MTR calculated on a voxel-by-voxel basis as MTR = 100 × (M0 – MS)/M0, where M0 and MS represent the signal intensities without and with the saturation pulse; B Significant clusters were identified in individuals with first-episode schizophrenia compared to healthy controls based on a voxel-level statistical threshold of P < 0.05 (family-wise error corrected, cluster size > 40). The color bar indicates T values. MTR magnetization transfer ratio, L left, CUN cuneus, PCUN precuneus, MTG middle temporal gyrus, IPL inferior parietal lobule, THA thalamus, CBL cerebellum, PCL paracentral lobule.

Table 2 Differences in magnetization transfer ratios between antipsychotic-naïve first-episode schizophrenia and healthy controls.

Correlations between the regional mean MTR and symptom severity

Correlations were observed between PANSS Total Scores and the mean MTR within the left precuneus (r = 0.34, P = 0.0004), left cuneus (r = 0.33, P = 0.0006), and left paracentral lobule (r = 0.37, P = 0.0001), as well as PANSS Paranoid Scores and the mean MTR within the left inferior parietal lobule (r = 0.34, P = 0.0005) (Fig. 2), which all survived Bonferroni correction (P = 0.05/[7 MTR findings × 11 clinical characteristics] = 0.0006). There were no significant correlations between other regional MTR values and schizophrenia symptom severity (all P > 0.05) (Supplementary Table S1).

Fig. 2: Scatter plots showing the correlations between PANSS total scores and regional mean MTR in first-episode schizophrenia.
figure 2

Partial correlations (presented as r) were conducted between regional mean MTR and PANSS score, with age, sex, and education years as covariates. Findings were significant (presented as P) after correction for multiple comparisons using the Bonferroni correction. MTR magnetization transfer ratio, PCUN precuneus, CUN cuneus, IPL inferior parietal lobule, PCL paracentral lobule, PANSS Positive and Negative Syndrome Scale.

Classification accuracy and performance

Overall, the support vector machine model distinguished individuals with antipsychotic-naïve FES from HCs based on whole-brain MTR maps with an accuracy of 75.5% (219 of 290 individuals, 95% CI: 73.0%–78.0%), sensitivity 65.7% (94 of 143 FES, 95% CI: 63.0%–68.4%) and specificity 85.0% (125 of 147 HC, 95% CI: 82.9%–87.1%) (Fig. 3A, all P ˂ 0.05). The area under the receiver operating characteristic curve was 0.82 (95% CI: 0.80–0.85) (Fig. 3B).

Fig. 3: Classification performance based on whole-brain MTR maps.
figure 3

A The confusion matrix of classification between FES and HC. B Receiver operating characteristic curve for a prediction model based on whole-brain MTR maps using support vector machine. MTR magnetization transfer ratio, FES first-episode schizophrenia, HC healthy controls, AUC area under receiver operating characteristic curve.

Discussion

Because medication effects and long illness duration may confound case-control MTI comparisons, we studied MTR in individuals with antipsychotic-naïve FES. We also evaluated the diagnostic value of MTR in schizophrenia. This cross-sectional study demonstrated regional microstructural MTR alterations in individuals with antipsychotic-naïve FES in left hemisphere regions including the middle temporal gyrus, inferior parietal lobule, precuneus, cuneus, paracentral lobe, thalamus, and the cerebellum. Furthermore, the regional MTR values in left precuneus and cuneus and paracentral lobule were correlated with overall clinical symptom severity. Classification models trained with MTR maps showed fair to good performance (area under the receiver operating characteristic curve 0.82).

MTR signals are influenced by brain macromolecules. A higher MTR can reflect a higher concentration of macromolecules, including myelin [32], which is important in axonal function [33]. Thus, the high MTR in these regions may suggest enhanced signal transmission in the left cerebral hemisphere that could reflect neocortical hyperexcitability related to disease or a compensation mechanism for the well-established reductions in intra- and interhemispheric connectivity in schizophrenia [34]. Of course, higher communication speed does not necessarily mean greater communication efficiency [35]. Furthermore, MTR depends not only on macromolecules but also T1 relaxation time of free water [32], so high MTR may reflect increased extracellular free water, which has been linked to inflammatory activity [36] - an important possibility, given the recent attention to neuroinflammation in schizophrenia [37]. Therefore, both myelination changes and neuroinflammation can affect the MTR signal, and future research is needed to test their contributions. Interestingly, our findings were all in the left cerebral hemisphere. This left-sided asymmetry has also been observed in diffusion tensor imaging [38]. Crow et al. [39], positing a “lateralization hypothesis of schizophrenia”, proposed that the left hemisphere may be more vulnerable to insult. This has not always been observed in MT studies but is consistent with the present findings.

Prior reports of higher MTRs confined to the uncinate and arcuate fasciculi used a tract-based region-of-interest method [34, 40, 41]. Our study investigated whole-brain differences in voxelwise analyses in FES, and found significantly elevated MTRs in multiple neocortical regions, thalamus and cerebellum, which suggests a much more widely distributed pattern of abnormalities. Some of these regions are key nodes of the default mode network [42], which is involved in self-referential processing and theory-of-mind functions [43, 44], and whose disruption has been linked to symptoms such as hallucination and paranoid ideation in schizophrenia [45]. Our findings are consistent with reports of increased neural activity of parietal [45] and occipital regions [46] as well as sensory hyperresponsiveness during acute illness [47]. The correlations between the parietal-occipital MTR alterations and PANSS scores support a critical role for microstructural disruptions of these regions in the development of symptoms. As noted above, a higher MTR could indicate greater neuronal remyelination, which might contribute to a volume increase [48]. Previous studies found a greater density of interstitial neurons in the parietal [49] and occipital areas [50] in individuals with schizophrenia, which is suggestive of neurodevelopmental abnormalities that might also contribute to the present findings.

Our FES displayed a higher MTR in the left middle temporal gyrus. fMRI studies in schizophrenia show hyperactivation in the middle temporal gyrus during auditory hallucinations [51]. However, a lower MTR in the middle temporal gyrus has been reported in medicated individuals with chronic schizophrenia [15]; this may reflect the difference between acute psychosis and persistent illness, and/or drug treatment effects. Diffusion tensor imaging studies in individuals with FES compared with HCs report lower FA in the left temporal areas [38]; this is not necessarily discrepant from our finding of higher MTR, because the two techniques measure different things: diffusion tensor imaging, responding to the orientation of water diffusion along anatomical compartments, reflects the spatial organization of fiber tracts, while MTR estimates tissue composition [14]. Future multimodal research combining both might help clarify specific white matter abnormalities in FES.

Higher MTR in the left thalamus, cerebellum and frontal regions in our FES support the findings of robust structural deficits across cerebellar-thalamo-cortical networks [2]. Previous studies showed that altered functional connectivity within prefrontal-thalamic-cerebellar circuitry is related to structural deficits [52, 53]. Our investigation extends this with novel evidence regarding the structural basis for this altered brain function.

The accuracy of case-control classification using the whole-brain MTR map was 75.5%. This is not alone sufficient for clinical application. However, MTI may have promise as part of multimodal MRI-based protocols for the identifying schizophrenia at the individual level. Although machine learning approaches are not yet ready for clinical use in psychiatry, in view of the need for objective diagnostic tests in the early stages of disease, they do offer promise for reducing misdiagnosis [28]. Improvement of classification may come from the biologically-based delineation of subgroups within the schizophrenia syndrome, and inclusion of multimodal imaging features in classification protocols [54].

Our study has limitations. First, no other imaging methods were used to verify and extend interpretation. Second, there are several ways of quantifying MTI [55]. We employed the classic MTR, used in many neuropsychiatric studies. MTR is sensitive to MRI pulse sequences and relaxation properties, making it difficult to define the specific neurobiology [48]. Emerging techniques such as the free-water imaging metric (not possible with the current imaging protocol) could provide information complementary to MTR [56]. Interpretation of our findings will require cell biology and pathology studies, and confirmation with other approaches for evaluating MT. Third, our machine learning analysis evaluated a single type of imaging modality data and one classification method. However, the models described in this paper will require replication in an independent dataset before any application in clinical decision making [57]. Finally, the cross-sectional study design limited inferences about individual changes over time and with antipsychotic treatment, which need to be examined in future longitudinal studies.

In conclusion, the present study demonstrates that MTR detects clinically relevant changes, probably reflecting macromolecular abnormalities, in the left hemisphere; these may represent an important component of the neuropathology close to the onset of schizophrenia, before antipsychotic treatment. These findings contribute to understanding the neurobiological basis of the complex clinical syndrome of schizophrenia. While not alone useful for clinical application, MTR mapping may be a useful addition to multimodal imaging profiling for diagnostic classification; this is a key aim of psychoradiology [58,59,60,61], the emerging radiological subspecialty guiding diagnosis and treatment decisions in neuropsychiatric disorders. Future research might usefully combine imaging markers as their characteristics and utility for classification are better understood, and consider different classification methods, such as Gaussian process classification and minimum spanning trees.