Designing weighted correlation kernels in convolutional neural networks for functional connectivity based brain disease diagnosis

https://doi.org/10.1016/j.media.2020.101709Get rights and content

Highlights

  • A new wc-kernel to measure the correlation of brain regions, with weights learned in a data-driven manner to characterize specific contributions of different time points, is proposed.

  • A unified wc-kernel-based CNN framework to define functional connectivity networks and extract hierarchical connectivities for disease diagnosis is developed.

  • Achieving an accuracy of 84.6%, 88.0% and 57.0 for eMCI (early MCI) vs. HC (healthy control), AD vs. HC, and AD vs. lMCI (later MCI) vs. eMCI vs. HC classifications, respectively.

Abstract

Functional connectivity networks (FCNs) based on functional magnetic resonance imaging (fMRI) have been widely applied to analyzing and diagnosing brain diseases, such as Alzheimer’s disease (AD) and its prodrome stage, i.e., mild cognitive impairment (MCI). Existing studies usually use Pearson correlation coefficient (PCC) method to construct FCNs, and then extract network measures (e.g., clustering coefficients) as features to learn a diagnostic model. However, the valuable observation information in network construction (e.g., specific contributions of different time points), as well as high-level and high-order network features are neglected in these studies. In this paper, we first define a novel weighted correlation kernel (called wc-kernel) to measure the correlation of brain regions, by which weighting factors are learned in a data-driven manner to characterize the contributions of different time points, thus conveying the richer interaction information among brain regions compared with the PCC method. Furthermore, we build a wc-kernel based convolutional neural network (CNN) (called wck-CNN) framework for learning the hierarchical (i.e., from local to global and also from low-level to high-level) features for disease diagnosis, by using fMRI data. Specifically, we first define a layer to build dynamic FCNs using our proposed wc-kernels. Then, we define another three layers to sequentially extract local (brain region specific), global (brain network specific) and temporal features from the constructed dynamic FCNs for classification. Experimental results on 174 subjects (a total of 563 scans) with rest-state fMRI (rs-fMRI) data from ADNI database demonstrate the efficacy of our proposed method.

Introduction

Accurate diagnosis of brain diseases, such as Alzheimer’s disease (AD) and its prodromal stage, i.e., mild cognitive impairment (MCI), is very important for early treatment and possible delay of disease progression. As a complex network, the human brain contains a large number of functional regions. The functional interaction among these regions provides an important insight into the basic mechanism and cognitive processes of the brain (Greicius et al., 2003). In the context of neuroimaging, functional magnetic resonance imaging (fMRI) provides a non-invasive way to map the functional interaction of the brain. Based on fMRI data, functional connectivity networks (FCNs), which characterize functional interactions among brain regions, have been applied to brain disease analysis, and thus helping us better understand the pathology of neurological disorders. Recently, FCNs constructed based on resting-state fMRI (rs-fMRI) have been widely applied in the task of automated brain diseases diagnosis (Guye, Bettus, Bartolomei, Cozzone, 2010, Jie, Zhang, Gao, Wang, Wee, Shen, 2014, Wee, Yap, Zhang, Denny, Browndyke, Potter, Welsh-Bohmer, Wang, Shen, 2012).

Up to now, studies on FCNs typically focus on two aspects: 1) traditional FCN, which usually implicitly assumes that functional connectivity is temporally stationary throughout recording period in fMRI (Sporns, 2011). However, these studies ignore the changing properties of FCNs over time. 2) dynamic FCN, which focuses on the temporal changes of functional connectivity between specific brain regions. Existing studies have suggested that the changing properties of functional connectivity over time may be associated with cognitive and vigilance state (Thompson, Magnuson, Merritt, Schwarb, Pan, McKinley, Tripp, Schumacher, Keilholz, 2013, Chang, Liu, Chen, Liu, Duyn, 2013), which are critical for better understanding the pathology of brain diseases (Hutchison, Womelsdorf, Allen, Bandettini, Calhoun, Corbetta, Della Penna, Duyn, Glover, Gonzalez-Castillo, et al., 2013, Zhang, Small, 2006, Kiviniemi, Vire, Remes, Elseoud, Starck, Tervonen, Nikkinen, 2011). Also, some studies have found that brain diseases (e.g., AD/MCI) are related with temporal changes of functional connectivities (Jones, Vemuri, Murphy, Gunter, Senjem, Machulda, Przybelski, Gregg, Kantarci, Knopman, et al., 2012, Wee, Yang, Yap, Shen, Initiative, et al., 2016).

In these previous studies, the FCNs are usually constructed by simply calculating the Pearson correlation coefficients (PCCs) between time series from brain regions, and then the low-level measures (e.g., clustering coefficients) are extracted from the constructed FCNs as features to train a diagnostic model (e.g., support vector machine, SVM). However, these studies have several critical limitations. First, during the construction of FCNs, they neglect the valuable observation information (e.g., specific contributions of different time points). Intuitively, different time points could have different contributions for characterizing interactions among brain regions. In fact, studies have demonstrated that the functional connectivity calculated at specific time points can extend the usual pairwise interaction measures both in information and interpretability (Tagliazucchi, Balenzuela, Fraiman, Chialvo, 2012, Tagliazucchi, Balenzuela, Fraiman, Montoya, Chialvo, 2011). Second, the high-level and high-order network features that could further improve the performance are also neglected by these previous studies in feature extraction step. In addition, since network construction, feature extraction and classification are separately performed, it could yield sub-optimal learning model, thus degrading the diagnosis performance.

To address these problems and motivated by recent successful applications of convolutional neural networks (CNNs) in the natural image analysis field, in this paper we first define a weighted correlation kernel (called wc-kernel) for calculating the correlation between brain regions by using learned weights to characterize the contributions of different time points. Compared with previous methods (e.g., PCC), our proposed wc-kernel can capture the specific contributions of different time points, thus conveying richer interaction information among brain regions. Furthermore, we build a wc-kernel based CNN (called wck-CNN) for constructing FCNs and extracting the hierarchical (i.e., from local and global, and also from low-level to high-level) high-order features for brain disease classification, using fMRI data. Fig. 1 shows the architecture of our proposed wck-CNN method. Specifically, we first define a layer to build dynamic FCNs using our proposed wc-kernels. Here, we build multiple dynamic FCNs using multiple wc-kernels, with each one reflecting temporal changes of FCNs, thus conveying richer dynamic information of brain network. Then, we build another three layers to sequentially extract local (brain region specific), global (brain network specific), and temporal high-level and high-order features from the constructed low-level and low-order functional connectivities for classification. We evaluate the proposed wck-CNN method on 174 subjects (a total of 563 scans) with rs-fMRI data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database, 2 which includes 48 healthy controls (HC), 50 early MCI (eMCI), 45 late MCI (lMCI) and 31 AD. The experimental results suggest that our proposed method can not only improve the diagnostic performance compared with state-of-the-art methods, but also provide novel insights into the interaction patterns of brain activities and their changes in diseases.

The main contributions of this paper are four-fold. First, we define a wc-kernel to measure the correlation of brain regions, with weights learned in a data-driven manner to characterize specific contributions of different time points. The proposed wc-kernel is different from previous methods (e.g., PCC method) that equally treat all time points. Second, we build a unified wck-CNN framework for defining FCNs and extracting hierarchical connectivities for disease diagnosis based on rs-fMRI data. To the best of our knowledge, this is among the first attempt to define the correlation kernel in CNN for characterizing the interactions among brain regions, and to build a unified CNN framework for dynamic FCN construction and analysis using fMRI data. Third, we provide an implementation for performing inference on brain network data. Finally, we investigate the changing connectivity patterns in AD/MCI patients.

The rest of the paper is organized as follows. In Section 2, we briefly review the related studies. Then, we describe the data used in the experiments, and present our proposed method and learning framework in Section 3. In Section 4, we describe conducted experiments and present corresponding results. In Section 5, we present discussions regarding the experiments and results. Finally, we conclude this paper in Section 6.

Section snippets

Functional connectivity network

FCNs that characterize the temporal correlation among blood oxygen level dependent (BOLD) signals of brain regions have shown great potential for understanding the functional activities of both healthy and abnormal human brains. Numerous studies have applied FCNs to analyzing brain diseases, including AD (Wang et al., 2013) and MCI (Bai et al., 2011), and reported a series of abnormal connectivity linked to specific brain regions, including hippocampus (Bai, Zhang, Watson, Yu, Shi, Yuan, Zang,

Subjects and fMRI data preprocessing

In this study, we use a total of 174 subjects with rs-fMRI data from ADNI database, including 48 HCs, 50 early MCI (eMCI), 45 late MCI (lMCI) and 31 AD. There, respectively, include 154, 165, 145, 99 scans for HC, eMCI, lMCI and AD subject groups, covering nine possible stages (i.e., baseline, 6, 12, 24, 36, 48, 60, 72 and 84 months). There are 147 subjects with baseline scans, and other 27 subjects without baseline scan. Data acquisition is performed as follows: the image resolution is 2.29

Experimental settings

We perform two groups of experiments, including 1) two binary classification tasks, i.e., eMCI vs. HC and AD vs. HC classifications, 2) a multi-class classification task, i.e., AD vs. lMCI vs. eMCI vs. HC classification, by using a 5-fold cross-validation. Specifically, for each classification task, the set of subjects with baseline scan is (roughly) equivalently partitioned into five subsets. One subset is used as the testing set. The remaining four subsets and subjects without baseline scan

Significance of results

FCNs have been widely applied to diagnosing physiological and psychiatric disease, such as AD and MCI. To construct FCNs, the PCC method is usually used to measure the correlation among different brain regions (Aerts, Fias, Caeyenberghs, Marinazzo, 2016, Zhang, Cheng, Liu, Zhang, Lei, Yao, Becker, Liu, Kendrick, Lu, et al., 2016, Córdova-Palomera, Kaufmann, Persson, Alnæs, Doan, Moberget, Lund, Barca, Engvig, Brækhus, et al., 2017). The main disadvantage of the PCC method is treating different

Conclusion

In this paper, we define a novel wc-kernel for characterizing rich interaction information between brain regions. Different previous methods (e.g., PCC method) that equally treat all time points, our proposed wc-kernel measures the correlation of brain regions, with learned weights to characterize specific contributions of different time points. We further build a wc-kernel based CNN framework for dynamic FCN construction and analysis using fMRI data. Experimental results on the ADNI dataset

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

This study was supported by National Natural Science Foundation of China (nos. 61976006, 61573023, 61703301, 61902003), Foundation for Outstanding Young in Higher Education of Anhui, China (gxyqZD2017010), NGII Fund, China (no. NGII20190612), and AHNU Fundamental Research Funds (nos. 1708085MF145, 1808085MF171).

References (84)

  • M. Kudela et al.

    Assessing uncertainty in dynamic functional connectivity

    Neuroimage

    (2017)
  • C. Lian et al.

    Multi-channel multi-scale fully convolutional network for 3D perivascular spaces segmentation in 7t MR images

    Med. Image Anal.

    (2018)
  • Z. Liu et al.

    Altered topological patterns of brain networks in mild cognitive impairment and Alzheimer’s disease: a resting-state fMRI study

    Psychiatry Res.

    (2012)
  • H. Ni et al.

    Network analysis in detection of early-stage mild cognitive impairment

    Physica A

    (2017)
  • M.G. Preti et al.

    The dynamic functional connectome: state-of-the-art and perspectives

    Neuroimage

    (2017)
  • L. Qiao et al.

    Data-driven graph construction and graph learning: areview

    Neurocomputing

    (2018)
  • S. Sadaghiani et al.

    Functional interactions between intrinsic brain activity and behavior

    Neuroimage

    (2013)
  • W.W. Seeley et al.

    Neurodegenerative diseases target large-scale human brain networks

    Neuron

    (2009)
  • S.M. Smith et al.

    Network modelling methods for fMRI

    Neuroimage

    (2011)
  • E. Tagliazucchi et al.

    Spontaneous bold event triggered averages for estimating functional connectivity at resting state

    Neurosci. Lett.

    (2011)
  • N. Tzourio-Mazoyer et al.

    Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain

    Neuroimage

    (2002)
  • K.R. Van Dijk et al.

    The influence of head motion on intrinsic functional connectivity MRI

    Neuroimage

    (2012)
  • J. Wang et al.

    Disrupted functional brain connectome in individuals at risk for Alzheimer’s disease

    Biol. Psychiatry

    (2013)
  • C.-Y. Wee et al.

    Enriched white matter connectivity networks for accurate identification of MCI patients

    Neuroimage

    (2011)
  • C.-Y. Wee et al.

    Identification of MCI individuals using structural and functional connectivity networks

    Neuroimage

    (2012)
  • C.-Y. Wee et al.

    Identification of MCI individuals using structural and functional connectivity networks

    Neuroimage

    (2012)
  • L. Xiang et al.

    Deep embedding convolutional neural network for synthesizing CT image from T1-weighted MR image

    Med. Image Anal.

    (2018)
  • H. Yao et al.

    Decreased functional connectivity of the amygdala in Alzheimer’s disease revealed by resting-state fmri

    Eur. J. Radiol.

    (2013)
  • A. Zalesky et al.

    Whole-brain anatomical networks: does the choice of nodes matter?

    Neuroimage

    (2010)
  • Y. Zhou et al.

    Abnormal connectivity in the posterior cingulate and hippocampus in early Alzheimer’s disease and mild cognitive impairment

    Alzheimer’s Dementia

    (2008)
  • H. Aerts et al.

    Brain networks under attack: robustness properties and the impact of lesions

    Brain

    (2016)
  • M.R. Arbabshirani et al.

    Classification of schizophrenia patients based on resting-state functional network connectivity

    Front. Neurosci.

    (2013)
  • L. Baldaçara et al.

    Cerebellar volume in patients with dementia

    Revista Brasileira de Psiquiatria

    (2011)
  • R.L. Buckner et al.

    The brain’s default network

    Ann. N. Y. Acad. Sci.

    (2008)
  • R.L. Buckner et al.

    Cortical hubs revealed by intrinsic functional connectivity: mapping, assessment of stability, and relation to Alzheimer’s disease

    J. Neurosci.

    (2009)
  • G.H.G. Catie Chang

    Time-frequency dynamics of resting-state brain connectivity measured with fMRI

    Neuroimage

    (2010)
  • G. Chen et al.

    Classification of Alzheimer disease, mild cognitive impairment, and normal cognitive status with large-scale network analysis based on resting-state functional MR imaging

    Radiology

    (2011)
  • A. Córdova-Palomera et al.

    Disrupted global metastability and static and dynamic brain connectivity across individuals in the Alzheimer’s disease continuum

    Sci. Rep.

    (2017)
  • P. Dvorak et al.

    Structured prediction with convolutional neural networks for multimodal brain tumor segmentation

    Proceeding of the Multimodal Brain Tumor Image Segmentation Challenge, the 18st International Conference on Medical Image Computing and Computer Assisted Intervention

    (2015)
  • M.D. Fox et al.

    The human brain is intrinsically organized into dynamic, anticorrelated functional networks

    Proc. Natl. Acad. Sci. U.S.A.

    (2005)
  • M.D. Greicius et al.

    Functional connectivity in the resting brain: a network analysis of the default mode hypothesis

    Proc. Natl. Acad. Sci.

    (2003)
  • M.D. Greicius et al.

    Default-mode network activity distinguishes Alzheimer’s disease from healthy aging: evidence from functional MRI

    Proc. Natl. Acad. Sci. U.S.A.

    (2004)
  • Cited by (48)

    • Unsupervised cross-domain functional MRI adaptation for automated major depressive disorder identification

      2023, Medical Image Analysis
      Citation Excerpt :

      We will further study the influence of feature alignment at different layers in Section 5.3. We compare our proposed UFA-Net with five conventional methods, i.e., (1) BC-SVM-N, (2) BC-SVM-G, (3) BC-SVM-N-G, (4) XGBoost (Shi et al., 2021b), and (5) MaLRR (Wang et al., 2019b), and also four deep learning methods, i.e., (6) Wck-CNN (Jie et al., 2020), (7) LSTM (Graves, 2012), (8) STNet (Wang et al., 2019a), and (9) DANN (Ganin and Lempitsky, 2015). (5) MaLRR (Wang et al., 2019b): This method develops a multi-site domain adaptation framework via low-rank representation (LRR) decomposition based on rs-fMRI signals.

    • Longitudinal study of early mild cognitive impairment via similarity-constrained group learning and self-attention based SBi-LSTM

      2022, Knowledge-Based Systems
      Citation Excerpt :

      However, they may miss the effect of BFCN. Another novel method uses weight correlation kernel model for effectively constructing a dynamic BFCN from rs-fMRI, which obtained quite impressive results [41]. However, this method does not consider longitudinal study.

    View all citing articles on Scopus
    1

    These authors contributed equally to this paper.

    View full text