A machine learning-based classification approach on Parkinson’s disease diffusion tensor imaging datasets
Neurological Research and Practice volume 2, Article number: 46 (2020)
The presence of motor signs and symptoms in Parkinson’s disease (PD) is the result of a long-lasting prodromal phase with an advancing neurodegenerative process. The identification of PD patients in an early phase is, however, crucial for developing disease-modifying drugs. The objective of our study is to investigate whether Diffusion Tensor Imaging (DTI) of the Substantia nigra (SN) analyzed by machine learning algorithms (ML) can be used to identify PD patients.
Our study proposes the use of computer-aided algorithms and a highly reproducible approach (in contrast to manually SN segmentation) to increase the reliability and accuracy of DTI metrics used for classification.
The results of our study do not confirm the feasibility of the DTI approach, neither on a whole-brain level, ROI-labelled analyses, nor when focusing on the SN only.
Our study did not provide any evidence to support the hypothesis that DTI-based analysis, in particular of the SN, could be used to identify PD patients correctly.
Diffusion tensor imaging (DTI) has been proposed for analyzing microstructural integrity not only of white but also grey matter. However, the use of DTI to observe, e. g., subcortical grey matter changes is currently under debate . Whether microstructural alterations of the whole brain, regions of interests (ROI)-labeled grey matter, or the substantia nigra (SN) can be detected applying diffusion metrics in Parkinson’s disease patients (PD) is still unclear. The significance of several previous DTI studies in PD is limited due to small sample sizes and by the fact that specific regions of interests were delineated manually for the extraction of diffusion metrics. Besides, studies that were able to demonstrate significant group differences have also shown a relevant overlap of diffusion metrics between PD patients and healthy controls, which undermines the potential diagnostic use. Machine learning-based (ML) models might help to detect subtle alterations of diffusion metrics, by their multivariate nature and by the integration of different imaging modalities, and to improve their diagnostic use subsequently. The aforementioned practice also hindered the translation into clinical practice . Our study hypothesizes that ML algorithms and the application of a suitable sub-cortical atlas for the elderly population can be used to distinguish between PD patients and age- and gender-matched healthy controls in a standardized and therefore potentially more sensitive manner . Computing algorithms like binary support vector machines (bSVM) or multiple-kernel learning (MKL) provide suitable and promising tools to address classification problems based on neuroimaging data . Advancements in the multivariate interpretation of neuroimaging data have already been proven useful in a plethora of neuropsychiatric  and neurodegenerative diseases [11, 12]. Besides, the employment of machine-learning algorithms to Parkinson’s disease datasets has offered unique advancements in interpreting distinct neuroimaging modalities [3, 4, 20, 23]. MKL also yields the opportunity to concatenate different imaging modalities. This is of particular interest as distinct diffusion metrics are meant to resemble different histopathological hallmarks of neurodegeneration .
DTI datasets of 162 PD patients (age: 63.9 ± 9.3 years; gender: 34.2% female; disease duration: 6.5 ± 4.1 months; mean MDS-UPDRS-III: 13.9 ± 2.1; mean Hoehn and Yahr stadium: 1.2 ± 0.3) and 70 age and gender-matched healthy controls (HC) (age: 62.1 ± 10.1 years; gender: 34.9% female) were analyzed. This study used human subject recordings chosen from the Parkinson’s Progression Marker Initiative (PPMI) database. The PPMI dataset was published open-access with a positive ethics statement of the responsible authorities. Therefore, additional ethics committee approvals do not apply to this study. DTI-MR sequences were acquired on a Siemens 3 T TIM Trio scanner using a 12-channel matrix head coil and a two-dimensional echo-planar DTI sequence (TR/TE = 900/88 ms, flip angle = 90°, voxel size = 2 × 2 × 2 mm3, 72 slices, 64 gradient directions with a b-value of 1000 s/mm2). In addition, a non-gradient volume (b = 0 s/mm2) was acquired as well. Further details of the PPMI image acquisition protocol can be seen online (http://www.ppmi-info.org/wp-content/uploads/2017/06/PPMI-MRI-Operations-Manual-V7.pdf). We performed pre-processing by using the PANDA-toolbox (v1.3.1) in Matlab 2018b, including normalization to standard space (via FMRIB58_FA template, 2 mm × 2 mm × 2 mm voxel size) . In addition to conventional diffusion metrics (FA, MD, AD, and RD), we calculated local diffusion homogeneity (LDH) as another measure of microstructural white matter integrity. For the interpretation of DTI images, we calculated the following standard diffusion metrics based on the three-dimensional diffusion of water as a function of spatial location: Fractional Anisotropy (FA) is a summary measure for interpreting microstructural integrity. Mean Diffusivity (MD) is a measure of the cell membrane density. It is, therefore, sensitive for cellularity, edema, and necrosis of investigated tissue. Axial Diffusivity (AD) decreases in axonal injury. Radial Diffusivity (RD) increases in de- or dysmyelination of axons. A concise review article on the interpretability of diffusion metrics to investigate microstructural grey and white matter changes are described in a review article by Alexander et al. . Local diffusion homogeneity (LDH) is another diffusion metric that is specifically relevant to assess tissue homogeneity based on neighboring voxels . We computed LDH for 6, 18, and 26 neighboring voxels using Spearman’s Rank Correlation coefficient (06LDHs, 18LDHs, and 26LDHs) and Kendall’s coefficient concordance (06LDHk, 18LDHk, and 26LDHk) . Voxel-wise whole-brain analysis was performed using the FM-RIB58_FA template. We performed ROI-labeled analyses based on the well-established AAL atlas . To further increase the signal-to-noise ratio, we additionally performed classification after masking of the SN using the ATAG atlas for the elderly population . The datasets were classified through bSVMs (for single modalities) as well as MKL (for concatenated modalities). Ten-fold cross-validation (CV) and nested (leave one subject out) hyperparameter optimization as implemented in the PRoNTo-Toolbox (v2.1) . The determination of relevant bSVM and MKL parameters (such as the applied L1 regularization method or the nested hyperparameter optimization) is following standard practice and is extensively described in the publications of Schrouff et al. [17, 18]. Age, gender, and total intracranial volume were used as covariates. Balanced Accuracy (BA) and area under the curve of the receiver-operating characteristic curve (ROC-AUC) were calculated to assess classification performance and were compared to random permutation testing (against 10.000 permutations).
The application of the bSVM on the various types of diffusion metrics revealed that there are no significant differences concerning the BA or the ROC-AUC for voxel-wise whole-brain or AAL-based ROI-labeled analyses (data not shown here). As most studies suggest, diffusion metrics are most likely altered in the SN of PD patients, making the SN the region of highest interest to increase the signal-to-noise ratio for classification . Therefore, further analyses focused on the diffusion metrics of the masked SN and will be reported in the following (see Fig. 1). Again, there were no significant differences regarding BA or ROC-AUC: FA (BA: 47.83% ROC-AUC: 0.42); MD: (BA: 50.00%, ROC-AUC: 0.54); AD: (BA: 50.00%, ROC-AUC: 0.44); RD: (BA: 50.00%, ROC-AUC: 0.56); 06LDHs: (BA: 49.47%, ROC-AUC: 0.54); 18LDHs (BA: 56.64%, ROC-AUC: 0.57); 26LDHs (BA: 53.14%, ROC-AUC: 0.53); (BA: 55.03%, ROC-AUC: 0.58); 06LDHk (BA: 55.25%, ROC-AUC: 0.52); 18LDHk (BA: 53.14%, ROC-AUC: 0.53); 26LDHk (BA: 51.80%, ROC-AUC: 0.52). The results also indicate that the concatenation of diffusion metrics via MKL did not add any relevant information to improve the overall classification performance: FA + MD + AD+RD (BA: 49.44%, ROC-AUC: 0.41); 06LDHs + 18LDHs + 26LDHs (BA: 56.15%, ROC-AUC: 0.60); 06LDHk + 18LDHk + 26LDHk (BA: 58.12%, ROC-AUC: 0.52). An overview on provided diagnostic performances in displayed in the Table 1. The comparison to random permutation testing showed that the classifications, as mentioned above, did not outperform pure chance. Additionally, calculated weight maps are indicating a random weighting distribution of voxels within the SN used for the respective classifications (see Fig. 2), which is in contrast to previously reported changes of the dorsolateral portion of the SN (i. e., the nigrosome-1) .
In this study, we demonstrated a standardized and systematic approach to potentially attain the individual discrimination of PD patients from healthy controls using DTI datasets. This approach comprised the pre-processing of the data, the automatized selection of appropriate features, and the subsequent classification. Atkinson-Clement, Pinto, Eusebio, and Coulon  already stated that “[…] they did not observe a PD induced reduction of nigral FA” but also that “this observation is in contrast with some recent publications claiming very high diagnostic accuracy, but [are] well in line with other reports showing small or no PD induced nigral FA decrease”. A meta-analysis also did “not support nigral DTI metrics as a useful diagnostic marker of PD” . Our results are supporting the aforementioned lack of evidence and should put discussions about the diagnostic use of diffusion metrics in PD patients to rest. The negative results of our study most likely reflect the lacking suitability of diffusion metrics to investigate SN-related microstructural alterations in PD. The interpretation of our findings within the scope of differing DTI acquisition schemes and MRI scanner hardware is challenging. However, a multicenter validation study by the authors of Fox et al.  stated high intersite-concordance for applied DTI metrics on different scanner hardware (3 T magnetic field strength). ML-algorithms are a more standardizable and sensitive method to increase diagnostic accuracy and to disentangle the overlap of diffusion metrics other groups reported, which were only using voxel-wise mass-univariate or manually extracted diffusion metrics for subsequent analysis. The multivariate, compared to mass-univariate, approach and the additional concatenation of modalities should enhance the discriminatory, and therefore, diagnostic accuracy substantially. The lack of significant findings despite a larger sample size and a more sensitive and sophisticated approach in this study are further supporting the view that traditional diffusion metrics are indeed missing any diagnostic use. Whether DTI can be used to map individual disease progression remains, to this point, elusive. Further methodological improvements of diffusion-based imaging might improve diagnostic accuracy and might, therefore, cause a reconsideration of our current conclusion. However, the current MRI acquisition and analysis paradigms of DTI measures are not of any use for investigating grey matter alterations in PD. Further studies without substantial methodological improvements will most likely not result in potentially translatable advancements in improving diagnostic accuracy or patient care. Recent research studies which revealed that the use of free-water corrected diffusion maps for the analysis of tissue alterations might provide the opportunity for fostering the diagnostic accuracy based on this dataset . However, ML analyses of neuroimaging data is a fruitful approach in supporting clinical decision making and will be more frequently applied in the future . The objective of our study was to investigate the role of ML-based algorithms on diffusion metrics to identify PD patients correctly. Our study did not provide any evidence to support the hypothesis that DTI-based analysis, in particular of the SN, could be used to resolve the issue of correctly classifying study participants independent of the phenotype. An advantage of our methodology is that by calculating weighting maps, we can additionally validate our findings: Previous literature stated that the dorsolateral parts of the SN are the ones that are particularly affected at the beginning of the disease . Weighting maps should indicate the higher relevance of these specific areas for classification performance (which is in contrast to our findings, see Fig. 2). Here, this advantage is of even higher importance as further partitioning of the SN appears, within the scope of the already small region and the present image resolution, not to be feasible.
Our findings are well in line with previous publications using conventional analyses. Further studies without substantial methodological improvements (e. g., utilizing more complex diffusion models) will most likely not result in potentially translatable advancements in improving diagnostic accuracy or patient care.
Availability of data and materials
The proposed study has been performed on publicly-available data obtained from the Parkinson’s Progression Marker Initiative (PPMI). Study analyses were performed following the PPMI Data Use Agreement (http://www.ppmi-info.org/documents/ppmi-data-use-agreement.pdf).
Automated Anatomic Labeling
Atlas of the basal ganglia
Binary support vector machine
Diffusion tensor imaging
Local diffusion homogeneity
Magnetic Resonance Imaging
Parkinson’s Progression Marker Initiative
Receiver operator characteristics area under the curve
Region of interest
Alexander, A. L., Lee, J. E., Lazar, M., & Field, A. S. (2007). Diffusion tensor imaging of the brain. Neurotherapeutics, 4(3), 316–329.
Atkinson-Clement, C., Pinto, S., Eusebio, A., & Coulon, O. (2017). Diffusion tensor imaging in Parkinson's disease: review and meta-analysis. Neuroimage: Clinical, 16, 98–110.
Ballarini, T., Mueller, K., Albrecht, F., Růžička, F., Bezdicek, O., Růžička, E., … Schroeter, M. L. (2019). Regional gray matter changes and age predict individual treatment response in Parkinson’s disease. NeuroImage: Clinical, 21, 101636.
Cherubini, A., Nisticó, R., Novellino, F., Salsone, M., Nigro, S., Donzuso, G., & Quattrone, A. (2014). Magnetic resonance support vector machine discriminates essential tremor with rest tremor from tremor-dominant Parkinson disease. Movement Disorders, 29(9), 1216–1219.
Coutanche, M. N., Thompson-Schill, S. L., & Schultz, R. T. (2011). Multi-voxel pattern analysis of fMRI data predicts clinical symptom severity. Neuroimage, 57(1), 113–123.
Cui, Z., Zhong, S., Xu, P., He, Y., & Gong, G. (2013). PANDA: A pipeline toolbox for analyzing brain diffusion images. Frontiers in Human Neuroscience, 7, 42.
Fox, R. J., Sakaie, K., Lee, J. C., Debbins, J. P., Liu, Y., Arnold, D. L., … Fisher, E. (2012). A validation study of multicenter diffusion tensor imaging: Reliability of fractional anisotropy and diffusivity values. American Journal of Neuroradiology, 33(4), 695–700.
Fu, C. H. Y., & Costafreda, S. G. (2013). Neuroimaging-based biomarkers in psychiatry: Clinical opportunities of a paradigm shift. The Canadian Journal of Psychiatry, 58, 499–508.
Gong, G. (2013). Local diffusion homogeneity (LDH): An inter-voxel diffusion MRI metric for assessing inter-subject white matter variability. PLoS One, 8(6), e66366.
Keuken, M. C., Bazin, P. L., Schafer, A., Neumann, J., Turner, R., & Forstmann, B. U. (2013). Ultra-high 7T MRI of structural age-related changes of the subthalamic nucleus. Journal of Neuroscience, 33, 4896–4900.
Khedher, L., Ramírez, J., Górriz, J. M., Brahim, A., Segovia, F., & Alzheimer’s Disease Neuroimaging Initiative (2015). Early diagnosis of Alzheimer’s disease based on partial least squares, principal component analysis and support vector machine using segmented MRI images. Neurocomputing, 151, 139–150.
Magnin, B., Mesrob, L., Kinkingnéhun, S., Pélégrini-Issac, M., Colliot, O., Sarazin, M., … Benali, H. (2009). Support vector machine-based classification of Alzheimer’s disease from whole-brain anatomical MRI. Neuroradiology, 51(2), 73–83.
Menke, R. A., Scholz, J., Miller, K. L., Deoni, S., Jbabdi, S., Matthews, P. M., & Zarei, M. (2009). MRI characteristics of the substantia nigra in Parkinson’s disease: A combined quantitative T1 and DTI study. Neuroimage, 47(2), 435–441.
Pasternak, O., Sochen, N., Gur, Y., Intrator, N., & Assaf, Y. (2009). Free water elimination and mapping from diffusion MRI. Magnetic Resonance in Medicine, 62, 717–730.
Rulseh, A. M., Keller, J., Tintěra, J., Kožíšek, M., & Vymazal, J. (2013). Chasing shadows: What determines DTI metrics in gray matter regions? An in vitro and in vivo study. Journal of Magnetic Resonance Imaging, 38, 1103–1110.
Sacchet, M. D., Prasad, G., Foland-Ross, L. C., Thompson, P. M., & Gotlib, I. H. (2015). Support vector machine classification of major depressive disorder using diffusion-weighted neuroimaging and graph theory. Frontiers in Psychiatry, 6, 21.
Schrouff, J., Rosa, M. J., Rondina, J. M., Marquand, A. F., Chu, C., Ashburner, J., … Mourão-Miranda, J. (2013). PRoNTo: Pattern recognition for neuroimaging toolbox. Neuroinformatics, 11(3), 319–337.
Schrouff, J., Monteiro, J. M., Portugal, L., Rosa, M. J., Phillips, C., & Mourão-Miranda, J. (2018). Embedding anatomical or functional knowledge in whole-brain multiple kernel learning models. Neuroinformatics, 16(1), 117–143.
Schwarz, S. T., Abaei, M., Gontu, V., Morgan, P. S., Bajaj, N., & Auer, D. P. (2013). Diffusion tensor imaging of nigral degeneration in Parkinson’s disease: A region-of-interest and voxel-based study at 3T and systematic review with meta-analysis. Neuroimage Clinical, 3, 481–488.
Segovia, F., Illán, I. A., Górriz, J. M., Ramírez, J., Rominger, A., & Levin, J. (2015). Distinguishing Parkinson’s disease from atypical parkinsonian syndromes using PET data and a computer system based on support vector machines and Bayesian networks. Frontiers in Computational Neuroscience, 9, 137.
Tzourio-Mazoyer, N., Landeau, B., Papathanassiou, D., Crivello, F., Etard, O., Delcroix, N., … Joliot, M. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage, 15(1), 273–289.
Wei, P., Leong, D., Calabrese, E., White, L., Pierce, T., Platt, S., & Provenzale, J. (2013). Diffusion tensor imaging of neural tissue organization: Correlations between radiologic and histologic parameters. The Neuroradiology Journal, 26, 501–510.
Wu, Y., Jiang, J. H., Chen, L., Lu, J. Y., Ge, J. J., Liu, F. T., … Wang, J. (2019). Use of radiomic features and support vector machine to distinguish Parkinson’s disease cases from normal controls. Annals of Translational Medicine, 7(23), 773.
The authors would like to gratefully thank the initiators of the PPMI study, the executing staff, and the enrolled participants.
JP received funding from the Parkinson’s Foundation (Miami, Florida, USA), the Deutsche Parkinsongesellschaft (Berlin, Germany), and the Deutsche Forschungsgemeinschaft via the Clinician Scientist School Lübeck (DFG-GEPRIS 413535489). PPMI, a public-private partnership, is funded by the Michael J. Fox Foundation for Parkinson’s Research and funding partners, as are listed at http://www.ppmi-info.org/about-ppmi/who-we-are/study-sponsors.
Ethics approval and consent to participate
Data used in the preparation of this article were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (www.ppmi-info.org/data). Ethics approval and informed written consent of all study participants has been provided by PPMI. For up-to-date information on the study, visit www.ppmi-info.org.
Consent for publication
The authors have no competing or conflicting interest to report.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Prasuhn, J., Heldmann, M., Münte, T.F. et al. A machine learning-based classification approach on Parkinson’s disease diffusion tensor imaging datasets. Neurol. Res. Pract. 2, 46 (2020). https://doi.org/10.1186/s42466-020-00092-y
- Parkinson’s disease
- Machine learning
- Substantia nigra