Introduction
Brain research is essential for understanding, diagnosis and treatment of mental disorders. Neuroimaging precisely reflects the structure and function of the human brain and has become a commonly used technique in brain research.1–8 Various neuroimaging modalities and analytic methodologies have been applied to characterise the brain of those with mental disorders.9–11 The deployment of various brain research projects has significantly accelerated neuroimaging research on normal populations and people with mental illness. Accordingly, there are new challenges to data collection, data management and sharing, computational resources and data mining.
Neuroimaging data collected by different research groups often have inconsistent scanning parameters, which may affect the quality consistency of the data and harm the statistical power of the analyses.12 13 Before data collection, the efforts of the neuroimaging centre to standardise and accurately manage scanning parameters can help unify data quality and promote more efficient accumulation and sharing of neuroimaging data. At the same time, standardised subject registration and scanning procedures are required to maximise the consistency of the data collection process and provide an important foundation for neuroimaging data management, standardised processing and connection with clinical information.
The management of large-scale neuroimaging data, especially for mental disorders, is another challenge for imaging centres. Specifically, the diverse diagnosis of mental illness and the multiple imaging modalities used in research bring complexity to the centralised management and cleaning of data. Data collection and analysis are usually carried out in the form of a team (multiperson) or teamwork (multiteam), which brings challenges to data authority management and security assurance. There are some large-scale neuroimaging data management and sharing systems, such as LONI\IDA (The Image and Data Archive at the Laboratory of Neuro Imaging),14 COINS (The Collaborative Informatics and Neuroimaging Suite),15 NITRC-IR (The Neuroimaging Informatics Tools and Resources Clearinghouse Image Repository)16 and XNAT (The Extensible Neuroimaging Archive Toolkit)17 systems (table 1). These systems are versatile and emphasise the authority management of data, so the unit of data management is the dataset. It should be emphasised that the neuroimaging data management system is not equivalent to the data warehouse. In addition to the management of access rights and sharing, it is also necessary to consider how to connect each individual’s specific scan or imaging features to other data, such as scans using other imaging modalities and clinical phenotypes. Therefore, we need a more sophisticated data management system.
The processing of neuroimages usually relies on complex calculations that combine multiple general or specialised software programs, such as Matlab and FreeSurfer. Installing this software on personal computers and maintaining their operation is not a simple task. The neuroimaging research of many non-technically-oriented researchers has been hindered by this starting point. There is research evidence that the software version and even the operating system version have a systematic influence on the calculation results. For example, FreeSurfer running on different versions of Linux systems will give different skin thickness estimation results. In addition, with the rapid increase in the amount of data and the increase in computational complexity, the performance requirements of computing devices for data analysis far exceed the capacity of personal computers. Therefore, providing researchers with a cloud computing platform can create a seamless connection from the scanner to the data inspection and data analysis platform, reduce technical obstacles for neuroimaging researchers, and improve the efficiency of data analysis and the use of computing resources. There are several cloud-based computing platforms available, such as Neuroscience gateway18 and Cbrain19 (see table 1), providing different degrees of flexibility for image analyses. Nevertheless, in terms of ease of use, user familiarity and flexible use of tools, there is still much room for improvement in the computing system for clinical researchers.
To speed up the utilisation of large-scale neuroimaging data, it is necessary to provide a standardised preprocessing pipeline to make the data of multiple research groups comparable and make cross-disease comparison possible. At present, some data-sharing platforms have conducted standardised data mining and analysis attempts. Among them, ENIGMA20 provides ComBat21 scripts to deal with the differences between different sites, and proposes agreements for quality control and analysis of genomics and MRI; for ABIDE dataset,22 preprocessed data using five different image processing pipelines, including CCS (Connectome Computation System),23 C-PAC (Configurable Pipeline for the Analysis of Connectomes),24 DPARSF (Data Processing Assistant for Resting-State fMRI (functional magnetic resonance imaging)),25 CIVET26 and NIAK (Neuroimaging Analysis Kit),27 are shared. In addition, some independent data processing systems, such as volBrain,28 provide automatic brain image processing (skull stripping and brain morphological measures) service and an easy-to-use webpage interface. Furthermore, clinical researchers need a simple and automated multimodal image analysis system to help standardise and simplify neuroimaging preprocessing. To adapt to the cloud storage and management system of neuroimaging data, the pipeline of neuroimaging processing should also be able to be deployed in the cloud and work in connection with the cloud image database. Currently, no system integrates an automated, multi-modal neuroimaging preprocessing and feature extraction pipeline with a cloud database.
In terms of research and clinical transformation of mental disorders, the current neuroimaging research needs to answer questions including the sensitivity and specificity of neuroimaging markers and the dimension of brain abnormalities across mental disorders.29 30 Aggregating multiple types of mental disorders data with similar data quality and similar social and cultural conditions on the same platform is conducive to exploring and testing neuroimaging markers for mental disorders. In addition, based on large-scale and highly homogenous neuroimaging data, artificial intelligence methods will promote the application of neuroimaging technology to the diagnosis and treatment of mental disorders and even the choice of treatment options, thereby bringing technological innovations to the diagnosis and treatment of mental illnesses.
To meet the above challenges, we have developed an integrated and highly automated cloud neuroimaging system, Integrated Neuroimaging Cloud (INCloud). INCloud provides a highly automated cloud solution that connects the acquisition, management, analysis and mining, and clinical applications of neuroimaging data. The aims of INCloud are to (1) enhance the value of neuroimaging information to the scientific research and clinical application of neuropsychiatric diseases; (2) relieve researchers from the complex processes of computational environment deployment, image manipulations and preprocessing; (3) provide efficient facilities for cross-disease and cross-modal analyses and (4) translate research data into resources supporting clinical practice.