Multivariate pattern analysis of medical imaging-based Alzheimer's disease

1 Introduction

The human brain is a highly complex organ regulating the human neurological system. The human neocortex has up to 100 billion neurons connecting throughout the brain (1). They constitute a vast, interconnected network linked to human activities and emotions. Various neuroimaging techniques can acquire a wide range of brain signals. The term "neuroimage" is based on the representation of brain functionality or architecture (2). AD is among the most common types of memory loss in the twenty-first century and is a significant healthcare problem. As per statistics, there are ~5.5 Americans aged 65 years and older affected by AD (3). AD is a progressive brain disease. It is marked by a loss of executive function that treatment cannot resolve. Thus, studies have been conducted to develop ways to predict the disease, especially before symptoms appear, to slow or prevent them from worsening (4).

Traditionally, AD was detected through an invasive technique. Recently, multiple neuroimaging modalities have been developed to identify AD: positron emission tomography (PET) uses specific radiotracers to visualize and quantify amyloid plaques in the brain; electroencephalography (EEG) is utilized to obtain the electrical activity; and functional magnetic resonance imaging (fMRI) is utilized to measure the functionality of the brain with the help of oxygen level change detection in various parts of the brain, such as voxels (5). Moreover, the anatomical brain features are studied using magnetic resonance imaging (MRI), having high spatial determination, and can compare soft tissues (6).

Because neuroimaging techniques are rapidly changing, combining large amounts of high-dimensional, multimodal neuroimaging data is challenging. Thus, computer-aided machine learning methods for consolidative study have rapidly become extremely popular, and multiple neuroimaging modalities have recently been developed to identify AD. A popular neuroimaging process for examining brain activity in neurodegenerative illnesses is resting-state fMRI (7).

Based on recent research, brain changes associated with AD begin up to two decades before symptoms appear. Due to the high cost and side effects of current medicines, it is essential to focus on enhancing the quality of life or reducing the impact of the disease. To this end, a computer learning model showing significant performance in predicting the disease earlier can help minimize losses (8).

The structure of the study is outlined as follows: Section II provides background on the phases of AD and BOLD data. Section III reviews previous study on fMRI data. Section IV introduces the framework, while Section V shows the results. Section VI discusses the evaluation metrics used, and Section VII compares the findings with previous studies. Finally, Section VIII concludes the study and outlines future research directions.

1.1 Motivation and contribution

In recent years, computer-aided design systems have become increasingly important in diagnosing and grading AD, a severe disease affecting many people, particularly the older population. AD causes memory loss and an inability to function in one's environment. The biology of the disease is not yet fully understood, and no cure or medication is currently available to prevent its progress. Early detection is essential for minimizing the impact of the illness and enhancing patients' quality of life. However, classifying AD is challenging due to various constraints involved in fMRI scans, such as low spatial resolution, image artifacts, and motion aftereffects. Despite the low spatial resolution, the abstract and high-level shapes can still provide valuable information for our analysis. With a large amount of data, we have the potential to capture a wide range of variations, which can help improve the robustness and generalization of the model, based on inter subjects. This diversity can also help identify and characterize AD patterns and various sub-types or stages. Addressing these problems at different stages is necessary to develop a robust detection and classification framework for AD.

The primary contributions of this study include:

• To apply techniques using MVPA to consider patterns across multiple variables simultaneously.

• To identify relevant features in order to mitigate the impact of irrelevant or redundant ones by using the LASSO method.

• To propose a framework for detecting AD based on brain signals using hybrid machine learning classifiers.

• To evaluate the results using performance metrics on the public datasets of OASIS and ADNI for improved accuracy rates.

1.2 Early diagnosis benefits

Early detection of AD is crucial for several reasons (9):

• Early intervention: It is referred to as the strategies implemented for the early detection of AD. As there is no treatment for AD, the progression of the disease can help to manage the symptoms (10).

• Treatment planning: Early detection of AD allows for the timely implementation of comprehensive treatment plans, including medications, lifestyle changes, and cognitive interventions.

• Clinical trials: Early detection enables individuals to participate in clinical trials for new treatments, which are crucial for advancing our understanding of AD and developing new therapies.

• Learn about the management of AD symptoms.

• Develop a community for assistance.

• Conduct clinical studies to test any recent possible medication (9).

2 Background 2.1 Phases of AD

AD has been classified into four stages (11), as shown in Figure 1:

Figure 1. Phases of AD.

2.1.1 Normal control

Normal control is also known as cognitive normal, which is the natural process of cognitive aging. Individuals of 66 years of age healthily, retaining their ability to think, respond and communicate. This is related to the natural aging process (12). They show no symptoms of AD.

2.1.2 Mild Cognitive Impairment or prodromal stage

The intermediate stage between healthy control and AD is referred to as MCI. During this stage, an individual experiences short-term memory loss and difficulty remembering the names of familiar people or objects as a symptom. According to studies, 80% of MCI patients advance to AD after a certain time period of ~5–6 years (12).

Individuals may experience minor abnormalities in cognitive function, but they are insufficiently severe to meet the criteria for the diagnosis of Alzheimer's disease in Early Mild Cognitive Impairment (EMCI stage) (13). Therefore, this stage is generally considered harmless. Not everyone with MCI will develop AD, and some people may even show improvement in their mental abilities. This stage damages the medial temporal lobe of the hippocampus and causes symptoms of short-term memory loss.

More progression is toward another alarming stage, which is Late Mild Cognitive Impairment (LMCI) (13), affecting the lateral and parietal lobes of the brain. Reading difficulties, poor object recognition, difficulty knowing the names of people, and a lack of sense of direction are all symptoms of this stage.

2.1.3 Alzheimer's disease

AD is the final stage of the disease, characterized by severe memory loss, including the names of people and things. This stage is incurable (14). The stage of AD begins in the hippocampus and entorhinal cortex and gradually spreads to other brain sections, affecting the frontal, temporal, and occipital lobes of the brain. Poor judgment, impulsivity, a short attention span, and vision issues are all symptoms of this period. Advancing age, hereditary variables, brain traumas, vascular illnesses, pathogens, and external conditions are among the risk factors contributing to AD development, as shown in Figure 2. What leads to the pathological changes observed in AD remains unclear. While several theories exist, two of the most prominent ones suggest that cholinergic dysfunction and amyloid protein abnormalities may be significant risk factors. However, no widely accepted explanation exists for the underlying mechanisms of AD (15).

Figure 2. Risk factors of AD.

2.2 Blood Oxygenation Level-Dependent signal

Several important factors influence the BOLD signal, as shown in Figure 3.

Figure 3. The fMRI BOLD signals and hemodynamic response (16).

The complex interaction between neural action and causing a hemodynamic reply, and how an MRI scanner can detect this response. The magnetic field intensity, echo duration, and type of imaging technology used are only a few of the experimental factors in the scanning of fMRI that influence the number of BOLD signals detected by each scanner. For instance, although the hemodynamic response is the same, a 1% BOLD signal throughout an echo of 30 ms is comparable to 2% over an echo period of 60 ms, and the reaction is continuous. Additionally, BOLD imaging is prone to several aberrations, including field inhomogeneities, ghosting, and head motion (17). Determining how accurately the BOLD reply imitates a specific hemodynamic response is challenging due to the number of interacting variables.

The balloon method by Buxton et al. (18) has been developed through extensive research on the type of hemodynamic reply, particularly by Friston et al. (19). As previously mentioned by Buxton et al. (18), the BOLD signal vascular basis is primarily thought to be a relative inequality between rises in blood flow of local cerebral and concomitant (albeit smaller) rises in oxygen digestion, resulting in a brief drop in the deoxyhemoglobin to oxyhemoglobin ratio.

The blood volume, hematocrit, vascular geometry, and oxygenation levels of basal are other physiological variables affecting changes in the deoxyhemoglobin concentration (20, 21). Despite these crucial starting conditions, the hemodynamic response can differ significantly between species and cortical areas. Different facets of the hemodynamic response may alter on various timescales and have various neuronal underpinnings and effects on the signal of BOLD. It is now widely acknowledged that the signal of BOLD also occurs at prominent draining veins, possibly a few centimeters below the neuronally active part, in addition to capillaries. Inferentially, such changes in the signal would be located spatially apart from the stimulated brain tissue.

Consequently, regarding the "brain vs. vein" debate (22), suggest that the density is based on microvascular, which will consistently be less than that of neurons (23), is impeded by massive contributions of vessels, and is more likely to be the aspect restraining the BOLD-based fMRI spatial resolution. The spin-echo fMRI method reduces these vein contributions, making them potentially useful for more precisely tracing the neuronal sources of fMRI BOLD (24, 25). Capillaries have a more significant impact on image intensity (26), with stronger field strengths. Therefore, these two might become more beneficial when used together.

3 Literature review

Several studies have developed ideas for systems that could be used to classify AD. This section examines current studies using deep learning (DL) and machine learning (ML) models in systems for diagnosing and detecting AD. Some previous studies on detecting this disease have used standard ML methods (27). Additionally, many neuroimaging studies feature extraction strategies for fMRI signals; for example, Lama and Kwon (28) implemented graph theory to help predict AD at three stages: AD, MC, and NC, with classifications based on the linear support vector machine (SVM) and the regularized extreme learning machine. The Node2vec graph embedding approach converts graph features into feature vectors.

Parmar et al. (29) developed a 3D-CNN that uses rs-fMRI data to predict AD development. By employing unconventional techniques, they extracted patterns from neuroimaging data and found that a simple deep-learning model works well in categorizing AD. The findings of the study suggest a promising future, where fMRI-based biomarkers could assist in the early diagnosis and classification of AD. The study achieved 96.67% accuracy.

Guo and Zhang (30) introduced a distinct network using an autoencoder(AE) to detect natural aging and progression disorders. The network is based on biased neural networks and can easily diagnose AD. The researchers evaluated the system using the fMRI AD dataset and observed that it provides 25% better accuracy than other methods. The study achieved a remarkable 94.6% accuracy. Another study by Alarjani et al. (31) compared machine learning (ML) and deep learning (DL) models for early detection of AD using fMRI data. A 3D convolutional neural network (3D-CNN) extracted features from support vector machine (SVM) for classification. The 3D-CNN achieved 98.3% accuracy, while the SVM achieved 97.5%.

Shahparian et al. (32) developed an ML-based system that detected AD using fMRI images. The system is used to calculate time series for specific anatomical regions using the individual's fMRI data, and the latent low-rank representation method is utilized to extract pertinent features. Based on the acquired characteristics, the SVM classifier determines whether the person is healthy at the onset of the disease or has AD. The proposed method has an accuracy exceeding 97.5%. The problem with vascular dementia (VD) and AD is that both are more frequent. These may cause controversial diagnoses while using classical MRI and clinical methods. Castellazzi et al. (33) different ML algorithms alongside combinations of MRI data are analyzed. AD and VD are two of the most common. Concerning AD and VD, they may demonstrate multiple neurological symptoms that may lead to ambiguous diagnoses when using MRI criteria and conventional clinical. To overcome this problem, a method to classify AD and VD is presented. The system is assessed by three algorithms, such as ANN, SVM, and neuro-fuzzy inference.

Wang and Lim (34) conducted a new assessment approach introduced for individuals with AD and MCI compared with NC individuals, which utilized the zoom-in neural network DL algorithm. By extracting features from the resting-state fMRI dataset obtained from the ADNI, the algorithm could detect the implicated regions during AD by utilizing the automated anatomical labeling (AAL) Atlas. The study found that the ZNN obtained good results of 97.7, 84.8, and 72.7% accuracy for distinguishing AD from NC and MCI, NC from MCI and AD, and MCI from NC and AD, respectively. This was achieved using seven discriminative ROIs in the AAL-90.

Data optimization is indeed a complex task in the field of neuroimaging. However, Zamani et al. (35) proposed an interesting approach integrating artificial neural network (ANN) with evolutionary algorithms to optimize the neuroimaging data with multiple parameters. Using the rs-fMRI data based on the resting state, they measured the FC and computed 1,155 parameters. They tested the system using the ADNI dataset and achieved 94% accuracy.

To achieve AD discrimination at various stages, Nguyen et al. (36) suggested a voxel-wise discriminative system for multi-measuring rs-fMRI and combining hybrid MVPA and extreme learning machine (ELMs) and applied it to two different datasets. Jiao et al. (37) proposed a method focusing on the multi-scale combination of features. This approach utilizes global static features, moment features, and more refined features extracted from networks that are static, dynamic, and high-order functional. Subsequently, SVM was used to classify EMCI versus NC. Lu et al. (38) developed a system categorizing AD, MCI, and CN of fMRI data using FC throughout the brain rather than feature selection. They then used an ELM to classify binary stages. Unfortunately, this framework is only appropriate for a small dataset.

Yang et al. (39) extensively applied the brain function network to classify AD biomarkers 240 in the MCI stage. They used multiple time points of rs-fMRI data by combining the fused sparse network model based on centralized learning that is parameter-free. The essential features selected by the similarity network fusion method were then used to classify them using SVM. In addition, Chan et al. (40) proposed approach for AD uses a graph neural network (GNN) on MRI and fMRI scans. It encodes scans into brain graphs, clusters representations learned by the GNN to identify disease subtypes, and constructs population graphs for final decision-making. This approach outperforms existing methods, identifying three AD subtypes and revealing unique biomarkers, such as left cuneus and left isthmus cingulate cortex degeneration.

Lama et al. (41) constructed the brain network using Pearson's correlation-based FC of fMRI data. The brain network's graph features were transformed into feature vectors using the Node2vec graph embedding technique. Furthermore, they selected features using various approaches, which they then applied to classifiers: single-layered extreme learning and multi-layered ELM. Koluragi et al. (42) combined SVM and EfficientNetB0 to improve the performance. The integrated approach outperformed individuals, leveraging EfficientNetB0's efficient resource utilization and balance.

In earlier research, rs-fMRI used a mono-band frequency range and focused on low-order neurodynamics. Thus, high-order neurodynamics were deliberately excluded. To address these issues, Sethuraman et al. (43) proposed an automated system to detect AD using rs-fMRI. The system constructs a high-order neurodynamic functional network using different levels of rs-fMRI time-series data, such as slow4 and slow5, and the full-band ranges from 0.027 to 0.08 Hz, 0.01 to 0.027 Hz, and 0.01 to 0.08 Hz. SVM and k-nearest neighbor (KNN) were used for ML, and AlexNet and Inception were implemented to classify various stages of AD. The system achieved 96.61% accuracy in differentiating between AD and NC. Begum and Selvaraj (44) used deep CNN (DCNN) and 3D densely connected convolutional neural network algorithms to diagnose AD and perform feature analysis on fMRI data.

To enhance early detection (45), the effectiveness of Extreme Learning Machines (ELMs) was assessed alongside fMRI-based FC metrics. The non-linear methods such as MIC and eMIC were applied as classification features leads to robust outcomes. The study achieved a 95% accuracy rate in distinguishing between AD and NC using these methods. The study conducted by Penalba-Sánchez et al. (46) investigated the dynamic and static FC of resting-state fMRI using various methods across 116 ROIs for four participant groups. Additionally, they utilized graph theory metrics to investigate network segregation and integration. The results showed that the EMCI group had a longer typical path length and lower degree compared with the healthy control (HC) group.

3.1 Important of gap

MVPA techniques can enhance the ability to detect significant changes in the activity of the brain that may not be noticeable with traditional univariate methods. This is particularly important in AD, where early detection of subtle changes can be crucial for timely intervention. Additionally, MVPA allows a more detailed understanding of how different brain regions interact and contribute to cognitive processes. This can provide valuable insights into the underlying mechanisms of AD and other neurological disorders.

4 Proposed framework

AD is a serious health condition affecting many people, particularly the older population worldwide. It is a debilitating illness causing memory loss and impairing one's ability to interact with their surroundings. Early detection is crucial in mitigating the effects of Alzheimer's disease and improving the quality of life. Recognizing the disease at its onset enables the reduction of its impact on patients. We constructed a predictive framework to detect AD at an early stage based on human brain imaging techniques: fMRI. Figure 4 presents a summary of the proposed framework. It includes the following steps: (1) data collection (i.e., fMRI), (2) preprocessing of fMRI data to avoid articles (i.e., noisy), (3) computing FC through MVPA, (4) extracting time series of fMRI data, (5) computing correlation matrices for each stage, (6) feature selection to select relevant features (i.e., voxel), (7) supervised learning, and (8) evaluation and analysis.

Figure 4. Proposed framework.

4.1 fMRI signal preprocessing

Since medical images are complex and difficult to extract features, various techniques must be used to process images in the dataset.

A flexible preprocessing pipeline is used to prepare functional and structured data, including realignment, slice timing correction (STC), normalization to MNI space, and smoothing (47). For realignment, we utilized the SPM realignment unwarping procedure suggested by Andersson et al. (48). Then, scans are co-registered based on a reference image, such as the first scan of the first session. For this, a least square technique and a transformation of a 6-parameter (rigid body) are utilized, as presented in the study by Friston et al. (49). After that, the interpolation of the B-spline was resampled to reduce the effects of motive and magnetic artifacts.

Temporal misalignment and methods were applied to identify scans. A reference BOLD image was developed by applying the mean to the scans, and the outliers were excluded. The SPM unified normalization algorithm is used to perform the normalization and obtain the standard MNI space (50, 51), with the probability map template based on default IXI-549 tissue, as resampled to 2 mm isotropic voxels. Finally, the spatial convolution of the data was performed with the help of a Gaussian kernel of 6 mm full-width at half-maximum (FWHM) for smoothing (see Figure 5 for an illustration).

Figure 5. Pipeline for preprocessing of BOLD data.

4.2 Functional connectivity

An essential application of fMRI studies is brain network mapping in AD patients and between the brain network mapping routes. At rest, the default mode network is among the most exciting networks (52). DMN relates to knowing previous events, imagining future events, self-relevant mental processing, and checking external information (53). Alterations in DMN functional activity have been linked to neurological disorders (54–56). Most studies show decreased FC in the DMN. In a study by Koch et al. (57), the power of the DMN in rs-fMRI was examined to differentiate between three groups: CN individuals, MCI, and patients with AD. Moreover, this can be constructed using numerous imaging technologies [for example, EEG/magnetoencephalography [MEG] and structural, diffusion, and functional MRI]. Ways to analyze FC include UNIVAR and MVPA.

4.2.1 Univariate analysis

UNIVAR is a method used to analyze fMRI data. UNIVAR assesses the individual voxel neural activation or the average voxel activation of the brain. Thus, it is used for the localization of brain regions participating in processing specific stimuli such as face versus object. The conclusion about the brain regions participating in cognitive processes is also drawn from the study by Haynes and Rees (58). A general linear model is employed on each voxel, which is why it is called univariate (59). FCA characterizes communication between various brain regions during a task or rest. It also measures the relationship strength between the BOLD signal of the time series (60), as shown in Figure 6.

∀x,y rn(x,y)=gn*b(x,y)+ϵn(x,y).σ(x,y) (1)

Figure 6. Schematic diagram of univariate analysis.

Null hypothesis C.b(x, y) = 0

Here in Equation 1, n refers to the number of subjects in a study, and x and y are the voxel pair. The characterizing FC of these two voxels can be considered as rn(x, y), where gn is referred to as the vector of a predictor of each subject n. The unknown regression coefficient of an unknown vector is b(x,y), while ϵn (x,y) and σ (x,y) are error term and inter-subject variance, respectively. A null hypothesis can be formed using C.b(x, y) = 0.

Many studies used UNIVAR, such as in the study by Moeller et al. (61), to identify the region's dynamic activity close to the expected waveform. In another method Bu et al. (62), the authors examined the UNIVAR and MVPA overlap.

4.2.2 MultiVoxel (or Multivariate) Pattern Analysis

Multivariate Pattern Analysis MVPA is the most used technique for analyzing functional data. In this study, the spatial pattern of neural activation across various voxels is considered (e.g., voxels in fMRI or channels in MEG/EEG). It also assesses whether it has information related to the task (63). It is called multivariate because it is based on analyzing a set of voxels rather than single voxel modeling (64). The similarities of such patterns can also be investigated by the activation of these patterns, such as by viewing a scene vs. a face, Norman et al. (65), as shown in Figure 7. The MVPA can be mathematically defined as follows (Equation 2).

∀x rn(x)=gn*B(x)+ϵn(x)*∑(x) (2)

Figure 7. Schematic diagram of Multi-Voxel Pattern Analysis.

Null hypothesis C.B(x).P(x) = 0

While rn (x) refers to connectivity value whole map, unknown predictor of regression coefficients is denoted by B(x). ϵn (x) refers to residual error. ∑(x) is denoted as voxel-by-voxel matrix of positive definite. While C denotes between subject, P(x) represents contrast matrix of between-voxels. There are many studies that used MVPA such as in the study by Yoon et al. (66), it used validate impairment hypothesis in schizophrenia-distributed representations. In another method, Lee et al. (67) conducted hypothesiss by using MVPA to check that based on the brain prediction, the efficiency of models has variations across the stimuli types.

4.3 Region of interest

After preprocessing BOLD fMRI data, we can extract features from the fMRI data depending on the atlas. Automated Anatomical Labeling (AAL) atlas is a tool used in neuroimaging that provides a pre-defined anatomical division of the human brain. This tool is widely used in neuroscience research, particularly in functional and structural brain imaging studies, such as fMRI and PET. The AAL atlas helps researchers to identify and label specific brain regions in their neuroimaging data. The human brain is divided into anatomical regions, each with a specific label in the AAL atlas. AAL atlas provides standardized three-dimensional coordinates for each region, which researchers can use to locate and precisely label brain imaging data areas. The AAL atlas performs various analyzes, including region-of-interest (ROI) studies in functional brain imaging, to map brain activity during specific tasks or resting-state conditions (68). Few types of the AAL atlas are as follows: AAL1 (69), AAL2 (70), Chinese AAL (71), AAL3 (68). Dealing with high-dimensional and small sample datasets such as fMRI data is challenging when it comes to classification and modeling. To address this issue, the AAL template is utilized in this study to calculate the functional link matrix after processing the original image. In Figure 8, the AAL3 used to perform feature extraction to identify relevant brain regions or patterns for the fMRI. AAL3 includes 170 regions, masking objects with an atlas to extract time series within each ROI (see Figure 9).

Figure 8. View for the AAL3 template.

Figure 9. (A) Performed mask and functional of AD. (B) Performed mask and functional of MCI. (C) Performed mask and functional of NC.

4.4 Compute connectivity

Multiple techniques are available to calculate the FC of fMRI. These techniques include connectivity maps of seed-to-voxel, ROI-to-ROI connectivity matrices, independent component analysis, and multivariate pattern analysis (MVPA). This study proposes FC using MVPA to analyze individual voxel resolution in the brain-wide connectome. This approach uses the MVPA methods to overcome the challenges of brain-wide connectome analysis. MVPA was applied to a 4D BOLD dataset to compute the correlation matrix between voxel time series within each ROI and remove relevant voxels based on their correlation with other voxels. These analyzes calculate a series of associated connectivity patterns and spatial maps that illustrate the voxel connectivity to the rest of the brain. Based on the provided fMRI time-series data, the calculated correlation matrix will then contain correlation values between ROI pairs. The FC matrix is displayed using the AAL3 template, which includes 166 brain regions, resulting in a connectivity matrix of 166 X 166. The correlation matrix ranges from 0 to 1, with 0 indicating no correlation and 1 indicating a high degree of correlation. The matrix is shown in Figures 10, 11.

Figure 10. Functional connection matrix and brain network visualization for each stage (AD, MCI, and NC). (A) Functional connectivity NC. (B) Functional connectivity AD. (C) Functional connectivity MCI.

Figure 11. Functional connection matrix and brain network visualization for each stage (AD, MCI, and NC) in ADNI dataset. (A) Functional connectivity NC. (B) Functional connectivity AD. (C) Functional connectivity MCI.

4.5 Feature selection 4.5.1 LASSO

Suppose we have a data (xi,Yi), i = 1, 2, … , N, where xi = (Xil,Yip)T refers to the variables used for prediction, yi refers to the response. In the usual setup of regression, we suppose that either all observation is independent or yis are independently conditionally of the given yijs we can suppose that Xij referred to as standardized ∑i xij/N = 0, ∑i x2ij/N Suppose β^=(β^1,…,β^p)T, and the lasso estimate (α^,β^)

Here, t≥0 is referred to as a parameter for tuning. For all t, the solution for a is α^=ŷ. We can consider without losing the generality that α^=0 which omit α. The solution of the above equation is a problem of quadratic programming having linear constraints of inequality.

The amount of shrinkage is controlled by the parameter t≥0. It is applied for estimation. Suppose β^j refers to the estimates of full least squares. Let ∑|β^j|, then the shrinkage will occur due to t < 0. This shrinkage will occur in the solutions toward 0. There are some coefficients and value of these coefficients will be 0. If t = t0/2, then the affect will be same as searching the best subset having a size of p/2. It is not necessary that the matrix of design will be of full rank.

The motivation behind the Lasso is from a proposal by Breiman, and it can be defined as Equations 3 and 4.

(α^,β^)=argminsubject to∑j|βj|≤t. (3) ∑i=1N(yi-α-∑jcjβ^j°xij)2subject to cj≥0, ∑cj≤t (4)

As previously mentioned, fMRI data are high-dimensional, with many voxels (3D pixels) representing regions of the brain. In this context, LASSO helps select a subset of these most relevant voxels for a particular analysis. Lasso is used as a regularization technique in linear regression methods. It adds a penalty term to the sum of squared errors, encouraging sparsity in the resulting model. It promotes the selection of a subset of features (voxels or ROI) while setting others to zero. Identifying relevant voxels or ROIs: Lasso regression is employed to identify relevant features (relevant brain regions or voxels). The Lasso coefficients provide information about the importance of each feature by setting a suitable penalty parameter (alpha = 0.01). Features with non-zero coefficients are considered relevant, and those with coefficients set to zero are effectively excluded from the model (72). We selected the λ value that minimized the cross-validated mean squared error (MSE), as shown in Figures 12, 13.

Figure 12. MSE of the LASSO fit, cross-validated with a paramete

View original article

FRONTIERS IN MEDICINE

分享书签

0 0 0 0 0 0 0

More from this channel

Multivariate pattern analysis of medical imaging-based Alzheimer's disease

留言 (0)