Next Article in Journal
Determination of pH in Powdered Concrete Samples or in Suspension
Previous Article in Journal
Modeling of Temperature Time-Lag Effect for Concrete Box-Girder Bridges
Previous Article in Special Issue
Molecular Specific and Sensitive Detection of Pyrazinamide and Its Metabolite Pyrazinoic Acid by Means of Surface Enhanced Raman Spectroscopy Employing In Situ Prepared Colloids
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multivariate Statistical Analysis of Surface Enhanced Raman Spectra of Human Serum for Alzheimer’s Disease Diagnosis

1
Department of Chemistry, University at Albany, SUNY, 1400 Washington Avenue, Albany, NY 12222, USA
2
Alzheimer’s Center and Movement Disorders Program, Department of Neurology of Albany Medical Center, Albany, NY 12208, USA
3
Parkinson’s Disease and Movement Disorders Center of Albany Medical Center, Albany, NY 12208, USA
4
Layton Aging and Alzheimer’s Center, Oregon Health & Science University, Portland, OR 97239, USA
*
Author to whom correspondence should be addressed.
Appl. Sci. 2019, 9(16), 3256; https://doi.org/10.3390/app9163256
Submission received: 1 July 2019 / Revised: 30 July 2019 / Accepted: 7 August 2019 / Published: 9 August 2019
(This article belongs to the Special Issue Surfaced Enhanced Raman Scattering (SERS) in Disease Diagnosis)

Abstract

:

Featured Application

In this research we propose a novel method for detecting Alzheimer’s disease. This method involves the use of surface enhanced Raman spectroscopy in combination with multivariate statistical analysis. Based on the results of the proof-of-concept study, we have successfully demonstrated the potential of the method to identify Alzheimer’s disease through analysis of blood serum. With further work, this method could be developed into a novel clinical assay for the effective and accurate diagnosis of Alzheimer’s disease.

Abstract

Alzheimer’s disease (AD) is the most common form of dementia worldwide and is characterized by progressive cognitive decline. Along with being incurable and lethal, AD is difficult to diagnose with high levels of accuracy. Blood serum from Alzheimer’s disease (AD) patients was analyzed by surface-enhanced Raman spectroscopy (SERS) coupled with multivariate statistical analysis. The obtained spectra were compared with spectra from healthy controls (HC) to develop a simple test for AD detection. Serum spectra from AD patients were further compared to spectra from patients with other neurodegenerative dementias (OD). Colloidal silver nanoparticles (AgNPs) were used as the SERS-active substrates. Classification experiments involving serum SERS spectra using artificial neural networks (ANNs) achieved a diagnostic sensitivity around 96% for differentiating AD samples from HC samples in a binary model and 98% for differentiating AD, HC, and OD samples in a tertiary model. The results from this proof-of-concept study demonstrate the great potential of SERS blood serum analysis to be developed further into a novel clinical assay for the effective and accurate diagnosis of AD.

1. Introduction

Alzheimer’s disease (AD) is a progressive neurodegenerative disease which results from neuronal cellular pathology associated with the dysregulation of protein and lipid metabolism, oxidative stress, and inflammation [1]. The discovery of relatively robust AD-specific biomarker signatures in blood has recently been reported by several research groups [2,3,4]. It was indicated that the diagnostic strategy could significantly benefit from combining several serum biomarkers because of the multiplicity of pathophysiological processes comprised in AD [5,6,7]. Efforts to find reliable biomarkers for AD in peripheral blood resulted in the documentation of several promising novel blood biomarker panels including protein signatures, [2,4,8], lipid and metabolite profiles [9], oxidative stress species [10], and proinflammatory cytokines [11].
Intense multidisciplinary research has shown that Raman spectroscopy combined with advanced statistics can determine potential differences in molecular structure and biochemical composition of body fluids and tissues [12,13,14,15,16,17,18,19]. Furthermore, it has emerged as a powerful technique for the detection of cancer [20,21], diabetes [22,23], preeclampsia [24], and atherosclerosis [25]. Most importantly, Raman spectroscopy has proven to be a useful tool for detecting characteristic biochemical changes in blood which occur due to neurodegenerative disorders such as AD [26,27,28] and Parkinson’s disease (PD) [29]. This indicates that as a disease progresses within humans, changes within the composition of blood serum occurs, causing disease-specific alterations in the spectral signature obtained by Raman spectroscopy. Consequently, these changes of spectral biofluid features can be used for diagnostic purposes [30,31].
However, conventional Raman spectroscopy has an inherently small cross-section of Raman scattering and the Raman signals from biological samples are usually weak [12]. Even further, it is only natural to assume that at the initial stages of AD, the biochemical changes reflected in blood will be extremely subtle [32]. To overcome these potential hurdles for introducing the technology to clinical settings, the approach requires high spectral quality, high levels of sensitivity and specificity, and advanced methods for data analysis.
The phenomenon of surface-enhanced Raman scattering (SERS) provides all of the aforementioned qualities. Adsorption of a sample onto rough metal nanostructures leads to a significant enhancement of Raman scattering by several orders of magnitude (between 102–106) [33]. Recent advances in theory and computational techniques have shown that a strong SERS effect can be achieved using silver or gold colloids [34]. SERS has proven itself to be a highly attractive bioassay platform for bioanalytical sensing which requires low detection levels and a high degree of specificity [34,35]. SERS has been used in combination with well-validated multivariate statistical methods to address a variety of biomedical problems including detection of cancer, infectious diseases, and metabolic diseases [23,36,37,38,39].
The development of a reliable assay for rapid detection and differentiation of AD will dramatically improve the management of this deadly disease in terms of providing early diagnostic efficiency and accuracy, as well as aid in the search for a cure [32,40]. As such, this study explores a SERS-based analysis of blood serum samples to develop a rapid and reliable laboratory test for AD diagnostics. The approach is based on probing the total biochemical composition of blood serum with Raman spectroscopy using silver nanoparticles (AgNPs) as a signal-enhancing substrate. The obtained SERS spectral signatures were analyzed by artificial neural networks (ANNs) for AD detection and differentiation. ANNs are a form of sophisticated statistical techniques which can be used for multivariate data analysis. This advanced computerized approach can be applied for sample classification as the result of pattern learning and recognition of measured disease-related spectral features.
The new method was validated by analyzing blood serum samples from three different subject groups: AD patients, patients with other forms of neurodegenerative dementia (OD), and healthy controls (HC). The proposed approach suggests a novel avenue for development of a rapid, reliable, and inexpensive blood serum test which can be used for the detection and differentiation of AD and related neurodegenerative diseases.

2. Materials and Methods

Institutional Review Boards at the University at Albany and at Albany Medical College (AMC) reviewed and approved the research protocol for the human studies reported herein. All study subjects submitted a written informed consent to the authorized study personnel before participating.

2.1. Chemicals

Solutions of sodium hydroxide (Sigma-Aldrich, St. Louis, MO, USA), hydroxylamine hydrochloride (Honeywell Fluka, Morris Plains, NJ, USA), silver nitrate (Merck, Kenilworth, NJ, USA), and ethylenediaminetetraacetic acid (Sigma-Aldrich, St. Louis, MO, USA) were prepared with distilled water.

2.2. Human Blood Serum Samples

Serum samples were collected from 48 individuals recruited from the subspecialty neurological clinics at the Alzheimer’s Center and the Parkinson’s disease and Movement Disorders Center, both at AMC. Three groups of subjects were recruited: AD, OD, and HC. The AD group included patients diagnosed with Alzheimer’s disease at either mild (n = 10) or moderate (n = 10) stages of the disease. The OD group consisted of patients diagnosed with other neurodegenerative dementias including Lewy body dementia (n = 5), Parkinson’s disease dementia (n = 10), and frontotemporal dementia (FTD, n = 3, where two of these donors have a specific variant of FTD called progressive supranuclear palsy). Medical history of all recruited patients was carefully reviewed by a trained neurologist who established the clinical diagnoses. Clinical assessments to determine the level of dementia were made using the following guidelines. Dementia was defined by a Clinical Dementia Rating Scale (CDR) of 0.5 or more for all dementia subjects [41]. AD was diagnosed by the NINDS-ADRDA criteria [42] and CDR was used to determine the stage of AD (CDR of 0.5–1 indicates mild AD, and CDR of 2.0 indicates moderate AD). Criteria for the diagnosis of Parkinson’s disease utilized the Unified Parkinson’s Disease Rating Scale [43,44] and diagnosis of Lewy body dementia was made following the guidelines outlined by the DLB Consortium [45]. The criteria for the diagnosis of FTD were applied as reported by Neary et al. [46].
The third group of blood serum donors included ten age- and gender-matched healthy control volunteers. Selected control subjects were typically the patients’ spouses in general good health with no active major disease and who were free of any neurological or psychiatric disorders. The healthy controls thus had similar ethnic and socioeconomic backgrounds as well as environmental factors including age, education, race, religion, diet, and everyday lifestyle. A general summary of the demographic information for the study subjects is presented in Table 1.

2.3. Colloidal Silver Nanoparticles (AgNPs)

SERS measurements of blood serum were acquired using aqueous colloidal suspensions of hydroxylamine-reduced AgNPs prepared as described by Leopold and Lendl [47]. Briefly, colloidal AgNPs were obtained through the addition of 9 mL of 0.1 M sodium hydroxide solution mixed with 10 mL of 6 × 10−2 M hydroxylamine hydrochloride solution to 180 mL of 1.11 × 10−3 M silver nitrate aqueous solution. The mixture was shaken until it appeared homogeneous; the solution was a milky gray color. The concentration of silver nitrate and hydroxylamine hydrochloride/sodium hydroxide was calculated to be 10−3 and 3 × 10−3/4.5 × 10−3 M in the final obtained reaction mixture. The silver colloidal solution was then centrifuged at 9300 rcf for 10 min, and the supernatant was discarded. After making a 1:100 dilution, the AgNPs were characterized by an UV-Vis absorption maximum at 427 nm, as seen in Figure 1, which is in good agreement with the literature [47].

2.4. Sample Preparation

Peripheral blood samples of 5 mL were collected from AMC study subjects and immediately processed into aliquots of anticoagulated (1 mL) whole blood, plasma, and serum which were stored at −80 °C until analysis. EDTA was used as an anticoagulant at a concentration of 1.5 mg/mL. All blood samples were drawn and handled identically. For SERS measurements, the blood serum sample was defrosted on ice. The sample was then mixed with colloidal AgNPs in a 1:1 ratio by volume, bringing the total volume for each measurement to 40 μL. The solution was gently mixed with a pipette tip to ensure a homogeneous mixture was made. The solution was placed on a microscopic glass slide covered with aluminum foil and allowed to completely dry under gentle air flow for 5 min. The aluminum foil was used as a substrate due to the low fluorescence signal produced.

2.5. Spectroscopic Measurements

The SERS spectra were collected using a Renishaw inVia confocal Raman spectrometer equipped with a research-grade Leica microscope, and 20× long-range objective (numerical aperture of 0.35). The spectra were recorded using WiRE 3.2 software in the range of 400–1800 cm−1 under 785 nm excitation. To avoid photodegradation of the samples, the laser power was reduced to 11 mW. Spectra were recorded from ten sequential spots on the sample, with one 10 s accumulation collected at each spot. The data was collected using automatic mapping, in a grid-like manner, through the use of a Renishaw PRIOR automatic stage. No coffee ring effect observed; however, a cast film was observed. The area of the sample that was mapped was chosen at random.

2.6. Data Treatment

The SERS spectra were imported to MATLAB R2012a (7.14) software for preprocessing methods and analysis. The adaptive iteratively reweighted penalized least squares (airPLS) algorithm was used for fluorescent background removal [48]. The SERS spectra were normalized by total area and mean centered [49]. GA data analysis was performed using PLS_Toolbox 6.2 (Eigenvector Research, Inc.) available within the MATLAB environment [49] and the ANN analysis was performed using R-project software, ver. 3.4.4.

2.7. Genetic Algorithm

Genetic Algorithm (GA) [50] was employed using PLS_Toolbox within the MATLAB environment to analyze the spectroscopic data used in the modeling events described below. Using GA, all variables within the Raman spectral dataset collected from serum are considered in order to determine their significance for sample classification. When the spectral data consists of many features in the dataset, as is seen herein, the data can become too noisy; this problem can be addressed through reducing the dimensionality of the dataset by selecting only a subset of informative spectral features and removing the uninformative ones. The process involves selecting variables with the lowest prediction error (RMSE-CE) through simulated natural selection, which involves genetic mutation and recombination of chromosomes, or subsets of variables.
The Raman bands which are determined by GA as the most useful for differentiating between classes of blood serum data can be further studied to determine their possible clinical relevance, including indicating biochemical changes which occur within serum during progression of AD.

2.8. Artificial Neural Networks

Machine learnings methods are known as data driven approaches; several different techniques were applied to the dataset to determine which algorithm could best solve the problem at hand. As a result, Artificial Neural Networks (ANNs) was selected as the optimal choice for multivariate data analysis [51]. ANNs are simplified mathematical models which resemble the human brain and can be described as several layers of interconnected “neurons.” Each neuron can perform mathematical operations on the input values and transfer the result to the next layer of neurons. The parameters of the mathematical operations are tuned by a learning algorithm. The power of ANNs is based on the collective behavior of many interconnected neurons and their ability to exchange and retain meaningful information. Advantageously, ANNs do not rely on data distribution and thus can derive patterns from large, complex, and noisy datasets which are typically difficult to analyze by conventional linear data analysis methods. ANNs are fault-tolerant and capable of generalization, which enables them to restore missing data and represent “real-world” solutions for the generalization of a problem.
However, depending on the complexity of the model, ANNs can be time consuming to perform and the number of hidden layers has to be determined to completely represent all features of the dataset. Furthermore, it is not always clear how ANNs approach the solution, and as such, they are sometimes called “black boxes” [52,53,54]. ANNs tend to converge on local minima, yielding reduced generalization abilities [55,56]. However, ANNs consistently outperform other classification methods, indicating the ability of non-linear data generalization to achieve high predictive capabilities [27,54,57,58,59]. The process of searching for a global minima can be complicated; to alleviate this problem herein, each training event during the testing phase was repeated twenty times, each time with new random starting values for the network weights; the network with the lowest error was then determined. Overtraining was assessed by performance of the model on the validation data. One established statistical technique to evaluate if a model is over-trained is to use a separate dataset for validation of the trained model. The cross-validation technique is an extension of this principle. The measurements of model accuracy on the validation dataset are accepted as an estimate of model accuracy on novel, unseen data. A well-generalized network should give good prediction results on the validation data, and these should be comparable to the prediction results on the test set [60].
To design, test, and validate ANN models, the “neuralnet” package in the R environment was employed [61]. The specific form of ANN used for AD diagnostics was a multilayer perceptron (MLP). The MLP architecture was designed and optimized by varying the number of hidden layers and the number of neurons in each layer; each architecture was tested using transfer and resilient back-propagation training methods. For each classification event, a random split was used on the original dataset to divide it into training (90%) and test (10%) sets during the testing phase and then during cross-validation, 10% of data was separated from the training set and used.
The training dataset was used to determine the relationship between dependent and independent variables. The test set, in turn, assesses the performance of the model. The assignment of data to either the training set or the test set is done by random sampling, where multiple spectra from donors were used as individual inputs. The random split was applied to the original dataset of 480 spectra to divide it into training (90%) and test (10%) datasets; this corresponds to 48 randomly selected individual spectra used for testing and the remaining spectra (432) to be used for training. This approach has some limitations, including overlooking the high probability that spectra from the same donor will appear in both the training and the test sets, therefore potentially creating a class imbalance problem in some of the training and test datasets. Therefore, after the optimal model network structure was selected and evaluated using the test dataset, an estimation of how the ANN is expected to perform when applied for making general prediction was tested using the Bootstrap Latin Partition (BLP) cross-validation scheme. Considering the small dataset, the BLP cross-validation scheme was chosen to evaluate the statistical fit of the selected ANN model. It has been shown that the BLP method reduces performance dependence by isolation and separation of the training and validation datasets and reduces prediction accuracy variance. Further, the BLP scheme preserves class distribution in both the training and validation sets and is performed in a donor-wise manner. This means that all spectra collected for an individual donor were excluded from the training dataset and then used to test the built model. Sensitivity (true-positive rate), selectivity (true-negative rate), and accuracy parameters and area under the receiver operating characteristic curve (AUC) of classification for validating data subsets were used to assess the performance of the calculated models. Results of BLP are reported as averages across all folds and bootstraps.

3. Results and Discussion

Both the Raman and SERS spectra of serum samples were measured. Figure 2 shows the comparison of the SERS spectrum of serum mixed with AgNPs to the regular Raman spectrum of blood serum. The observed dramatic increase in intensity of many vibrational bands within the SERS spectrum indicates that significant enhancement effects from the silver colloids was achieved.
Serum samples were collected from three groups of subjects: AD (mild AD, n = 10 and moderate AD, n = 10), other neurodegenerative dementias (OD, n = 18), and healthy controls (HC, n = 10). Supplementary Figure S1a shows the mean SERS serum spectrum for each group of subjects. Supplementary Figure S1b–f presents the difference spectrum and ±2 standard deviations (STD) for each of the compared groups. Although there are visible differences in the serum spectral profiles of different groups, the differentiation of the groups solely based on visual inspection can be unreliable, as confirmed by the difference spectra lying within 2 STDs for each compared group.
Because the differences between average spectra fall within 2 STDs of the mean, it is necessary to use multivariate statistical methods to fully understand the differentiation capabilities and significance of the observed spectral differences to be used for diagnostic purposes. Multivariate statistical methods allow for the capture and analysis of spectral variability by transforming hidden characteristic spectral features, which may be lost upon averaging of spectral datasets, into a discriminative algorithm. Further analysis demonstrated that high discrimination accuracy could be achieved by modeling nonlinear relationships between variables using ANNs.
It is important to note that a large number of features can lead to ANNs becoming overfit. Additionally, Raman spectra often contain spectral regions which may be non-informative. Therefore, GA was applied to optimize the generalization performance of the ANN models and to reduce the risk of overfitting. GA was used to select the subset of spectral variables which provide the most optimal information for discriminating between classes of donors in the training dataset, which consisted of 48 donors and 480 spectra. In general, more data leads to better accuracy due to a typically better quality dataset which has a closer domain and less noise. For GA, a single run was selected to have 71 individuals per population and 100 generations. A total of 300 runs were performed. After GA selected which spectral regions are the most informative, the Raman bands were identified and assigned to the biomolecules contributing to that vibrational mode (proteins, sugars, lipids, etc.). GA identified informative regions and bands of SERS serum spectra through comparison of the AD group to the HC group (Figure 3a) and the AD group to the OD group (Figure 3b). Spectral regions identified by GA as best influencing the differentiation capabilities are marked in red.
Although GA revealed a greater number of significant regions between AD and OD SERS serum spectra than it did between AD and HC SERS serum spectra, the number of identified bands were very close. GA selected 9 regions (14 bands) and 5 regions (11 bands) between the AD to OD and AD to HC comparisons, respectively. As can be seen in Figure 3, the regions selected by GA in both comparisons are complimentary, and as such they selectively target complimentary spectral biomarkers expressed for the differentiation of these three classes, which can be very advantageous for statistical modeling. The region borders, vibrational band positions, and tentative band assignments for presumptive molecular contributions are given in Table 2 and Table 3 for each comparison. Interestingly, the regions selected by GA for SERS comparison almost completely overlap with regions which were selected in comparisons of conventional Raman spectroscopic datasets identified for the same purpose [27]. This shows the consistency of the applied method for the classification of blood serum spectral datasets obtained using different Raman spectroscopic techniques. Further, the tentatively identified set of molecules listed in Table 2 and Table 3 can be used for future evaluation of the spectroscopic signature for AD. For this study, the selected regions that are the most significant differentiation were used for the rest of analysis, including during the building of the ANN models.
After GA, ANNs were used to build three different models. The first model was a binary model to distinguish between all AD and HC samples. The second and third models were tertiary models built to distinguish between mild AD, moderate AD, and HC samples and between HC, all AD, and OD samples. The spectral features selected by GA for the best discrimination between AD and HC samples was used as the input dataset for all ANN networks. Since ANN is a supervised method, it is important to characterize the modeled system sufficiently with input and output data. The common approach is to partition the dataset into three parts: the training, test, and validation sets. For the algorithm, the inputs were provided as data points of the Raman spectra (240 values of wavenumbers, selected by GA) and the assigned classes (AD, OD, and HC) were the identity labels. The output layer contained two and three neurons for the binary and the tertiary models, respectively. The network weights within the network are trained for multiple different configurations of ANNs based on the resilient backpropagation algorithm with weight backtracking of the adjustable parameters in order to minimize differences between the network output and the known labels of the samples. The threshold for the partial derivatives of the error function was set to a stopping criteria of 0.01. All hyper-parameters, i.e., parameters whose values are set before the learning process begins, influence how the weights (parameters) between the neuron connections will be trained by the algorithm. The hyper-parameters are identified by a trial-and-error process, and performance of the algorithm on the test dataset is used to determine the optimal ANN architecture. The network that provided the lowest prediction errors on the test set as compared to the others was selected as the final optimal ANN network. The confusion table seen in Table 4 summarizes the results that were achieved when using the optimal ANN network structures during the testing phase. The confusion table depicts to which class every spectrum from the test set was predicted as belonging and compares the predictions to the actual class. After the training process was finished, all network weights were fixed to their single optimized value.
The neural network architectures that yielded the best results during the testing phase consisted of one hidden layer with 10 neurons for the binary model, of two hidden layers with 60 and 10 neurons each for the AD tertiary model, and of two hidden layers with 60 and 20 neurons each in the OD tertiary model. The performance of the optimal ANN model is assessed using the BLP cross-validation procedure in order to assess the trained ANNs ability to differentiate between classes of Raman spectra. Ten Latin partitions bootstrapped five times were used to measure the generalized prediction accuracy of the models [64]. BLP will preserve the class distribution in both the training and validation sets and ensure all samples are used for prediction only once. Here, 10 training-prediction subsets of complete samples (donors), ensuring that each sample is only used once for prediction, were furnished. For each bootstrap, the data was split into training and prediction sets so that nine Latin partitions were combined into a training set for model building, and the tenth partition was used for prediction. All results from the ten prediction sets and across the five bootstraps were pooled and averaged, and the standard deviation was calculated. This approach was used to measure the generalized prediction ability of the three optimal ANN models built.
The final network results for each model are shown in Table 5. The performance parameters for all classes are listed as they were determined using BLP. The sensitivity (true-positive rate, i.e., percent of spectra belonging to a class that were correctly predicted as belonging to that class), specificity (true-negative rate, i.e., percent of spectra belonging to a class that were correctly predicted as not belonging to a different class), and accuracy (overall closeness of the predicted results to the known results) for each class prediction is reported. Table 5 represents the results of the ANN networks for HC vs AD differentiation in a binary model, for HC vs mild AD vs moderate AD in an AD tertiary model, and for HC vs AD vs OD in an OD tertiary model. The AD tertiary model demonstrates slightly worse performance than the AD binary and OD tertiary classification models, resulting in an accuracy of 94.84% as compared to the average accuracy of 96.47% and 98.31%, respectively. Such results can be expected, as only relatively minor changes in biochemical composition of blood serum are likely to be associated with AD progression from the mild stage to the moderate stage of the disease. With smaller sample sizes, one can expect larger variance between bootstrap results, however the observed variance of the performance parameters in this study, as determined by the calculated standard deviation, was very small thus indicating stability of the models.
The results shown in Table 5 correspond to the performance parameters for samples at the individual spectrum level. Each spectrum receives a probability estimate for belonging to each class; the spectrum is then assigned to the class corresponding to the highest probability estimate. To determine the classification of each donor, overall, the donor is assigned to the class the majority of its spectra were assigned as. For the binary model and tertiary OD model, all donors were correctly classified. In the tertiary AD model, however, four donors were misclassified, each time occurring in a different bootstrap.
The BLP cross-validation results shown in Table 5 exhibit good prediction results and prove that the model is not majorly overfit. Rigorous validation using the BLP scheme allows for overfitting to be avoided and ensures high quality of the resulting dataset. Further, accuracy is calculated in classification problems and is computed as the percentage of correctly classified inputs. The accuracy of the validation set is a good indicator of future classification generalization abilities. However, in order to monitor overfitting, the test and validation accuracies were compared, since an overfit model would be expected to perform much worse on the validation data. It is further expected that the test accuracy would ideally only be slightly higher or lower than the validation accuracy. This was confirmed by accuracy values achieved for binary AD, tertiary AD and tertiary OD models from the test set, which were 98% each for all models. These values are only slightly higher than the average accuracies of the BLP cross-validation results for the binary AD, tertiary AD and tertiary OD models, which were 96%, 95% and 98%, respectively (Table 5). Additionally, there have been a number of studies reported which use a very similar approach of Raman spectroscopy in combination with machine learning methods (specifically with ANN) for the analysis of the same size or smaller datasets than what is used in the current study. For example, the same approach was used to differentiate eleven patients with type 2 diabetes and nine controls (healthy subjects) from different anatomical locations, achieving 88.9–90.9% accuracy using a 10-fold cross-validation approach in donor-wise manner [65]. It should be noted that the study reported herein is designed as a proof-of-concept and more testing is necessary to obtain a strong understanding of the classification accuracy and generalizability of this framework.
To further understand the classification ability of the SERS spectra, receiver operating characteristic (ROC) curves were built using the “pROC” package [66] to generate ROC curves using the probability estimates for classifying spectra to a particular class. A ROC curve plots the sensitivity (true positive rate) versus the specificity (true negative rate). The points on the curve represent different thresholds for determining the class assignment of a spectrum and allows the user to understand the tradeoff between sensitivity and specificity for all possible thresholds, rather than just the one that was used in the ANN models. Using this method, it was possible to assess the performance of the model using a variety of thresholds. The results of the ROC analysis can be seen in Figure 4. The AUC value indicates the probability of correct classification. These results further support the strong classification ability of the SERS spectra to correctly determine to which class a donor belongs for diagnostic purposes.

4. Conclusions

A combination of multivariate analysis and surface-enhanced Raman spectroscopy using silver colloidal nanoparticles was applied for the analysis of blood serum from AD patients, healthy controls, and individuals with other forms of dementia. Through the use of ANNs as a multivariate analysis, it was possible to differentiate AD serum samples from normal and other dementia serum samples with high diagnostic sensitivity and specificity. The ANNs achieved a diagnostic sensitivity around 96% for differentiating between AD and HC samples, 95% for differentiating mild AD, moderate AD, and HC samples and 98% for differentiating HC, AD, and OD samples. Tentative assignments of the Raman bands selected by GA as the most useful for correct classification of the measured SERS spectra demonstrated spectroscopic markers that may reflect AD specific changes of particular nucleic acids, saccharides, and protein content in the blood serum of AD patients. These vibrational mode assignments will require further evaluation and experimentation in order to more specifically and reliably determine the biomolecules which are responsible for the changes in biochemical composition of the blood serum samples, which will be conducted in future studies. The results reported herein demonstrate the great potential for development of a successful method which combines silver nanoparticle-based SERS serum analysis with ANN-based diagnostic algorithms which can be used within clinical laboratory settings for the non-invasive detection and screening of Alzheimer’s disease.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-3417/9/16/3256/s1, Figure S1: The difference spectrum calculated between the different classes of average spectra, including ±2 standard deviations.

Author Contributions

Investigation and Writing – Original Draft Preparation, E.R.; Writing – Review & Editing, N.M.R.; Software, Validation, and Formal Analysis, L.H.; Conceptualization and Resources, D.Z, P.M, E.M., J.Q., E.A.Z, I.K.L.; Supervision, I.K.L.

Funding

This work was supported by the State University of New York Technology Accelerator Fund. E.M. was supported by the Riley Family Chair in Parkinson’s Disease.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Farlow, M.R. Etiology and pathogenesis of Alzheimer’s disease. Am. J. Health Pharm. 1998, 55, S5–S10. [Google Scholar] [CrossRef] [PubMed]
  2. Zellner, M.; Veitinger, M.; Umlauf, E. The role of proteomics in dementia and Alzheimer’s disease. Acta Neuropathol. 2009, 118, 181–195. [Google Scholar] [CrossRef] [PubMed]
  3. Irizarry, M.C. Biomarkers of Alzheimer disease in plasma. NeuroRX 2004, 1, 226–234. [Google Scholar] [CrossRef] [PubMed]
  4. Doecke, J.D.; Laws, S.M.; Faux, N.G.; Wilson, W.; Burnham, S.C.; Lam, C.P.; Mondal, A.; Bedo, J.; Bush, A.I.; Brown, B.; et al. Blood-based protein biomarkers for diagnosis of Alzheimer disease. Arch. Neurol. 2012, 69, 1318–1325. [Google Scholar] [CrossRef] [PubMed]
  5. Teunissen, C.E.; Lutjohann, D.; von Bergmann, K.; Verhey, F.; Vreeling, F.; Wauters, A.; Bosmans, E.; Bosma, H.; van Boxtel, M.P.; Maes, M.; et al. Combination of serum markers related to several mechanisms in Alzheimer’s disease. Neurobiol. Aging 2003, 24, 893–902. [Google Scholar] [CrossRef]
  6. Blennow, K.; Hampel, H.; Weiner, M.; Zetterberg, H. Cerebrospinal fluid and plasma biomarkers in Alzheimer disease. Nat. Rev. Neurol. 2010, 6, 131–144. [Google Scholar] [CrossRef] [PubMed]
  7. Rosén, C.; Hansson, O.; Blennow, K.; Zetterberg, H. Fluid biomarkers in Alzheimer’s disease—Current concepts. Mol. Neurodegener. 2013, 8, 20. [Google Scholar] [CrossRef]
  8. Britschgi, M.; Wyss-Coray, T. Blood Protein Signature for the Early Diagnosis of Alzheimer Disease. Arch. Neurol. 2009, 66, 161. [Google Scholar] [CrossRef]
  9. Mapstone, M.; Cheema, A.K.; Fiandaca, M.S.; Zhong, X.; Mhyre, T.R.; MacArthur, L.H.; Hall, W.J.; Fisher, S.G.; Peterson, D.R.; Haley, J.M.; et al. Plasma phospholipids identify antecedent memory impairment in older adults. Nat. Med. 2014, 20, 415–418. [Google Scholar] [CrossRef]
  10. Burns, D.H.; Rosendahl, S.; Bandilla, D.; Maes, O.C.; Chertkow, H.M.; Schipper, H.M. Near-infrared spectroscopy of blood plasma for diagnosis of sporadic Alzheimer’s disease. J. Alzheimer’s Dis. 2009, 17, 391–397. [Google Scholar] [CrossRef]
  11. Lee, K.S.; Chung, J.H.; Lee, K.H.; Shin, M.J.; Oh, B.H.; Hong, C.H. Bioplex analysis of plasma cytokines in Alzheimer’s disease and mild cognitive impairment. Immunol. Lett. 2008, 121, 105–109. [Google Scholar] [CrossRef] [PubMed]
  12. Tu, Q.; Chang, C. Diagnostic applications of Raman spectroscopy. Nanomedicine 2012, 8, 545–558. [Google Scholar] [CrossRef] [PubMed]
  13. Krafft, C. Bioanalytical applications of Raman spectroscopy. Anal. Bioanal. Chem. 2004, 378, 60–62. [Google Scholar] [CrossRef] [PubMed]
  14. Baena, J.R.; Lendl, B. Raman spectroscopy in chemical bioanalysis. Curr. Opin. Chem. Biol. 2004, 8, 534–539. [Google Scholar] [CrossRef] [PubMed]
  15. Manoharan, R.; Wang, Y.; Feld, M.S. Histochemical analysis of biological tissues using Raman spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 1996, 52, 215–249. [Google Scholar] [CrossRef]
  16. Sikirzhytski, V.; Sikirzhytskaya, A.; Lednev, I.K. Multidimensional Raman Spectroscopic Signatures as a Tool for Forensic Identification of Body Fluid Traces: A Review. Appl. Spectrosc. 2011, 65, 1223–1232. [Google Scholar] [CrossRef] [PubMed]
  17. Koo, T.W.; Berger, A.J.; Itzkan, I.; Horowitz, G.; Feld, M.S. Reagentless Blood Analysis by Near-Infrared Raman Spectroscopy. Diabetes Technol. Ther. 1999, 1, 153–157. [Google Scholar] [CrossRef]
  18. McLaughlin, G.; Doty, K.C.; Lednev, I.K. Raman spectroscopy of blood for species identification. Anal. Chem. 2014, 86, 11628–11633. [Google Scholar] [CrossRef]
  19. Sikirzhytski, V.; Sikirzhytskaya, A.; Lednev, I.K. Multidimensional Raman spectroscopic signature of sweat and its potential application to forensic body fluid identification. Anal. Chim. Acta 2012, 718, 78–83. [Google Scholar] [CrossRef]
  20. Pichardo-Molina, J.L.; Frausto-Reyes, C.; Barbosa-Garcí, O.; Huerta-Franco, R.; González-Trujillo, J.L.; Ramírez-Alvarado, C.A.; Gutiérrez-Juárez, G.; Medina-Gutiérrez, C. Raman spectroscopy and multivariate analysis of serum samples from breast cancer patients. Lasers Med. Sci. 2007, 22, 229–236. [Google Scholar] [CrossRef]
  21. Ralbovsky, N.M.; Lednev, I.K. Raman spectroscopy and chemometrics: A potential universal method for diagnosing cancer. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019, 219, 463–487. [Google Scholar] [CrossRef] [PubMed]
  22. Bispo, J.A.M.; Vieira, E.E.D.S.; Silveira, J.L.; Fernandes, A.B. Correlating the amount of urea, creatinine, and glucose in urine from patients with diabetes mellitus and hypertension with the risk of developing renal lesions by means of Raman spectroscopy and principal component analysis. J. Biomed. Opt. 2013, 18, 087004. [Google Scholar] [CrossRef] [PubMed]
  23. Han, H.W.; Yan, X.L.; Dong, R.X.; Ban, G.; Li, K. Analysis of serum from type II diabetes mellitus and diabetic complication using surface-enhanced Raman spectra (SERS). Appl. Phys. B 2009, 94, 667–672. [Google Scholar] [CrossRef]
  24. Basar, G.; Parlatan, U.; Seninak, S.; Gunel, T.; Benian, A.; Kalelioglu, I. Investigation of Preeclampsia Using Raman Spectroscopy. Spectrosc. Int. J. 2012, 27, 239–252. [Google Scholar] [CrossRef] [Green Version]
  25. Lattermann, A.; Matthäus, C.; Bergner, N.; Beleites, C.; Romeike, B.F.; Krafft, C.; Brehm, B.R.; Popp, J. Characterization of atherosclerotic plaque depositions by Raman and FTIR imaging. J. Biophotonics 2013, 6, 110–121. [Google Scholar] [CrossRef]
  26. Carmona, P.; Molina, M.; Calero, M.; Bermejo-Pareja, F.; Martínez-Martín, P.; Toledano, A. Discrimination Analysis of Blood Plasma Associated with Alzheimer’s Disease Using Vibrational Spectroscopy. J. Alzheimer’s Dis. 2013, 34, 911–920. [Google Scholar] [CrossRef] [PubMed]
  27. Ryzhikova, E.; Kazakov, O.; Halamkova, L.; Celmins, D.; Malone, P.; Molho, E.; Zimmerman, E.A.; Lednev, I.K. Raman spectroscopy of blood serum for Alzheimer’s disease diagnostics: Specificity relative to other types of dementia. J. Biophotonics 2015, 8, 584–596. [Google Scholar] [CrossRef]
  28. Ralbovsky, N.M.; Lednev, I.K. Raman hyperspectroscopy shows promise for diagnosis of Alzheimer’s. Biophotonics 2018, 4, 33–37. [Google Scholar]
  29. Schipper, H.M.; Kwok, C.S.; Rosendahl, S.M.; Bandilla, D.; Maes, O.; Melmed, C.; Rabinovitch, D.; Burns, D.H. Spectroscopy of human plasma for diagnosis of idiopathic Parkinson’s disease. Biomark. Med. 2008, 2, 229–238. [Google Scholar] [CrossRef]
  30. Henry, M.S.; Passmore, A.P.; Todd, S.; McGuinness, B.; Craig, D.; Johnston, J.A. The development of effective biomarkers for Alzheimer’s disease: A review. Int. J. Geriatr. Psychiatry 2013, 28, 331–340. [Google Scholar] [CrossRef]
  31. Krafft, C.; Steiner, G.; Beleites, C.; Salzer, R. Disease recognition by infrared and Raman spectroscopy. J. Biophotonics 2009, 2, 13–28. [Google Scholar] [CrossRef] [PubMed]
  32. Monien, B.H.; Apostolova, L.G.; Bitan, G. Early diagnostics and therapeutics for Alzheimer’s disease—How early can we get there? Expert Rev. Neurother. 2006, 6, 1293–1306. [Google Scholar] [CrossRef] [PubMed]
  33. Das, R.S.; Agrawal, Y.K. Raman spectroscopy: Recent advancements, techniques and applications. Vib. Spectrosc. 2011, 57, 163–176. [Google Scholar] [CrossRef]
  34. Larmour, I.A.; Graham, D. Surface enhanced optical spectroscopies for bioanalysis. Analyst 2011, 136, 3831–3853. [Google Scholar] [CrossRef] [PubMed]
  35. Negri, P.; Dluhy, R.A. Ag nanorod based surface-enhanced Raman spectroscopy applied to bioanalytical sensing. J. Biophotonics 2013, 6, 20–35. [Google Scholar] [CrossRef]
  36. Wabuyele, M.B.; Vo-Dinh, T. Detection of Human Immunodeficiency Virus Type 1 DNA Sequence Using Plasmonics Nanoprobes. Anal. Chem. 2005, 77, 7810–7815. [Google Scholar] [CrossRef] [PubMed]
  37. Bonifacio, A.; Dalla Marta, S.; Spizzo, R.; Cervo, S.; Steffan, A.; Colombatti, A.; Sergo, V. Surface-enhanced Raman spectroscopy of blood plasma and serum using Ag and Au nanoparticles: A systematic study. Anal. Bioanal. Chem. 2014, 406, 2355–2365. [Google Scholar] [CrossRef]
  38. Feng, S.; Chen, R.; Lin, J.; Pan, J.; Chen, G.; Li, Y.; Cheng, M.; Huang, Z.; Chen, J.; Zeng, H. Nasopharyngeal cancer detection based on blood plasma surface-enhanced Raman spectroscopy and multivariate analysis. Biosens. Bioelectron. 2010, 25, 2414–2419. [Google Scholar] [CrossRef]
  39. Feng, S.; Pan, J.; Wu, Y.; Lin, D.; Chen, Y.; Xi, G.; Lin, J.; Chen, R. Study on gastric cancer blood plasma based on surface-enhanced Raman spectroscopy combined with multivariate analysis. Sci. China Life Sci. 2011, 54, 828–834. [Google Scholar] [CrossRef] [Green Version]
  40. Solomon, P.R.; Murphy, C.A. Early diagnosis and treatment of Alzheimer’s disease. Expert Rev. Neurother. 2008, 8, 769–780. [Google Scholar] [CrossRef]
  41. Morris, J.C. The Clinical Dementia Rating (CDR): Current version and scoring rules. Neurology 1993, 43, 2412–2414. [Google Scholar] [CrossRef] [PubMed]
  42. McKhann, G.; Drachman, D.; Folstein, M.; Katzman, R.; Price, D.; Stadlan, E.M. Clinical diagnosis of Alzheimer’s disease: Report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology 1984, 34, 939–944. [Google Scholar] [CrossRef] [PubMed]
  43. Movement Disorder Society Task Force on Rating Scales for Parkinson’s Disease. The Unified Parkinson’s Disease Rating Scale (UPDRS): Status and recommendations. Mov. Disord. 2003, 18, 738–750. [Google Scholar] [CrossRef] [PubMed]
  44. Goetz, C.G.; Tilley, B.C.; Shaftman, S.R.; Stebbins, G.T.; Fahn, S.; Martinez-Martin, P.; Poewe, W.; Sampaio, C.; Stern, M.B.; Dodel, R.; et al. Movement Disorder Society-sponsored revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS): Scale presentation and clinimetric testing results. Mov. Disord. 2008, 23, 2129–2170. [Google Scholar] [CrossRef] [PubMed]
  45. McKeith, I.G.; Dickson, D.W.; Lowe, J.; Emre, M.; O’Brien, J.T.; Feldman, H.; Cummings, J.; Duda, J.E.; Lippa, C.; Perry, E.K.; et al. Diagnosis and management of dementia with Lewy bodies: Third report of the DLB Consortium. Neurology 2005, 65, 1863–1872. [Google Scholar] [CrossRef] [PubMed]
  46. Neary, D.; Snowden, J.S.; Gustafson, L.; Passant, U.; Stuss, D.; Black, S.; Freedman, M.; Kertesz, A.; Robert, P.H.; Albert, M.; et al. Frontotemporal lobar degeneration: A consensus on clinical diagnostic criteria. Neurology 1998, 51, 1546–1554. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Leopold, N.; Lendl, B. A New Method for Fast Preparation of Highly Surface-Enhanced Raman Scattering (SERS) Active Silver Colloids at Room Temperature by Reduction of Silver Nitrate with Hydroxylamine Hydrochloride. J. Phys. Chem. B 2003, 107, 5723–5727. [Google Scholar] [CrossRef]
  48. Zhang, Z.M.; Chen, S.; Liang, Y.Z. Baseline correction using adaptive iteratively reweighted penalized least squares. Analyst 2010, 135, 1138–1146. [Google Scholar] [CrossRef]
  49. Wise, B.M.; Gallagher, N.B.; Bro, R.; Shaver, J.M.; Windig, W.; Koch, R.S.; OÕSullivan, D. PLS_Toolbox 6 for Use with Matlab; Software, Eigenvector Research, Inc.: Wenatchee, WA, USA, 2011. [Google Scholar]
  50. Niazi, A.; Leardi, R. Genetic algorithms in chemometrics. J. Chemom. 2012, 26, 345–351. [Google Scholar] [CrossRef]
  51. Bishop, C.M. Neural Networks for Pattern Recognition; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
  52. Smith, A.; Nugent, C.; McClean, S.; Nugent, C.; McClean, S. Evaluation of inherent performance of intelligent medical decision support systems: Utilising neural networks as an example. Artif. Intell. Med. 2003, 27, 1–27. [Google Scholar] [CrossRef]
  53. Tung, W.L.; Quek, C.; Cheng, P. GenSo-EWS: A novel neural-fuzzy based early warning system for predicting bank failures. Neural Netw. 2004, 17, 567–587. [Google Scholar] [CrossRef] [PubMed]
  54. Wall, R.; Cunningham, P.; Walsh, P.; Byrne, S. Explaining the output of ensembles in medical decision support on a case by case basis. Artif. Intell. Med. 2003, 28, 191–206. [Google Scholar] [CrossRef] [Green Version]
  55. Everitt, B.S.; Dunn, G. Applied Multuvariate Data Analysis; John Wiley & Son, Ltd.: Hoboken, NJ, USA, 2001; pp. 248–268. [Google Scholar]
  56. Cheng, B.; Titterington, D.M. Neural Networks: A Review from a Statistical Perspective. Stat. Sci. 1994, 9, 2–30. [Google Scholar] [CrossRef]
  57. Chernushevich, I.V.; Loboda, A.V.; Thomson, B.A. An introduction to quadrupole–time-of-flight mass spectrometry. J. Mass Spectrom. 2001, 36, 849–865. [Google Scholar] [CrossRef] [PubMed]
  58. Eftekhar, B.; Mohammad, K.; Ardebili, H.E.; Ghodsi, M.; Ketabchi, E. Comparison of artificial neural network and logistic regression models for prediction of mortality in head trauma based on initial clinical data. BMC Med. Inform. Decis. Mak. 2005, 5, 3. [Google Scholar] [CrossRef]
  59. Shen, L.; Tan, E.C. Reducing multiclass cancer classification to binary by output coding and SVM. Comput. Biol. Chem. 2006, 30, 63–71. [Google Scholar] [CrossRef]
  60. Ablameyko, S. Neural Networks for Instrumentation, Measurement and Related Industrial Applications; IOS Press: Amsterdam, The Netherlands, 2003. [Google Scholar]
  61. Guenther, F.; Fritsch, S. Neuralnet: Training of Neural Networks. R J. 2010, 2, 30–38. [Google Scholar] [CrossRef]
  62. De Gelder, J.; De Gussem, K.; Vandenabeele, P.; Moens, L. Reference database of Raman spectra of biological molecules. J. Raman Spectrosc. 2007, 38, 1133–1147. [Google Scholar] [CrossRef]
  63. Movasaghi, Z.; Rehman, S.; Rehman, I.U. Raman spectroscopy of biological tissues. Appl. Spectrosc. Rev. 2007, 42, 493–541. [Google Scholar] [CrossRef]
  64. Dietterich, T.G. Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput. 1998, 10, 1895–1923. [Google Scholar] [CrossRef]
  65. Guevara, E.; Torres-Galván, J.C.; Ramírez-Elías, M.G.; Luevano-Contreras, C.; González, F.J. Use of Raman spectroscopy to screen diabetes mellitus with machine learning tools. Biomed. Opt. Express 2018, 9, 4998–5010. [Google Scholar] [CrossRef] [PubMed]
  66. Robin, X.; Turck, N.; Hainard, A.; Tiberti, N.; Lisacek, F.; Sanchez, J.C.; Müller, M. pROC: An open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform. 2011, 12, 77. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The UV-visible absorption spectrum of the AgNPs colloid. The absorption maximum is located at 427 nm.
Figure 1. The UV-visible absorption spectrum of the AgNPs colloid. The absorption maximum is located at 427 nm.
Applsci 09 03256 g001
Figure 2. SERS spectrum (red) of the blood serum sample from a normal subject (HC) obtained by mixing the serum sample with AgNPs colloids in a 1:1 ratio, and the regular Raman spectrum (blue) of the same blood serum sample without the AgNP colloids. Spectra were recorded under the same conditions for comparison.
Figure 2. SERS spectrum (red) of the blood serum sample from a normal subject (HC) obtained by mixing the serum sample with AgNPs colloids in a 1:1 ratio, and the regular Raman spectrum (blue) of the same blood serum sample without the AgNP colloids. Spectra were recorded under the same conditions for comparison.
Applsci 09 03256 g002
Figure 3. The most informative SERS spectral bands and regions as selected by GA for differentiation between: (a) AD (all stages) and HC serum samples and (b) AD (all stages) and OD serum samples. Selected regions are depicted in red on the average SERS pattern for serum of all classes in order to make comparisons.
Figure 3. The most informative SERS spectral bands and regions as selected by GA for differentiation between: (a) AD (all stages) and HC serum samples and (b) AD (all stages) and OD serum samples. Selected regions are depicted in red on the average SERS pattern for serum of all classes in order to make comparisons.
Applsci 09 03256 g003
Figure 4. The calculated ROC curves using BLP prediction values of the ANN models for differentiating AD vs HC (a) in the binary model, and predictions from both tertiary models, which differentiates one class against all others. The AD tertiary model differentiates mild AD (b1) vs moderate AD (b2) versus HC (b3). The OD tertiary model differentiates AD (c1) vs HC (c2) vs OD (c3). AUC refers to area under ROC curve value calculated from the model predictions against the outcome that shows the efficacies of the ANN models.
Figure 4. The calculated ROC curves using BLP prediction values of the ANN models for differentiating AD vs HC (a) in the binary model, and predictions from both tertiary models, which differentiates one class against all others. The AD tertiary model differentiates mild AD (b1) vs moderate AD (b2) versus HC (b3). The OD tertiary model differentiates AD (c1) vs HC (c2) vs OD (c3). AUC refers to area under ROC curve value calculated from the model predictions against the outcome that shows the efficacies of the ANN models.
Applsci 09 03256 g004
Table 1. Summary of demographic information on the blood serum study subjects.
Table 1. Summary of demographic information on the blood serum study subjects.
AMC Subjects (Serum Samples)
ADHC
(n = 10)
OD
(n = 18)
Moderate Stage (n = 10)Mild Stage (n = 10)Total AD (n = 20)
Age in years ± STD76 ± 1072.4 ± 8.474 ± 9.368 ± 1173 ± 7
Male (%)4060505078
Female (%)6040505022
Table 2. Tentative assignments of the most important regions in the SERS spectrum of blood serum for the discrimination between AD and HC, as determined by GA.
Table 2. Tentative assignments of the most important regions in the SERS spectrum of blood serum for the discrimination between AD and HC, as determined by GA.
GA RegionPeak PositionVibrational ModeContributions [15,20,24,62,63]
1674–710683
725
υ(C-S), υ(C-C),
(C-N), (CH2),
CN-(CH3)3,
N+(CH3)3
adenine, glycine, tryptophan, histidine, proline, oleic acid, triolein, mannose, citric acid, coenzyme A, acetyl coenzyme A, glutathione, phospholipids
2899–973886
933
958
δ(C-O-H),
υ(C-C), υ(CH3),
δRing vibration,
(CH2), (C-O-C)
guanine, glycine, valine, histidine, proline, hydroxyproline, gluthatione, α helix, fatty acids, cholesterol, mannose, galactosamine, N-acetyl-D-glucosamine
31087–11231071
1092
1134
υ(C-N), υ(C-C),
υ(C-O)
adenine, glycine, serine, glutamate, tryptophan, histidine, fatty acids, mannose, glucose, lactose, galactosamine, N-acetyl-D-glucosamine, acetyl coenzyme A, D-fructose-6-phosphate, glutathione, phospholipids
41162–11991203υ(C-N), (N-H),
(CH2),
tyrosine, phenylalanine, tryptophan, glycine, proline, amide III, N-acetyl-D-glucosamine, coenzyme A, acetyl coenzyme A,
51613–16491612
1654
υ(C=O), (NH2),
υ(C=C)
adenine, cytosine, thymine, tryptophan, tyrosine, fatty acids, galactosamine, pyruvate, coenzyme A, acetoacetate, ascorbic acid, amide I, α helix, phospholipid
ᶹ: stretching mode; ᵟ: bending mode.
Table 3. Tentative assignments of the most important regions in the SERS spectrum of blood serum for the discrimination between AD and OD, as determined by GA.
Table 3. Tentative assignments of the most important regions in the SERS spectrum of blood serum for the discrimination between AD and OD, as determined by GA.
GA RegionPeak PositionVibrational ModeContributions [15,20,24,62,63]
1448–485495Ring torsion vibration,
υ(N-C-S)
guanine, arginine, valine, glutamate, tyrosine, tryptophan, mannose, glycogen, acetyl coenzyme A, acetoacetate
2523–560530υ(S-S) disulfide stretching in proteinsuracil, alanine, phenylalanine, glutamate, tyrosine, tryptophan, cysteine, mannose, fructose, galactosamine, N-acetyl-D-glucosamine, phosphoenolpyruvate, acetyl coenzyme A, D-fructose-6-phosphate, riboflavin, phosphatidylserine
3598–635590
636
υ(C-S), υ4(PO43-)ascorbic acid, glutamate, tryptophan, methionine, tyrosine, lactose, N-acetyl-D-glucosamine, pyruvate, acetyl coenzyme A, phosphatidylinositol
4749–785742
766
792
δRing vibrationcytosine, thymine, uracil, glutamate, tyrosine, tryptophan, proline, N-acetyl-D-glucosamine, phosphoenolpyruvate, acetoacetate, phospholipids, riboflavin, pyrimidine, phosphatidylserine
5861–936886
933
δ(C-O-H),
υ(C-C),
δRing vibration,
(CH2), (C-O-C)
guanine, glycine, valine, histidine, proline, hydroxyproline, gluthatione, α helix, fatty acids, mannose, galactosamine, N-acetyl-D-glucosamine
61124–11611134υ(C-N), υ(C-C)adenine, glycine, serine, fatty acids, mannose, phospholipids
71312–13861331
1370
υ(C-H), (C3-C3), (C5-O5), (CHα),
(CH3CH2),
δ(CH3), δ(CH2),
υ(CH3),
adenine, thymine, guanine, valine, serine, arginine, phenylalanine, tyrosine, tryptophan, histidine, proline, fatty acids, lactose, mannose, galactosamine, N-acetyl-D-glucosamine, acetyl coenzyme A, gluthatione, phospholipids, amid III
81613–16491612
1654
υ(C=O), (NH2),
υ(C=C)
adenine, cytosine, thymine, tryptophan, tyrosine, fatty acids, galactosamine, pyruvate, coenzyme A, acetoacetate, ascorbic acid, amide I, α helix, phospholipids
91688–1724
ᶹ: stretching mode; ᵟ: bending mode.
Table 4. Confusion table of the prediction results achieved for selected networks during the testing phase on the test datasets for all three models with spectra used as individual inputs.
Table 4. Confusion table of the prediction results achieved for selected networks during the testing phase on the test datasets for all three models with spectra used as individual inputs.
Binary Model
Predicted/ActualADHC
AD271
HC012
Tertiary Model–AD
Predicted/ActualMild ADModerate ADHC
Mild AD1100
Moderate AD 1140
HC0014
Tertiary Model–OD
Predicted/ActualADHCOD
AD2800
HC160
OD0013
Table 5. Final network results of binary and tertiary ANN models for differentiating between classes of blood serum samples.
Table 5. Final network results of binary and tertiary ANN models for differentiating between classes of blood serum samples.
Binary Model
Sensitivity (±STD)Specificity (±STD)Accuracy (±STD)
AD vs. HC98.00 (0.447)93.40 (1.497)96.47 (0.688)
Tertiary Model–AD
Sensitivity (±STD)Specificity (±STD)Accuracy (±STD)
Mild AD88.80 (0.047)94.50 (0.025)93.13 (0.010)
Moderate AD90.80 (0.025)96.80 (0.007)94.79 (0.010)
HC97.20 (0.010)96.30 (0.009)96.60 (0.006)
Tertiary Model–OD
Sensitivity (±STD)Specificity (±STD)Accuracy (±STD)
AD96.40 (0.015)98.21 (0.003)97.46 (0.006)
HC95.00 (0.007)98.79 (0.007)98.00 (0.006)
OD100.00 (0.000)99.13 (0.002)99.46 (0.001)

Share and Cite

MDPI and ACS Style

Ryzhikova, E.; Ralbovsky, N.M.; Halámková, L.; Celmins, D.; Malone, P.; Molho, E.; Quinn, J.; Zimmerman, E.A.; Lednev, I.K. Multivariate Statistical Analysis of Surface Enhanced Raman Spectra of Human Serum for Alzheimer’s Disease Diagnosis. Appl. Sci. 2019, 9, 3256. https://doi.org/10.3390/app9163256

AMA Style

Ryzhikova E, Ralbovsky NM, Halámková L, Celmins D, Malone P, Molho E, Quinn J, Zimmerman EA, Lednev IK. Multivariate Statistical Analysis of Surface Enhanced Raman Spectra of Human Serum for Alzheimer’s Disease Diagnosis. Applied Sciences. 2019; 9(16):3256. https://doi.org/10.3390/app9163256

Chicago/Turabian Style

Ryzhikova, Elena, Nicole M. Ralbovsky, Lenka Halámková, Dzintra Celmins, Paula Malone, Eric Molho, Joseph Quinn, Earl A. Zimmerman, and Igor K. Lednev. 2019. "Multivariate Statistical Analysis of Surface Enhanced Raman Spectra of Human Serum for Alzheimer’s Disease Diagnosis" Applied Sciences 9, no. 16: 3256. https://doi.org/10.3390/app9163256

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop