Saint Petersburg, St. Petersburg, Russian Federation
Saint Petersburg, St. Petersburg, Russian Federation
Saint Petersburg, St. Petersburg, Russian Federation
Saint Petersburg, Russian Federation
Saint Petersburg, St. Petersburg, Russian Federation
Saint Petersburg, St. Petersburg, Russian Federation
Saint Petersburg, St. Petersburg, Russian Federation
Saint Petersburg, St. Petersburg, Russian Federation
Multiple myeloma and chronic lymphocytic leukemia are oncological diseases of the blood, which remain incurable today. The paper proposes a method for classifying blood serum samples from patients with multiple myeloma, chronic lymphocytic leukemia and healthy donors based on the analysis of their spectra in the mid-infrared (IR) range. IR spectra of blood serum were recorded using a Tensor 27 IR Fourier spectrometer in D2O solution. To analyze the obtained spectra in this work, a machine learning algorithm was implemented – the principal component analysis. The use of the principal component analysis made it possible to significantly simplify the representation of the array of spectral data. 45 samples of blood serum were analyzed in the work. As a result of applying this approach, the studied set of samples is divided into three disjoint sets corresponding to blood serum samples of patients with multiple myeloma, chronic lymphocytic leukemia and healthy donors. Thus, the principal component method can be successfully applied to classify blood serum samples of patients with diagnoses of multiple myeloma and chronic lymphocytic leukemia. The universality of the proposed algorithm allows us to expect that in the future it is possible to apply a similar approach for other oncohematological diseases.
Principal Component Analysis, oncohematological diseases, IR spectroscopy, multiple myeloma, chronic lymphocytic leukemia
1. Barlogie B., Gale R.P. Multiple Myeloma and Chronic Lymphocytic Leukemia: Commonalities and Differences in Biology and Therapy. Leukemia & Lymphoma, 1991, vol. 5, no. 1, pp. 27-32.
2. Raab M.S., Podar K., Breitkreutz I., Richardson P.G., Anderson K.C. Multiple myeloma. 2009, vol. 374, 16 p.
3. Mikhailets E.S., Chernyshev D.A., Telnaya E.A., Plotnikova L.V., Garifullin A.D., Kuvshinov A.Yu., Voloshin S.V., Polyanichko A.M. Protein secondary structure analysis of serum from patients with oncohematological diseases. Journal of Physics: Conference Series, 2021, vol. 2103, p. 012053, doi: 10.1088/1742-6596/2103/1/012053.
4. Telnaya E.A., Plotnikova L.V., Garifullin A.D., Kuvshinov A.Yu., Voloshin S.V., Polyanichko A.M. Infrared spectroscopy of blood serum of patients with oncohematological diseases. St. Petersburg State University. Biophysica, 2020, vol. 65, no. 6, pp. 1154-1160. (In Russ.)
5. Dunn K. Process Improvement Using Data, 2010.
6. Powell W.B. Approximate Dynamic Programming: Solving the curses of dimensionality. John Wiley & Sons, 2007, vol. 703.
7. Polyanichko A.M., Romanov N.M., Starkova T.Y. et al.Analysis of the secondary structure of the linker histone H1 by infrared absorption spectra. Tsitologiya, 2014, vol. 56, no. 4, pp. 316-322. (In Russ.)
8. Python 3.10.0.
9. Jupyter Notebook.
10. Rodionova O.E., Pomerantsev A.L. Chemometrics in analytical chemistry. 2006. (In Russ.)