Bibliographic Details
| Title: |
PyFasma: an open-source, modular Python package for preprocessing and multivariate analysis of Raman spectroscopy data. |
| Authors: |
Pavlou, Eleftherios, Kourkoumelis, Nikolaos |
| Source: |
Analyst; 7/21/2025, Vol. 150 Issue 4, p3112-3122, 11p |
| Subject Terms: |
RAMAN spectroscopy, MULTIVARIATE analysis, PYTHON programming language, FEATURE extraction, INTEGRATED software |
| Abstract: |
Raman spectroscopy is a versatile, label-free technique for probing molecular composition in biological samples. However, the detection of subtle biochemical traits in high-throughput spectral datasets requires careful preprocessing, dimensionality reduction, and statistically sound analytical strategies. We present PyFasma, an open-source Python package for Raman spectroscopy, integrating essential preprocessing tools (e.g., spike removal, smoothing, baseline correction, normalization), multivariate techniques (PCA, PLS-DA), and spectral deconvolution within a modular, Jupyter Notebook-friendly framework. In addition to describing the software, we demonstrate PyFasma's capabilities through a practical biomedical case study comparing Raman spectra from healthy and osteoporotic cortical bone samples. The results revealed statistically significant differences in mineral-to-matrix ratio and crystallinity between assigned groups, with PCA and PLS-DA successfully distinguishing pathological from normal bone spectra. PyFasma encourages best practices in model validation, including the powerful but often overlooked, repeated stratified cross-validation, enhancing the generalizability of multivariate analyses. It also offers an easy-to-use, extensible solution for Raman data analysis, enabling the reproducible and robust interpretation of complex spectra of biological samples. [ABSTRACT FROM AUTHOR] |
|
Copyright of Analyst is the property of Royal Society of Chemistry and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) |
| Database: |
Complementary Index |