LLM-SPSS: An Efficient LLM-Based Secure Partitioned Storage Scheme in Distributed Hybrid Clouds.

Saved in:
Bibliographic Details
Title: LLM-SPSS: An Efficient LLM-Based Secure Partitioned Storage Scheme in Distributed Hybrid Clouds.
Authors: Zhou, Ran, Che, Bichen, Yang, Liangbin
Source: Electronics (2079-9292); Jan2026, Vol. 15 Issue 1, p30, 18p
Subject Terms: HYBRID cloud computing, CLOUD storage, LANGUAGE models, PARALLEL programming, CLASSIFICATION, PARALLEL algorithms, COMPUTER performance
Abstract: With the growing adoption of hybrid cloud storage, the identification and protection of sensitive information within large-scale unstructured data has become increasingly challenging. Traditional rule-based and machine learning approaches have limitations in context-aware sensitive data classification and large-scale processing. In this work, a novel framework named LLM-SPSS, implementing a secure and confidential storage layout for distributed hybrid clouds through a fine-tuned XLM-R Base model and multi-dimensional data partitioning, is proposed. First, a fine-tuned XLM-R Base model with adaptive prompt tuning is employed to enable context-aware sensitive data classification and improve detection accuracy. In addition, MapReduce-based distributed processing allows the framework to scale efficiently to large datasets, thus enhancing computational efficiency. Furthermore, a multi-dimensional cloud partitioning scheme provides secure and fine-grained storage isolation within hybrid cloud environments. Experimental results demonstrate that LLM-SPSS achieves an F1-score of 99.66% and yields a 6.3× speed-up over the non-distributed baseline, outperforming traditional rule-based (F1 68.27%), conventional machine learning (SVM F1 98.32%, Random Forest F1 95.79%), and other LLM-based approaches (DePrompt F1 95.95%) and effectively balancing high accuracy with computational efficiency. [ABSTRACT FROM AUTHOR]
Copyright of Electronics (2079-9292) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Complementary Index
Description
Abstract:With the growing adoption of hybrid cloud storage, the identification and protection of sensitive information within large-scale unstructured data has become increasingly challenging. Traditional rule-based and machine learning approaches have limitations in context-aware sensitive data classification and large-scale processing. In this work, a novel framework named LLM-SPSS, implementing a secure and confidential storage layout for distributed hybrid clouds through a fine-tuned XLM-R Base model and multi-dimensional data partitioning, is proposed. First, a fine-tuned XLM-R Base model with adaptive prompt tuning is employed to enable context-aware sensitive data classification and improve detection accuracy. In addition, MapReduce-based distributed processing allows the framework to scale efficiently to large datasets, thus enhancing computational efficiency. Furthermore, a multi-dimensional cloud partitioning scheme provides secure and fine-grained storage isolation within hybrid cloud environments. Experimental results demonstrate that LLM-SPSS achieves an F1-score of 99.66% and yields a 6.3× speed-up over the non-distributed baseline, outperforming traditional rule-based (F1 68.27%), conventional machine learning (SVM F1 98.32%, Random Forest F1 95.79%), and other LLM-based approaches (DePrompt F1 95.95%) and effectively balancing high accuracy with computational efficiency. [ABSTRACT FROM AUTHOR]
ISSN:20799292
DOI:10.3390/electronics15010030