Concurrent Processing of Retail Data in Python to Optimize Runtime

Saved in:
Bibliographic Details
Title: Concurrent Processing of Retail Data in Python to Optimize Runtime
Authors: Slavin, Bobby
Source: Data Science Undergraduate Honors Theses
Publisher Information: ScholarWorks@UARK
Publication Year: 2024
Collection: University of Arkansas: ScholarWorks@UARK
Subject Terms: Concurrent Processing, Multiprocessing, Multithreading, Data Science
Description: This thesis explores the application of multiprocessing and multithreading techniques in Python to optimize runtime efficiency on the analysis of retail data. As the retail data processed by a program increases, so does the runtime of the program. If you are performing this processing using only a single core, even a gigabyte of data can potentially take upwards to half an hour to finish processing, while larger datasets of 100 GB or more could take days, heavily limiting the amount of retail data that can be processed in a reasonable amount of time. By employing multithreading and multiprocessing architectures in Python, a programming language commonly used in data analysis, this study attempts to evaluate their efficacy and feasibility in reducing the runtime of retail data processing to more manageable levels. The results of this study show the importance of utilizing concurrent computing paradigms to address the computational challenges posed by the ever-expanding volumes of retail data.
Document Type: text
File Description: application/pdf
Language: unknown
Relation: https://scholarworks.uark.edu/dtscuht/12; https://scholarworks.uark.edu/context/dtscuht/article/1012/viewcontent/Bobby_Slavin_Undergraduate_Thesis___Practicum_Thesis_on_Concurrent_Processing_of_Retail_Data_to_Optimize_Runtime.pdf
Availability: https://scholarworks.uark.edu/dtscuht/12
https://scholarworks.uark.edu/context/dtscuht/article/1012/viewcontent/Bobby_Slavin_Undergraduate_Thesis___Practicum_Thesis_on_Concurrent_Processing_of_Retail_Data_to_Optimize_Runtime.pdf
Accession Number: edsbas.A4E3DA7
Database: BASE
FullText Text:
  Availability: 0
Header DbId: edsbas
DbLabel: BASE
An: edsbas.A4E3DA7
RelevancyScore: 892
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 892.306396484375
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Concurrent Processing of Retail Data in Python to Optimize Runtime
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Slavin%2C+Bobby%22">Slavin, Bobby</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Data Science Undergraduate Honors Theses
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: ScholarWorks@UARK
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2024
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: University of Arkansas: ScholarWorks@UARK
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Concurrent+Processing%22">Concurrent Processing</searchLink><br /><searchLink fieldCode="DE" term="%22Multiprocessing%22">Multiprocessing</searchLink><br /><searchLink fieldCode="DE" term="%22Multithreading%22">Multithreading</searchLink><br /><searchLink fieldCode="DE" term="%22Data+Science%22">Data Science</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: This thesis explores the application of multiprocessing and multithreading techniques in Python to optimize runtime efficiency on the analysis of retail data. As the retail data processed by a program increases, so does the runtime of the program. If you are performing this processing using only a single core, even a gigabyte of data can potentially take upwards to half an hour to finish processing, while larger datasets of 100 GB or more could take days, heavily limiting the amount of retail data that can be processed in a reasonable amount of time. By employing multithreading and multiprocessing architectures in Python, a programming language commonly used in data analysis, this study attempts to evaluate their efficacy and feasibility in reducing the runtime of retail data processing to more manageable levels. The results of this study show the importance of utilizing concurrent computing paradigms to address the computational challenges posed by the ever-expanding volumes of retail data.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: text
– Name: Format
  Label: File Description
  Group: SrcInfo
  Data: application/pdf
– Name: Language
  Label: Language
  Group: Lang
  Data: unknown
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: https://scholarworks.uark.edu/dtscuht/12; https://scholarworks.uark.edu/context/dtscuht/article/1012/viewcontent/Bobby_Slavin_Undergraduate_Thesis___Practicum_Thesis_on_Concurrent_Processing_of_Retail_Data_to_Optimize_Runtime.pdf
– Name: URL
  Label: Availability
  Group: URL
  Data: https://scholarworks.uark.edu/dtscuht/12<br />https://scholarworks.uark.edu/context/dtscuht/article/1012/viewcontent/Bobby_Slavin_Undergraduate_Thesis___Practicum_Thesis_on_Concurrent_Processing_of_Retail_Data_to_Optimize_Runtime.pdf
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsbas.A4E3DA7
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.A4E3DA7
RecordInfo BibRecord:
  BibEntity:
    Languages:
      – Text: unknown
    Subjects:
      – SubjectFull: Concurrent Processing
        Type: general
      – SubjectFull: Multiprocessing
        Type: general
      – SubjectFull: Multithreading
        Type: general
      – SubjectFull: Data Science
        Type: general
    Titles:
      – TitleFull: Concurrent Processing of Retail Data in Python to Optimize Runtime
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Slavin, Bobby
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2024
          Identifiers:
            – Type: issn-locals
              Value: edsbas
          Titles:
            – TitleFull: Data Science Undergraduate Honors Theses
              Type: main
ResultId 1