Evaluating Grasp-based cloud dimensioning for comparative genomics: A practical approach

Cloud computing establishes a new computing model where a wide range of computing resources are provided to several types of users. Especially for bioinformatics experiments modeled as scientific workflows, clouds provide several types of resources as virtual machines (VM), storage, databases and co...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Proceedings / IEEE International Conference on Cluster Computing s. 371 - 379
Hlavní autori: Coutinho, Rafaelli, Drummond, Lucia, Frota, Yuri, de Oliveira, Daniel, Ocana, Kary
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: IEEE 01.09.2014
Predmet:
ISSN:1552-5244
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Cloud computing establishes a new computing model where a wide range of computing resources are provided to several types of users. Especially for bioinformatics experiments modeled as scientific workflows, clouds provide several types of resources as virtual machines (VM), storage, databases and computing power that can be combined for empowering the scientific workflow execution. These workflows usually require high performance environments and parallelism techniques since their activities are data and computing intensive and can execute for a long time. There are then some Scientific Workflow Management Systems (SWfMS) that already manage the parallel execution of scientific workflows in clouds. Most of them instantiate a virtual cluster for the execution. However, they rely on the user to estimate the amount of VMs to be instantiated to create this virtual cluster. Estimating the amount of VMs to instantiate is then a crucial task to avoid negative impacts on the workflow performance with under or over estimations. This dimensioning also is not a trivial task in clouds due to the large number of VM types to choose in a cloud provider. Previously proposed approach named GraspCC already provides a near optimal estimation of the amount of VM for general applications, not scientific workflows. In this paper, we coupled the GraspCC to SciCumulus (Cloud-based Parallel Engine for Scientific Workflows) engine to estimate the necessary amount of VMs for bioinformatics workflows. We have evaluated GraspCC by comparing the estimative with real executions of a set of large-scale comparative genomics workflows. It showed the suitability of GraspCC to estimate the amount of VMs in real bioinformatics cloud workflows.
AbstractList Cloud computing establishes a new computing model where a wide range of computing resources are provided to several types of users. Especially for bioinformatics experiments modeled as scientific workflows, clouds provide several types of resources as virtual machines (VM), storage, databases and computing power that can be combined for empowering the scientific workflow execution. These workflows usually require high performance environments and parallelism techniques since their activities are data and computing intensive and can execute for a long time. There are then some Scientific Workflow Management Systems (SWfMS) that already manage the parallel execution of scientific workflows in clouds. Most of them instantiate a virtual cluster for the execution. However, they rely on the user to estimate the amount of VMs to be instantiated to create this virtual cluster. Estimating the amount of VMs to instantiate is then a crucial task to avoid negative impacts on the workflow performance with under or over estimations. This dimensioning also is not a trivial task in clouds due to the large number of VM types to choose in a cloud provider. Previously proposed approach named GraspCC already provides a near optimal estimation of the amount of VM for general applications, not scientific workflows. In this paper, we coupled the GraspCC to SciCumulus (Cloud-based Parallel Engine for Scientific Workflows) engine to estimate the necessary amount of VMs for bioinformatics workflows. We have evaluated GraspCC by comparing the estimative with real executions of a set of large-scale comparative genomics workflows. It showed the suitability of GraspCC to estimate the amount of VMs in real bioinformatics cloud workflows.
Author Ocana, Kary
de Oliveira, Daniel
Drummond, Lucia
Coutinho, Rafaelli
Frota, Yuri
Author_xml – sequence: 1
  givenname: Rafaelli
  surname: Coutinho
  fullname: Coutinho, Rafaelli
  email: rcoutinho@ic.uff.br
  organization: IC/Fluminense Fed. Univ., Niteroi, Brazil
– sequence: 2
  givenname: Lucia
  surname: Drummond
  fullname: Drummond, Lucia
  email: lucia@ic.uff.br
  organization: IC/Fluminense Fed. Univ., Niteroi, Brazil
– sequence: 3
  givenname: Yuri
  surname: Frota
  fullname: Frota, Yuri
  email: yuri@ic.uff.br
  organization: IC/Fluminense Fed. Univ., Niteroi, Brazil
– sequence: 4
  givenname: Daniel
  surname: de Oliveira
  fullname: de Oliveira, Daniel
  email: danielcmo@ic.uff.br
  organization: IC/Fluminense Fed. Univ., Niteroi, Brazil
– sequence: 5
  givenname: Kary
  surname: Ocana
  fullname: Ocana, Kary
  email: kary@cos.ufrj.br
  organization: COPPE, UFRJ, Rio de Janeiro, Brazil
BookMark eNotkM1KAzEUhSNUsK19Al3kBabmfxJ3pdQqDAjagrtyJ3NbAzOTIWkLvr0VuzqL83HgOxMy6mOPhDxyNuecuadltf3crD7mgnE1N87Y0robMuGqdE5rZfWIjLnWotBCqTsyyznUTJjSKK3ZmHytztCe4Bj6A10nyENRQ8aG-jaeGtqEDvscYv9X72OiPnYDpAt-RnrAPnbB52e6oEMCfwweWgrDkCL473tyu4c24-yaU7J9WW2Wr0X1vn5bLqoiCMGPBViB6uJhmULFgNfG7jUojygFlNI4B9rKGqyxFrXUDr2tbelKq5w0jZBT8vC_GxBxN6TQQfrZXY-Qv-zlVP4
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/CLUSTER.2014.6968789
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1479955485
9781479955480
EndPage 379
ExternalDocumentID 6968789
Genre orig-research
GroupedDBID 29O
6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i221t-a82e4109804e40a1b68f5a4cee32a73699a583ba8688e5359ec8b879784936d23
IEDL.DBID RIE
ISICitedReferencesCount 4
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000411853200054&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1552-5244
IngestDate Wed Aug 27 04:46:18 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i221t-a82e4109804e40a1b68f5a4cee32a73699a583ba8688e5359ec8b879784936d23
OpenAccessLink https://doi.org/10.1109/cluster.2014.6968789
PageCount 9
ParticipantIDs ieee_primary_6968789
PublicationCentury 2000
PublicationDate 2014-Sept.
PublicationDateYYYYMMDD 2014-09-01
PublicationDate_xml – month: 09
  year: 2014
  text: 2014-Sept.
PublicationDecade 2010
PublicationTitle Proceedings / IEEE International Conference on Cluster Computing
PublicationTitleAbbrev CLUSTER
PublicationYear 2014
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib026764550
ssj0037306
Score 1.9428407
Snippet Cloud computing establishes a new computing model where a wide range of computing resources are provided to several types of users. Especially for...
SourceID ieee
SourceType Publisher
StartPage 371
SubjectTerms Bioinformatics
Bioinformatics Workflows
Cloud Computing
Computational modeling
Drugs
Estimation
Genomics
Hidden Markov models
Phylogeny
Virtual Machine Allocation
Title Evaluating Grasp-based cloud dimensioning for comparative genomics: A practical approach
URI https://ieeexplore.ieee.org/document/6968789
WOSCitedRecordID wos000411853200054&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwIxEJ4A8eAJFYzv9ODRhd3t25shoAdDSJSEG5ntdg0JAmHB32-7D4yJF29ND00z0-nMtPN9A3AfGWRSiDgwiDTwdYQBRsaZuzRoRZZkXKdFswk5HqvZTE8a8HDAwlhri-Iz2_PD4i8_XZu9fyrreyIXqXQTmlLKEqtVn51YSOEBuvUtTN3JLZBFnPtki7EKNheFuj94nb65WNHXdbFeteav5iqFbxm1_7erE-j-gPTI5OB-TqFhV2fQrrs0kMpoOzAbVoTeqw_yvMV8E3jPlRKzXO9Tknp2_7x8lCUugCXmhw6ceALXz4XJH8kTqeBUuCQ1DXkXpqPh--AlqPopBIs4jnYBqtg6bWgVMstCjBKhMo7M7ZPGKKnQGrmiCSqhlOWUa2tUoqTLM5mmIo3pObRW65W9AJJqz1wvESNr3FXrkj7LslAlYYZS8IxeQscLar4pKTPmlYyu_p6-hmOvi7J06wZau-3e3sKR-dot8u1doedvmZenCw
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwIxEG4QTfSECsa3PXh0YXf79maIiBEJiZBwI7PdriFBICz4-233ATHx4q3poWlmOp2Zdr5vELoPNFDBeehpAOK5OkIPAm3NXWgwPIkSpuKs2YTo9-V4rAYV9LDFwhhjsuIz03TD7C8_XuiNeyprOSIXIdUe2meUhkGO1ipPT8gFdxDd8h4m9uxm2CLGXLpFaQGcC3zVavdGHzZadJVdtFms-qu9SuZdOrX_7esYNXYwPTzYOqATVDHzU1Qr-zTgwmzraPxcUHrPP_HLCtKl53xXjPVssYlx7Pj90_xZFtsQFusdITh2FK5fU50-4idcAKpghksi8gYadZ6H7a5XdFTwpmEYrD2QobH6UNKnhvoQRFwmDKjdJwlBEK4UMEkikFxKwwhTRstICptpUkV4HJIzVJ0v5uYc4Vg57noBEBhtL1ub9hma-DLyExCcJeQC1Z2gJsucNGNSyOjy7-k7dNgdvvcmvdf-2xU6cnrJC7muUXW92pgbdKC_19N0dZvp_Ad6vqpS
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+%2F+IEEE+International+Conference+on+Cluster+Computing&rft.atitle=Evaluating+Grasp-based+cloud+dimensioning+for+comparative+genomics%3A+A+practical+approach&rft.au=Coutinho%2C+Rafaelli&rft.au=Drummond%2C+Lucia&rft.au=Frota%2C+Yuri&rft.au=de+Oliveira%2C+Daniel&rft.date=2014-09-01&rft.pub=IEEE&rft.issn=1552-5244&rft.spage=371&rft.epage=379&rft_id=info:doi/10.1109%2FCLUSTER.2014.6968789&rft.externalDocID=6968789
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1552-5244&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1552-5244&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1552-5244&client=summon