‘Big data’, Hadoop and cloud computing in genomics

[Display omitted] •Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the fastest growing fields of big data science.•Cloud computing and big data technologies can be used to deal with biology’s big data sets.•The...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of biomedical informatics Jg. 46; H. 5; S. 774 - 781
Hauptverfasser: O’Driscoll, Aisling, Daugelaite, Jurate, Sleator, Roy D.
Format: Journal Article
Sprache:Englisch
Veröffentlicht: United States Elsevier Inc 01.10.2013
Schlagworte:
ISSN:1532-0464, 1532-0480, 1532-0480
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract [Display omitted] •Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the fastest growing fields of big data science.•Cloud computing and big data technologies can be used to deal with biology’s big data sets.•The Apache Hadoop project, which provides distributed and parallelised data processing are presented.•Challenges associated with cloud computing and big data technologies in biology are discussed. Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology’s big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.
AbstractList [Display omitted] •Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the fastest growing fields of big data science.•Cloud computing and big data technologies can be used to deal with biology’s big data sets.•The Apache Hadoop project, which provides distributed and parallelised data processing are presented.•Challenges associated with cloud computing and big data technologies in biology are discussed. Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology’s big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.
Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.
Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.
Author Sleator, Roy D.
Daugelaite, Jurate
O’Driscoll, Aisling
Author_xml – sequence: 1
  givenname: Aisling
  surname: O’Driscoll
  fullname: O’Driscoll, Aisling
  organization: Department of Computing, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland
– sequence: 2
  givenname: Jurate
  surname: Daugelaite
  fullname: Daugelaite, Jurate
  organization: Department of Biological Sciences, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland
– sequence: 3
  givenname: Roy D.
  surname: Sleator
  fullname: Sleator, Roy D.
  email: roy.sleator@cit.ie
  organization: Department of Biological Sciences, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland
BackLink https://www.ncbi.nlm.nih.gov/pubmed/23872175$$D View this record in MEDLINE/PubMed
BookMark eNp9kLtOwzAUQC1UBG3hA1hQRgYS7DiOg5gA8ZIqscBs-XFTOUrsEidIbP0M-D2-BKMCA0MX28M517pnhibOO0DoiOCMYFKeNVmjbJZjQjPMM4zJDpoSRvMUFxWe_L3LYh_NQmgiQBgr99B-TiueE86mqPxcv1_ZZWLkID_XH6fJvTTerxLpTKJbP8bTd6txsG6ZWJcswfnO6nCAdmvZBjj8uefo-fbm6fo-XTzePVxfLlJNGR3SStU0fmrYOZG4yjkrK1UpULLmWgKhtcIailJKVlQEG9DACqMkL6LGeaXoHJ1s5q56_zJCGERng4a2lQ78GAQpKM15Tul5RI9_0FF1YMSqt53s38TvrhHgG0D3PoQeaqHtIAfr3dBL2wqCxXdV0YhYVXxXFZiLGC2a5J_5O3ybc7FxIOZ5tdCLoC04Dcb2oAdhvN1ifwGUKo8-
CitedBy_id crossref_primary_10_1109_ACCESS_2020_3015016
crossref_primary_10_1177_1687814018814955
crossref_primary_10_1109_ACCESS_2022_3177278
crossref_primary_10_1002_wcms_1701
crossref_primary_10_1016_j_biotechadv_2024_108400
crossref_primary_10_1002_spy2_121
crossref_primary_10_1111_hir_12286
crossref_primary_10_1016_j_cie_2016_07_013
crossref_primary_10_1111_age_12655
crossref_primary_10_1007_s10916_016_0565_7
crossref_primary_10_1093_jamia_ocab032
crossref_primary_10_1371_journal_pone_0183413
crossref_primary_10_4018_IJITWE_2019070103
crossref_primary_10_1155_2018_3984061
crossref_primary_10_1007_s10723_017_9408_0
crossref_primary_10_1016_j_comnet_2018_08_005
crossref_primary_10_1016_j_future_2024_01_011
crossref_primary_10_1007_s10916_017_0832_2
crossref_primary_10_1016_j_jksuci_2017_07_001
crossref_primary_10_1016_j_procs_2016_05_544
crossref_primary_10_24190_ISSN2564_615X_2017_04_02
crossref_primary_10_1002_gch2_202300163
crossref_primary_10_1093_jamia_ocy111
crossref_primary_10_1080_0952813X_2021_1955980
crossref_primary_10_1155_2022_1265340
crossref_primary_10_1186_s40537_015_0016_1
crossref_primary_10_1155_2018_3598284
crossref_primary_10_1016_j_coisb_2017_07_004
crossref_primary_10_1186_s12859_015_0497_0
crossref_primary_10_1109_TCC_2014_2315797
crossref_primary_10_3389_fpls_2017_01461
crossref_primary_10_1016_j_future_2019_10_038
crossref_primary_10_3389_fnins_2021_591122
crossref_primary_10_1109_TCBB_2018_2816022
crossref_primary_10_1080_1206212X_2023_2301183
crossref_primary_10_1155_2015_639021
crossref_primary_10_1186_1752_0509_8_S2_I1
crossref_primary_10_1002_cpe_4499
crossref_primary_10_1007_s10916_018_0993_7
crossref_primary_10_1007_s11390_020_9801_1
crossref_primary_10_1016_j_procs_2022_03_101
crossref_primary_10_1016_j_cels_2019_11_002
crossref_primary_10_1145_3358211
crossref_primary_10_1097_AOG_0000000000001865
crossref_primary_10_1007_s13204_021_01984_4
crossref_primary_10_1016_j_giq_2018_11_004
crossref_primary_10_1371_journal_pcbi_1008645
crossref_primary_10_1007_s10916_018_1007_5
crossref_primary_10_1155_2014_712826
crossref_primary_10_1016_j_cmpb_2016_04_016
crossref_primary_10_1155_2016_3617572
crossref_primary_10_2217_pme_2018_0085
crossref_primary_10_1089_cmb_2017_0016
crossref_primary_10_1109_TCSS_2015_2514088
crossref_primary_10_1109_ACCESS_2017_2730843
crossref_primary_10_2478_dim_2018_0014
crossref_primary_10_1007_s11227_015_1501_1
crossref_primary_10_1111_pbi_12645
crossref_primary_10_1186_s12859_018_2019_3
crossref_primary_10_1007_s13204_021_02164_0
crossref_primary_10_1371_journal_pone_0236471
crossref_primary_10_1093_milmed_usx114
crossref_primary_10_1016_j_scs_2018_02_019
crossref_primary_10_1109_TBME_2016_2573285
crossref_primary_10_1016_j_tplants_2019_01_006
crossref_primary_10_1007_s10015_018_0437_y
crossref_primary_10_3390_ijerph192214641
crossref_primary_10_1016_j_ajhg_2019_09_027
crossref_primary_10_3390_cancers15143690
crossref_primary_10_1016_j_neucom_2017_01_126
crossref_primary_10_1089_big_2020_0383
crossref_primary_10_3390_ijms18020412
crossref_primary_10_1016_j_prevetmed_2015_05_012
crossref_primary_10_1080_01605682_2019_1630328
crossref_primary_10_1186_s12859_020_03757_2
crossref_primary_10_1016_j_jnca_2018_02_008
crossref_primary_10_1007_s10586_018_2860_1
crossref_primary_10_3389_fvets_2017_00194
crossref_primary_10_1007_s00521_020_04873_z
crossref_primary_10_1063_1_4946894
crossref_primary_10_2196_medinform_2913
crossref_primary_10_1186_s13326_017_0146_9
crossref_primary_10_1016_j_cose_2017_06_003
crossref_primary_10_1111_jocn_14164
crossref_primary_10_2478_dim_2018_00014
crossref_primary_10_1016_j_is_2014_07_006
crossref_primary_10_1016_j_tele_2015_12_005
crossref_primary_10_1186_s12864_018_4611_3
crossref_primary_10_1186_s12859_017_1723_8
crossref_primary_10_1080_17445760_2014_929685
crossref_primary_10_3389_fmed_2021_784455
crossref_primary_10_1016_j_parco_2016_10_003
crossref_primary_10_1371_journal_pone_0201483
crossref_primary_10_1088_1757_899X_563_3_032012
crossref_primary_10_1093_comjnl_bxaa192
crossref_primary_10_1038_nature15816
crossref_primary_10_1002_cpe_3628
crossref_primary_10_1109_RBME_2018_2829704
crossref_primary_10_2135_cropsci2014_03_0195
crossref_primary_10_1016_j_jep_2016_07_063
crossref_primary_10_1007_s11277_018_5334_0
crossref_primary_10_1186_s12859_022_04648_4
crossref_primary_10_1016_j_jbi_2017_05_012
crossref_primary_10_1016_j_ajo_2017_03_026
crossref_primary_10_1007_s11277_022_09535_y
crossref_primary_10_3389_fgene_2022_876869
crossref_primary_10_1093_gigascience_giac040
crossref_primary_10_1109_TCBB_2020_3000661
crossref_primary_10_1002_cpe_3974
crossref_primary_10_1016_j_ijmedinf_2016_11_006
crossref_primary_10_3390_technologies13070285
crossref_primary_10_3390_metabo12010014
crossref_primary_10_1007_s10723_018_9458_y
crossref_primary_10_1109_TBDATA_2016_2643683
crossref_primary_10_1016_j_future_2015_10_003
crossref_primary_10_1016_j_jbi_2014_01_005
crossref_primary_10_1080_1475939X_2017_1408490
crossref_primary_10_5582_bst_2014_01048
crossref_primary_10_3390_pr8080951
crossref_primary_10_1007_s10916_015_0344_x
crossref_primary_10_1007_s00530_020_00736_8
crossref_primary_10_1016_j_cels_2017_05_013
crossref_primary_10_1093_bioadv_vbaf168
crossref_primary_10_1016_j_jbi_2015_01_008
crossref_primary_10_14400_JDC_2014_12_9_201
crossref_primary_10_3233_JIFS_189264
crossref_primary_10_1007_s11227_025_07563_6
crossref_primary_10_1016_j_ijmedinf_2016_09_008
crossref_primary_10_1186_s13742_015_0045_x
crossref_primary_10_1002_cpe_4854
crossref_primary_10_1007_s11227_016_1677_z
crossref_primary_10_1016_j_future_2015_04_012
crossref_primary_10_2196_22214
crossref_primary_10_1016_j_jii_2019_04_005
crossref_primary_10_2217_pgs_2016_0152
crossref_primary_10_1007_s10916_017_0777_5
crossref_primary_10_1016_j_iac_2014_09_014
crossref_primary_10_1016_j_neucom_2016_11_077
crossref_primary_10_1109_ACCESS_2020_2965955
crossref_primary_10_1007_s00500_023_08797_3
crossref_primary_10_3233_JIFS_223295
crossref_primary_10_1016_j_matpr_2017_12_340
crossref_primary_10_1016_j_procs_2018_05_004
crossref_primary_10_1517_17460441_2014_872623
crossref_primary_10_1109_TCBB_2020_2967385
crossref_primary_10_1016_j_future_2017_11_010
crossref_primary_10_1016_j_jii_2020_100129
crossref_primary_10_4028_www_scientific_net_AMM_530_531_827
crossref_primary_10_1088_1742_6596_1544_1_012119
crossref_primary_10_1109_MCSE_2018_05329812
crossref_primary_10_1016_j_gpb_2016_01_005
crossref_primary_10_1007_s13369_023_08172_2
crossref_primary_10_1186_1471_2105_15_30
crossref_primary_10_1002_cpe_5814
crossref_primary_10_3390_ijgi6060166
Cites_doi 10.1038/scientificamerican0805-32
10.1186/1471-2105-13-200
10.1186/1471-2105-13-324
10.1126/science.331.6018.666
10.1109/ICPPW.2009.37
10.1002/0471250953.bi1503s39
10.1093/bioinformatics/btr325
10.1016/j.cell.2012.05.044
10.1186/1745-6150-7-43
10.1093/bioinformatics/btq644
10.1073/pnas.0506388102
10.1101/gr.4086505
10.1186/2047-2501-1-6
10.1038/msb.2012.47
10.1186/1751-0473-6-13
10.1186/1471-2105-12-139
10.1093/bioinformatics/btp236
10.1093/bioinformatics/bth361
10.1093/bioinformatics/bts165
10.1093/bioinformatics/bts061
10.1109/eScience.2008.62
10.1038/nrmicro2850
10.1186/1471-2105-11-S12-S1
10.1093/bioinformatics/bts054
10.1093/nar/gkf543
10.1101/gr.175701
10.1111/j.1472-765X.2008.02444.x
10.1186/1471-2105-13-42
10.1093/bioinformatics/bts023
10.1186/gb-2012-13-3-314
10.1162/152651603322874762
10.1101/gr.107524.110
10.1186/1471-2105-11-S1-S15
10.1126/science.1224311
10.4161/bioe.22367
10.1007/978-1-61779-424-7_2
10.1038/nature09304
10.3184/003685009X12605492662844
10.1093/bioinformatics/bts647
10.1038/nbt0710-691
10.1016/j.mehy.2009.08.047
10.1186/1471-2105-11-S12-S2
10.3163/1536-5050.95.4.454
10.1038/nrg2857
10.1186/1471-2164-13-341
10.1186/1471-2164-13-S7-S28
10.1186/1756-0500-4-171
10.1186/gb-2010-11-11-r116
10.1186/gb-2010-11-8-r83
10.1164/rccm.201203-0523ED
10.1186/gb-2009-10-11-r134
10.1093/bioinformatics/btr630
ContentType Journal Article
Copyright 2013 Elsevier Inc.
Copyright © 2013 Elsevier Inc. All rights reserved.
Copyright_xml – notice: 2013 Elsevier Inc.
– notice: Copyright © 2013 Elsevier Inc. All rights reserved.
DBID 6I.
AAFTH
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1016/j.jbi.2013.07.001
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList
MEDLINE
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Engineering
Public Health
EISSN 1532-0480
EndPage 781
ExternalDocumentID 23872175
10_1016_j_jbi_2013_07_001
S1532046413001007
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
--K
--M
-~X
.DC
.GJ
.~1
0R~
1B1
1RT
1~.
1~5
29J
4.4
457
4G.
53G
5GY
5VS
6I.
7-5
71M
8P~
AACTN
AAEDT
AAEDW
AAFTH
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAWTL
AAXUO
AAYFN
ABBOA
ABBQC
ABFRF
ABJNI
ABLVK
ABMAC
ABMZM
ABVKL
ABXDB
ABYKQ
ACDAQ
ACGFO
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADFGL
ADMUD
AEBSH
AEFWE
AEKER
AENEX
AEXQZ
AFKWA
AFTJW
AFXIZ
AGHFR
AGUBO
AGYEJ
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
AJRQY
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
ANZVX
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BAWUL
BKOJK
BLXMC
BNPGV
CAG
COF
CS3
DIK
DM4
DU5
EBS
EFBJH
EFLBG
EJD
EO8
EO9
EP2
EP3
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
G8K
GBLVA
GBOLZ
HVGLF
HZ~
IHE
IXB
J1W
KOM
LCYCR
LG5
M41
MO0
N9A
NCXOZ
O-L
O9-
OAUVE
OK1
OZT
P-8
P-9
PC.
Q38
R2-
RIG
ROL
RPZ
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SSH
SSV
SSZ
T5K
UAP
UHS
UNMZH
XPP
ZGI
ZMT
ZU3
~G-
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABWVN
ACIEU
ACLOT
ACRPL
ACVFH
ADCNI
ADNMO
ADVLN
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
CITATION
EFKBS
~HD
AGCQF
AGRNS
CGR
CUY
CVF
ECM
EIF
NPM
7X8
ID FETCH-LOGICAL-c353t-8bf3011d591a0827568b8bebaf7cae13fb0ce46aa54810dece54dba74f30778b3
ISICitedReferencesCount 257
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000324848600002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1532-0464
1532-0480
IngestDate Thu Oct 02 05:58:16 EDT 2025
Mon Jul 21 06:05:40 EDT 2025
Tue Nov 18 21:53:07 EST 2025
Sat Nov 29 06:23:10 EST 2025
Fri Feb 23 02:33:45 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords Cloud computing
Big data
Bioinformatics
Genomics
Hadoop
Language English
License http://www.elsevier.com/open-access/userlicense/1.0
Copyright © 2013 Elsevier Inc. All rights reserved.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c353t-8bf3011d591a0827568b8bebaf7cae13fb0ce46aa54810dece54dba74f30778b3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://dx.doi.org/10.1016/j.jbi.2013.07.001
PMID 23872175
PQID 1433272339
PQPubID 23479
PageCount 8
ParticipantIDs proquest_miscellaneous_1433272339
pubmed_primary_23872175
crossref_citationtrail_10_1016_j_jbi_2013_07_001
crossref_primary_10_1016_j_jbi_2013_07_001
elsevier_sciencedirect_doi_10_1016_j_jbi_2013_07_001
PublicationCentury 2000
PublicationDate October 2013
2013-10-00
2013-Oct
20131001
PublicationDateYYYYMMDD 2013-10-01
PublicationDate_xml – month: 10
  year: 2013
  text: October 2013
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Journal of biomedical informatics
PublicationTitleAlternate J Biomed Inform
PublicationYear 2013
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
References Data Deluge and the Human Microbiome Project.
Feng, Grossman, Stein (b0465) 2011; 12
Moore (b0015) 1965; 38
Zou, Li, Jiang, Lin, Li, Chen (b0505) 2013
Colosimo, Peterson, Mardis, Hirschman (b0420) 2011; 6
Cloudera and Mount Sinai: The structure of a Big Data Revolution?
Bridging the gap between HPC and IaaS clouds.
Obama Administration Unveils “Big Data” Initiative: Announces $200 Million In New R&D Investments.
Schadt, Linderman, Sorenson, Lee, Nolan (b0260) 2010; 11
Liu, Wong, Wu, Luo, Yiu, Li (b0430) 2012; 28
Pireddu, Leo, Zanetti (b0385) 2011; 27
O’Connor, Merriman, Nelson (b0495) 2010; 11
Cooper, Khatib, Treuille, Barbero, Lee, Beenen (b0345) 2010; 466
Matthews, Williams (b0415) 2010; 11
NextBio, Intel to collaborate on improving Hadoop Stack for Genomic Data Analysis.
O’Driscoll, Sleator (b0530) 2013
Available at http://asperasoft.com/.
Creating HIPAA-Compliant Medical Data Applications With AWS.
Loman, Constantinidou, Chan, Halachev, Sergeant, Penn (b0030) 2012; 10
Jourdren, Bernard, Dillies, Le Crom (b0405) 2012; 28
Nguyen, Shi, Ruden (b0235) 2011; 4
As We May Communicate.
Oinn, Addis, Ferris, Marvin, Senger, Greenwood (b0165) 2004; 20
Hong, Rhie, Park, Lee, Ju, Kim (b0170) 2012; 28
Langmead, Hansen, Leek (b0515) 2010; 11
Kelley, Schatz, Salzberg (b0455) 2010; 11
Schatz (b0380) 2009; 25
Zhang, Gu, Liu, Wang, Azuaje (b0460) 2012; 28
Mathe, Sagot, Schiex, Rouze (b0040) 2002; 30
Davenport (b0090) 2012; 90
How “Cloud” Services Democratize DNA Sequencing.
Manyika, Chui, Brown, Bughin, Dobbs, Roxburgh (b0475) 2011
Taylor (b0190) 2010; 11
Langmead, Schatz, Lin, Pop, Salzberg (b0230) 2009; 10
Furusawa, Kaneko (b0355) 2012; 338
Vouzis, Sahinidis (b0425) 2011; 27
How Hadoop Makes Short Work of Big Data.
Managing and Analysing 1,000,000 Genomes.
Chae, Jung, Lee, Marru, Lee, Kim (b0085) 2013; 1
.
Mason, Elemento (b0050) 2012; 13
Karr, Sanghvi, Macklin, Gutschow, Jacobs, Bolival (b0370) 2012; 150
Stein (b0045) 2010; 11
Healthcare Cloud Computing (Clinical, EMR, SaaS, Private, Public, Hybrid) Market – Global Trends, Challenges, Opportunities & Forecasts (2012–2017).
Niemenmaa, Kallio, Schumacher, Klemela, Korpelainen, Heljanko (b0410) 2012; 28
Sleator (b0330) 2012; 815
Sleator (b0335) 2012; 3
Giardine, Riemer, Hardison, Burhans, Elnitski, Shah (b0155) 2005; 15
Sleator (b0325) 2010; 93
Social Media And The Big Data Explosion.
The Benefits Of Data Center Virtualization For Businesses.
What will happen to Amazon’s massive cloud business?
Blastreduce: high performance short read mapping with mapreduce.
Gantz J, Reinsel, D. The Digital Universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. In: IDC iView: IDC Analyze the, Future; 2012.
Robertson (b0525) 2003; 3
Shachak, Shuval, Fine (b0145) 2007; 95
Available at https://dnanexus.com/.
Hadoop Sorts a Petabyte in 16.25 Hours and a Terabyte in 62 Seconds.
Schadt (b0295) 2012; 8
Murray (b0350) 2012; 185
Schatz, Sommer, Kelley, Pop (b0535) 2010; vol. 10
Chang, Chen, Chen, Ho (b0400) 2012; 13
Walter (b0020) 2005; 293
Marianayagam, Fawzi, Head-Gordon (b0340) 2005; 102
Quail, Smith, Coupland, Otto, Harris, Connor (b0005) 2012; 13
Fusaro, Patil, Gafni, Wall, Tonellato (b0485) 2011
Klein (b0315) 2011; 39
Angiuoli, Matalka, Gussman, Galens, Vangala, Riley (b0490) 2011
Schatz, Langmead, Salzberg (b0225) 2010; 28
Krampis K, Booth T, Chapman B, Tiwari B, Bicak M, Field D, et al.. Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community. BMC Bioinform; 2012;13:42. <calendar:T1:13:42>.
Helping accelerate treatment for pediatric cancer with Dell cloud technology.
Schoenherr, Forer, Weissensteiner, Specht, Kronenberg, Kloss-Brandstaetter (b0520) 2012; 13
Lewis, Csordas, Killcoyne, Hermjakob, Hoopmann, Moritz (b0540) 2012; 13
Big Data Offers Big Opportunities for Retail, Financial, Web Companies.
Pennisi (b0275) 2011; 331
Yeh, Lim, Burge (b0100) 2001; 11
Sleator, Shortall, Hill (b0320) 2008; 47
Genomics Takes Flight….To the Cloud.
Sleator (b0375) 2012; 3
Leo S, Santoni F, Zanetti G. Biodoop: bioinformatics on hadoop. In: Parallel processing workshops, 2009. ICPPW ‘09. International Conference on; 2009. p. 415–22.
Cloudera Chief Scientist Jeff Hammerbacher Teams with Mount Sinai School of Medicine to Solve Medical Challenges Using Big Data.
Huang, Tata, Prill (b0450) 2013; 29
Managing data in the Cloud Age.
EMC Sitting In Sweet Spot Of $70 Billion Big Data Industry.
1,000 Genomes in the Cloud and NCBI Experiences.
Matsunaga A, Tsugawa M, and Fortes J. CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications. IEEE Fourth International Conference on eScience, Indiana, USA, 2008 222-229.
Gurtowski, Schatz, Langmead (b0510) 2012
Dai, Gao, Guo, Xiao, Zhang (b0480) 2012; 7
Dai, Gao, Guo, Xiao, Zhang (b0500) 2012; 7
McKenna (b0215) 2010; 20
Pollack (b0010) 2011
Big Data, Meet the Huge Data That Will Shape Your Future.
Davies (b0470) 2010
Sleator (b0360) 2010; 74
10.1016/j.jbi.2013.07.001_b0265
O’Connor (10.1016/j.jbi.2013.07.001_b0495) 2010; 11
Pennisi (10.1016/j.jbi.2013.07.001_b0275) 2011; 331
Colosimo (10.1016/j.jbi.2013.07.001_b0420) 2011; 6
Zhang (10.1016/j.jbi.2013.07.001_b0460) 2012; 28
10.1016/j.jbi.2013.07.001_b0065
Pollack (10.1016/j.jbi.2013.07.001_b0010) 2011
10.1016/j.jbi.2013.07.001_b0185
10.1016/j.jbi.2013.07.001_b0060
10.1016/j.jbi.2013.07.001_b0180
Chang (10.1016/j.jbi.2013.07.001_b0400) 2012; 13
Dai (10.1016/j.jbi.2013.07.001_b0500) 2012; 7
Schadt (10.1016/j.jbi.2013.07.001_b0260) 2010; 11
Langmead (10.1016/j.jbi.2013.07.001_b0515) 2010; 11
Lewis (10.1016/j.jbi.2013.07.001_b0540) 2012; 13
Jourdren (10.1016/j.jbi.2013.07.001_b0405) 2012; 28
Huang (10.1016/j.jbi.2013.07.001_b0450) 2013; 29
Chae (10.1016/j.jbi.2013.07.001_b0085) 2013; 1
Taylor (10.1016/j.jbi.2013.07.001_b0190) 2010; 11
Mason (10.1016/j.jbi.2013.07.001_b0050) 2012; 13
10.1016/j.jbi.2013.07.001_b0135
Kelley (10.1016/j.jbi.2013.07.001_b0455) 2010; 11
Oinn (10.1016/j.jbi.2013.07.001_b0165) 2004; 20
10.1016/j.jbi.2013.07.001_b0255
10.1016/j.jbi.2013.07.001_b0075
Moore (10.1016/j.jbi.2013.07.001_b0015) 1965; 38
10.1016/j.jbi.2013.07.001_b0195
Murray (10.1016/j.jbi.2013.07.001_b0350) 2012; 185
10.1016/j.jbi.2013.07.001_b0150
Klein (10.1016/j.jbi.2013.07.001_b0315) 2011; 39
Niemenmaa (10.1016/j.jbi.2013.07.001_b0410) 2012; 28
10.1016/j.jbi.2013.07.001_b0390
10.1016/j.jbi.2013.07.001_b0070
Hong (10.1016/j.jbi.2013.07.001_b0170) 2012; 28
Pireddu (10.1016/j.jbi.2013.07.001_b0385) 2011; 27
Yeh (10.1016/j.jbi.2013.07.001_b0100) 2001; 11
Sleator (10.1016/j.jbi.2013.07.001_b0320) 2008; 47
Mathe (10.1016/j.jbi.2013.07.001_b0040) 2002; 30
Sleator (10.1016/j.jbi.2013.07.001_b0335) 2012; 3
Schatz (10.1016/j.jbi.2013.07.001_b0225) 2010; 28
Stein (10.1016/j.jbi.2013.07.001_b0045) 2010; 11
Davies (10.1016/j.jbi.2013.07.001_b0470) 2010
10.1016/j.jbi.2013.07.001_b0305
Sleator (10.1016/j.jbi.2013.07.001_b0360) 2010; 74
Matthews (10.1016/j.jbi.2013.07.001_b0415) 2010; 11
10.1016/j.jbi.2013.07.001_b0105
10.1016/j.jbi.2013.07.001_b0025
10.1016/j.jbi.2013.07.001_b0300
10.1016/j.jbi.2013.07.001_b0440
10.1016/j.jbi.2013.07.001_b0120
10.1016/j.jbi.2013.07.001_b0285
Dai (10.1016/j.jbi.2013.07.001_b0480) 2012; 7
Walter (10.1016/j.jbi.2013.07.001_b0020) 2005; 293
Feng (10.1016/j.jbi.2013.07.001_b0465) 2011; 12
Loman (10.1016/j.jbi.2013.07.001_b0030) 2012; 10
10.1016/j.jbi.2013.07.001_b0280
Furusawa (10.1016/j.jbi.2013.07.001_b0355) 2012; 338
Sleator (10.1016/j.jbi.2013.07.001_b0330) 2012; 815
10.1016/j.jbi.2013.07.001_b0080
Cooper (10.1016/j.jbi.2013.07.001_b0345) 2010; 466
Gurtowski (10.1016/j.jbi.2013.07.001_b0510) 2012
Schatz (10.1016/j.jbi.2013.07.001_b0380) 2009; 25
Zou (10.1016/j.jbi.2013.07.001_b0505) 2013
Liu (10.1016/j.jbi.2013.07.001_b0430) 2012; 28
Schadt (10.1016/j.jbi.2013.07.001_b0295) 2012; 8
Karr (10.1016/j.jbi.2013.07.001_b0370) 2012; 150
Angiuoli (10.1016/j.jbi.2013.07.001_b0490) 2011
Shachak (10.1016/j.jbi.2013.07.001_b0145) 2007; 95
10.1016/j.jbi.2013.07.001_b0115
Quail (10.1016/j.jbi.2013.07.001_b0005) 2012; 13
10.1016/j.jbi.2013.07.001_b0055
Langmead (10.1016/j.jbi.2013.07.001_b0230) 2009; 10
Vouzis (10.1016/j.jbi.2013.07.001_b0425) 2011; 27
Sleator (10.1016/j.jbi.2013.07.001_b0325) 2010; 93
10.1016/j.jbi.2013.07.001_b0250
10.1016/j.jbi.2013.07.001_b0095
Fusaro (10.1016/j.jbi.2013.07.001_b0485) 2011
Sleator (10.1016/j.jbi.2013.07.001_b0375) 2012; 3
10.1016/j.jbi.2013.07.001_b0290
Giardine (10.1016/j.jbi.2013.07.001_b0155) 2005; 15
Marianayagam (10.1016/j.jbi.2013.07.001_b0340) 2005; 102
Schatz (10.1016/j.jbi.2013.07.001_b0535) 2010; vol. 10
Schoenherr (10.1016/j.jbi.2013.07.001_b0520) 2012; 13
Nguyen (10.1016/j.jbi.2013.07.001_b0235) 2011; 4
Manyika (10.1016/j.jbi.2013.07.001_b0475) 2011
O’Driscoll (10.1016/j.jbi.2013.07.001_b0530) 2013
10.1016/j.jbi.2013.07.001_b0205
Robertson (10.1016/j.jbi.2013.07.001_b0525) 2003; 3
10.1016/j.jbi.2013.07.001_b0445
Davenport (10.1016/j.jbi.2013.07.001_b0090) 2012; 90
10.1016/j.jbi.2013.07.001_b0125
10.1016/j.jbi.2013.07.001_b0245
McKenna (10.1016/j.jbi.2013.07.001_b0215) 2010; 20
References_xml – volume: 25
  start-page: 1363
  year: 2009
  end-page: 1369
  ident: b0380
  article-title: CloudBurst: highly sensitive read mapping with MapReduce
  publication-title: Bioinformatics
– volume: 13
  start-page: S28
  year: 2012
  ident: b0400
  article-title: A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework
  publication-title: BMC Genomics
– volume: 28
  start-page: 878
  year: 2012
  end-page: 879
  ident: b0430
  article-title: SOAP3: ultra-fast GPU-based parallel alignment tool for short reads
  publication-title: Bioinformatics
– volume: 28
  start-page: 691
  year: 2010
  end-page: 693
  ident: b0225
  article-title: Cloud computing and the DNA data race
  publication-title: Nat Biotechnol
– volume: 39
  start-page: 571
  year: 2011
  end-page: 578
  ident: b0315
  article-title: Cloudy confidentiality: clinical and legal implications of cloud computing in health care
  publication-title: J Am Acad Psychiatry Law
– start-page: 4
  year: 2013
  ident: b0530
  article-title: Synthetic DNA: the next generation of big data storage
  publication-title: Bioengineered
– reference: Managing data in the Cloud Age. <
– volume: 3
  year: 2003
  ident: b0525
  article-title: The $1000 genome: ethical and legal issues in whole genome sequencing of individuals
  publication-title: Am J Bioeth
– reference: Managing and Analysing 1,000,000 Genomes. <
– reference: Big Data Offers Big Opportunities for Retail, Financial, Web Companies. <
– reference: Hadoop Sorts a Petabyte in 16.25 Hours and a Terabyte in 62 Seconds. <
– volume: 13
  start-page: 200
  year: 2012
  ident: b0520
  article-title: Cloudgene: a graphical execution platform for MapReduce programs on private and public clouds
  publication-title: BMC Bioinform
– volume: 13
  year: 2012
  ident: b0540
  article-title: Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework
  publication-title: BMC Bioinform
– year: 2012
  ident: b0510
  article-title: Genotyping in the cloud with Crossbow
  publication-title: Current Protocol Bioinform
– volume: 29
  start-page: 135
  year: 2013
  end-page: 136
  ident: b0450
  article-title: BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters
  publication-title: Bioinformatics
– reference: 1,000 Genomes in the Cloud and NCBI Experiences. <
– volume: vol. 10
  year: 2010
  ident: b0535
  article-title: De Novo assembly of large genomes with cloud computing
  publication-title: Biology of genomes
– reference: What will happen to Amazon’s massive cloud business? <
– volume: 93
  start-page: 1
  year: 2010
  end-page: 6
  ident: b0325
  article-title: An overview of the processes shaping protein evolution
  publication-title: Sci Prog
– volume: 28
  start-page: 721
  year: 2012
  end-page: 723
  ident: b0170
  article-title: FX: an RNA-Seq analysis tool on the cloud
  publication-title: Bioinformatics
– volume: 38
  start-page: 4
  year: 1965
  end-page: 7
  ident: b0015
  article-title: Cramming more components into integrated circuits
  publication-title: Electronics
– volume: 47
  start-page: 361
  year: 2008
  end-page: 366
  ident: b0320
  article-title: Metagenomics
  publication-title: Lett Appl Microbiol
– reference: Matsunaga A, Tsugawa M, and Fortes J. CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications. IEEE Fourth International Conference on eScience, Indiana, USA, 2008 222-229.
– reference: Available at https://dnanexus.com/.
– volume: 95
  start-page: 454
  year: 2007
  end-page: 458
  ident: b0145
  article-title: Barriers and enablers to the acceptance of bioinformatics tools: a qualitative study
  publication-title: J Med Libr Assoc
– reference: Big Data, Meet the Huge Data That Will Shape Your Future. <
– reference: Krampis K, Booth T, Chapman B, Tiwari B, Bicak M, Field D, et al.. Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community. BMC Bioinform; 2012;13:42. <calendar:T1:13:42>.
– volume: 28
  start-page: 1542
  year: 2012
  end-page: 1543
  ident: b0405
  article-title: Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses
  publication-title: Bioinformatics
– volume: 11
  start-page: S1
  year: 2010
  ident: b0190
  article-title: An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics
  publication-title: BMC Bioinform
– volume: 102
  start-page: 16684
  year: 2005
  end-page: 16689
  ident: b0340
  article-title: Protein folding by distributed computing and the denatured state ensemble
  publication-title: Proc Natl Acad Sci USA
– reference: Gantz J, Reinsel, D. The Digital Universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. In: IDC iView: IDC Analyze the, Future; 2012.
– reference: EMC Sitting In Sweet Spot Of $70 Billion Big Data Industry. <
– volume: 10
  start-page: 599
  year: 2012
  end-page: 606
  ident: b0030
  article-title: High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity
  publication-title: Nat Rev Microbiol
– year: 2010
  ident: b0470
  article-title: The $1,000 genome: the revolution in DNA sequencing and the new era of personalized medicine [hardcover]
– volume: 11
  year: 2010
  ident: b0495
  article-title: SeqWare Query Engine: storing and searching sequence data in the cloud
  publication-title: BMC Bioinform
– volume: 11
  start-page: 647
  year: 2010
  end-page: 657
  ident: b0260
  article-title: Computational solutions to large-scale data management and analysis
  publication-title: Nat Rev Genet
– start-page: 12
  year: 2011
  ident: b0490
  article-title: CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing
  publication-title: BMC Bioinform
– volume: 466
  start-page: 756
  year: 2010
  end-page: 760
  ident: b0345
  article-title: Predicting protein structures with a multiplayer online game
  publication-title: Nature
– volume: 28
  start-page: 876
  year: 2012
  end-page: 877
  ident: b0410
  article-title: Hadoop-BAM: directly manipulating next generation sequencing data in the cloud
  publication-title: Bioinformatics
– volume: 7
  start-page: 43
  year: 2012
  ident: b0480
  article-title: Bioinformatics clouds for big data manipulation
  publication-title: Biology Direct
– volume: 74
  start-page: 214
  year: 2010
  end-page: 215
  ident: b0360
  article-title: The human superorganism – of microbes and men
  publication-title: Med Hypotheses
– reference: Cloudera Chief Scientist Jeff Hammerbacher Teams with Mount Sinai School of Medicine to Solve Medical Challenges Using Big Data. <
– volume: 27
  start-page: 2159
  year: 2011
  end-page: 2160
  ident: b0385
  article-title: SEAL: a distributed short read mapping and duplicate removal tool
  publication-title: Bioinformatics
– reference: Obama Administration Unveils “Big Data” Initiative: Announces $200 Million In New R&D Investments. <
– volume: 150
  start-page: 389
  year: 2012
  end-page: 401
  ident: b0370
  article-title: A whole-cell computational model predicts phenotype from genotype
  publication-title: Cell
– volume: 11
  start-page: R116
  year: 2010
  ident: b0455
  article-title: Quake: quality-aware detection and correction of sequencing errors
  publication-title: Genome Biol
– year: 2011
  ident: b0485
  article-title: Biomedical cloud computing with amazon web services
  publication-title: PLOS J
– volume: 185
  start-page: 1251
  year: 2012
  end-page: 1252
  ident: b0350
  article-title: Personalized medicine: been there, done that, always needs work!
  publication-title: Am J Respir Crit Care Med
– volume: 338
  start-page: 215
  year: 2012
  end-page: 217
  ident: b0355
  article-title: A dynamical-systems view of stem cell biology
  publication-title: Science
– volume: 11
  start-page: 207
  year: 2010
  ident: b0045
  article-title: The case for cloud computing in genome informatics
  publication-title: Rev J: Genome Biol
– reference: NextBio, Intel to collaborate on improving Hadoop Stack for Genomic Data Analysis. <
– volume: 8
  start-page: 612
  year: 2012
  ident: b0295
  article-title: The changing privacy landscape in the era of big data
  publication-title: Mol Syst Biol
– reference: As We May Communicate. <
– volume: 11
  start-page: 803
  year: 2001
  end-page: 816
  ident: b0100
  article-title: Computational inference of homologous gene structures in the human genome
  publication-title: Genome Res
– volume: 293
  start-page: 32
  year: 2005
  end-page: 33
  ident: b0020
  publication-title: Kryder’s Law Sci Am
– reference: Healthcare Cloud Computing (Clinical, EMR, SaaS, Private, Public, Hybrid) Market – Global Trends, Challenges, Opportunities & Forecasts (2012–2017). <
– reference: Leo S, Santoni F, Zanetti G. Biodoop: bioinformatics on hadoop. In: Parallel processing workshops, 2009. ICPPW ‘09. International Conference on; 2009. p. 415–22.
– volume: 10
  start-page: R134
  year: 2009
  ident: b0230
  article-title: Searching for SNPs with cloud computing
  publication-title: Genome Biol
– reference: Data Deluge and the Human Microbiome Project. <
– volume: 20
  start-page: 1297
  year: 2010
  end-page: 1303
  ident: b0215
  article-title: The genome analysis toolkit: a MapReduce framework for analysing next-generation DNA sequencing data
  publication-title: Genome Res
– volume: 4
  start-page: 171
  year: 2011
  ident: b0235
  article-title: CloudAligner: a fast and full-featured MapReduce based tool for sequence mapping
  publication-title: BMC Res Notes
– volume: 1
  start-page: 6
  year: 2013
  ident: b0085
  article-title: Bio and health informatics meets cloud: BioVLab as an example
  publication-title: Health Inform Sci Syst
– reference: Creating HIPAA-Compliant Medical Data Applications With AWS. <
– volume: 815
  start-page: 15
  year: 2012
  end-page: 24
  ident: b0330
  article-title: Prediction of protein functions
  publication-title: Methods Mol Biol
– reference: Bridging the gap between HPC and IaaS clouds. <
– volume: 331
  start-page: 666
  year: 2011
  end-page: 668
  ident: b0275
  article-title: Will computers crash genomics?
  publication-title: Science
– reference: How “Cloud” Services Democratize DNA Sequencing. <
– volume: 13
  start-page: 314
  year: 2012
  ident: b0050
  article-title: Faster sequencers, larger datasets, new challenges
  publication-title: Genome Biol
– volume: 7
  year: 2012
  ident: b0500
  article-title: Bioinformatics clouds for big data manipulation
  publication-title: Biol Direct
– year: 2011
  ident: b0010
  article-title: DNA sequencing: caught in the deluge of data
– volume: 30
  start-page: 4103
  year: 2002
  end-page: 4117
  ident: b0040
  article-title: Current methods of gene prediction, their strengths and weaknesses
  publication-title: Nucl Acids Res
– volume: 28
  start-page: 294
  year: 2012
  end-page: 295
  ident: b0460
  article-title: Gene set analysis in the cloud
  publication-title: Bioinformatics
– volume: 3
  start-page: 80
  year: 2012
  end-page: 85
  ident: b0335
  article-title: Proteins: form and function
  publication-title: Bioeng Bugs
– volume: 20
  start-page: 3045
  year: 2004
  end-page: 3054
  ident: b0165
  article-title: Taverna: a tool for the composition and enactment of bioinformatics workflows
  publication-title: Bioinformatics
– reference: Social Media And The Big Data Explosion. <
– year: 2013
  ident: b0505
  article-title: Survey of MapReduce frame operation in bioinformatics
  publication-title: Brief Bioinform
– reference: Genomics Takes Flight….To the Cloud. <
– reference: How Hadoop Makes Short Work of Big Data. <
– volume: 11
  year: 2010
  ident: b0515
  article-title: Cloud-scale RNA-sequencing differential expression analysis with Myrna
  publication-title: Genome Biol
– volume: 3
  start-page: 311
  year: 2012
  end-page: 312
  ident: b0375
  article-title: Digital biology: a new era has begun
  publication-title: Bioengineered
– reference: >.
– volume: 90
  start-page: 128
  year: 2012
  ident: b0090
  article-title: D. J. Data scientist: the sexiest job of the 21st century
  publication-title: Harward Business
– reference: The Benefits Of Data Center Virtualization For Businesses. <
– reference: Available at http://asperasoft.com/.
– reference: .
– volume: 15
  start-page: 1451
  year: 2005
  end-page: 1455
  ident: b0155
  article-title: Galaxy: a platform for interactive large-scale genome analysis
  publication-title: Genome Res
– reference: Helping accelerate treatment for pediatric cancer with Dell cloud technology. <
– volume: 12
  start-page: 139
  year: 2011
  ident: b0465
  article-title: PeakRanger: a cloud-enabled peak caller for ChIP-seq data
  publication-title: BMC Bioinform
– year: 2011
  ident: b0475
  article-title: Big data: the next frontier for innovation, competition, and productivity
– reference: Cloudera and Mount Sinai: The structure of a Big Data Revolution? <
– volume: 11
  start-page: S15
  year: 2010
  ident: b0415
  article-title: MrsRF: an efficient MapReduce algorithm for analyzing large collections of evolutionary trees
  publication-title: BMC Bioinform
– volume: 27
  start-page: 182
  year: 2011
  end-page: 188
  ident: b0425
  article-title: GPU-BLAST: using graphics processors to accelerate protein sequence alignment
  publication-title: Bioinformatics
– volume: 13
  start-page: 341
  year: 2012
  ident: b0005
  article-title: A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers
  publication-title: BMC Genomics
– reference: Blastreduce: high performance short read mapping with mapreduce. <
– volume: 6
  start-page: 13
  year: 2011
  ident: b0420
  article-title: Nephele: genotyping via complete composition vectors and MapReduce
  publication-title: Source Code Biol Med
– ident: 10.1016/j.jbi.2013.07.001_b0120
– year: 2013
  ident: 10.1016/j.jbi.2013.07.001_b0505
  article-title: Survey of MapReduce frame operation in bioinformatics
  publication-title: Brief Bioinform
– volume: 293
  start-page: 32
  issue: August
  year: 2005
  ident: 10.1016/j.jbi.2013.07.001_b0020
  publication-title: Kryder’s Law Sci Am
  doi: 10.1038/scientificamerican0805-32
– volume: 13
  start-page: 200
  issue: 1
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0520
  article-title: Cloudgene: a graphical execution platform for MapReduce programs on private and public clouds
  publication-title: BMC Bioinform
  doi: 10.1186/1471-2105-13-200
– volume: 13
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0540
  article-title: Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework
  publication-title: BMC Bioinform
  doi: 10.1186/1471-2105-13-324
– volume: 331
  start-page: 666
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0275
  article-title: Will computers crash genomics?
  publication-title: Science
  doi: 10.1126/science.331.6018.666
– ident: 10.1016/j.jbi.2013.07.001_b0195
– ident: 10.1016/j.jbi.2013.07.001_b0445
  doi: 10.1109/ICPPW.2009.37
– year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0510
  article-title: Genotyping in the cloud with Crossbow
  publication-title: Current Protocol Bioinform
  doi: 10.1002/0471250953.bi1503s39
– volume: 27
  start-page: 2159
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0385
  article-title: SEAL: a distributed short read mapping and duplicate removal tool
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btr325
– volume: 150
  start-page: 389
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0370
  article-title: A whole-cell computational model predicts phenotype from genotype
  publication-title: Cell
  doi: 10.1016/j.cell.2012.05.044
– volume: 7
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0500
  article-title: Bioinformatics clouds for big data manipulation
  publication-title: Biol Direct
  doi: 10.1186/1745-6150-7-43
– ident: 10.1016/j.jbi.2013.07.001_b0105
– ident: 10.1016/j.jbi.2013.07.001_b0290
– volume: 27
  start-page: 182
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0425
  article-title: GPU-BLAST: using graphics processors to accelerate protein sequence alignment
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq644
– ident: 10.1016/j.jbi.2013.07.001_b0095
– volume: 102
  start-page: 16684
  year: 2005
  ident: 10.1016/j.jbi.2013.07.001_b0340
  article-title: Protein folding by distributed computing and the denatured state ensemble
  publication-title: Proc Natl Acad Sci USA
  doi: 10.1073/pnas.0506388102
– volume: 15
  start-page: 1451
  year: 2005
  ident: 10.1016/j.jbi.2013.07.001_b0155
  article-title: Galaxy: a platform for interactive large-scale genome analysis
  publication-title: Genome Res
  doi: 10.1101/gr.4086505
– ident: 10.1016/j.jbi.2013.07.001_b0185
– volume: 1
  start-page: 6
  year: 2013
  ident: 10.1016/j.jbi.2013.07.001_b0085
  article-title: Bio and health informatics meets cloud: BioVLab as an example
  publication-title: Health Inform Sci Syst
  doi: 10.1186/2047-2501-1-6
– ident: 10.1016/j.jbi.2013.07.001_b0265
– volume: vol. 10
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0535
  article-title: De Novo assembly of large genomes with cloud computing
– ident: 10.1016/j.jbi.2013.07.001_b0280
– volume: 8
  start-page: 612
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0295
  article-title: The changing privacy landscape in the era of big data
  publication-title: Mol Syst Biol
  doi: 10.1038/msb.2012.47
– volume: 6
  start-page: 13
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0420
  article-title: Nephele: genotyping via complete composition vectors and MapReduce
  publication-title: Source Code Biol Med
  doi: 10.1186/1751-0473-6-13
– volume: 12
  start-page: 139
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0465
  article-title: PeakRanger: a cloud-enabled peak caller for ChIP-seq data
  publication-title: BMC Bioinform
  doi: 10.1186/1471-2105-12-139
– ident: 10.1016/j.jbi.2013.07.001_b0060
– volume: 25
  start-page: 1363
  year: 2009
  ident: 10.1016/j.jbi.2013.07.001_b0380
  article-title: CloudBurst: highly sensitive read mapping with MapReduce
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp236
– ident: 10.1016/j.jbi.2013.07.001_b0125
– volume: 20
  start-page: 3045
  year: 2004
  ident: 10.1016/j.jbi.2013.07.001_b0165
  article-title: Taverna: a tool for the composition and enactment of bioinformatics workflows
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bth361
– ident: 10.1016/j.jbi.2013.07.001_b0255
– volume: 28
  start-page: 1542
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0405
  article-title: Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts165
– volume: 28
  start-page: 878
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0430
  article-title: SOAP3: ultra-fast GPU-based parallel alignment tool for short reads
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts061
– ident: 10.1016/j.jbi.2013.07.001_b0440
  doi: 10.1109/eScience.2008.62
– volume: 39
  start-page: 571
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0315
  article-title: Cloudy confidentiality: clinical and legal implications of cloud computing in health care
  publication-title: J Am Acad Psychiatry Law
– volume: 10
  start-page: 599
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0030
  article-title: High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity
  publication-title: Nat Rev Microbiol
  doi: 10.1038/nrmicro2850
– volume: 11
  start-page: S1
  issue: Suppl. 12
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0190
  article-title: An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics
  publication-title: BMC Bioinform
  doi: 10.1186/1471-2105-11-S12-S1
– ident: 10.1016/j.jbi.2013.07.001_b0180
– volume: 28
  start-page: 876
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0410
  article-title: Hadoop-BAM: directly manipulating next generation sequencing data in the cloud
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts054
– volume: 30
  start-page: 4103
  year: 2002
  ident: 10.1016/j.jbi.2013.07.001_b0040
  article-title: Current methods of gene prediction, their strengths and weaknesses
  publication-title: Nucl Acids Res
  doi: 10.1093/nar/gkf543
– ident: 10.1016/j.jbi.2013.07.001_b0075
– volume: 11
  start-page: 803
  year: 2001
  ident: 10.1016/j.jbi.2013.07.001_b0100
  article-title: Computational inference of homologous gene structures in the human genome
  publication-title: Genome Res
  doi: 10.1101/gr.175701
– volume: 47
  start-page: 361
  year: 2008
  ident: 10.1016/j.jbi.2013.07.001_b0320
  article-title: Metagenomics
  publication-title: Lett Appl Microbiol
  doi: 10.1111/j.1472-765X.2008.02444.x
– ident: 10.1016/j.jbi.2013.07.001_b0150
  doi: 10.1186/1471-2105-13-42
– ident: 10.1016/j.jbi.2013.07.001_b0245
– volume: 28
  start-page: 721
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0170
  article-title: FX: an RNA-Seq analysis tool on the cloud
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts023
– volume: 13
  start-page: 314
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0050
  article-title: Faster sequencers, larger datasets, new challenges
  publication-title: Genome Biol
  doi: 10.1186/gb-2012-13-3-314
– volume: 90
  start-page: 128
  issue: 70–6
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0090
  article-title: D. J. Data scientist: the sexiest job of the 21st century
  publication-title: Harward Business
– volume: 3
  start-page: 80
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0335
  article-title: Proteins: form and function
  publication-title: Bioeng Bugs
– ident: 10.1016/j.jbi.2013.07.001_b0115
– ident: 10.1016/j.jbi.2013.07.001_b0250
– volume: 3
  year: 2003
  ident: 10.1016/j.jbi.2013.07.001_b0525
  article-title: The $1000 genome: ethical and legal issues in whole genome sequencing of individuals
  publication-title: Am J Bioeth
  doi: 10.1162/152651603322874762
– volume: 20
  start-page: 1297
  issue: July
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0215
  article-title: The genome analysis toolkit: a MapReduce framework for analysing next-generation DNA sequencing data
  publication-title: Genome Res
  doi: 10.1101/gr.107524.110
– year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0470
– ident: 10.1016/j.jbi.2013.07.001_b0065
– ident: 10.1016/j.jbi.2013.07.001_b0055
– ident: 10.1016/j.jbi.2013.07.001_b0080
– volume: 7
  start-page: 43
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0480
  article-title: Bioinformatics clouds for big data manipulation
  publication-title: Biology Direct
  doi: 10.1186/1745-6150-7-43
– ident: 10.1016/j.jbi.2013.07.001_b0305
– volume: 11
  start-page: S15
  issue: Suppl. 1
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0415
  article-title: MrsRF: an efficient MapReduce algorithm for analyzing large collections of evolutionary trees
  publication-title: BMC Bioinform
  doi: 10.1186/1471-2105-11-S1-S15
– start-page: 4
  year: 2013
  ident: 10.1016/j.jbi.2013.07.001_b0530
  article-title: Synthetic DNA: the next generation of big data storage
  publication-title: Bioengineered
– volume: 338
  start-page: 215
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0355
  article-title: A dynamical-systems view of stem cell biology
  publication-title: Science
  doi: 10.1126/science.1224311
– year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0475
– volume: 3
  start-page: 311
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0375
  article-title: Digital biology: a new era has begun
  publication-title: Bioengineered
  doi: 10.4161/bioe.22367
– volume: 815
  start-page: 15
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0330
  article-title: Prediction of protein functions
  publication-title: Methods Mol Biol
  doi: 10.1007/978-1-61779-424-7_2
– volume: 466
  start-page: 756
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0345
  article-title: Predicting protein structures with a multiplayer online game
  publication-title: Nature
  doi: 10.1038/nature09304
– year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0010
– ident: 10.1016/j.jbi.2013.07.001_b0070
– volume: 93
  start-page: 1
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0325
  article-title: An overview of the processes shaping protein evolution
  publication-title: Sci Prog
  doi: 10.3184/003685009X12605492662844
– volume: 29
  start-page: 135
  year: 2013
  ident: 10.1016/j.jbi.2013.07.001_b0450
  article-title: BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts647
– volume: 28
  start-page: 691
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0225
  article-title: Cloud computing and the DNA data race
  publication-title: Nat Biotechnol
  doi: 10.1038/nbt0710-691
– volume: 74
  start-page: 214
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0360
  article-title: The human superorganism – of microbes and men
  publication-title: Med Hypotheses
  doi: 10.1016/j.mehy.2009.08.047
– volume: 11
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0495
  article-title: SeqWare Query Engine: storing and searching sequence data in the cloud
  publication-title: BMC Bioinform
  doi: 10.1186/1471-2105-11-S12-S2
– ident: 10.1016/j.jbi.2013.07.001_b0135
– volume: 95
  start-page: 454
  year: 2007
  ident: 10.1016/j.jbi.2013.07.001_b0145
  article-title: Barriers and enablers to the acceptance of bioinformatics tools: a qualitative study
  publication-title: J Med Libr Assoc
  doi: 10.3163/1536-5050.95.4.454
– ident: 10.1016/j.jbi.2013.07.001_b0205
– volume: 11
  start-page: 647
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0260
  article-title: Computational solutions to large-scale data management and analysis
  publication-title: Nat Rev Genet
  doi: 10.1038/nrg2857
– volume: 38
  start-page: 4
  year: 1965
  ident: 10.1016/j.jbi.2013.07.001_b0015
  article-title: Cramming more components into integrated circuits
  publication-title: Electronics
– volume: 11
  start-page: 207
  issue: May
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0045
  article-title: The case for cloud computing in genome informatics
  publication-title: Rev J: Genome Biol
– volume: 13
  start-page: 341
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0005
  article-title: A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-13-341
– year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0485
  article-title: Biomedical cloud computing with amazon web services
  publication-title: PLOS J
– volume: 13
  start-page: S28
  issue: Suppl. 7
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0400
  article-title: A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-13-S7-S28
– volume: 4
  start-page: 171
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0235
  article-title: CloudAligner: a fast and full-featured MapReduce based tool for sequence mapping
  publication-title: BMC Res Notes
  doi: 10.1186/1756-0500-4-171
– volume: 11
  start-page: R116
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0455
  article-title: Quake: quality-aware detection and correction of sequencing errors
  publication-title: Genome Biol
  doi: 10.1186/gb-2010-11-11-r116
– volume: 11
  year: 2010
  ident: 10.1016/j.jbi.2013.07.001_b0515
  article-title: Cloud-scale RNA-sequencing differential expression analysis with Myrna
  publication-title: Genome Biol
  doi: 10.1186/gb-2010-11-8-r83
– ident: 10.1016/j.jbi.2013.07.001_b0025
– ident: 10.1016/j.jbi.2013.07.001_b0300
– volume: 185
  start-page: 1251
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0350
  article-title: Personalized medicine: been there, done that, always needs work!
  publication-title: Am J Respir Crit Care Med
  doi: 10.1164/rccm.201203-0523ED
– start-page: 12
  year: 2011
  ident: 10.1016/j.jbi.2013.07.001_b0490
  article-title: CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing
  publication-title: BMC Bioinform
– ident: 10.1016/j.jbi.2013.07.001_b0390
– ident: 10.1016/j.jbi.2013.07.001_b0285
– volume: 10
  start-page: R134
  year: 2009
  ident: 10.1016/j.jbi.2013.07.001_b0230
  article-title: Searching for SNPs with cloud computing
  publication-title: Genome Biol
  doi: 10.1186/gb-2009-10-11-r134
– volume: 28
  start-page: 294
  year: 2012
  ident: 10.1016/j.jbi.2013.07.001_b0460
  article-title: Gene set analysis in the cloud
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btr630
SSID ssj0011556
Score 2.5419166
SecondaryResourceType review_article
Snippet [Display omitted] •Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the...
Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A...
SourceID proquest
pubmed
crossref
elsevier
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 774
SubjectTerms Big data
Bioinformatics
Cloud computing
Genomics
Hadoop
Human Genome Project
Humans
Internet
Software
Title ‘Big data’, Hadoop and cloud computing in genomics
URI https://dx.doi.org/10.1016/j.jbi.2013.07.001
https://www.ncbi.nlm.nih.gov/pubmed/23872175
https://www.proquest.com/docview/1433272339
Volume 46
WOSCitedRecordID wos000324848600002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1532-0480
  dateEnd: 20210131
  omitProxy: false
  ssIdentifier: ssj0011556
  issn: 1532-0464
  databaseCode: AIEXJ
  dateStart: 20010201
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1da9RAFB3sVkQR0VXb-lFGUApqJMkkmclj1YoWLD5U2LcwM5ksWdokdBOp_947X7uhZYs--BJCkknCnJObOzPn3ovQ6yThlcwkDzJZsiCpsjjgGedBFEkB_0sYkFWVKTZBT07YbJb_cAkVlqacAG0adnmZd_8VajgGYOvQ2X-Ae3VTOAD7ADpsAXbY_hXwb2L6sZ6_09JP2DW_Fl62bWcj2M7awcSxdUPvoll0ltZzr3m_7qba-HwDpUuy2o8E8lor8xnshKaTMTO19lvn6_nvYa6zUNoafMeDzksxmpXlfevk3b-d8tjNP0RrJRv8PrzNjAMdmj42qm5esR4vWxsLSW1RnmuW204iLD4sRK0Fd8SkVLXPGYHWnRvUwM-AkastuXIlXbY_tYW2Y5rmbIK2D78dzY5XS0vgQGV-edsI_a48UaeHdvfY5KtsGosYn-T0IXrgUMKHlgSP0C3VTNG9UYrJKbrz3Yknpui-naLFNvLsMYoPgCdY8-TgPbYcwcARbDiCVxzBdYM9R56gn1-OTj99DVwJjUCSlPQBE5W24GWaRxycPZpmTDChBK-o5CoilQilSuDjhIFrFJZKqjQpBacJNKOUCfIUTZq2UbsI51Uk8pJIKplKBNUx2ZJkqaRMgdcZ5nso9J1VSJdfXpc5OSu8kHBRQFcXuquLUKseoj30dtWks8lVbro48QgUzju0Xl8BFLqp2SuPVgGWUy-H8Ua1wxIGvYTENCYEXn3Hwrh6C8-AZxvPPEd311_DCzTpLwb1Et2Wv_p6ebGPtuiM7Tvq_QGNVo64
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%27Big+data%27%2C+Hadoop+and+cloud+computing+in+genomics&rft.jtitle=Journal+of+biomedical+informatics&rft.au=O%27Driscoll%2C+Aisling&rft.au=Daugelaite%2C+Jurate&rft.au=Sleator%2C+Roy+D&rft.date=2013-10-01&rft.eissn=1532-0480&rft.volume=46&rft.issue=5&rft.spage=774&rft_id=info:doi/10.1016%2Fj.jbi.2013.07.001&rft_id=info%3Apmid%2F23872175&rft.externalDocID=23872175
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1532-0464&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1532-0464&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1532-0464&client=summon