‘Big data’, Hadoop and cloud computing in genomics
[Display omitted] •Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the fastest growing fields of big data science.•Cloud computing and big data technologies can be used to deal with biology’s big data sets.•The...
Gespeichert in:
| Veröffentlicht in: | Journal of biomedical informatics Jg. 46; H. 5; S. 774 - 781 |
|---|---|
| Hauptverfasser: | , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
United States
Elsevier Inc
01.10.2013
|
| Schlagworte: | |
| ISSN: | 1532-0464, 1532-0480, 1532-0480 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | [Display omitted]
•Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the fastest growing fields of big data science.•Cloud computing and big data technologies can be used to deal with biology’s big data sets.•The Apache Hadoop project, which provides distributed and parallelised data processing are presented.•Challenges associated with cloud computing and big data technologies in biology are discussed.
Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology’s big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community. |
|---|---|
| AbstractList | [Display omitted]
•Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the fastest growing fields of big data science.•Cloud computing and big data technologies can be used to deal with biology’s big data sets.•The Apache Hadoop project, which provides distributed and parallelised data processing are presented.•Challenges associated with cloud computing and big data technologies in biology are discussed.
Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology’s big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community. Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community. Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community.Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A consequence of this is that the medical discoveries of the future will largely depend on our ability to process and analyse large genomic data sets, which continue to expand as the cost of sequencing decreases. Herein, we provide an overview of cloud computing and big data technologies, and discuss how such expertise can be used to deal with biology's big data sets. In particular, big data technologies such as the Apache Hadoop project, which provides distributed and parallelised data processing and analysis of petabyte (PB) scale data sets will be discussed, together with an overview of the current usage of Hadoop within the bioinformatics community. |
| Author | Sleator, Roy D. Daugelaite, Jurate O’Driscoll, Aisling |
| Author_xml | – sequence: 1 givenname: Aisling surname: O’Driscoll fullname: O’Driscoll, Aisling organization: Department of Computing, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland – sequence: 2 givenname: Jurate surname: Daugelaite fullname: Daugelaite, Jurate organization: Department of Biological Sciences, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland – sequence: 3 givenname: Roy D. surname: Sleator fullname: Sleator, Roy D. email: roy.sleator@cit.ie organization: Department of Biological Sciences, Cork Institute of Technology, Rossa Avenue, Bishopstown, Cork, Ireland |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/23872175$$D View this record in MEDLINE/PubMed |
| BookMark | eNp9kLtOwzAUQC1UBG3hA1hQRgYS7DiOg5gA8ZIqscBs-XFTOUrsEidIbP0M-D2-BKMCA0MX28M517pnhibOO0DoiOCMYFKeNVmjbJZjQjPMM4zJDpoSRvMUFxWe_L3LYh_NQmgiQBgr99B-TiueE86mqPxcv1_ZZWLkID_XH6fJvTTerxLpTKJbP8bTd6txsG6ZWJcswfnO6nCAdmvZBjj8uefo-fbm6fo-XTzePVxfLlJNGR3SStU0fmrYOZG4yjkrK1UpULLmWgKhtcIailJKVlQEG9DACqMkL6LGeaXoHJ1s5q56_zJCGERng4a2lQ78GAQpKM15Tul5RI9_0FF1YMSqt53s38TvrhHgG0D3PoQeaqHtIAfr3dBL2wqCxXdV0YhYVXxXFZiLGC2a5J_5O3ybc7FxIOZ5tdCLoC04Dcb2oAdhvN1ifwGUKo8- |
| CitedBy_id | crossref_primary_10_1109_ACCESS_2020_3015016 crossref_primary_10_1177_1687814018814955 crossref_primary_10_1109_ACCESS_2022_3177278 crossref_primary_10_1002_wcms_1701 crossref_primary_10_1016_j_biotechadv_2024_108400 crossref_primary_10_1002_spy2_121 crossref_primary_10_1111_hir_12286 crossref_primary_10_1016_j_cie_2016_07_013 crossref_primary_10_1111_age_12655 crossref_primary_10_1007_s10916_016_0565_7 crossref_primary_10_1093_jamia_ocab032 crossref_primary_10_1371_journal_pone_0183413 crossref_primary_10_4018_IJITWE_2019070103 crossref_primary_10_1155_2018_3984061 crossref_primary_10_1007_s10723_017_9408_0 crossref_primary_10_1016_j_comnet_2018_08_005 crossref_primary_10_1016_j_future_2024_01_011 crossref_primary_10_1007_s10916_017_0832_2 crossref_primary_10_1016_j_jksuci_2017_07_001 crossref_primary_10_1016_j_procs_2016_05_544 crossref_primary_10_24190_ISSN2564_615X_2017_04_02 crossref_primary_10_1002_gch2_202300163 crossref_primary_10_1093_jamia_ocy111 crossref_primary_10_1080_0952813X_2021_1955980 crossref_primary_10_1155_2022_1265340 crossref_primary_10_1186_s40537_015_0016_1 crossref_primary_10_1155_2018_3598284 crossref_primary_10_1016_j_coisb_2017_07_004 crossref_primary_10_1186_s12859_015_0497_0 crossref_primary_10_1109_TCC_2014_2315797 crossref_primary_10_3389_fpls_2017_01461 crossref_primary_10_1016_j_future_2019_10_038 crossref_primary_10_3389_fnins_2021_591122 crossref_primary_10_1109_TCBB_2018_2816022 crossref_primary_10_1080_1206212X_2023_2301183 crossref_primary_10_1155_2015_639021 crossref_primary_10_1186_1752_0509_8_S2_I1 crossref_primary_10_1002_cpe_4499 crossref_primary_10_1007_s10916_018_0993_7 crossref_primary_10_1007_s11390_020_9801_1 crossref_primary_10_1016_j_procs_2022_03_101 crossref_primary_10_1016_j_cels_2019_11_002 crossref_primary_10_1145_3358211 crossref_primary_10_1097_AOG_0000000000001865 crossref_primary_10_1007_s13204_021_01984_4 crossref_primary_10_1016_j_giq_2018_11_004 crossref_primary_10_1371_journal_pcbi_1008645 crossref_primary_10_1007_s10916_018_1007_5 crossref_primary_10_1155_2014_712826 crossref_primary_10_1016_j_cmpb_2016_04_016 crossref_primary_10_1155_2016_3617572 crossref_primary_10_2217_pme_2018_0085 crossref_primary_10_1089_cmb_2017_0016 crossref_primary_10_1109_TCSS_2015_2514088 crossref_primary_10_1109_ACCESS_2017_2730843 crossref_primary_10_2478_dim_2018_0014 crossref_primary_10_1007_s11227_015_1501_1 crossref_primary_10_1111_pbi_12645 crossref_primary_10_1186_s12859_018_2019_3 crossref_primary_10_1007_s13204_021_02164_0 crossref_primary_10_1371_journal_pone_0236471 crossref_primary_10_1093_milmed_usx114 crossref_primary_10_1016_j_scs_2018_02_019 crossref_primary_10_1109_TBME_2016_2573285 crossref_primary_10_1016_j_tplants_2019_01_006 crossref_primary_10_1007_s10015_018_0437_y crossref_primary_10_3390_ijerph192214641 crossref_primary_10_1016_j_ajhg_2019_09_027 crossref_primary_10_3390_cancers15143690 crossref_primary_10_1016_j_neucom_2017_01_126 crossref_primary_10_1089_big_2020_0383 crossref_primary_10_3390_ijms18020412 crossref_primary_10_1016_j_prevetmed_2015_05_012 crossref_primary_10_1080_01605682_2019_1630328 crossref_primary_10_1186_s12859_020_03757_2 crossref_primary_10_1016_j_jnca_2018_02_008 crossref_primary_10_1007_s10586_018_2860_1 crossref_primary_10_3389_fvets_2017_00194 crossref_primary_10_1007_s00521_020_04873_z crossref_primary_10_1063_1_4946894 crossref_primary_10_2196_medinform_2913 crossref_primary_10_1186_s13326_017_0146_9 crossref_primary_10_1016_j_cose_2017_06_003 crossref_primary_10_1111_jocn_14164 crossref_primary_10_2478_dim_2018_00014 crossref_primary_10_1016_j_is_2014_07_006 crossref_primary_10_1016_j_tele_2015_12_005 crossref_primary_10_1186_s12864_018_4611_3 crossref_primary_10_1186_s12859_017_1723_8 crossref_primary_10_1080_17445760_2014_929685 crossref_primary_10_3389_fmed_2021_784455 crossref_primary_10_1016_j_parco_2016_10_003 crossref_primary_10_1371_journal_pone_0201483 crossref_primary_10_1088_1757_899X_563_3_032012 crossref_primary_10_1093_comjnl_bxaa192 crossref_primary_10_1038_nature15816 crossref_primary_10_1002_cpe_3628 crossref_primary_10_1109_RBME_2018_2829704 crossref_primary_10_2135_cropsci2014_03_0195 crossref_primary_10_1016_j_jep_2016_07_063 crossref_primary_10_1007_s11277_018_5334_0 crossref_primary_10_1186_s12859_022_04648_4 crossref_primary_10_1016_j_jbi_2017_05_012 crossref_primary_10_1016_j_ajo_2017_03_026 crossref_primary_10_1007_s11277_022_09535_y crossref_primary_10_3389_fgene_2022_876869 crossref_primary_10_1093_gigascience_giac040 crossref_primary_10_1109_TCBB_2020_3000661 crossref_primary_10_1002_cpe_3974 crossref_primary_10_1016_j_ijmedinf_2016_11_006 crossref_primary_10_3390_technologies13070285 crossref_primary_10_3390_metabo12010014 crossref_primary_10_1007_s10723_018_9458_y crossref_primary_10_1109_TBDATA_2016_2643683 crossref_primary_10_1016_j_future_2015_10_003 crossref_primary_10_1016_j_jbi_2014_01_005 crossref_primary_10_1080_1475939X_2017_1408490 crossref_primary_10_5582_bst_2014_01048 crossref_primary_10_3390_pr8080951 crossref_primary_10_1007_s10916_015_0344_x crossref_primary_10_1007_s00530_020_00736_8 crossref_primary_10_1016_j_cels_2017_05_013 crossref_primary_10_1093_bioadv_vbaf168 crossref_primary_10_1016_j_jbi_2015_01_008 crossref_primary_10_14400_JDC_2014_12_9_201 crossref_primary_10_3233_JIFS_189264 crossref_primary_10_1007_s11227_025_07563_6 crossref_primary_10_1016_j_ijmedinf_2016_09_008 crossref_primary_10_1186_s13742_015_0045_x crossref_primary_10_1002_cpe_4854 crossref_primary_10_1007_s11227_016_1677_z crossref_primary_10_1016_j_future_2015_04_012 crossref_primary_10_2196_22214 crossref_primary_10_1016_j_jii_2019_04_005 crossref_primary_10_2217_pgs_2016_0152 crossref_primary_10_1007_s10916_017_0777_5 crossref_primary_10_1016_j_iac_2014_09_014 crossref_primary_10_1016_j_neucom_2016_11_077 crossref_primary_10_1109_ACCESS_2020_2965955 crossref_primary_10_1007_s00500_023_08797_3 crossref_primary_10_3233_JIFS_223295 crossref_primary_10_1016_j_matpr_2017_12_340 crossref_primary_10_1016_j_procs_2018_05_004 crossref_primary_10_1517_17460441_2014_872623 crossref_primary_10_1109_TCBB_2020_2967385 crossref_primary_10_1016_j_future_2017_11_010 crossref_primary_10_1016_j_jii_2020_100129 crossref_primary_10_4028_www_scientific_net_AMM_530_531_827 crossref_primary_10_1088_1742_6596_1544_1_012119 crossref_primary_10_1109_MCSE_2018_05329812 crossref_primary_10_1016_j_gpb_2016_01_005 crossref_primary_10_1007_s13369_023_08172_2 crossref_primary_10_1186_1471_2105_15_30 crossref_primary_10_1002_cpe_5814 crossref_primary_10_3390_ijgi6060166 |
| Cites_doi | 10.1038/scientificamerican0805-32 10.1186/1471-2105-13-200 10.1186/1471-2105-13-324 10.1126/science.331.6018.666 10.1109/ICPPW.2009.37 10.1002/0471250953.bi1503s39 10.1093/bioinformatics/btr325 10.1016/j.cell.2012.05.044 10.1186/1745-6150-7-43 10.1093/bioinformatics/btq644 10.1073/pnas.0506388102 10.1101/gr.4086505 10.1186/2047-2501-1-6 10.1038/msb.2012.47 10.1186/1751-0473-6-13 10.1186/1471-2105-12-139 10.1093/bioinformatics/btp236 10.1093/bioinformatics/bth361 10.1093/bioinformatics/bts165 10.1093/bioinformatics/bts061 10.1109/eScience.2008.62 10.1038/nrmicro2850 10.1186/1471-2105-11-S12-S1 10.1093/bioinformatics/bts054 10.1093/nar/gkf543 10.1101/gr.175701 10.1111/j.1472-765X.2008.02444.x 10.1186/1471-2105-13-42 10.1093/bioinformatics/bts023 10.1186/gb-2012-13-3-314 10.1162/152651603322874762 10.1101/gr.107524.110 10.1186/1471-2105-11-S1-S15 10.1126/science.1224311 10.4161/bioe.22367 10.1007/978-1-61779-424-7_2 10.1038/nature09304 10.3184/003685009X12605492662844 10.1093/bioinformatics/bts647 10.1038/nbt0710-691 10.1016/j.mehy.2009.08.047 10.1186/1471-2105-11-S12-S2 10.3163/1536-5050.95.4.454 10.1038/nrg2857 10.1186/1471-2164-13-341 10.1186/1471-2164-13-S7-S28 10.1186/1756-0500-4-171 10.1186/gb-2010-11-11-r116 10.1186/gb-2010-11-8-r83 10.1164/rccm.201203-0523ED 10.1186/gb-2009-10-11-r134 10.1093/bioinformatics/btr630 |
| ContentType | Journal Article |
| Copyright | 2013 Elsevier Inc. Copyright © 2013 Elsevier Inc. All rights reserved. |
| Copyright_xml | – notice: 2013 Elsevier Inc. – notice: Copyright © 2013 Elsevier Inc. All rights reserved. |
| DBID | 6I. AAFTH AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 |
| DOI | 10.1016/j.jbi.2013.07.001 |
| DatabaseName | ScienceDirect Open Access Titles Elsevier:ScienceDirect:Open Access CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic |
| DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
| DatabaseTitleList | MEDLINE MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Medicine Engineering Public Health |
| EISSN | 1532-0480 |
| EndPage | 781 |
| ExternalDocumentID | 23872175 10_1016_j_jbi_2013_07_001 S1532046413001007 |
| Genre | Research Support, Non-U.S. Gov't Journal Article |
| GroupedDBID | --- --K --M -~X .DC .GJ .~1 0R~ 1B1 1RT 1~. 1~5 29J 4.4 457 4G. 53G 5GY 5VS 6I. 7-5 71M 8P~ AACTN AAEDT AAEDW AAFTH AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAWTL AAXUO AAYFN ABBOA ABBQC ABFRF ABJNI ABLVK ABMAC ABMZM ABVKL ABXDB ABYKQ ACDAQ ACGFO ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADFGL ADMUD AEBSH AEFWE AEKER AENEX AEXQZ AFKWA AFTJW AFXIZ AGHFR AGUBO AGYEJ AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV AJRQY ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ ANZVX AOUOD ASPBG AVWKF AXJTR AZFZN BAWUL BKOJK BLXMC BNPGV CAG COF CS3 DIK DM4 DU5 EBS EFBJH EFLBG EJD EO8 EO9 EP2 EP3 F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HVGLF HZ~ IHE IXB J1W KOM LCYCR LG5 M41 MO0 N9A NCXOZ O-L O9- OAUVE OK1 OZT P-8 P-9 PC. Q38 R2- RIG ROL RPZ SDF SDG SDP SES SEW SPC SPCBC SSH SSV SSZ T5K UAP UHS UNMZH XPP ZGI ZMT ZU3 ~G- 9DU AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACIEU ACLOT ACRPL ACVFH ADCNI ADNMO ADVLN AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP CITATION EFKBS ~HD AGCQF AGRNS CGR CUY CVF ECM EIF NPM 7X8 |
| ID | FETCH-LOGICAL-c353t-8bf3011d591a0827568b8bebaf7cae13fb0ce46aa54810dece54dba74f30778b3 |
| ISICitedReferencesCount | 257 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000324848600002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1532-0464 1532-0480 |
| IngestDate | Thu Oct 02 05:58:16 EDT 2025 Mon Jul 21 06:05:40 EDT 2025 Tue Nov 18 21:53:07 EST 2025 Sat Nov 29 06:23:10 EST 2025 Fri Feb 23 02:33:45 EST 2024 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 5 |
| Keywords | Cloud computing Big data Bioinformatics Genomics Hadoop |
| Language | English |
| License | http://www.elsevier.com/open-access/userlicense/1.0 Copyright © 2013 Elsevier Inc. All rights reserved. |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c353t-8bf3011d591a0827568b8bebaf7cae13fb0ce46aa54810dece54dba74f30778b3 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| OpenAccessLink | https://dx.doi.org/10.1016/j.jbi.2013.07.001 |
| PMID | 23872175 |
| PQID | 1433272339 |
| PQPubID | 23479 |
| PageCount | 8 |
| ParticipantIDs | proquest_miscellaneous_1433272339 pubmed_primary_23872175 crossref_citationtrail_10_1016_j_jbi_2013_07_001 crossref_primary_10_1016_j_jbi_2013_07_001 elsevier_sciencedirect_doi_10_1016_j_jbi_2013_07_001 |
| PublicationCentury | 2000 |
| PublicationDate | October 2013 2013-10-00 2013-Oct 20131001 |
| PublicationDateYYYYMMDD | 2013-10-01 |
| PublicationDate_xml | – month: 10 year: 2013 text: October 2013 |
| PublicationDecade | 2010 |
| PublicationPlace | United States |
| PublicationPlace_xml | – name: United States |
| PublicationTitle | Journal of biomedical informatics |
| PublicationTitleAlternate | J Biomed Inform |
| PublicationYear | 2013 |
| Publisher | Elsevier Inc |
| Publisher_xml | – name: Elsevier Inc |
| References | Data Deluge and the Human Microbiome Project. Feng, Grossman, Stein (b0465) 2011; 12 Moore (b0015) 1965; 38 Zou, Li, Jiang, Lin, Li, Chen (b0505) 2013 Colosimo, Peterson, Mardis, Hirschman (b0420) 2011; 6 Cloudera and Mount Sinai: The structure of a Big Data Revolution? Bridging the gap between HPC and IaaS clouds. Obama Administration Unveils “Big Data” Initiative: Announces $200 Million In New R&D Investments. Schadt, Linderman, Sorenson, Lee, Nolan (b0260) 2010; 11 Liu, Wong, Wu, Luo, Yiu, Li (b0430) 2012; 28 Pireddu, Leo, Zanetti (b0385) 2011; 27 O’Connor, Merriman, Nelson (b0495) 2010; 11 Cooper, Khatib, Treuille, Barbero, Lee, Beenen (b0345) 2010; 466 Matthews, Williams (b0415) 2010; 11 NextBio, Intel to collaborate on improving Hadoop Stack for Genomic Data Analysis. O’Driscoll, Sleator (b0530) 2013 Available at http://asperasoft.com/. Creating HIPAA-Compliant Medical Data Applications With AWS. Loman, Constantinidou, Chan, Halachev, Sergeant, Penn (b0030) 2012; 10 Jourdren, Bernard, Dillies, Le Crom (b0405) 2012; 28 Nguyen, Shi, Ruden (b0235) 2011; 4 As We May Communicate. Oinn, Addis, Ferris, Marvin, Senger, Greenwood (b0165) 2004; 20 Hong, Rhie, Park, Lee, Ju, Kim (b0170) 2012; 28 Langmead, Hansen, Leek (b0515) 2010; 11 Kelley, Schatz, Salzberg (b0455) 2010; 11 Schatz (b0380) 2009; 25 Zhang, Gu, Liu, Wang, Azuaje (b0460) 2012; 28 Mathe, Sagot, Schiex, Rouze (b0040) 2002; 30 Davenport (b0090) 2012; 90 How “Cloud” Services Democratize DNA Sequencing. Manyika, Chui, Brown, Bughin, Dobbs, Roxburgh (b0475) 2011 Taylor (b0190) 2010; 11 Langmead, Schatz, Lin, Pop, Salzberg (b0230) 2009; 10 Furusawa, Kaneko (b0355) 2012; 338 Vouzis, Sahinidis (b0425) 2011; 27 How Hadoop Makes Short Work of Big Data. Managing and Analysing 1,000,000 Genomes. Chae, Jung, Lee, Marru, Lee, Kim (b0085) 2013; 1 . Mason, Elemento (b0050) 2012; 13 Karr, Sanghvi, Macklin, Gutschow, Jacobs, Bolival (b0370) 2012; 150 Stein (b0045) 2010; 11 Healthcare Cloud Computing (Clinical, EMR, SaaS, Private, Public, Hybrid) Market – Global Trends, Challenges, Opportunities & Forecasts (2012–2017). Niemenmaa, Kallio, Schumacher, Klemela, Korpelainen, Heljanko (b0410) 2012; 28 Sleator (b0330) 2012; 815 Sleator (b0335) 2012; 3 Giardine, Riemer, Hardison, Burhans, Elnitski, Shah (b0155) 2005; 15 Sleator (b0325) 2010; 93 Social Media And The Big Data Explosion. The Benefits Of Data Center Virtualization For Businesses. What will happen to Amazon’s massive cloud business? Blastreduce: high performance short read mapping with mapreduce. Gantz J, Reinsel, D. The Digital Universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. In: IDC iView: IDC Analyze the, Future; 2012. Robertson (b0525) 2003; 3 Shachak, Shuval, Fine (b0145) 2007; 95 Available at https://dnanexus.com/. Hadoop Sorts a Petabyte in 16.25 Hours and a Terabyte in 62 Seconds. Schadt (b0295) 2012; 8 Murray (b0350) 2012; 185 Schatz, Sommer, Kelley, Pop (b0535) 2010; vol. 10 Chang, Chen, Chen, Ho (b0400) 2012; 13 Walter (b0020) 2005; 293 Marianayagam, Fawzi, Head-Gordon (b0340) 2005; 102 Quail, Smith, Coupland, Otto, Harris, Connor (b0005) 2012; 13 Fusaro, Patil, Gafni, Wall, Tonellato (b0485) 2011 Klein (b0315) 2011; 39 Angiuoli, Matalka, Gussman, Galens, Vangala, Riley (b0490) 2011 Schatz, Langmead, Salzberg (b0225) 2010; 28 Krampis K, Booth T, Chapman B, Tiwari B, Bicak M, Field D, et al.. Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community. BMC Bioinform; 2012;13:42. <calendar:T1:13:42>. Helping accelerate treatment for pediatric cancer with Dell cloud technology. Schoenherr, Forer, Weissensteiner, Specht, Kronenberg, Kloss-Brandstaetter (b0520) 2012; 13 Lewis, Csordas, Killcoyne, Hermjakob, Hoopmann, Moritz (b0540) 2012; 13 Big Data Offers Big Opportunities for Retail, Financial, Web Companies. Pennisi (b0275) 2011; 331 Yeh, Lim, Burge (b0100) 2001; 11 Sleator, Shortall, Hill (b0320) 2008; 47 Genomics Takes Flight….To the Cloud. Sleator (b0375) 2012; 3 Leo S, Santoni F, Zanetti G. Biodoop: bioinformatics on hadoop. In: Parallel processing workshops, 2009. ICPPW ‘09. International Conference on; 2009. p. 415–22. Cloudera Chief Scientist Jeff Hammerbacher Teams with Mount Sinai School of Medicine to Solve Medical Challenges Using Big Data. Huang, Tata, Prill (b0450) 2013; 29 Managing data in the Cloud Age. EMC Sitting In Sweet Spot Of $70 Billion Big Data Industry. 1,000 Genomes in the Cloud and NCBI Experiences. Matsunaga A, Tsugawa M, and Fortes J. CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications. IEEE Fourth International Conference on eScience, Indiana, USA, 2008 222-229. Gurtowski, Schatz, Langmead (b0510) 2012 Dai, Gao, Guo, Xiao, Zhang (b0480) 2012; 7 Dai, Gao, Guo, Xiao, Zhang (b0500) 2012; 7 McKenna (b0215) 2010; 20 Pollack (b0010) 2011 Big Data, Meet the Huge Data That Will Shape Your Future. Davies (b0470) 2010 Sleator (b0360) 2010; 74 10.1016/j.jbi.2013.07.001_b0265 O’Connor (10.1016/j.jbi.2013.07.001_b0495) 2010; 11 Pennisi (10.1016/j.jbi.2013.07.001_b0275) 2011; 331 Colosimo (10.1016/j.jbi.2013.07.001_b0420) 2011; 6 Zhang (10.1016/j.jbi.2013.07.001_b0460) 2012; 28 10.1016/j.jbi.2013.07.001_b0065 Pollack (10.1016/j.jbi.2013.07.001_b0010) 2011 10.1016/j.jbi.2013.07.001_b0185 10.1016/j.jbi.2013.07.001_b0060 10.1016/j.jbi.2013.07.001_b0180 Chang (10.1016/j.jbi.2013.07.001_b0400) 2012; 13 Dai (10.1016/j.jbi.2013.07.001_b0500) 2012; 7 Schadt (10.1016/j.jbi.2013.07.001_b0260) 2010; 11 Langmead (10.1016/j.jbi.2013.07.001_b0515) 2010; 11 Lewis (10.1016/j.jbi.2013.07.001_b0540) 2012; 13 Jourdren (10.1016/j.jbi.2013.07.001_b0405) 2012; 28 Huang (10.1016/j.jbi.2013.07.001_b0450) 2013; 29 Chae (10.1016/j.jbi.2013.07.001_b0085) 2013; 1 Taylor (10.1016/j.jbi.2013.07.001_b0190) 2010; 11 Mason (10.1016/j.jbi.2013.07.001_b0050) 2012; 13 10.1016/j.jbi.2013.07.001_b0135 Kelley (10.1016/j.jbi.2013.07.001_b0455) 2010; 11 Oinn (10.1016/j.jbi.2013.07.001_b0165) 2004; 20 10.1016/j.jbi.2013.07.001_b0255 10.1016/j.jbi.2013.07.001_b0075 Moore (10.1016/j.jbi.2013.07.001_b0015) 1965; 38 10.1016/j.jbi.2013.07.001_b0195 Murray (10.1016/j.jbi.2013.07.001_b0350) 2012; 185 10.1016/j.jbi.2013.07.001_b0150 Klein (10.1016/j.jbi.2013.07.001_b0315) 2011; 39 Niemenmaa (10.1016/j.jbi.2013.07.001_b0410) 2012; 28 10.1016/j.jbi.2013.07.001_b0390 10.1016/j.jbi.2013.07.001_b0070 Hong (10.1016/j.jbi.2013.07.001_b0170) 2012; 28 Pireddu (10.1016/j.jbi.2013.07.001_b0385) 2011; 27 Yeh (10.1016/j.jbi.2013.07.001_b0100) 2001; 11 Sleator (10.1016/j.jbi.2013.07.001_b0320) 2008; 47 Mathe (10.1016/j.jbi.2013.07.001_b0040) 2002; 30 Sleator (10.1016/j.jbi.2013.07.001_b0335) 2012; 3 Schatz (10.1016/j.jbi.2013.07.001_b0225) 2010; 28 Stein (10.1016/j.jbi.2013.07.001_b0045) 2010; 11 Davies (10.1016/j.jbi.2013.07.001_b0470) 2010 10.1016/j.jbi.2013.07.001_b0305 Sleator (10.1016/j.jbi.2013.07.001_b0360) 2010; 74 Matthews (10.1016/j.jbi.2013.07.001_b0415) 2010; 11 10.1016/j.jbi.2013.07.001_b0105 10.1016/j.jbi.2013.07.001_b0025 10.1016/j.jbi.2013.07.001_b0300 10.1016/j.jbi.2013.07.001_b0440 10.1016/j.jbi.2013.07.001_b0120 10.1016/j.jbi.2013.07.001_b0285 Dai (10.1016/j.jbi.2013.07.001_b0480) 2012; 7 Walter (10.1016/j.jbi.2013.07.001_b0020) 2005; 293 Feng (10.1016/j.jbi.2013.07.001_b0465) 2011; 12 Loman (10.1016/j.jbi.2013.07.001_b0030) 2012; 10 10.1016/j.jbi.2013.07.001_b0280 Furusawa (10.1016/j.jbi.2013.07.001_b0355) 2012; 338 Sleator (10.1016/j.jbi.2013.07.001_b0330) 2012; 815 10.1016/j.jbi.2013.07.001_b0080 Cooper (10.1016/j.jbi.2013.07.001_b0345) 2010; 466 Gurtowski (10.1016/j.jbi.2013.07.001_b0510) 2012 Schatz (10.1016/j.jbi.2013.07.001_b0380) 2009; 25 Zou (10.1016/j.jbi.2013.07.001_b0505) 2013 Liu (10.1016/j.jbi.2013.07.001_b0430) 2012; 28 Schadt (10.1016/j.jbi.2013.07.001_b0295) 2012; 8 Karr (10.1016/j.jbi.2013.07.001_b0370) 2012; 150 Angiuoli (10.1016/j.jbi.2013.07.001_b0490) 2011 Shachak (10.1016/j.jbi.2013.07.001_b0145) 2007; 95 10.1016/j.jbi.2013.07.001_b0115 Quail (10.1016/j.jbi.2013.07.001_b0005) 2012; 13 10.1016/j.jbi.2013.07.001_b0055 Langmead (10.1016/j.jbi.2013.07.001_b0230) 2009; 10 Vouzis (10.1016/j.jbi.2013.07.001_b0425) 2011; 27 Sleator (10.1016/j.jbi.2013.07.001_b0325) 2010; 93 10.1016/j.jbi.2013.07.001_b0250 10.1016/j.jbi.2013.07.001_b0095 Fusaro (10.1016/j.jbi.2013.07.001_b0485) 2011 Sleator (10.1016/j.jbi.2013.07.001_b0375) 2012; 3 10.1016/j.jbi.2013.07.001_b0290 Giardine (10.1016/j.jbi.2013.07.001_b0155) 2005; 15 Marianayagam (10.1016/j.jbi.2013.07.001_b0340) 2005; 102 Schatz (10.1016/j.jbi.2013.07.001_b0535) 2010; vol. 10 Schoenherr (10.1016/j.jbi.2013.07.001_b0520) 2012; 13 Nguyen (10.1016/j.jbi.2013.07.001_b0235) 2011; 4 Manyika (10.1016/j.jbi.2013.07.001_b0475) 2011 O’Driscoll (10.1016/j.jbi.2013.07.001_b0530) 2013 10.1016/j.jbi.2013.07.001_b0205 Robertson (10.1016/j.jbi.2013.07.001_b0525) 2003; 3 10.1016/j.jbi.2013.07.001_b0445 Davenport (10.1016/j.jbi.2013.07.001_b0090) 2012; 90 10.1016/j.jbi.2013.07.001_b0125 10.1016/j.jbi.2013.07.001_b0245 McKenna (10.1016/j.jbi.2013.07.001_b0215) 2010; 20 |
| References_xml | – volume: 25 start-page: 1363 year: 2009 end-page: 1369 ident: b0380 article-title: CloudBurst: highly sensitive read mapping with MapReduce publication-title: Bioinformatics – volume: 13 start-page: S28 year: 2012 ident: b0400 article-title: A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework publication-title: BMC Genomics – volume: 28 start-page: 878 year: 2012 end-page: 879 ident: b0430 article-title: SOAP3: ultra-fast GPU-based parallel alignment tool for short reads publication-title: Bioinformatics – volume: 28 start-page: 691 year: 2010 end-page: 693 ident: b0225 article-title: Cloud computing and the DNA data race publication-title: Nat Biotechnol – volume: 39 start-page: 571 year: 2011 end-page: 578 ident: b0315 article-title: Cloudy confidentiality: clinical and legal implications of cloud computing in health care publication-title: J Am Acad Psychiatry Law – start-page: 4 year: 2013 ident: b0530 article-title: Synthetic DNA: the next generation of big data storage publication-title: Bioengineered – reference: Managing data in the Cloud Age. < – volume: 3 year: 2003 ident: b0525 article-title: The $1000 genome: ethical and legal issues in whole genome sequencing of individuals publication-title: Am J Bioeth – reference: Managing and Analysing 1,000,000 Genomes. < – reference: Big Data Offers Big Opportunities for Retail, Financial, Web Companies. < – reference: Hadoop Sorts a Petabyte in 16.25 Hours and a Terabyte in 62 Seconds. < – volume: 13 start-page: 200 year: 2012 ident: b0520 article-title: Cloudgene: a graphical execution platform for MapReduce programs on private and public clouds publication-title: BMC Bioinform – volume: 13 year: 2012 ident: b0540 article-title: Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework publication-title: BMC Bioinform – year: 2012 ident: b0510 article-title: Genotyping in the cloud with Crossbow publication-title: Current Protocol Bioinform – volume: 29 start-page: 135 year: 2013 end-page: 136 ident: b0450 article-title: BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters publication-title: Bioinformatics – reference: 1,000 Genomes in the Cloud and NCBI Experiences. < – volume: vol. 10 year: 2010 ident: b0535 article-title: De Novo assembly of large genomes with cloud computing publication-title: Biology of genomes – reference: What will happen to Amazon’s massive cloud business? < – volume: 93 start-page: 1 year: 2010 end-page: 6 ident: b0325 article-title: An overview of the processes shaping protein evolution publication-title: Sci Prog – volume: 28 start-page: 721 year: 2012 end-page: 723 ident: b0170 article-title: FX: an RNA-Seq analysis tool on the cloud publication-title: Bioinformatics – volume: 38 start-page: 4 year: 1965 end-page: 7 ident: b0015 article-title: Cramming more components into integrated circuits publication-title: Electronics – volume: 47 start-page: 361 year: 2008 end-page: 366 ident: b0320 article-title: Metagenomics publication-title: Lett Appl Microbiol – reference: Matsunaga A, Tsugawa M, and Fortes J. CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications. IEEE Fourth International Conference on eScience, Indiana, USA, 2008 222-229. – reference: Available at https://dnanexus.com/. – volume: 95 start-page: 454 year: 2007 end-page: 458 ident: b0145 article-title: Barriers and enablers to the acceptance of bioinformatics tools: a qualitative study publication-title: J Med Libr Assoc – reference: Big Data, Meet the Huge Data That Will Shape Your Future. < – reference: Krampis K, Booth T, Chapman B, Tiwari B, Bicak M, Field D, et al.. Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community. BMC Bioinform; 2012;13:42. <calendar:T1:13:42>. – volume: 28 start-page: 1542 year: 2012 end-page: 1543 ident: b0405 article-title: Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses publication-title: Bioinformatics – volume: 11 start-page: S1 year: 2010 ident: b0190 article-title: An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics publication-title: BMC Bioinform – volume: 102 start-page: 16684 year: 2005 end-page: 16689 ident: b0340 article-title: Protein folding by distributed computing and the denatured state ensemble publication-title: Proc Natl Acad Sci USA – reference: Gantz J, Reinsel, D. The Digital Universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. In: IDC iView: IDC Analyze the, Future; 2012. – reference: EMC Sitting In Sweet Spot Of $70 Billion Big Data Industry. < – volume: 10 start-page: 599 year: 2012 end-page: 606 ident: b0030 article-title: High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity publication-title: Nat Rev Microbiol – year: 2010 ident: b0470 article-title: The $1,000 genome: the revolution in DNA sequencing and the new era of personalized medicine [hardcover] – volume: 11 year: 2010 ident: b0495 article-title: SeqWare Query Engine: storing and searching sequence data in the cloud publication-title: BMC Bioinform – volume: 11 start-page: 647 year: 2010 end-page: 657 ident: b0260 article-title: Computational solutions to large-scale data management and analysis publication-title: Nat Rev Genet – start-page: 12 year: 2011 ident: b0490 article-title: CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing publication-title: BMC Bioinform – volume: 466 start-page: 756 year: 2010 end-page: 760 ident: b0345 article-title: Predicting protein structures with a multiplayer online game publication-title: Nature – volume: 28 start-page: 876 year: 2012 end-page: 877 ident: b0410 article-title: Hadoop-BAM: directly manipulating next generation sequencing data in the cloud publication-title: Bioinformatics – volume: 7 start-page: 43 year: 2012 ident: b0480 article-title: Bioinformatics clouds for big data manipulation publication-title: Biology Direct – volume: 74 start-page: 214 year: 2010 end-page: 215 ident: b0360 article-title: The human superorganism – of microbes and men publication-title: Med Hypotheses – reference: Cloudera Chief Scientist Jeff Hammerbacher Teams with Mount Sinai School of Medicine to Solve Medical Challenges Using Big Data. < – volume: 27 start-page: 2159 year: 2011 end-page: 2160 ident: b0385 article-title: SEAL: a distributed short read mapping and duplicate removal tool publication-title: Bioinformatics – reference: Obama Administration Unveils “Big Data” Initiative: Announces $200 Million In New R&D Investments. < – volume: 150 start-page: 389 year: 2012 end-page: 401 ident: b0370 article-title: A whole-cell computational model predicts phenotype from genotype publication-title: Cell – volume: 11 start-page: R116 year: 2010 ident: b0455 article-title: Quake: quality-aware detection and correction of sequencing errors publication-title: Genome Biol – year: 2011 ident: b0485 article-title: Biomedical cloud computing with amazon web services publication-title: PLOS J – volume: 185 start-page: 1251 year: 2012 end-page: 1252 ident: b0350 article-title: Personalized medicine: been there, done that, always needs work! publication-title: Am J Respir Crit Care Med – volume: 338 start-page: 215 year: 2012 end-page: 217 ident: b0355 article-title: A dynamical-systems view of stem cell biology publication-title: Science – volume: 11 start-page: 207 year: 2010 ident: b0045 article-title: The case for cloud computing in genome informatics publication-title: Rev J: Genome Biol – reference: NextBio, Intel to collaborate on improving Hadoop Stack for Genomic Data Analysis. < – volume: 8 start-page: 612 year: 2012 ident: b0295 article-title: The changing privacy landscape in the era of big data publication-title: Mol Syst Biol – reference: As We May Communicate. < – volume: 11 start-page: 803 year: 2001 end-page: 816 ident: b0100 article-title: Computational inference of homologous gene structures in the human genome publication-title: Genome Res – volume: 293 start-page: 32 year: 2005 end-page: 33 ident: b0020 publication-title: Kryder’s Law Sci Am – reference: Healthcare Cloud Computing (Clinical, EMR, SaaS, Private, Public, Hybrid) Market – Global Trends, Challenges, Opportunities & Forecasts (2012–2017). < – reference: Leo S, Santoni F, Zanetti G. Biodoop: bioinformatics on hadoop. In: Parallel processing workshops, 2009. ICPPW ‘09. International Conference on; 2009. p. 415–22. – volume: 10 start-page: R134 year: 2009 ident: b0230 article-title: Searching for SNPs with cloud computing publication-title: Genome Biol – reference: Data Deluge and the Human Microbiome Project. < – volume: 20 start-page: 1297 year: 2010 end-page: 1303 ident: b0215 article-title: The genome analysis toolkit: a MapReduce framework for analysing next-generation DNA sequencing data publication-title: Genome Res – volume: 4 start-page: 171 year: 2011 ident: b0235 article-title: CloudAligner: a fast and full-featured MapReduce based tool for sequence mapping publication-title: BMC Res Notes – volume: 1 start-page: 6 year: 2013 ident: b0085 article-title: Bio and health informatics meets cloud: BioVLab as an example publication-title: Health Inform Sci Syst – reference: Creating HIPAA-Compliant Medical Data Applications With AWS. < – volume: 815 start-page: 15 year: 2012 end-page: 24 ident: b0330 article-title: Prediction of protein functions publication-title: Methods Mol Biol – reference: Bridging the gap between HPC and IaaS clouds. < – volume: 331 start-page: 666 year: 2011 end-page: 668 ident: b0275 article-title: Will computers crash genomics? publication-title: Science – reference: How “Cloud” Services Democratize DNA Sequencing. < – volume: 13 start-page: 314 year: 2012 ident: b0050 article-title: Faster sequencers, larger datasets, new challenges publication-title: Genome Biol – volume: 7 year: 2012 ident: b0500 article-title: Bioinformatics clouds for big data manipulation publication-title: Biol Direct – year: 2011 ident: b0010 article-title: DNA sequencing: caught in the deluge of data – volume: 30 start-page: 4103 year: 2002 end-page: 4117 ident: b0040 article-title: Current methods of gene prediction, their strengths and weaknesses publication-title: Nucl Acids Res – volume: 28 start-page: 294 year: 2012 end-page: 295 ident: b0460 article-title: Gene set analysis in the cloud publication-title: Bioinformatics – volume: 3 start-page: 80 year: 2012 end-page: 85 ident: b0335 article-title: Proteins: form and function publication-title: Bioeng Bugs – volume: 20 start-page: 3045 year: 2004 end-page: 3054 ident: b0165 article-title: Taverna: a tool for the composition and enactment of bioinformatics workflows publication-title: Bioinformatics – reference: Social Media And The Big Data Explosion. < – year: 2013 ident: b0505 article-title: Survey of MapReduce frame operation in bioinformatics publication-title: Brief Bioinform – reference: Genomics Takes Flight….To the Cloud. < – reference: How Hadoop Makes Short Work of Big Data. < – volume: 11 year: 2010 ident: b0515 article-title: Cloud-scale RNA-sequencing differential expression analysis with Myrna publication-title: Genome Biol – volume: 3 start-page: 311 year: 2012 end-page: 312 ident: b0375 article-title: Digital biology: a new era has begun publication-title: Bioengineered – reference: >. – volume: 90 start-page: 128 year: 2012 ident: b0090 article-title: D. J. Data scientist: the sexiest job of the 21st century publication-title: Harward Business – reference: The Benefits Of Data Center Virtualization For Businesses. < – reference: Available at http://asperasoft.com/. – reference: . – volume: 15 start-page: 1451 year: 2005 end-page: 1455 ident: b0155 article-title: Galaxy: a platform for interactive large-scale genome analysis publication-title: Genome Res – reference: Helping accelerate treatment for pediatric cancer with Dell cloud technology. < – volume: 12 start-page: 139 year: 2011 ident: b0465 article-title: PeakRanger: a cloud-enabled peak caller for ChIP-seq data publication-title: BMC Bioinform – year: 2011 ident: b0475 article-title: Big data: the next frontier for innovation, competition, and productivity – reference: Cloudera and Mount Sinai: The structure of a Big Data Revolution? < – volume: 11 start-page: S15 year: 2010 ident: b0415 article-title: MrsRF: an efficient MapReduce algorithm for analyzing large collections of evolutionary trees publication-title: BMC Bioinform – volume: 27 start-page: 182 year: 2011 end-page: 188 ident: b0425 article-title: GPU-BLAST: using graphics processors to accelerate protein sequence alignment publication-title: Bioinformatics – volume: 13 start-page: 341 year: 2012 ident: b0005 article-title: A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers publication-title: BMC Genomics – reference: Blastreduce: high performance short read mapping with mapreduce. < – volume: 6 start-page: 13 year: 2011 ident: b0420 article-title: Nephele: genotyping via complete composition vectors and MapReduce publication-title: Source Code Biol Med – ident: 10.1016/j.jbi.2013.07.001_b0120 – year: 2013 ident: 10.1016/j.jbi.2013.07.001_b0505 article-title: Survey of MapReduce frame operation in bioinformatics publication-title: Brief Bioinform – volume: 293 start-page: 32 issue: August year: 2005 ident: 10.1016/j.jbi.2013.07.001_b0020 publication-title: Kryder’s Law Sci Am doi: 10.1038/scientificamerican0805-32 – volume: 13 start-page: 200 issue: 1 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0520 article-title: Cloudgene: a graphical execution platform for MapReduce programs on private and public clouds publication-title: BMC Bioinform doi: 10.1186/1471-2105-13-200 – volume: 13 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0540 article-title: Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework publication-title: BMC Bioinform doi: 10.1186/1471-2105-13-324 – volume: 331 start-page: 666 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0275 article-title: Will computers crash genomics? publication-title: Science doi: 10.1126/science.331.6018.666 – ident: 10.1016/j.jbi.2013.07.001_b0195 – ident: 10.1016/j.jbi.2013.07.001_b0445 doi: 10.1109/ICPPW.2009.37 – year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0510 article-title: Genotyping in the cloud with Crossbow publication-title: Current Protocol Bioinform doi: 10.1002/0471250953.bi1503s39 – volume: 27 start-page: 2159 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0385 article-title: SEAL: a distributed short read mapping and duplicate removal tool publication-title: Bioinformatics doi: 10.1093/bioinformatics/btr325 – volume: 150 start-page: 389 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0370 article-title: A whole-cell computational model predicts phenotype from genotype publication-title: Cell doi: 10.1016/j.cell.2012.05.044 – volume: 7 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0500 article-title: Bioinformatics clouds for big data manipulation publication-title: Biol Direct doi: 10.1186/1745-6150-7-43 – ident: 10.1016/j.jbi.2013.07.001_b0105 – ident: 10.1016/j.jbi.2013.07.001_b0290 – volume: 27 start-page: 182 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0425 article-title: GPU-BLAST: using graphics processors to accelerate protein sequence alignment publication-title: Bioinformatics doi: 10.1093/bioinformatics/btq644 – ident: 10.1016/j.jbi.2013.07.001_b0095 – volume: 102 start-page: 16684 year: 2005 ident: 10.1016/j.jbi.2013.07.001_b0340 article-title: Protein folding by distributed computing and the denatured state ensemble publication-title: Proc Natl Acad Sci USA doi: 10.1073/pnas.0506388102 – volume: 15 start-page: 1451 year: 2005 ident: 10.1016/j.jbi.2013.07.001_b0155 article-title: Galaxy: a platform for interactive large-scale genome analysis publication-title: Genome Res doi: 10.1101/gr.4086505 – ident: 10.1016/j.jbi.2013.07.001_b0185 – volume: 1 start-page: 6 year: 2013 ident: 10.1016/j.jbi.2013.07.001_b0085 article-title: Bio and health informatics meets cloud: BioVLab as an example publication-title: Health Inform Sci Syst doi: 10.1186/2047-2501-1-6 – ident: 10.1016/j.jbi.2013.07.001_b0265 – volume: vol. 10 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0535 article-title: De Novo assembly of large genomes with cloud computing – ident: 10.1016/j.jbi.2013.07.001_b0280 – volume: 8 start-page: 612 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0295 article-title: The changing privacy landscape in the era of big data publication-title: Mol Syst Biol doi: 10.1038/msb.2012.47 – volume: 6 start-page: 13 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0420 article-title: Nephele: genotyping via complete composition vectors and MapReduce publication-title: Source Code Biol Med doi: 10.1186/1751-0473-6-13 – volume: 12 start-page: 139 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0465 article-title: PeakRanger: a cloud-enabled peak caller for ChIP-seq data publication-title: BMC Bioinform doi: 10.1186/1471-2105-12-139 – ident: 10.1016/j.jbi.2013.07.001_b0060 – volume: 25 start-page: 1363 year: 2009 ident: 10.1016/j.jbi.2013.07.001_b0380 article-title: CloudBurst: highly sensitive read mapping with MapReduce publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp236 – ident: 10.1016/j.jbi.2013.07.001_b0125 – volume: 20 start-page: 3045 year: 2004 ident: 10.1016/j.jbi.2013.07.001_b0165 article-title: Taverna: a tool for the composition and enactment of bioinformatics workflows publication-title: Bioinformatics doi: 10.1093/bioinformatics/bth361 – ident: 10.1016/j.jbi.2013.07.001_b0255 – volume: 28 start-page: 1542 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0405 article-title: Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts165 – volume: 28 start-page: 878 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0430 article-title: SOAP3: ultra-fast GPU-based parallel alignment tool for short reads publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts061 – ident: 10.1016/j.jbi.2013.07.001_b0440 doi: 10.1109/eScience.2008.62 – volume: 39 start-page: 571 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0315 article-title: Cloudy confidentiality: clinical and legal implications of cloud computing in health care publication-title: J Am Acad Psychiatry Law – volume: 10 start-page: 599 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0030 article-title: High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity publication-title: Nat Rev Microbiol doi: 10.1038/nrmicro2850 – volume: 11 start-page: S1 issue: Suppl. 12 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0190 article-title: An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics publication-title: BMC Bioinform doi: 10.1186/1471-2105-11-S12-S1 – ident: 10.1016/j.jbi.2013.07.001_b0180 – volume: 28 start-page: 876 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0410 article-title: Hadoop-BAM: directly manipulating next generation sequencing data in the cloud publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts054 – volume: 30 start-page: 4103 year: 2002 ident: 10.1016/j.jbi.2013.07.001_b0040 article-title: Current methods of gene prediction, their strengths and weaknesses publication-title: Nucl Acids Res doi: 10.1093/nar/gkf543 – ident: 10.1016/j.jbi.2013.07.001_b0075 – volume: 11 start-page: 803 year: 2001 ident: 10.1016/j.jbi.2013.07.001_b0100 article-title: Computational inference of homologous gene structures in the human genome publication-title: Genome Res doi: 10.1101/gr.175701 – volume: 47 start-page: 361 year: 2008 ident: 10.1016/j.jbi.2013.07.001_b0320 article-title: Metagenomics publication-title: Lett Appl Microbiol doi: 10.1111/j.1472-765X.2008.02444.x – ident: 10.1016/j.jbi.2013.07.001_b0150 doi: 10.1186/1471-2105-13-42 – ident: 10.1016/j.jbi.2013.07.001_b0245 – volume: 28 start-page: 721 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0170 article-title: FX: an RNA-Seq analysis tool on the cloud publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts023 – volume: 13 start-page: 314 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0050 article-title: Faster sequencers, larger datasets, new challenges publication-title: Genome Biol doi: 10.1186/gb-2012-13-3-314 – volume: 90 start-page: 128 issue: 70–6 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0090 article-title: D. J. Data scientist: the sexiest job of the 21st century publication-title: Harward Business – volume: 3 start-page: 80 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0335 article-title: Proteins: form and function publication-title: Bioeng Bugs – ident: 10.1016/j.jbi.2013.07.001_b0115 – ident: 10.1016/j.jbi.2013.07.001_b0250 – volume: 3 year: 2003 ident: 10.1016/j.jbi.2013.07.001_b0525 article-title: The $1000 genome: ethical and legal issues in whole genome sequencing of individuals publication-title: Am J Bioeth doi: 10.1162/152651603322874762 – volume: 20 start-page: 1297 issue: July year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0215 article-title: The genome analysis toolkit: a MapReduce framework for analysing next-generation DNA sequencing data publication-title: Genome Res doi: 10.1101/gr.107524.110 – year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0470 – ident: 10.1016/j.jbi.2013.07.001_b0065 – ident: 10.1016/j.jbi.2013.07.001_b0055 – ident: 10.1016/j.jbi.2013.07.001_b0080 – volume: 7 start-page: 43 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0480 article-title: Bioinformatics clouds for big data manipulation publication-title: Biology Direct doi: 10.1186/1745-6150-7-43 – ident: 10.1016/j.jbi.2013.07.001_b0305 – volume: 11 start-page: S15 issue: Suppl. 1 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0415 article-title: MrsRF: an efficient MapReduce algorithm for analyzing large collections of evolutionary trees publication-title: BMC Bioinform doi: 10.1186/1471-2105-11-S1-S15 – start-page: 4 year: 2013 ident: 10.1016/j.jbi.2013.07.001_b0530 article-title: Synthetic DNA: the next generation of big data storage publication-title: Bioengineered – volume: 338 start-page: 215 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0355 article-title: A dynamical-systems view of stem cell biology publication-title: Science doi: 10.1126/science.1224311 – year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0475 – volume: 3 start-page: 311 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0375 article-title: Digital biology: a new era has begun publication-title: Bioengineered doi: 10.4161/bioe.22367 – volume: 815 start-page: 15 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0330 article-title: Prediction of protein functions publication-title: Methods Mol Biol doi: 10.1007/978-1-61779-424-7_2 – volume: 466 start-page: 756 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0345 article-title: Predicting protein structures with a multiplayer online game publication-title: Nature doi: 10.1038/nature09304 – year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0010 – ident: 10.1016/j.jbi.2013.07.001_b0070 – volume: 93 start-page: 1 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0325 article-title: An overview of the processes shaping protein evolution publication-title: Sci Prog doi: 10.3184/003685009X12605492662844 – volume: 29 start-page: 135 year: 2013 ident: 10.1016/j.jbi.2013.07.001_b0450 article-title: BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts647 – volume: 28 start-page: 691 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0225 article-title: Cloud computing and the DNA data race publication-title: Nat Biotechnol doi: 10.1038/nbt0710-691 – volume: 74 start-page: 214 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0360 article-title: The human superorganism – of microbes and men publication-title: Med Hypotheses doi: 10.1016/j.mehy.2009.08.047 – volume: 11 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0495 article-title: SeqWare Query Engine: storing and searching sequence data in the cloud publication-title: BMC Bioinform doi: 10.1186/1471-2105-11-S12-S2 – ident: 10.1016/j.jbi.2013.07.001_b0135 – volume: 95 start-page: 454 year: 2007 ident: 10.1016/j.jbi.2013.07.001_b0145 article-title: Barriers and enablers to the acceptance of bioinformatics tools: a qualitative study publication-title: J Med Libr Assoc doi: 10.3163/1536-5050.95.4.454 – ident: 10.1016/j.jbi.2013.07.001_b0205 – volume: 11 start-page: 647 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0260 article-title: Computational solutions to large-scale data management and analysis publication-title: Nat Rev Genet doi: 10.1038/nrg2857 – volume: 38 start-page: 4 year: 1965 ident: 10.1016/j.jbi.2013.07.001_b0015 article-title: Cramming more components into integrated circuits publication-title: Electronics – volume: 11 start-page: 207 issue: May year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0045 article-title: The case for cloud computing in genome informatics publication-title: Rev J: Genome Biol – volume: 13 start-page: 341 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0005 article-title: A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers publication-title: BMC Genomics doi: 10.1186/1471-2164-13-341 – year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0485 article-title: Biomedical cloud computing with amazon web services publication-title: PLOS J – volume: 13 start-page: S28 issue: Suppl. 7 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0400 article-title: A de novo next generation genomic sequence assembler based on string graph and MapReduce cloud computing framework publication-title: BMC Genomics doi: 10.1186/1471-2164-13-S7-S28 – volume: 4 start-page: 171 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0235 article-title: CloudAligner: a fast and full-featured MapReduce based tool for sequence mapping publication-title: BMC Res Notes doi: 10.1186/1756-0500-4-171 – volume: 11 start-page: R116 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0455 article-title: Quake: quality-aware detection and correction of sequencing errors publication-title: Genome Biol doi: 10.1186/gb-2010-11-11-r116 – volume: 11 year: 2010 ident: 10.1016/j.jbi.2013.07.001_b0515 article-title: Cloud-scale RNA-sequencing differential expression analysis with Myrna publication-title: Genome Biol doi: 10.1186/gb-2010-11-8-r83 – ident: 10.1016/j.jbi.2013.07.001_b0025 – ident: 10.1016/j.jbi.2013.07.001_b0300 – volume: 185 start-page: 1251 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0350 article-title: Personalized medicine: been there, done that, always needs work! publication-title: Am J Respir Crit Care Med doi: 10.1164/rccm.201203-0523ED – start-page: 12 year: 2011 ident: 10.1016/j.jbi.2013.07.001_b0490 article-title: CloVR: a virtual machine for automated and portable sequence analysis from the desktop using cloud computing publication-title: BMC Bioinform – ident: 10.1016/j.jbi.2013.07.001_b0390 – ident: 10.1016/j.jbi.2013.07.001_b0285 – volume: 10 start-page: R134 year: 2009 ident: 10.1016/j.jbi.2013.07.001_b0230 article-title: Searching for SNPs with cloud computing publication-title: Genome Biol doi: 10.1186/gb-2009-10-11-r134 – volume: 28 start-page: 294 year: 2012 ident: 10.1016/j.jbi.2013.07.001_b0460 article-title: Gene set analysis in the cloud publication-title: Bioinformatics doi: 10.1093/bioinformatics/btr630 |
| SSID | ssj0011556 |
| Score | 2.5419166 |
| SecondaryResourceType | review_article |
| Snippet | [Display omitted]
•Ever improving next generation sequencing technologies has led to an unprecedented proliferation of sequence data.•Biology is now one of the... Since the completion of the Human Genome project at the turn of the Century, there has been an unprecedented proliferation of genomic sequence data. A... |
| SourceID | proquest pubmed crossref elsevier |
| SourceType | Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 774 |
| SubjectTerms | Big data Bioinformatics Cloud computing Genomics Hadoop Human Genome Project Humans Internet Software |
| Title | ‘Big data’, Hadoop and cloud computing in genomics |
| URI | https://dx.doi.org/10.1016/j.jbi.2013.07.001 https://www.ncbi.nlm.nih.gov/pubmed/23872175 https://www.proquest.com/docview/1433272339 |
| Volume | 46 |
| WOSCitedRecordID | wos000324848600002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: Elsevier SD Freedom Collection Journals 2021 customDbUrl: eissn: 1532-0480 dateEnd: 20210131 omitProxy: false ssIdentifier: ssj0011556 issn: 1532-0464 databaseCode: AIEXJ dateStart: 20010201 isFulltext: true titleUrlDefault: https://www.sciencedirect.com providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1da9RAFB3sVkQR0VXb-lFGUApqJMkkmclj1YoWLD5U2LcwM5ksWdokdBOp_947X7uhZYs--BJCkknCnJObOzPn3ovQ6yThlcwkDzJZsiCpsjjgGedBFEkB_0sYkFWVKTZBT07YbJb_cAkVlqacAG0adnmZd_8VajgGYOvQ2X-Ae3VTOAD7ADpsAXbY_hXwb2L6sZ6_09JP2DW_Fl62bWcj2M7awcSxdUPvoll0ltZzr3m_7qba-HwDpUuy2o8E8lor8xnshKaTMTO19lvn6_nvYa6zUNoafMeDzksxmpXlfevk3b-d8tjNP0RrJRv8PrzNjAMdmj42qm5esR4vWxsLSW1RnmuW204iLD4sRK0Fd8SkVLXPGYHWnRvUwM-AkastuXIlXbY_tYW2Y5rmbIK2D78dzY5XS0vgQGV-edsI_a48UaeHdvfY5KtsGosYn-T0IXrgUMKHlgSP0C3VTNG9UYrJKbrz3Yknpui-naLFNvLsMYoPgCdY8-TgPbYcwcARbDiCVxzBdYM9R56gn1-OTj99DVwJjUCSlPQBE5W24GWaRxycPZpmTDChBK-o5CoilQilSuDjhIFrFJZKqjQpBacJNKOUCfIUTZq2UbsI51Uk8pJIKplKBNUx2ZJkqaRMgdcZ5nso9J1VSJdfXpc5OSu8kHBRQFcXuquLUKseoj30dtWks8lVbro48QgUzju0Xl8BFLqp2SuPVgGWUy-H8Ua1wxIGvYTENCYEXn3Hwrh6C8-AZxvPPEd311_DCzTpLwb1Et2Wv_p6ebGPtuiM7Tvq_QGNVo64 |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%27Big+data%27%2C+Hadoop+and+cloud+computing+in+genomics&rft.jtitle=Journal+of+biomedical+informatics&rft.au=O%27Driscoll%2C+Aisling&rft.au=Daugelaite%2C+Jurate&rft.au=Sleator%2C+Roy+D&rft.date=2013-10-01&rft.eissn=1532-0480&rft.volume=46&rft.issue=5&rft.spage=774&rft_id=info:doi/10.1016%2Fj.jbi.2013.07.001&rft_id=info%3Apmid%2F23872175&rft.externalDocID=23872175 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1532-0464&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1532-0464&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1532-0464&client=summon |