Demonstration of Hadoop-GIS: A Spatial Data Warehousing System Over MapReduce

The proliferation of GPS-enabled devices, and the rapid improvement of scientific instruments have resulted in massive amounts of spatial data in the last decade. Support of high performance spatial queries on large volumes data has become increasingly important in numerous fields, which requires a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the ... ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems : ACM GIS. ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems Jg. 2013; S. 528
Hauptverfasser: Aji, Ablimit, Sun, Xiling, Vo, Hoang, Liu, Qioaling, Lee, Rubao, Zhang, Xiaodong, Saltz, Joel, Wang, Fusheng
Format: Journal Article
Sprache:Englisch
Veröffentlicht: United States 01.11.2013
Schlagworte:
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract The proliferation of GPS-enabled devices, and the rapid improvement of scientific instruments have resulted in massive amounts of spatial data in the last decade. Support of high performance spatial queries on large volumes data has become increasingly important in numerous fields, which requires a scalable and efficient spatial data warehousing solution as existing approaches exhibit scalability limitations and efficiency bottlenecks for large scale spatial applications. In this demonstration, we present - a scalable and high performance spatial query system over MapReduce. Hadoop-GIS provides an efficient spatial query engine to process spatial queries, data and space based partitioning, and query pipelines that parallelize queries implicitly on MapReduce. Hadoop-GIS also provides an expressive, SQL-like spatial query language for workload specification. We will demonstrate how spatial queries are expressed in spatially extended SQL queries, and submitted through a command line/web interface for execution. Parallel to our system demonstration, we explain the system architecture and details on how queries are translated to MapReduce operators, optimized, and executed on Hadoop. In addition, we will showcase how the system can be used to support two representative real world use cases: large scale pathology analytical imaging, and geo-spatial data warehousing.
AbstractList The proliferation of GPS-enabled devices, and the rapid improvement of scientific instruments have resulted in massive amounts of spatial data in the last decade. Support of high performance spatial queries on large volumes data has become increasingly important in numerous fields, which requires a scalable and efficient spatial data warehousing solution as existing approaches exhibit scalability limitations and efficiency bottlenecks for large scale spatial applications. In this demonstration, we present Hadoop-GIS - a scalable and high performance spatial query system over MapReduce. Hadoop-GIS provides an efficient spatial query engine to process spatial queries, data and space based partitioning, and query pipelines that parallelize queries implicitly on MapReduce. Hadoop-GIS also provides an expressive, SQL-like spatial query language for workload specification. We will demonstrate how spatial queries are expressed in spatially extended SQL queries, and submitted through a command line/web interface for execution. Parallel to our system demonstration, we explain the system architecture and details on how queries are translated to MapReduce operators, optimized, and executed on Hadoop. In addition, we will showcase how the system can be used to support two representative real world use cases: large scale pathology analytical imaging, and geo-spatial data warehousing.The proliferation of GPS-enabled devices, and the rapid improvement of scientific instruments have resulted in massive amounts of spatial data in the last decade. Support of high performance spatial queries on large volumes data has become increasingly important in numerous fields, which requires a scalable and efficient spatial data warehousing solution as existing approaches exhibit scalability limitations and efficiency bottlenecks for large scale spatial applications. In this demonstration, we present Hadoop-GIS - a scalable and high performance spatial query system over MapReduce. Hadoop-GIS provides an efficient spatial query engine to process spatial queries, data and space based partitioning, and query pipelines that parallelize queries implicitly on MapReduce. Hadoop-GIS also provides an expressive, SQL-like spatial query language for workload specification. We will demonstrate how spatial queries are expressed in spatially extended SQL queries, and submitted through a command line/web interface for execution. Parallel to our system demonstration, we explain the system architecture and details on how queries are translated to MapReduce operators, optimized, and executed on Hadoop. In addition, we will showcase how the system can be used to support two representative real world use cases: large scale pathology analytical imaging, and geo-spatial data warehousing.
The proliferation of GPS-enabled devices, and the rapid improvement of scientific instruments have resulted in massive amounts of spatial data in the last decade. Support of high performance spatial queries on large volumes data has become increasingly important in numerous fields, which requires a scalable and efficient spatial data warehousing solution as existing approaches exhibit scalability limitations and efficiency bottlenecks for large scale spatial applications. In this demonstration, we present - a scalable and high performance spatial query system over MapReduce. Hadoop-GIS provides an efficient spatial query engine to process spatial queries, data and space based partitioning, and query pipelines that parallelize queries implicitly on MapReduce. Hadoop-GIS also provides an expressive, SQL-like spatial query language for workload specification. We will demonstrate how spatial queries are expressed in spatially extended SQL queries, and submitted through a command line/web interface for execution. Parallel to our system demonstration, we explain the system architecture and details on how queries are translated to MapReduce operators, optimized, and executed on Hadoop. In addition, we will showcase how the system can be used to support two representative real world use cases: large scale pathology analytical imaging, and geo-spatial data warehousing.
Author Saltz, Joel
Vo, Hoang
Wang, Fusheng
Zhang, Xiaodong
Lee, Rubao
Sun, Xiling
Liu, Qioaling
Aji, Ablimit
Author_xml – sequence: 1
  givenname: Ablimit
  surname: Aji
  fullname: Aji, Ablimit
  organization: MathCS and BMI, Emory University
– sequence: 2
  givenname: Xiling
  surname: Sun
  fullname: Sun, Xiling
  organization: EECS, Northwestern University
– sequence: 3
  givenname: Hoang
  surname: Vo
  fullname: Vo, Hoang
  organization: MathCS and BMI, Emory University
– sequence: 4
  givenname: Qioaling
  surname: Liu
  fullname: Liu, Qioaling
  organization: MathCS and BMI, Emory University
– sequence: 5
  givenname: Rubao
  surname: Lee
  fullname: Lee, Rubao
  organization: CSE, Ohio State University
– sequence: 6
  givenname: Xiaodong
  surname: Zhang
  fullname: Zhang, Xiaodong
  organization: CSE, Ohio State University
– sequence: 7
  givenname: Joel
  surname: Saltz
  fullname: Saltz, Joel
  organization: MathCS and BMI, Emory University
– sequence: 8
  givenname: Fusheng
  surname: Wang
  fullname: Wang, Fusheng
  organization: MathCS and BMI, Emory University
BackLink https://www.ncbi.nlm.nih.gov/pubmed/27617325$$D View this record in MEDLINE/PubMed
BookMark eNo1kEtLw0AUhWeh-Kiu3cks3aTOM5m4K21tCy0Fq7gMNzM3GkgycZII_fcGrauz-A6Hj3NNzhrfICF3nE05V_pRaKElV9PfFOyCXIok5okU-orsFlj7pusD9KVvqC_oGpz3bbTaHJ7ojB7aEUBFF9ADfYeAn37oyuaDHo5djzXdf2OgO2hf0A0Wb8h5AVWHt6eckLfn5et8HW33q818to1gNOgjxRh3wqaOxxhLmyruZJEIyA3yghkwcZKCsiMTyuWjeo7SCG1sal2uwYgJefjbbYP_GrDrs7rsLFYVNDj6ZdxIrZnSRo3V-1N1yGt0WRvKGsIx-79A_AA4eVdA
ContentType Journal Article
DBID NPM
7X8
DOI 10.1145/2525314.2525320
DatabaseName PubMed
MEDLINE - Academic
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
ExternalDocumentID 27617325
Genre Journal Article
GrantInformation_xml – fundername: NIBIB NIH HHS
  grantid: P20 EB000591
– fundername: NLM NIH HHS
  grantid: R01 LM011119
– fundername: NLM NIH HHS
  grantid: R01 LM009239
– fundername: NCI NIH HHS
  grantid: U54 CA113001
– fundername: CCR NIH HHS
  grantid: HHSN261200800001C
– fundername: NCI NIH HHS
  grantid: HHSN261200800001E
GroupedDBID NPM
7X8
ID FETCH-LOGICAL-a253t-4001d2c9d16e63c941d3f72ab8e1f08a8679a4ce6324db314be38258c9cdb5a82
IEDL.DBID 7X8
IngestDate Fri Jul 11 06:14:15 EDT 2025
Sat May 31 02:09:57 EDT 2025
IsPeerReviewed false
IsScholarly false
Keywords Scientific Data Management
Hive
Analytical Imaging
Data Warehouse
Database
MapReduce
Spatial Query Processing
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a253t-4001d2c9d16e63c941d3f72ab8e1f08a8679a4ce6324db314be38258c9cdb5a82
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PMID 27617325
PQID 1835504584
PQPubID 23479
ParticipantIDs proquest_miscellaneous_1835504584
pubmed_primary_27617325
PublicationCentury 2000
PublicationDate 20131101
PublicationDateYYYYMMDD 2013-11-01
PublicationDate_xml – month: 11
  year: 2013
  text: 20131101
  day: 1
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Proceedings of the ... ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems : ACM GIS. ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
PublicationTitleAlternate Proc ACM SIGSPATIAL Int Conf Adv Inf
PublicationYear 2013
Score 1.674108
Snippet The proliferation of GPS-enabled devices, and the rapid improvement of scientific instruments have resulted in massive amounts of spatial data in the last...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 528
Title Demonstration of Hadoop-GIS: A Spatial Data Warehousing System Over MapReduce
URI https://www.ncbi.nlm.nih.gov/pubmed/27617325
https://www.proquest.com/docview/1835504584
Volume 2013
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T8MwELaAMrDwEK_ykpFYTRPbSRwWVFEKDC0VD9Et8hMYSEIb8fs5J6mYkJBYksmSfXc5f_Zdvg-hM5VGHGArI4FhMeFKaCIMD4mOuXC-jhMLV4tNJOOxmE7TSXvhNm_bKhc5sU7UptD-jrwHoQdg2lf1LstP4lWjfHW1ldBYRh0GUMa3dCVT0TL4hDzq0YhCkPHz-k2D32FkvZ0MN_47kU203gJJ3G88v4WWbL6NRgP74QFf41ZcOAyZpShKcnP3eIH72AsQQ8DhgawkfpEz-1b4xvdX3BCX43sIbDyS5YMndLU76Hl4_XR1S1rBBCJheRWcBYPQUJ2aMLYx0ykPDXMJlUrY0AVCenI9ybX1FO1GgV2UZXBCFDrVRkVS0F20khe53Uc4jKWJIPVJlTpuhEuZ4DJOnNIwSMqgi04X1skgIH2VQeYW5pz92KeL9hoTZ2XDnJHRBAATo9HBH0YfojXqpSfq__6OUMfB52iP0ar-qt7ns5Pa0_AcT0bfnAW1Yw
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Demonstration+of+Hadoop-GIS%3A+A+Spatial+Data+Warehousing+System+Over+MapReduce&rft.jtitle=Proceedings+of+the+...+ACM+SIGSPATIAL+International+Conference+on+Advances+in+Geographic+Information+Systems+%3A+ACM+GIS.+ACM+SIGSPATIAL+International+Conference+on+Advances+in+Geographic+Information+Systems&rft.au=Aji%2C+Ablimit&rft.au=Sun%2C+Xiling&rft.au=Vo%2C+Hoang&rft.au=Liu%2C+Qioaling&rft.date=2013-11-01&rft.volume=2013&rft.spage=528&rft_id=info:doi/10.1145%2F2525314.2525320&rft.externalDBID=NO_FULL_TEXT