Enhanced Graph Transforming V2 Algorithm for Non-Simple Graph in Big Data Pre-Processing

Incapability of relational database in handling large-scale data triggers the development of NoSQL database that becomes part of a big data ecosystem. NoSQL database has different characteristics compared to the relational database. However, NoSQL database requires data from the relational database...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on knowledge and data engineering Jg. 32; H. 1; S. 67 - 77
Hauptverfasser: Sutedi, Sutedi, Setiawan, Noor Akhmad, Adji, Teguh Bharata
Format: Journal Article
Sprache:Englisch
Veröffentlicht: New York IEEE 01.01.2020
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Schlagworte:
ISSN:1041-4347, 1558-2191
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Incapability of relational database in handling large-scale data triggers the development of NoSQL database that becomes part of a big data ecosystem. NoSQL database has different characteristics compared to the relational database. However, NoSQL database requires data from the relational database as one of the structured data sources. Therefore, data pre-processing is required to ensure proper data migration from a relational database to NoSQL database. This data pre-processing is normally called data transformation. One of the simple and understandable transformation algorithms is graph transforming algorithm. However, the algorithm has a problem in solving a non-simple graph (multigraph). This research proposes an algorithm to overcome several multigraph problems. The experimental work confirms that the algorithm proposed in this research is able to transform data from a relational database to NoSQL schema that has a minimum number of redundant attributes while the data completeness is still maintained.
AbstractList Incapability of relational database in handling large-scale data triggers the development of NoSQL database that becomes part of a big data ecosystem. NoSQL database has different characteristics compared to the relational database. However, NoSQL database requires data from the relational database as one of the structured data sources. Therefore, data pre-processing is required to ensure proper data migration from a relational database to NoSQL database. This data pre-processing is normally called data transformation. One of the simple and understandable transformation algorithms is graph transforming algorithm. However, the algorithm has a problem in solving a non-simple graph (multigraph). This research proposes an algorithm to overcome several multigraph problems. The experimental work confirms that the algorithm proposed in this research is able to transform data from a relational database to NoSQL schema that has a minimum number of redundant attributes while the data completeness is still maintained.
Author Adji, Teguh Bharata
Sutedi, Sutedi
Setiawan, Noor Akhmad
Author_xml – sequence: 1
  givenname: Sutedi
  orcidid: 0000-0002-6341-1689
  surname: Sutedi
  fullname: Sutedi, Sutedi
  email: sutedi.s3te15@mail.ugm.ac.id
  organization: Department of Electrical and Information Engineering, Universitas Gadjah Mada, Yogyakarta, Indonesia
– sequence: 2
  givenname: Noor Akhmad
  orcidid: 0000-0002-5631-1073
  surname: Setiawan
  fullname: Setiawan, Noor Akhmad
  email: noorwewe@ugm.ac.id
  organization: Department of Electrical and Information Engineering, Universitas Gadjah Mada, Yogyakarta, Indonesia
– sequence: 3
  givenname: Teguh Bharata
  orcidid: 0000-0001-7856-1498
  surname: Adji
  fullname: Adji, Teguh Bharata
  email: adji@ugm.ac.id
  organization: Department of Electrical and Information Engineering, Universitas Gadjah Mada, Yogyakarta, Indonesia
BookMark eNp9kMFOAjEQhhuDiYA-gPHSxPNip92l3SMCopEoiWi8NWW3hZLdFtvl4Nu7BOLBg6eZTP5vZvL1UMd5pxG6BjIAIPnd8nkyHVACYkCFIDmHM9SFLBMJhRw6bU9SSFKW8gvUi3FLCBFcQBd9Tt1GuUKXeBbUboOXQblofKitW-MPikfV2gfbbGrcDvGLd8mbrXeVPsWtw_d2jSeqUXgRdLIIvtAxtvAlOjeqivrqVPvo_WG6HD8m89fZ03g0Twqas6Z9qTSsZMwoSAmnnJV0ZVTOjCG8FELkqhCc6EIYqpgBpUsFUKyGJWUAwxVnfXR73LsL_muvYyO3fh9ce1JSRjMQKRuSNsWPqSL4GIM2srCNaqx3TVC2kkDkQaM8aJQHjfKksSXhD7kLtlbh-1_m5shYrfVvXmSMMsLYD6POfts
CODEN ITKEEH
CitedBy_id crossref_primary_10_19053_01211129_v32_n65_2023_16519
Cites_doi 10.1109/3PGCIC.2014.127
10.1007/s10515-013-0135-x
10.1109/ICSIGSYS.2017.7967051
10.1109/CIT.2014.77
10.1016/j.procs.2015.05.367
10.1109/ICCE-TW.2015.7216979
10.1109/3PGCIC.2014.137
10.1109/WAINA.2015.19
10.1109/PACRIM.2013.6625441
10.1109/SMC.2015.353
10.1109/TKDE.2017.2722412
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/TKDE.2018.2880971
DatabaseName IEEE Xplore (IEEE)
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2191
EndPage 77
ExternalDocumentID 10_1109_TKDE_2018_2880971
8532303
Genre orig-research
GroupedDBID -~X
.DC
0R~
29I
4.4
5GY
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACGFO
ACIWK
AENEX
AGQYO
AHBIQ
AKJIK
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
F5P
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
RXW
TAE
TN5
UHB
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c293t-43df3d33fa1407273d2bfa93ff07d8889ac870ec8f2a3f1aeda11cb6d23116b73
IEDL.DBID RIE
ISICitedReferencesCount 1
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000502988400006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1041-4347
IngestDate Sun Oct 05 02:24:21 EDT 2025
Sat Nov 29 04:46:47 EST 2025
Tue Nov 18 22:32:51 EST 2025
Wed Aug 27 06:30:44 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
https://doi.org/10.15223/policy-029
https://doi.org/10.15223/policy-037
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c293t-43df3d33fa1407273d2bfa93ff07d8889ac870ec8f2a3f1aeda11cb6d23116b73
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0001-7856-1498
0000-0002-5631-1073
0000-0002-6341-1689
PQID 2325184360
PQPubID 85438
PageCount 11
ParticipantIDs crossref_citationtrail_10_1109_TKDE_2018_2880971
crossref_primary_10_1109_TKDE_2018_2880971
ieee_primary_8532303
proquest_journals_2325184360
PublicationCentury 2000
PublicationDate 2020-Jan.-1
2020-1-1
20200101
PublicationDateYYYYMMDD 2020-01-01
PublicationDate_xml – month: 01
  year: 2020
  text: 2020-Jan.-1
  day: 01
PublicationDecade 2020
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on knowledge and data engineering
PublicationTitleAbbrev TKDE
PublicationYear 2020
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
potey (ref14) 2015
ref12
ref11
ref10
rocha (ref8) 0; 51
ref1
han (ref7) 0
ref9
ref4
ref3
russom (ref2) 2013
ref6
ref5
seshagiri (ref15) 2016; 5
References_xml – ident: ref11
  doi: 10.1109/3PGCIC.2014.127
– ident: ref1
  doi: 10.1007/s10515-013-0135-x
– ident: ref10
  doi: 10.1109/ICSIGSYS.2017.7967051
– ident: ref3
  doi: 10.1109/CIT.2014.77
– volume: 51
  start-page: 2593
  year: 0
  ident: ref8
  article-title: A framework for migrating relational datasets to NoSQL
  publication-title: Proc Int Conf Comput Sci
  doi: 10.1016/j.procs.2015.05.367
– ident: ref13
  doi: 10.1109/ICCE-TW.2015.7216979
– ident: ref9
  doi: 10.1109/3PGCIC.2014.137
– volume: 5
  start-page: 2631
  year: 2016
  ident: ref15
  article-title: Data migration methodology from SQL to column oriented databases (HBase)
  publication-title: Int J Adv Res Comput Eng Technol
– start-page: 363
  year: 0
  ident: ref7
  article-title: Survey on NoSQL database
  publication-title: Proc 6th Int Conf Pervasive Comput Appl
– start-page: 1
  year: 2015
  ident: ref14
  article-title: Database migration from structured database to non- structured database
  publication-title: Proc Int Conf on Recent Trends in Engineering & Technology
– ident: ref6
  doi: 10.1109/WAINA.2015.19
– ident: ref4
  doi: 10.1109/PACRIM.2013.6625441
– ident: ref12
  doi: 10.1109/SMC.2015.353
– ident: ref5
  doi: 10.1109/TKDE.2017.2722412
– start-page: 1
  year: 2013
  ident: ref2
  article-title: Managing big data
  publication-title: TDWI Best Pract Report TDWI Res
SSID ssj0008781
Score 2.2978113
Snippet Incapability of relational database in handling large-scale data triggers the development of NoSQL database that becomes part of a big data ecosystem. NoSQL...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 67
SubjectTerms Algorithms
Big Data
Data management
Ecosystems
graph
loop
multigraph
multiple edges
NoSQL databases
Object oriented programming
Redundancy
Relational data bases
Relational databases
Software upgrading
SQL to NoSQL
Transformations
Transforms
Title Enhanced Graph Transforming V2 Algorithm for Non-Simple Graph in Big Data Pre-Processing
URI https://ieeexplore.ieee.org/document/8532303
https://www.proquest.com/docview/2325184360
Volume 32
WOSCitedRecordID wos000502988400006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Electronic Library (IEL)
  customDbUrl:
  eissn: 1558-2191
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0008781
  issn: 1041-4347
  databaseCode: RIE
  dateStart: 19890101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bS8MwFD5M8UEfvIvzRh58EqtNszXJo5dNQRiCU_ZW0jRxhdnJrP5-T7JsKIrgW0mTUvIlOefkJN8HcBxjgCaZFpHOZR61bEtEqkjyiKu2pjY1Suqp2ATv9cRgIO8bcDq_C2OM8YfPzJl79Ln8Yqzf3VbZOZoW9JjZAixwnk7vas1XXcG9IClGFxgTsRYPGUway_P-3XXHHeISZwmOVsnpNxvkRVV-rMTevHTX_vdj67Aa3EhyMcV9Axqm2oS1mUQDCTN2E1a-8A1uwaBTDX3Gn9w4nmrSn3mt-JY8JeRi9DyelPXwhWAh6Y2r6KF07MGhelmRy_KZXKtakfuJicIlA2y8DY_dTv_qNgrSCpFG-15jDxWWFYxZRR1DGmeIj1WSWRvzAoNiqTROZKOFTRSzVJlCUarztEB3kKY5ZzuwWI0rswvE8euzAt2uxH3LWGFETlUuHdGgQGPXhHjW2ZkOvONO_mKU-fgjlpnDJ3P4ZAGfJpzMm7xOSTf-qrzlAJlXDFg04WCGaBam5VuG7mPbCdyk8d7vrfZhOXEBtd9jOYDFevJuDmFJf9Tl2-TIj7hP31zR6A
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bS8MwFD54A_XBuzidmgefxLqmmWvy6GVecA7BKXsraZpsBe1kVn-_J1k2FEXwraRJKfmSnHNyku8DOAgxQBNM8UClIg3qps4DmUVpEMsTRU1DS6FGYhNxu827XXE_BUeTuzBaa3f4TB_bR5fLzwbq3W6V1dC0oMfMpmHWKmf521qTdZfHTpIU4wuMilg99jlMGopa5_aiaY9x8eMIx6uI6Tcr5GRVfqzFzsBcLv_v11ZgyTuS5HSE_CpM6WINlsciDcTP2TVY_MI4uA7dZtF3OX9yZZmqSWfst-Jb8hSR0-feYJiX_ReChaQ9KIKH3PIH--p5Qc7yHrmQpST3Qx34awbYeAMeL5ud8-vAiysECi18iT2UGZYxZiS1HGkxQ4SMFMyYMM4wLBZS4VTWiptIMkOlziSlKm1k6BDSRhqzTZgpBoXeAmIZ9lmGjldkv6UN1zylMhWWapCjuatAOO7sRHnmcSuA8Zy4CCQUicUnsfgkHp8KHE6avI5oN_6qvG4BmVT0WFSgOkY08RPzLUEH8sRK3DTC7d9b7cP8deeulbRu2rc7sBDZ8NrtuFRhphy-612YUx9l_jbcc6PvE_tW1TE
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Enhanced+Graph+Transforming+V2+Algorithm+for+Non-Simple+Graph+in+Big+Data+Pre-Processing&rft.jtitle=IEEE+transactions+on+knowledge+and+data+engineering&rft.au=Sutedi&rft.au=Setiawan%2C+Noor+Akhmad&rft.au=Adji%2C+Teguh+Bharata&rft.date=2020-01-01&rft.pub=The+Institute+of+Electrical+and+Electronics+Engineers%2C+Inc.+%28IEEE%29&rft.issn=1041-4347&rft.eissn=1558-2191&rft.volume=32&rft.issue=1&rft.spage=67&rft_id=info:doi/10.1109%2FTKDE.2018.2880971&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1041-4347&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1041-4347&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1041-4347&client=summon