The System Architecture for the Basic Information of Science and Technology Experts Based on Distributed Storage and Web Mining

In order to build an efficient basic information system of science and technology experts based on Web mining, a novel system architecture for application is proposed in this paper. The proposed system architecture integrates spider module, local distributed storage and Mongo-DB. The basic experts i...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2012 International Conference on Computer Science and Service System s. 527 - 530
Hlavní autoři: Quanyin Zhu, Pei Zhou
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.08.2012
Témata:
ISBN:9781467307215, 1467307211
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract In order to build an efficient basic information system of science and technology experts based on Web mining, a novel system architecture for application is proposed in this paper. The proposed system architecture integrates spider module, local distributed storage and Mongo-DB. The basic experts information of science and technology appeared in the Websites are synthesized as two format and using two strategies to deal with it respectively. The normalized texts which extracted from Web page by URLs are suggested. The extracted results include the name, sex, birth, hometown and professional title of science and technology experts respectively. The data stream flow, the information management model for the users and science and technology experts, the target website URLs and URLs management model, and data processing module are introduced in detailed. The synchronization of multiple databases and replica sets architecture for sharing cluster architecture is proposed in application system. Experiments show that the application system obtains a very high efficiency. The results show as by proposed system architecture can satisfy the application requirements for the customer.
AbstractList In order to build an efficient basic information system of science and technology experts based on Web mining, a novel system architecture for application is proposed in this paper. The proposed system architecture integrates spider module, local distributed storage and Mongo-DB. The basic experts information of science and technology appeared in the Websites are synthesized as two format and using two strategies to deal with it respectively. The normalized texts which extracted from Web page by URLs are suggested. The extracted results include the name, sex, birth, hometown and professional title of science and technology experts respectively. The data stream flow, the information management model for the users and science and technology experts, the target website URLs and URLs management model, and data processing module are introduced in detailed. The synchronization of multiple databases and replica sets architecture for sharing cluster architecture is proposed in application system. Experiments show that the application system obtains a very high efficiency. The results show as by proposed system architecture can satisfy the application requirements for the customer.
Author Quanyin Zhu
Pei Zhou
Author_xml – sequence: 1
  surname: Quanyin Zhu
  fullname: Quanyin Zhu
  email: hyitzqy@126.com
  organization: Fac. of Comput. Eng., Huaiyin Inst. of Technol., Huaiyin, China
– sequence: 2
  surname: Pei Zhou
  fullname: Pei Zhou
  email: 14752305928@126.com
  organization: Fac. of Comput. Eng., Huaiyin Inst. of Technol., Huaiyin, China
BookMark eNotjEtPAjEYRWvUREWWrtz0D4BtZ-hjiYhKgnExk7gkfXyFGuiQtiSy8q87iqubk3vPvUEXsYuA0B0lY0qJepg1TTNmhLIxreQZGiohieBqUguqyPkf05qLighGJ1domPMnIaRXuZDsGn23G8DNMRfY4Wmym1DAlkMC7LuES9896hwsXsSed7qELuLO48YGiBawjg63YDex23brI55_7SGV_OuAw_30KeSSgjmUHpvSJb0-OR9g8FuIIa5v0aXX2wzD_xyg9nnezl5Hy_eXxWy6HAVFygiMsqZWxlrlXE0sZZ5K4rhQ1lIugRlhhALpiTbUEe9cXyoJwhvGQPJqgO5PtwEAVvsUdjodV7xSdSV49QO782N4
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/CSSS.2012.138
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9780769547190
0769547192
EndPage 530
ExternalDocumentID 6394376
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i90t-eb9cb49bcc9dd40c12f180d679cc168e2b7b79e8f0ab1d0fdd0d698e7fb22e863
IEDL.DBID RIE
ISBN 9781467307215
1467307211
IngestDate Wed Aug 27 03:43:44 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-eb9cb49bcc9dd40c12f180d679cc168e2b7b79e8f0ab1d0fdd0d698e7fb22e863
PageCount 4
ParticipantIDs ieee_primary_6394376
PublicationCentury 2000
PublicationDate 2012-Aug.
PublicationDateYYYYMMDD 2012-08-01
PublicationDate_xml – month: 08
  year: 2012
  text: 2012-Aug.
PublicationDecade 2010
PublicationTitle 2012 International Conference on Computer Science and Service System
PublicationTitleAbbrev csss
PublicationYear 2012
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001106782
Score 1.5039984
Snippet In order to build an efficient basic information system of science and technology experts based on Web mining, a novel system architecture for application is...
SourceID ieee
SourceType Publisher
StartPage 527
SubjectTerms Computer architecture
Data processing
distributed storage
Educational institutions
Information management
Information retrieval
Mongo-DB
science and technology experts
sharing cluster architecture
spider module
systen architecture
Web mining
Title The System Architecture for the Basic Information of Science and Technology Experts Based on Distributed Storage and Web Mining
URI https://ieeexplore.ieee.org/document/6394376
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELXaioEJUIv41g2MpE2MG8cjFCoWqkqpRLcq9jlSlwS1KSt_nbMTmg4sbHFylqK7OHf23XvH2D3K2Aipo4AbbQKRYRxoChSCcUb-L1dIPlP4ZhNyNkuWSzXvsIc9FsZa64vP7NBd-lw-lmbnjspG5E0FLYgu60oZ11it9jzFcaEl3GO3Yvps3c7ml9KpGY9bjs3RJE1TV9jFh5GDphx0VvGOZXryv1c6ZYMWoQfzve85Yx1b9Nk32RxqCnJ4OkgQAAWmQIEePGdkE2gQSM4iUObQrG7ICoT2oB08B3K1dXMsAom-OIpd1x2Lhint1OlH5Od8WA3vvs3EgC2mr4vJW9A0WAjWKqwCq5XRQmljFKIITcTzKAkxlsqYKE4s11JLZZM8zHSEYY5ID1ViZa45t0n8eM56RVnYCwaYi1ALVIaEBElnmUsLc9I8Khwrc8n6Tnerz5pCY9Wo7erv29fs2JmmrrO7Yb1qs7O37Mh8Vevt5s7b_QeizK4j
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NT8IwFH9BNNGTGjB-24NHB1st23pUlGAEQjISuZG1r0u4DAPDq_-6r2UyDl68retrsrzX7r2-j98DuMco1CJSgce10p5IMfQUGQpeJyX9l0kknSlcs4loNIqnUzmuwcO2FsYY45LPTMs-ulg-LvTausrapE0FHYg92Leds8pqrcqjYtHQYu6qt0LauPZu8wvqVI47Fcpmu5skiU3t4q3AFqfs9FZxqqV3_L-POoFmVaPHxlvtcwo1kzfgm6TONiDk7GknRMDINGVk6rHnlKTCyhokKxO2yFh5vlmaI6tc7cyhIBcru8YgI9IXC7Jr-2PRMKG7Ov2K3JoPo9jQNZpowqT3Oun2vbLFgjeXfuEZJbUSUmktEYWvA54FsY9hJLUOwthwFalImjjzUxWgnyHSpIxNlCnOTRw-nkE9X-TmHBhmwlcCpSYiQdRpagPDnDiPEjtSX0DD8m72uQHRmJVsu_z79R0c9ifDwWzwNnq_giMrpk3W3TXUi-Xa3MCB_irmq-Wt2wM_4lyxbA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2012+International+Conference+on+Computer+Science+and+Service+System&rft.atitle=The+System+Architecture+for+the+Basic+Information+of+Science+and+Technology+Experts+Based+on+Distributed+Storage+and+Web+Mining&rft.au=Quanyin+Zhu&rft.au=Pei+Zhou&rft.date=2012-08-01&rft.pub=IEEE&rft.isbn=9781467307215&rft.spage=527&rft.epage=530&rft_id=info:doi/10.1109%2FCSSS.2012.138&rft.externalDocID=6394376
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467307215/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467307215/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781467307215/sc.gif&client=summon&freeimage=true