Suchergebnisse - Web Data Extraction and Crawling Techniques
-
1
A New Framework for Domain-Specific Hidden Web Crawling Based on Data Extraction Techniques
ISBN: 0780397703, 9780780397705ISSN: 2329-6364Veröffentlicht: IEEE 01.12.2006Veröffentlicht in 2006 ITI 4th International Conference on Information & Communications Technology (01.12.2006)“… % of the content on the Web, this portion of Web called Hidden Web (HW), they are "Hidden" in databases behind search interfaces …”
Volltext
Tagungsbericht -
2
Scrimmo: A Real-Time Web Scraper Monitoring the Belgian Real Estate Market
Veröffentlicht: IEEE 26.10.2023Veröffentlicht in 2023 IEEE International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) (26.10.2023)“… Web scraping (or Web crawling), a technique for automated data extraction from websites, has emerged as a valuable tool for scientific research and data analysis …”
Volltext
Tagungsbericht -
3
Enabling maps/location searches on mobile devices: constructing a POI database via focused crawling and information extraction
ISSN: 1365-8816, 1362-3087, 1365-8824Veröffentlicht: Abingdon Taylor & Francis 02.07.2016Veröffentlicht in International journal of geographical information science : IJGIS (02.07.2016)“… However, manual annotation is costly and limited in current POI search services. With the abundance of information on the Web, many store POIs can be extracted from the Web …”
Volltext
Journal Article -
4
Swarm-intelligence-based extraction and manifold crawling along the Large-Scale Structure
ISSN: 0035-8711, 1365-2966, 1365-2966Veröffentlicht: London Oxford University Press 01.04.2023Veröffentlicht in Monthly notices of the Royal Astronomical Society (01.04.2023)“… ) on N-body cosmological simulation data of the Cosmic Web. The 1-DREAM toolbox consists of five Machine Learning methods, whose aim is the extraction and modelling …”
Volltext
Journal Article -
5
Web Scraping or Web Crawling: State of Art, Techniques, Approaches and Application
ISSN: 2710-1274, 2074-8523Veröffentlicht: 30.12.2021Veröffentlicht in International journal of advances in soft computing and its applications (30.12.2021)“… Web scraping or web crawling refers to the procedure of automatic extraction of data from websites using software …”
Volltext
Journal Article -
6
Developing an automated framework for eco-label information categorization using web crawling and Natural Language Processing techniques
ISSN: 0957-4174Veröffentlicht: Elsevier Ltd 05.07.2025Veröffentlicht in Expert systems with applications (05.07.2025)“… This study explores the application of web crawling techniques, Natural Language Processing (NLP …”
Volltext
Journal Article -
7
Keyword weight optimization using gradient strategies in event focused web crawling
ISSN: 0167-8655, 1872-7344Veröffentlicht: Amsterdam Elsevier B.V 01.02.2021Veröffentlicht in Pattern recognition letters (01.02.2021)“… •A web crawling system for obtaining the set of web data regarding key events is essential …”
Volltext
Journal Article -
8
An ontology-driven multimedia focused crawler based on linked open data and deep learning techniques
ISSN: 1380-7501, 1573-7721Veröffentlicht: New York Springer US 01.03.2020Veröffentlicht in Multimedia tools and applications (01.03.2020)“… In this article we propose a novel approach to focused crawling based on the use of both textual and multimedia web page content …”
Volltext
Journal Article -
9
Web crawling based context aware recommender system using optimized deep recurrent neural network
ISSN: 2196-1115, 2196-1115Veröffentlicht: Cham Springer International Publishing 20.11.2021Veröffentlicht in Journal of big data (20.11.2021)“… Majorly, content and collaborative filtering techniques are employed in typical recommendation systems to find user preferences and provide final recommendations …”
Volltext
Journal Article -
10
Deep Web crawling: a survey
ISSN: 1386-145X, 1573-1413Veröffentlicht: New York Springer US 01.07.2019Veröffentlicht in World wide web (Bussum) (01.07.2019)“… Deep Web crawling refers to the problem of traversing the collection of pages in a deep Web site, which are dynamically generated in response to a particular query that is submitted using a search form …”
Volltext
Journal Article -
11
Collecting data on textiles from the internet using web crawling and web scraping tools
ISSN: 0379-0738, 1872-6283, 1872-6283Veröffentlicht: Ireland Elsevier B.V 01.05.2021Veröffentlicht in Forensic science international (01.05.2021)“… It has become more affordable for researchers who can now devote most of their time to extracting meaningful information from the structured data …”
Volltext
Journal Article -
12
Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles
ISSN: 0122-6517, 2382-4700, 2382-4700Veröffentlicht: 20.09.2022Veröffentlicht in Inge Cuc (20.09.2022)“… — The main objective of this work is to development of a Bot Crawler, which allows extracting information from Facebook without access restrictions, or request for credentials, based on web crawling and scraping techniques …”
Volltext
Journal Article -
13
Piecing together the puzzle: Improving event content coverage for real-time sub-event detection using adaptive microblog crawling
ISSN: 1932-6203, 1932-6203Veröffentlicht: United States Public Library of Science 06.11.2017Veröffentlicht in PloS one (06.11.2017)“… Existing Twitter event monitoring systems for sub-event detection and summarization currently typically analyse events based on partial data as conventional data collection methodologies are unable …”
Volltext
Journal Article -
14
Aplikasi Deteksi Motif dan Crawling Produk Batik Banyuwangi Berbasis Web
ISSN: 2301-7988, 2581-0588Veröffentlicht: LPPM ISB Atma Luhur 29.12.2022Veröffentlicht in Jurnal Sisfokom (29.12.2022)“… Therefore, in this study, an application was developed that can detect via web devices and also extract information related to Batik Banyuwangi products in the marketplace …”
Volltext
Journal Article -
15
xCrawl: a high-recall crawling method for Web mining
ISSN: 0219-1377, 0219-3116Veröffentlicht: London Springer-Verlag 01.11.2010Veröffentlicht in Knowledge and information systems (01.11.2010)“… Web mining systems exploit the redundancy of data published on the Web to automatically extract information from existing Web documents …”
Volltext
Journal Article -
16
Towards extracting event-centric collections from Web archives
ISSN: 1432-5012, 1432-1300Veröffentlicht: Berlin/Heidelberg Springer Berlin Heidelberg 01.03.2020Veröffentlicht in International journal on digital libraries (01.03.2020)“… Web archives constitute an increasingly important source of information for computer scientists, humanities researchers and journalists interested in studying past events …”
Volltext
Journal Article -
17
A Textual Content Analysis Model for Aligning Job Market Demands and University Curricula through Data Mining Techniques
ISSN: 1865-7923, 1865-7923Veröffentlicht: 02.08.2024Veröffentlicht in International journal of interactive mobile technologies (02.08.2024)“… Specifically, the integration of data mining techniques is employed for the automated extraction of relevant information from both labor market demands and university curricula …”
Volltext
Journal Article -
18
SiSOB data extraction and codification: A tool to analyze scientific careers
ISSN: 0048-7333, 1873-7625Veröffentlicht: Amsterdam Elsevier B.V 01.11.2015Veröffentlicht in Research policy (01.11.2015)“… •The software provides data crawling and data mining techniques used to transform webpage-based information and CV information into a relational database …”
Volltext
Journal Article -
19
NLP-based techniques for Cyber Threat Intelligence
ISSN: 1574-0137Veröffentlicht: Elsevier Inc 01.11.2025Veröffentlicht in Computer science review (01.11.2025)“… In the digital era, threat actors employ sophisticated techniques for which, often, digital traces in the form of textual data are available …”
Volltext
Journal Article -
20
Predicting customer profitability during acquisition: Finding the optimal combination of data source and data mining technique
ISSN: 0957-4174, 1873-6793Veröffentlicht: Amsterdam Elsevier Ltd 01.05.2013Veröffentlicht in Expert systems with applications (01.05.2013)“… ► Commercially-available data is augmented by web data. ► Combining both web data and commercial data leads to the best predictive results for lead qualification …”
Volltext
Journal Article

