Výsledky vyhledávání - "Web Data Extraction and Crawling Techniques"
-
1
Autoři: a další
Zdroj: Proceedings of the 2024 International Conference on Multimedia Retrieval. :1275-1281
Témata: FOS: Computer and information sciences, Technology, Science & Technology, 4. Education, out-of-context media, re-contextualized media, Handwriting Recognition and Text Detection, 16. Peace & justice, Computer science, 7. Clean energy, Computer Science, Artificial Intelligence, 3. Good health, Web Data Extraction and Crawling Techniques, Computer Science, Theory & Methods, Computer Science, Physical Sciences, Image Forgery Detection, news, cheapfakes detection, Computer Vision and Pattern Recognition, Digital Image Forgery Detection and Identification, misinformation, 10. No inequality, Information Systems
Popis souboru: application/pdf
-
2
Autoři: a další
Zdroj: IEEE Access, Vol 12, Pp 69456-69474 (2024)
Témata: object storage system (OSS), FOS: Computer and information sciences, Artificial intelligence, Computer Networks and Communications, Mathematical analysis, Web Data Extraction and Crawling Techniques, Content Distribution, Storage Systems, FOS: Mathematics, Information retrieval, Content (measure theory), Distributed Storage, content-based image retrieval (CBIR), Content-Centric Networking for Information Delivery, distributed systems, Computer data storage, Web Crawling, 9. Industry and infrastructure, Content-based searching (CoBS), deep learning, OpenStack Swift, 15. Life on land, Computer science, TK1-9971, Object storage, World Wide Web, Operating system, Distributed Storage Systems and Network Coding, Computer Science, Physical Sciences, Information Retrieval, Object (grammar), Electrical engineering. Electronics. Nuclear engineering, Mathematics, Information Systems
Přístupová URL adresa: https://doaj.org/article/bb11274ed4234753af83667c3a4f73b7
-
3
Autoři: a další
Zdroj: e_Buah Biblioteca Digital Universidad de Alcalá
Consejo Superior de Investigaciones Científicas (CSIC)
IEEE Access, Vol 12, Pp 77181-77213 (2024)Témata: MDD, FOS: Computer and information sciences, WCAG, Web Engineering for Applications Development, Web Data Extraction, MDA, WebML, Web Data Extraction and Crawling Techniques, Adaptive Web Applications, model-driven engineering, UML-Based Methodology, Information retrieval, profile UML-Web UML, Unified Modeling Language, 10. No inequality, Informática, Profile UML-Web UML, Context-Aware Web Applications, Accessibility web, OCL, Computer science, TK1-9971, Programming language, 3. Good health, World Wide Web, Computer Science, Physical Sciences, Electrical engineering. Electronics. Nuclear engineering, Model-driven engineering, Web page, Software, Information Systems
Popis souboru: application/pdf
Přístupová URL adresa: https://doaj.org/article/3b1132df020442679e2c2bd55830942b
-
4
Autoři:
Zdroj: International Journal of Software Engineering and Computer Systems. 9:93-104
Témata: FOS: Computer and information sciences, Artificial intelligence, Twitter Sentiment, 7. Clean energy, 12. Responsible consumption, Emoji, Social media, Web Data Extraction and Crawling Techniques, Sentiment analysis, Context (archaeology), Artificial Intelligence, Aspect-based Sentiment Analysis, Machine learning, Sentiment Analysis, Information retrieval, Biology, 9. Industry and infrastructure, Natural language processing, Paleontology, Statistical Machine Translation and Natural Language Processing, 16. Peace & justice, Computer science, Programming language, World Wide Web, Sentiment Analysis and Opinion Mining, Emotion Recognition, Computer Science, Physical Sciences, 8. Economic growth, Negation, Social Media, Information Systems
-
5
Autoři:
Zdroj: Journal of Information Technology and Computing. 4:1-14
Témata: FOS: Computer and information sciences, Artificial intelligence, Support vector machine, Class (philosophy), Feature (linguistics), 02 engineering and technology, Epistemology, Weighting, Pattern recognition (psychology), Feature vector, Web Data Extraction and Crawling Techniques, Machine Learning Algorithms, Selection (genetic algorithm), Artificial Intelligence, Multi-label Text Classification in Machine Learning, Machine learning, 0202 electrical engineering, electronic engineering, information engineering, Syntax, Feature Selection, Multi-label Learning, Content Adaptation, Text Classification, Preprocessor, Ontology, Natural language processing, 4. Education, Linguistics, Computer science, FOS: Philosophy, ethics and religion, Automatic Keyword Extraction from Textual Data, Philosophy, Computer Science, Physical Sciences, Feature selection, FOS: Languages and literature, Medicine, Radiology, Information Systems
-
6
Autoři: Shangzheng Song
Zdroj: Applied and Computational Engineering. 2:981-992
Témata: FOS: Computer and information sciences, Artificial intelligence, Information extraction, Web Data Extraction, Set (abstract data type), Data science, 12. Responsible consumption, Web Data Extraction and Crawling Techniques, Sentiment analysis, Artificial Intelligence, Aspect-based Sentiment Analysis, Multi-label Text Classification in Machine Learning, Sentiment Analysis, Genetics, Information retrieval, Feature Selection, Data mining, Biology, Computer science, Process (computing), Programming language, Paragraph, World Wide Web, Operating system, Sentiment Analysis and Opinion Mining, Emotion Recognition, FOS: Biological sciences, Computer Science, Physical Sciences, Polarity (international relations), Cell, Information Systems
-
7
A Levenshtein distance-based method for word segmentation in corpus augmentation of geoscience texts
Autoři: a další
Zdroj: Annals of GIS, Vol 29, Iss 2, Pp 293-306 (2023)
Témata: FOS: Computer and information sciences, Text segmentation, Artificial intelligence, Web Data Extraction, Annotation, Word (group theory), Mathematical geography. Cartography, 02 engineering and technology, GA1-1776, Mathematical analysis, 01 natural sciences, Web Data Extraction and Crawling Techniques, Segmentation, Geoscience, Artificial Intelligence, word segmentation, FOS: Mathematics, 0202 electrical engineering, electronic engineering, information engineering, Information retrieval, Natural Language Processing, 0105 earth and related environmental sciences, Chinese, Domain (mathematical analysis), Natural language processing, 4. Education, Levenshtein distance, Linguistics, Statistical Machine Translation and Natural Language Processing, Computer science, Crawling, FOS: Philosophy, ethics and religion, Philosophy, Computer Science, Physical Sciences, FOS: Languages and literature, Medicine, Conditional random field, Anatomy, corpus augmentation, Mathematics, Information Systems
Přístupová URL adresa: https://doaj.org/article/889ceef5a9cc4fe3873ea47cd8d78d79
-
8
Autoři: a další
Zdroj: Findings of the Association for Computational Linguistics: ACL 2023. :338-352
Témata: FOS: Computer and information sciences, SQL, Artificial intelligence, Natural language processing, Statistical Machine Translation and Natural Language Processing, Computer science, Structured Data, Programming language, Web Data Extraction and Crawling Techniques, Theoretical computer science, Artificial Intelligence, Computer Science, Physical Sciences, Information retrieval, Natural Language Processing, Information Systems
-
9
Autoři: a další
Zdroj: Computer Systems Science and Engineering. 46:2195-2214
Témata: FOS: Computer and information sciences, Web Data Extraction, Search engine indexing, 02 engineering and technology, Digital library, Data science, Web Data Extraction and Crawling Techniques, Artificial Intelligence, Subject (documents), Multi-label Text Classification in Machine Learning, 0202 electrical engineering, electronic engineering, information engineering, Information retrieval, Multi-label Learning, Metadata, Web Crawling, Geography, Computer science, Process (computing), Automatic Keyword Extraction from Textual Data, World Wide Web, Operating system, Literature, Computer Science, Physical Sciences, Information Retrieval, Poetry, Document Categorization, Benchmark (surveying), Art, Geodesy, Information Systems
-
10
Autoři: a další
Zdroj: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). :6102-6114
Témata: FOS: Computer and information sciences, Artificial intelligence, Identifier, Sequence-to-Sequence Learning, FOS: Political science, FOS: Law, Relevance feedback, Information Retrieval Techniques and Evaluation, Computer Science - Information Retrieval, Web Data Extraction and Crawling Techniques, Inference, Artificial Intelligence, Image (mathematics), Genetics, Information retrieval, Data mining, Biology, Political science, Natural Language Processing, Topic Modeling, Learning to Rank, Named Entity Recognition, Computer science, Programming language, World Wide Web, FOS: Biological sciences, Computer Science, Physical Sciences, Information Retrieval, Relevance (law), Pipeline (software), Image retrieval, Law, Information Retrieval (cs.IR), Information Systems, Index (typography), Sequence (biology)
Přístupová URL adresa: http://arxiv.org/abs/2305.11161
-
11
Autoři: a další
Zdroj: IEEE Access, Vol 11, Pp 9627-9655 (2023)
Témata: web APIs, FOS: Computer and information sciences, Application programming interface, QoS-Aware Web Services Composition and Semantic Matching, 02 engineering and technology, Semantic Web Services, Social psychology, 7. Clean energy, Web Data Extraction and Crawling Techniques, 0202 electrical engineering, electronic engineering, information engineering, Psychology, 10. No inequality, Web development, Web Service Composition, Web service, 9. Industry and infrastructure, 4. Education, client application problems, API Usage Patterns, Computer science, TK1-9971, Programming language, 3. Good health, Client applications, World Wide Web, FOS: Psychology, Popularity, web services, Computer Science, Physical Sciences, Service-Oriented Computing, Web application, Electrical engineering. Electronics. Nuclear engineering, Information Systems, Empirical Studies in Software Engineering
Přístupová URL adresa: https://doaj.org/article/d103eb6339bd4a4bb30bbb6825aff36e
-
12
Autoři:
Zdroj: Intelligent Automation & Soft Computing. 36:2379-2391
Témata: FOS: Computer and information sciences, Artificial intelligence, Support vector machine, Text Mining, Economics, Feature (linguistics), Word (group theory), Automatic summarization, Web Data Extraction and Crawling Techniques, Document classification, Sentiment analysis, Task (project management), Context (archaeology), Artificial Intelligence, Multi-label Text Classification in Machine Learning, Machine learning, Information retrieval, Multi-label Learning, Text Classification, Biology, Natural language processing, 4. Education, Paleontology, Linguistics, Bag-of-words model, Computer science, FOS: Philosophy, ethics and religion, Management, Philosophy, Sentiment Analysis and Opinion Mining, Emotion Recognition, Computer Science, Physical Sciences, FOS: Languages and literature, Document Categorization, Polysemy, Information Systems, SemEval
-
13
Autoři: a další
Zdroj: IEEE Access, Vol 11, Pp 28680-28687 (2023)
Témata: FOS: Computer and information sciences, Artificial neural network, Computer Science - Machine Learning, Artificial intelligence, Web Data Extraction, 02 engineering and technology, Pattern recognition (psychology), Quantum mechanics, Computer Science - Information Retrieval, Machine Learning (cs.LG), Filter (signal processing), Web Data Extraction and Crawling Techniques, Click-Through Rate Prediction, Shape Matching and Object Recognition, Machine learning, 0202 electrical engineering, electronic engineering, information engineering, Information retrieval, User Modeling, Data mining, Transformer, Web Crawling, Physics, Voltage, Computer science, web search, TK1-9971, click prediction, Recommender System Technologies, Click model, Collaborative Filtering, Computer Science, Physical Sciences, transformer, Feature extraction, Computer vision, Electrical engineering. Electronics. Nuclear engineering, Computer Vision and Pattern Recognition, Information Retrieval (cs.IR), Information Systems
Přístupová URL adresa: http://arxiv.org/abs/2301.07854
https://doaj.org/article/667d0299e95549a1ac436ea973c26366 -
14
Autoři: a další
Zdroj: Iran Journal of Computer Science. 6:221-232
Témata: FOS: Computer and information sciences, Artificial neural network, Artificial intelligence, Computer Networks and Communications, Android malware, Mobile device, 02 engineering and technology, Malware, Web Data Extraction and Crawling Techniques, Characterization and Detection of Android Malware, Artificial Intelligence, Android (operating system), Computer security, Deep belief network, Machine learning, Deep neural networks, 0202 electrical engineering, electronic engineering, information engineering, Deep learning, Android Malware, Computer science, Operating system, Machine Learning for Internet Traffic Classification, Categorization, Signal Processing, Computer Science, Physical Sciences, Network Intrusion Detection and Defense Mechanisms, Botnet Detection, Classifier (UML), Information Systems
-
15
Autoři: a další
Témata: FOS: Computer and information sciences, Signature (topology), Artificial intelligence, Feature (linguistics), Geometry, Linguistics, Statistical Machine Translation and Natural Language Processing, Handwriting Recognition and Text Detection, Pattern recognition (psychology), Computer science, FOS: Philosophy, ethics and religion, Web Data Extraction and Crawling Techniques, Philosophy, Artificial Intelligence, Computer Science, Physical Sciences, FOS: Mathematics, FOS: Languages and literature, Feature extraction, Computer Vision and Pattern Recognition, Data mining, Mathematics, Information Systems
-
16
Autoři: a další
Přispěvatelé: a další
Zdroj: Li, T, Li, S & Steedman, M 2021, Semi-Automatic Construction of Text-to-SQL Dataset for Domain Transfer . in S Oepen, K Sagae, R Tsarfaty, G Bouma, D Seddah & D Zeman (eds), Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies (IWPT 2021) . Stroudsburg, PA, United States, pp. 38-49, The 17th International Conference on Parsing Technologies, Bangkok, Thailand, 6/08/21 . https://doi.org/10.18653/v1/2021.iwpt-1.4
Proceedings of the 17th International Conference on Parsing Technology
Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies (IWPT 2021)Témata: FOS: Computer and information sciences, Syntax-based Translation Models, Parallel computing, 02 engineering and technology, Search engine, Mathematical analysis, Database, Web Data Extraction and Crawling Techniques, Machine Translation, Artificial Intelligence, FOS: Mathematics, 0202 electrical engineering, electronic engineering, information engineering, Information retrieval, Natural Language Processing, SQL, Domain (mathematical analysis), Natural language processing, Query by Example, Statistical Machine Translation and Natural Language Processing, Statistical Machine Translation, Computer science, Transfer (computing), Programming language, Dependency Parsing, Computer Science, Physical Sciences, Web search query, Mathematics, Information Systems
Popis souboru: application/pdf
-
17
Autoři: a další
Zdroj: IEEE Access, Vol 10, Pp 78928-78938 (2022)
Témata: FOS: Computer and information sciences, Syntax-based Translation Models, Exploit, Neural Machine Translation, Artificial intelligence, Sentence, Construct (python library), Vietnamese, Epistemology, 02 engineering and technology, Glosbe, Web Data Extraction and Crawling Techniques, Machine Translation, Artificial Intelligence, Computer security, 0202 electrical engineering, electronic engineering, information engineering, Information retrieval, Machine translation, Multilingual Neural Machine Translation, Natural Language Processing, Natural language processing, 4. Education, Chinese-Vietnamese machine translation, Bilingual dictionary, dictionary websites, Linguistics, Construction of a bilingual corpus, Statistical Machine Translation and Natural Language Processing, Statistical Machine Translation, Computer science, TK1-9971, FOS: Philosophy, ethics and religion, Programming language, Philosophy, Computer Science, Physical Sciences, Quality (philosophy), FOS: Languages and literature, Electrical engineering. Electronics. Nuclear engineering, Information Systems
Přístupová URL adresa: https://doaj.org/article/a98bf93663a241d3839a013f22d5dcf9
-
18
Autoři: a další
Zdroj: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. :8133-8149
Témata: FOS: Computer and information sciences, Artificial intelligence, Web Data Extraction, Geometry, 02 engineering and technology, Data science, Web Data Extraction and Crawling Techniques, Artificial Intelligence, FOS: Mathematics, 0202 electrical engineering, electronic engineering, information engineering, Information retrieval, Macro, Computer Science - Computation and Language, Natural language processing, Statistical Machine Translation and Natural Language Processing, Computer science, Language Modeling, Process (computing), Programming language, World Wide Web, Computer Science, Physical Sciences, Computation and Language (cs.CL), Semantic Web and Ontology Development, Block (permutation group theory), Software, Mathematics, Information Systems
Přístupová URL adresa: http://arxiv.org/abs/2211.05284
-
19
Autoři: a další
Zdroj: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. :2947-2964
Témata: FOS: Computer and information sciences, Web Data Extraction, Economics, Set (abstract data type), 02 engineering and technology, Mathematical analysis, Quantum mechanics, Data science, Web Data Extraction and Crawling Techniques, Task (project management), Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, FOS: Mathematics, Information retrieval, Event (particle physics), Text Classification, Source code, Natural Language Processing, Code (set theory), Computer Science - Computation and Language, Domain (mathematical analysis), Geography, Topic Modeling, Physics, Optics, Focus (optics), Named Entity Recognition, Computer science, Programming language, Management, Automatic Keyword Extraction from Textual Data, Operating system, Computer Science, Physical Sciences, Textual Data, Benchmark (surveying), Computation and Language (cs.CL), Mathematics, Geodesy, Information Systems
Přístupová URL adresa: http://arxiv.org/abs/2211.13896
-
20
Autoři: a další
Zdroj: IEEE Access, Vol 10, Pp 87681-87697 (2022)
Témata: Data Quality Assessment and Improvement, FOS: Computer and information sciences, Artificial intelligence, Economics, FOS: Political science, Social Sciences, End-to-end principle, 02 engineering and technology, Decision Sciences, Data science, Web Data Extraction and Crawling Techniques, Task (project management), Data Cleaning, 11. Sustainability, 0202 electrical engineering, electronic engineering, information engineering, Unstructured data, Political science, Physics, Named Entity Recognition, FOS: Philosophy, ethics and religion, Management, Physical Sciences, Electrical engineering. Electronics. Nuclear engineering, Information Systems, Analytics, Information extraction, Web Data Extraction, Volume (thermodynamics), Vietnamese, FOS: Law, Noise (video), Management Science and Operations Research, Real estate, NLP applications, Quantum mechanics, Big data, Data warehouse, Artificial Intelligence, Field (mathematics), FOS: Mathematics, Image (mathematics), Information retrieval, Data mining, information retrieval and text mining, Natural Language Processing, Pure mathematics, Linguistics, 15. Life on land, Computer science, TK1-9971, Process (computing), Philosophy, Operating system, Computer Science, Named-entity recognition, FOS: Languages and literature, Law, Mathematics
Přístupová URL adresa: https://doaj.org/article/7de9011ea16144db9e0f9a4038993f16
Nájsť tento článok vo Web of Science
Full Text Finder