Výsledky vyhľadávania - Automated Data Collection with R: A Practical Guide to Web Scraping AND Text Mining~

  1. 1

    Automated data collection with R: a practical guide to web scraping and text mining Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 111883478X, 9781118834817, 9781118834787, 9781118834732, 1118834739, 1118834801, 9781118834800
    Vydavateľské údaje: Chichester WILEY 2015
    “…A hands on guide to web scraping and text mining for both beginners and experienced users of R…”
    Získať plný text
    E-kniha Kniha
  2. 2

    Automated Data Collection with R - A Practical Guide to Web Scraping and Text Mining Autor Iacus, Stefano M.

    ISSN: 1548-7660, 1548-7660
    Vydavateľské údaje: Foundation for Open Access Statistics 01.11.2015
    Vydané v Journal of statistical software (01.11.2015)
    Získať plný text
    Journal Article
  3. 3
  4. 4

    Automated Data Collection with R: A Practical Guide to Web Scraping and Text Mining Autor Selig, Katharina

    ISSN: 0006-341X, 1541-0420
    Vydavateľské údaje: Wiley-Blackwell 01.12.2017
    Vydané v Biometrics (01.12.2017)
    Získať plný text
    Book Review
  5. 5
  6. 6
  7. 7

    Scraping the Web Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “…This chapter addresses three main aspects of web scraping with R. The first is how to retrieve data from the Web in different scenarios…”
    Získať plný text
    Kapitola
  8. 8
  9. 9

    HTML Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “…This chapter introduces the fundamentals of Hyper Text Markup Language (HTML) from the perspective of a web data collector…”
    Získať plný text
    Kapitola
  10. 10

    Managing Data Projects Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “…Deploying a successful data collection project requires more than knowledge of web technologies…”
    Získať plný text
    Kapitola
  11. 11

    Introduction Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 9781118834817, 111883481X
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “… The chapter proposes five steps that help to guide the data collection process. There are three areas that are important for data collection on the Web with R…”
    Získať plný text
    Kapitola
  12. 12

    Regular Expressions and Essential String Functions Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “…One of the central tasks in web scraping is to collect the relevant information for the research problem from heaps of textual data…”
    Získať plný text
    Kapitola
  13. 13

    Analyzing Sentiments of Product Reviews Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “… They start by collecting the reviews from the webpage. Next, the files are downloaded and stored in the previously created database…”
    Získať plný text
    Kapitola
  14. 14

    Mapping the Geographic Distribution of Names Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “… It first discusses a data collection strategy, and describes website inspection and data retrieval and information extraction…”
    Získať plný text
    Kapitola
  15. 15

    HTTP Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “… The chapter elaborates the methods GET and POST, the two most important methods for web scraping…”
    Získať plný text
    Kapitola
  16. 16

    Statistical Text Processing Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “…This chapter offers a brief introduction to statistical text processing. The Internet is predominantly a vast collection of more or less unclassified text…”
    Získať plný text
    Kapitola
  17. 17
  18. 18
  19. 19

    AJAX Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “… It discusses the XMLHttpRequest (XHR), an Application Programming Interface (API) for browser‐server communication and important data retrieval mechanism for dynamic web applications…”
    Získať plný text
    Kapitola
  20. 20

    XPath Autor Munzert, Simon, Rubba, Christian, Meißner, Peter, Nyhuis, Dominic

    ISBN: 111883481X, 9781118834817
    Vydavateľské údaje: Chichester, UK John Wiley & Sons, Ltd 28.07.2014
    “… It helps to build an intuition for querying tree‐based data structures like Hyper Text Markup Language (HTML)/XML documents…”
    Získať plný text
    Kapitola