How to do linguistics with R : data exploration and statistical analysis
This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for statistical computing, which is increasingly popular among linguists. How to do Linguistics with R: Data exploration and statistical analysis is uni...
Gespeichert in:
| 1. Verfasser: | |
|---|---|
| Format: | E-Book Buch |
| Sprache: | Englisch |
| Veröffentlicht: |
Amsterdam
John Benjamins Publishing Company
2015
John Benjamins |
| Ausgabe: | 1 |
| Schlagworte: | |
| ISBN: | 9789027212252, 9027212252, 9789027212245, 9027212244 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for statistical computing, which is increasingly popular among linguists. How to do Linguistics with R: Data exploration and statistical analysis is unique in its scope, as it covers a wide range of classical and cutting-edge statistical methods, including different flavours of regression analysis and ANOVA, random forests and conditional inference trees, as well as specific linguistic approaches, among which are Behavioural Profiles, Vector Space Models and various measures of association between words and constructions. The statistical topics are presented comprehensively, but without too much technical detail, and illustrated with linguistic case studies that answer non-trivial research questions. The book also demonstrates how to visualize linguistic data with the help of attractive informative graphs, including the popular ggplot2 system and Google visualization tools.This book has a companion website: http://doi.org/10.1075/z.195.website |
|---|---|
| AbstractList | This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for statistical computing, which is increasingly popular among linguists. How to do Linguistics with R: Data exploration and statistical analysis is unique in its scope, as it covers a wide range of classical and cutting-edge statistical methods, including different flavours of regression analysis and ANOVA, random forests and conditional inference trees, as well as specific linguistic approaches, among which are Behavioural Profiles, Vector Space Models and various measures of association between words and constructions. The statistical topics are presented comprehensively, but without too much technical detail, and illustrated with linguistic case studies that answer non-trivial research questions. The book also demonstrates how to visualize linguistic data with the help of attractive informative graphs, including the popular ggplot2 system and Google visualization tools.This book has a companion website: http://doi.org/10.1075/z.195.website |
| Author | Levshina, Natalia |
| Author_xml | – sequence: 1 fullname: Levshina, Natalia |
| BackLink | https://cir.nii.ac.jp/crid/1130282271414332800$$DView record in CiNii |
| BookMark | eNplkEtLAzEUhSM-sOr8hywEcVHNOxl3WqoVCoKI2yGTZDQaJ3WSOuqvd2oFETf3nsP9OBzuHthqY-sAKDA6wUjy088TXPINUJRSlYhIIhTjcvPXY0I42QZ7BGGOsJBI7YBRiVRJOad0FxQpPSE0HCVijI7AbBZ7mCO0EQbfPix9yt4k2Pv8CG_hGbQ6a-jeFyF2OvvYQt1amPKgV6AOg9fhI_l0ALYbHZIrfvY-uL-c3k1m4_nN1fXkfD7WjDIqxkxgq4y1DbWNsrWqayqwsNIQI0xDBG5KS2VNXD0U1Eo7jowh1EljnW4Yp_vgeB2s07Pr02MMOVVvwdUxPqfqz19-2V6H7DrrHrrlxyCqF92Zf-zRml108XXpUq6-I41rc6dDNb2YMKqEQKsGh2uy9b4yfjUxpogoQiRmmFFKFEL0CzyVe_Y |
| ContentType | eBook Book |
| DBID | RYH |
| DEWEY | 410.2855362 |
| DOI | 10.1075/z.195 |
| DatabaseName | CiNii Complete |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Languages & Literatures |
| EISBN | 9789027268457 9027268452 |
| Edition | 1 |
| ExternalDocumentID | 9789027268457 EBC4386605 BB20264968 |
| GroupedDBID | -VX 20A 38. AABBV AAZEP ABARN ABMRC ABQPQ ACLGV ADVEM AERYV AGWCO AHLVE AHWGJ AIXPE AJFER ALMA_UNASSIGNED_HOLDINGS AZZ BBABE CZZ DUGUG EBSCA ECOWB EKLKH HELXT I4C JJU MTCAI MYL N~5 QD8 RYH XI1 |
| ID | FETCH-LOGICAL-a43436-461d8cddf3df8db8bb3616d7c2c6cf261f9d37b2eb044a8ae50cc23e7cdeaf453 |
| ISBN | 9789027212252 9027212252 9789027212245 9027212244 |
| IngestDate | Thu Oct 09 05:52:06 EDT 2025 Fri Nov 21 19:43:29 EST 2025 Wed Nov 26 04:53:10 EST 2025 Thu Jun 26 22:03:04 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Keywords | Theoretical linguistics Cognition and language Computational & corpus linguistics |
| LCCN | 2015016708 |
| LCCallNum_Ident | P98.5.S83 |
| Language | English |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-a43436-461d8cddf3df8db8bb3616d7c2c6cf261f9d37b2eb044a8ae50cc23e7cdeaf453 |
| Notes | Includes bibliographical references (p. [425]-432) and indexes |
| OCLC | 908935533 |
| PQID | EBC4386605 |
| PageCount | 456 |
| ParticipantIDs | askewsholts_vlebooks_9789027268457 walterdegruyter_marc_9789027268457 proquest_ebookcentral_EBC4386605 nii_cinii_1130282271414332800 |
| PublicationCentury | 2000 |
| PublicationDate | c2015 2015 [2015] 2015-11-25 |
| PublicationDateYYYYMMDD | 2015-01-01 2015-11-25 |
| PublicationDate_xml | – year: 2015 text: c2015 |
| PublicationDecade | 2010 |
| PublicationPlace | Amsterdam |
| PublicationPlace_xml | – name: Amsterdam – name: Philadelphia |
| PublicationYear | 2015 |
| Publisher | John Benjamins Publishing Company John Benjamins |
| Publisher_xml | – name: John Benjamins Publishing Company – name: John Benjamins |
| RestrictionsOnAccess | restricted access |
| SSID | ssj0001570443 |
| Score | 1.959328 |
| Snippet | This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for... |
| SourceID | askewsholts walterdegruyter proquest nii |
| SourceType | Aggregation Database Publisher |
| SubjectTerms | Cognition and language Computational & corpus linguistics Computational linguistics Computational linguistics -- Methodology Computational linguistics -- Statistical methods LANGUAGE ARTS & DISCIPLINES / Linguistics / General Linguistics -- Software Methodology Theoretical linguistics |
| TableOfContents | 7.2 Putting several factors together: Predicting reaction times in a lexical decision task -- 7.2.1 Data and hypotheses -- 7.2.2 The lm() function and interpretation of its output -- 7.2.3 Selecting the explanatory variables -- 7.2.4 Checking for outliers and overly influential observations -- 7.2.5 Checking the regression assumptions -- 7.2.6 Testing and interpreting interactions -- 7.2.7 Checking for overfitting -- 7.2.8 Robust regression: Bootstrap -- 7.3 Summary -- 8. Finding differences between several groups -- What you will learn from this chapter: -- 8.1 What is ANOVA? -- 8.2 Motion events in Nicaraguan Sign Language: Independent one-way ANOVA -- 8.2.1 Theoretical background and data -- 8.2.2 Exploring the data -- 8.2.3 Assumptions of one-way parametric ANOVA -- 8.2.4 Performing parametric one-way ANOVA -- 8.2.5 Alternative tests -- 8.2.6 Post-hoc tests -- 8.3 Development of spatial modulations in Nicaraguan Sign Language: Independent factorial (two-way) ANOVA -- 8.3.1 The data and hypothesis -- 8.3.2 Descriptive statistics for different groups and interaction plot -- 8.3.3 Assumptions of parametric factorial ANOVA -- 8.3.4 ANOVA and orthogonal contrasts -- 8.3.5 Alternative tests -- 8.3.6 Post-hoc tests -- 8.4 Do native English and native Mandarin Chinese speakers conceptualize time differently? Repeated-measured and mixed-design ANOVA (mixed GLM method) -- 8.4.1 The data and hypothesis -- 8.4.2 Fitting a mixed-design ANOVA with the help of mixed GLM -- 8.4.3 Post-hoc tests -- 8.5 Summary -- 9. Measuring associations between two categorical variables -- What you will learn from this chapter: -- 9.1 Testing independence -- 9.2 The story of over is not over: Metaphoric and non-metaphoric uses in two registers (analysis of a 2-by-2 contingency table) -- 9.2.1 The data and hypothesis 4.3.2 Deviation of proportions as a measure of dispersion -- 4.4 Summary -- 5. Comparing two groups -- What you will learn from this chapter: -- 5.1 Comparing group means (medians): An overview of the tests -- 5.2 Comparing the number of associations triggered by high- and low-frequency nouns with the help of an independent t-test -- 5.2.1 Data and hypothesis -- 5.2.2 Descriptive statistics and visualizations -- 5.2.3 Choosing an appropriate test to compare the measures of central tendency in two groups -- 5.2.4 Confidence intervals and standard errors -- 5.3 Comparing concreteness scores of high- and low-frequency nouns with the help of a two-tailed Wilcoxon test -- 5.3.1 Data and hypotheses -- 5.3.2 Descriptive statistics and visualizations: Strip charts and rug plots -- 5.3.3 Inferential statistics: The two-tailed Wilcoxon test -- 5.4. Comparing associations produced by native and non-native speakers: A paired one-tailed t-test -- 5.4.1 Creating simulation data -- 5.4.2 Performing the paired t-test -- 5.5 Summary -- 6. Relationships between two numerical variables -- What you will learn from this chapter: -- 6.1 What is correlation? -- 6.2 Word length and word recognition: The Pearson product-moment correlation coefficient -- 6.2.1 The data and hypothesis -- 6.2.2 Descriptive statistics and visualizations -- 6.2.3 Testing the significance of the correlation coefficient -- 6.3 Emergence of grammar from lexicon: Spearman's ρ and Kendall's τ. -- 6.3.1 The data and hypothesis -- 6.3.2 Exploring the data and computing correlation coefficients -- 6.4 Visualization of correlations between more than two variables with the help of correlograms -- 6.5 Summary -- 7. More on frequencies and reaction times -- What you will learn from this chapter -- 7.1 The basic principles of linear regression analysis 12.2.2 Fitting a binary logistic regression model: Main functions -- 12.2.3 Selection of variables -- 12.2.4 Testing possible interactions -- 12.2.5 Identifying outliers and overly influential observations -- 12.2.6 Checking the regression assumptions -- 12.2.7 Testing for overfitting -- 12.2.8 Interpretation of the model -- 12.3 Summary -- 13. Multinomial (polytomous) logistic regression models of three and more near synonyms -- What you will learn from this chapter: -- 13.1 What is multinomial regression? -- 13.2 Multinomial models of English permissive constructions -- 13.2.1 Data and hypotheses -- 13.2.2 Contrasting allow and permit with let -- 13.2.3 'One vs. rest' approach -- 13.3 Summary -- 14. Conditional inference trees and random forests -- What you will learn from this chapter: -- 14.1 Conditional inference trees and random forests -- 14.2 Conditional inference trees and random forests of three English causative constructions -- 14.2.1 The data and hypotheses -- 14.2.2 Fitting a conditional inference tree model -- 14.2.3 Random forests -- 14.3 Summary -- 15. Behavioural profiles, distance metrics and cluster analysis -- What you will learn from this chapter: -- 15.1 What are Behavioural Profiles? -- 15.2 Behavioural Profiles of English analytic causatives -- 15.2.1 Data and theoretical background -- 15.2.2 Computation of numeric BP vectors from the categorical data -- 15.2.3 Distance matrix -- 15.2.4 Hierarchical cluster analysis -- 15.2.4.1 Identifying the clusters -- 15.2.4.2 Interpretation of the cluster solution: Snake plots and effect size measures -- 15.2.4.3 Validation of a cluster solution -- 15.2.5 Partitioning methods -- 15.2.5.1 General introduction -- 15.2.5.2 Partitioning around centroids (k-means) -- 15.2.5.3 Partitioning around medoids -- 15.3 Summary -- 16. Introduction to Semantic Vector Spaces 9.2.2 Visualizations, proportions and measures of effect size: Odds ratios, Cramér's V and the ø coefficient -- 9.2.3 Testing statistical significance: The χ2 -test of independence -- 9.3 Metaphorical and non-metaphorical uses of see in four registers (analysis of a 4-by-2 table) -- 9.3.1 The data and hypothesis -- 9.3.2 Descriptive statistics and visualizations -- 9.3.3 Testing the statistical significance and analysing the residuals: The χ2-test and mosaic and association plots -- 9.4 Summary -- 10. Association measures -- What will you learn from this chapter: -- 10.1 Measures of association: A brief typology -- 10.1.1 Frequencies that you will need in order to compute association measures -- 10.1.2 Unidirectional (asymmetric) vs. bidirectional (symmetric) measures -- 10.1.3 Contingency-based vs. non-contingency-based measures -- 10.2 Case study: The Russian ditransitive construction and its collexemes -- 10.2.1 Theoretical background and data -- 10.2.2 Computation of some popular association measures -- 10.3 Summary -- 11. Geographic variation of quite: Distinctive collexeme analysis -- What you will learn from this chapter: -- 11.1 Introduction to distinctive collexeme analysis -- 11.2 Distinctive collexeme analysis of quite + ADJ in different varieties of English -- 11.2.1 Theoretical background and data -- 11.2.2 Simple distinctive collexeme analysis of quite + ADJ in British and American English -- 11.2.3 Multiple distinctive collexeme analysis: Quite + ADJ in the British, American and Canadian varieties of English -- 11.3 Summary -- 12. Probabilistic multifactorial grammar and lexicology -- What you will learn from this chapter: -- 12.1 Introduction to logistic regression -- 12.2 Logistic regression model of Dutch causative auxiliaries doen and laten -- 12.2.1 Theoretical background and data What you will learn from this chapter Intro -- How to do Linguistics with R -- Title page -- LCC data -- Dedication page -- Table of contents -- Acknowledgements -- Introduction -- 1. Who is this book written for? -- 2. The quantitative turn in linguistics -- 3. How to use this textbook -- 1. What is statistics? -- What you will learn from this chapter: -- 1.1 Statistics and statistics -- 1.2 Formulating and testing your hypotheses -- 1.2.1 Null and alternative hypotheses -- 1.2.2 Those mysterious p-values… -- 1.2.3 Type I and Type II errors -- 1.2.4 One-tailed and two-tailed statistical tests -- 1.3 What statistics cannot do for you -- 1.4 Types of variables -- 1.5 Summary -- 2. Introduction to R -- What you will learn from this chapter: -- 2.1 Introduction -- 2.2 Installation of the basic distribution and add-on packages -- 2.3 First steps with R -- 2.3.1 Starting R -- 2.3.2 R syntax -- 2.3.3 Exiting from R or terminating a process -- 2.3.4 Getting help -- 2.4 Main types of R objects -- 2.5 RStudio -- 2.6 Importing and exporting your data and saving your graphs -- 2.6.1 Importing your data to R -- 2.6.2 Exporting your data from R -- 2.6.3 Saving your graphs -- 2.7 Summary -- 3. Descriptive statistics for quantitative variables -- What you will learn from this chapter: -- 3.1 Analysing the distribution of word lengths: Basic descriptive statistics -- 3.1.1 The data -- 3.1.2 Measures of central tendency -- 3.1.3 Measures of dispersion -- 3.2 Bad times, good times: Visualization of a distribution and finding outliers -- 3.3 Zipf's law and word frequency: Transformation of quantitative variables -- 3.4 Summary -- 4. How to explore qualitative variables -- What you will learn from this chapter: -- 4.1 Frequency tables, proportions and percentages -- 4.2 Visualization of categorical data -- 4.3 Basic Colour Terms: Deviations of Proportions in subcorpora -- 4.3.1 The data and hypothesis Chapter 2. Introduction to R -- References -- Chapter 17. Language and space -- Chapter 18. Multidimensional analysis of register variation -- Chapter 8. Finding differences between several groups -- Subject Index -- Chapter 20. Constructional change and motion charts -- Chapter 13. Multinomial (polytomous) logistic regression models of three and more near synonyms -- Chapter 12. Probabilistic multifactorial grammar and lexicology -- Chapter 14. Conditional inference trees and random forests -- Prelim pages -- Acknowledgements -- Chapter 11. Geographic variation of quite: Distinctive collexeme analysis -- Chapter 19. Exemplars, categories, prototypes -- Chapter 6. Relationships between two quantitative variables -- Index of R functions and packages Table of contents -- Chapter 4. How to explore qualitative variables -- Introduction -- Chapter 10. Association measures -- Chapter 5. Comparing two groups -- The most important R objects and basic operations with them -- Chapter 15. Behavioural profiles, distance metrics and cluster analysis -- Epilogue -- Chapter 16. Introduction to Semantic Vector Spaces -- Chapter 3. Descriptive statistics for quantitative variables -- Chapter 1. What is statistics? -- Chapter 7. More on frequencies and reaction times -- Chapter 9. Measuring associations between two categorical variables -- Main plotting functions and graphical parameters in R -- |
| Title | How to do linguistics with R : data exploration and statistical analysis |
| URI | https://cir.nii.ac.jp/crid/1130282271414332800 https://ebookcentral.proquest.com/lib/[SITE_ID]/detail.action?docID=4386605 https://www.degruyterbrill.com/isbn/9789027268457 https://www.vlebooks.com/vleweb/product/openreader?id=none&isbn=9789027268457&uid=none |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9NAEB6VBCEqVJV3oEWrCnFBBm_W9q6vrYJ6CFVVVVVv1j6r8LBR7ITCr2d247pxuMChl1XsxLvyN9HMfLOzMwBvnUCbrWwSCSZklDhtIxkbGzlDbcqpYjZ0nruY8pMTcXmZn7ZNS-vQToCXpbi-zn_cqajxHgrbH539D3F3k-IN_IxCxxHFjuOGR9xdtm1-fHe46r2pPNG-WrQVmEOgtdvOmdqljznJlWL11Q_lOu-n6QbvD4k1h7b8Ir_7XJq1oFVfkayIYo7s09d1WRWD_kttot-Ar_X7A101vOyXpd4wF10SX2_aezAcJylLBzBEezr5fBvnSnmcJOwBPGoX-hiW2YZtWX9FPY46vqnRsJezWc_J3_kZ0gWMvZovfjU329PB6p_vwtD6oyCPYcuWT-DFtA3t1uQdmXbVqOunsIfIk6YipiJryBOPPDl7BhefJudHx1HbdSKS_pRtFiUZ9S2djGPGCaOEUiyjmeF6rDPtkHG63DCuxlbhi0khbRprPWaWa2OlQxCew6CsSvsSCJJnI2LuuMYvVCxzpmiWCWTdeS5joUdwsIZCsfwWdsjrogftCPYRnELP_Ej9RjM6dZwm1JeeQ39_BOQGtiI836b1FpPDo4SJDAkrrrMBZ-ErpfTXefUvP3oND2__jHswaOYLuw_39bKZ1fM3rfD_AMKyOGk |
| linkProvider | ProQuest Ebooks |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=book&rft.title=How+to+do+Linguistics+with+R&rft.au=Levshina%2C+Natalia&rft.date=2015-01-01&rft.pub=John+Benjamins+Publishing+Company&rft.isbn=9789027268457&rft_id=info:doi/10.1075%2Fz.195&rft.externalDBID=n%2Fa&rft.externalDocID=9789027268457 |
| thumbnail_m | http://cvtisr.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fwww.degruyterbrill.com%2Fdocument%2Fcover%2Fisbn%2F9789027268457%2Foriginal http://cvtisr.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fvle.dmmserver.com%2Fmedia%2F640%2F97890272%2F9789027268457.jpg |

