How to do linguistics with R : data exploration and statistical analysis

This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for statistical computing, which is increasingly popular among linguists. How to do Linguistics with R: Data exploration and statistical analysis is uni...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Levshina, Natalia
Format: E-Book Buch
Sprache:Englisch
Veröffentlicht: Amsterdam John Benjamins Publishing Company 2015
John Benjamins
Ausgabe:1
Schlagworte:
ISBN:9789027212252, 9027212252, 9789027212245, 9027212244
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for statistical computing, which is increasingly popular among linguists. How to do Linguistics with R: Data exploration and statistical analysis is unique in its scope, as it covers a wide range of classical and cutting-edge statistical methods, including different flavours of regression analysis and ANOVA, random forests and conditional inference trees, as well as specific linguistic approaches, among which are Behavioural Profiles, Vector Space Models and various measures of association between words and constructions. The statistical topics are presented comprehensively, but without too much technical detail, and illustrated with linguistic case studies that answer non-trivial research questions. The book also demonstrates how to visualize linguistic data with the help of attractive informative graphs, including the popular ggplot2 system and Google visualization tools.This book has a companion website: http://doi.org/10.1075/z.195.website
AbstractList This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for statistical computing, which is increasingly popular among linguists. How to do Linguistics with R: Data exploration and statistical analysis is unique in its scope, as it covers a wide range of classical and cutting-edge statistical methods, including different flavours of regression analysis and ANOVA, random forests and conditional inference trees, as well as specific linguistic approaches, among which are Behavioural Profiles, Vector Space Models and various measures of association between words and constructions. The statistical topics are presented comprehensively, but without too much technical detail, and illustrated with linguistic case studies that answer non-trivial research questions. The book also demonstrates how to visualize linguistic data with the help of attractive informative graphs, including the popular ggplot2 system and Google visualization tools.This book has a companion website: http://doi.org/10.1075/z.195.website
Author Levshina, Natalia
Author_xml – sequence: 1
  fullname: Levshina, Natalia
BackLink https://cir.nii.ac.jp/crid/1130282271414332800$$DView record in CiNii
BookMark eNplkEtLAzEUhSM-sOr8hywEcVHNOxl3WqoVCoKI2yGTZDQaJ3WSOuqvd2oFETf3nsP9OBzuHthqY-sAKDA6wUjy088TXPINUJRSlYhIIhTjcvPXY0I42QZ7BGGOsJBI7YBRiVRJOad0FxQpPSE0HCVijI7AbBZ7mCO0EQbfPix9yt4k2Pv8CG_hGbQ6a-jeFyF2OvvYQt1amPKgV6AOg9fhI_l0ALYbHZIrfvY-uL-c3k1m4_nN1fXkfD7WjDIqxkxgq4y1DbWNsrWqayqwsNIQI0xDBG5KS2VNXD0U1Eo7jowh1EljnW4Yp_vgeB2s07Pr02MMOVVvwdUxPqfqz19-2V6H7DrrHrrlxyCqF92Zf-zRml108XXpUq6-I41rc6dDNb2YMKqEQKsGh2uy9b4yfjUxpogoQiRmmFFKFEL0CzyVe_Y
ContentType eBook
Book
DBID RYH
DEWEY 410.2855362
DOI 10.1075/z.195
DatabaseName CiNii Complete
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Languages & Literatures
EISBN 9789027268457
9027268452
Edition 1
ExternalDocumentID 9789027268457
EBC4386605
BB20264968
GroupedDBID -VX
20A
38.
AABBV
AAZEP
ABARN
ABMRC
ABQPQ
ACLGV
ADVEM
AERYV
AGWCO
AHLVE
AHWGJ
AIXPE
AJFER
ALMA_UNASSIGNED_HOLDINGS
AZZ
BBABE
CZZ
DUGUG
EBSCA
ECOWB
EKLKH
HELXT
I4C
JJU
MTCAI
MYL
N~5
QD8
RYH
XI1
ID FETCH-LOGICAL-a43436-461d8cddf3df8db8bb3616d7c2c6cf261f9d37b2eb044a8ae50cc23e7cdeaf453
ISBN 9789027212252
9027212252
9789027212245
9027212244
IngestDate Thu Oct 09 05:52:06 EDT 2025
Fri Nov 21 19:43:29 EST 2025
Wed Nov 26 04:53:10 EST 2025
Thu Jun 26 22:03:04 EDT 2025
IsPeerReviewed false
IsScholarly false
Keywords Theoretical linguistics
Cognition and language
Computational & corpus linguistics
LCCN 2015016708
LCCallNum_Ident P98.5.S83
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-a43436-461d8cddf3df8db8bb3616d7c2c6cf261f9d37b2eb044a8ae50cc23e7cdeaf453
Notes Includes bibliographical references (p. [425]-432) and indexes
OCLC 908935533
PQID EBC4386605
PageCount 456
ParticipantIDs askewsholts_vlebooks_9789027268457
walterdegruyter_marc_9789027268457
proquest_ebookcentral_EBC4386605
nii_cinii_1130282271414332800
PublicationCentury 2000
PublicationDate c2015
2015
[2015]
2015-11-25
PublicationDateYYYYMMDD 2015-01-01
2015-11-25
PublicationDate_xml – year: 2015
  text: c2015
PublicationDecade 2010
PublicationPlace Amsterdam
PublicationPlace_xml – name: Amsterdam
– name: Philadelphia
PublicationYear 2015
Publisher John Benjamins Publishing Company
John Benjamins
Publisher_xml – name: John Benjamins Publishing Company
– name: John Benjamins
RestrictionsOnAccess restricted access
SSID ssj0001570443
Score 1.959328
Snippet This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for...
SourceID askewsholts
walterdegruyter
proquest
nii
SourceType Aggregation Database
Publisher
SubjectTerms Cognition and language
Computational & corpus linguistics
Computational linguistics
Computational linguistics -- Methodology
Computational linguistics -- Statistical methods
LANGUAGE ARTS & DISCIPLINES / Linguistics / General
Linguistics -- Software
Methodology
Theoretical linguistics
TableOfContents 7.2 Putting several factors together: Predicting reaction times in a lexical decision task -- 7.2.1 Data and hypotheses -- 7.2.2 The lm() function and interpretation of its output -- 7.2.3 Selecting the explanatory variables -- 7.2.4 Checking for outliers and overly influential observations -- 7.2.5 Checking the regression assumptions -- 7.2.6 Testing and interpreting interactions -- 7.2.7 Checking for overfitting -- 7.2.8 Robust regression: Bootstrap -- 7.3 Summary -- 8. Finding differences between several groups -- What you will learn from this chapter: -- 8.1 What is ANOVA? -- 8.2 Motion events in Nicaraguan Sign Language: Independent one-way ANOVA -- 8.2.1 Theoretical background and data -- 8.2.2 Exploring the data -- 8.2.3 Assumptions of one-way parametric ANOVA -- 8.2.4 Performing parametric one-way ANOVA -- 8.2.5 Alternative tests -- 8.2.6 Post-hoc tests -- 8.3 Development of spatial modulations in Nicaraguan Sign Language: Independent factorial (two-way) ANOVA -- 8.3.1 The data and hypothesis -- 8.3.2 Descriptive statistics for different groups and interaction plot -- 8.3.3 Assumptions of parametric factorial ANOVA -- 8.3.4 ANOVA and orthogonal contrasts -- 8.3.5 Alternative tests -- 8.3.6 Post-hoc tests -- 8.4 Do native English and native Mandarin Chinese speakers conceptualize time differently? Repeated-measured and mixed-design ANOVA (mixed GLM method) -- 8.4.1 The data and hypothesis -- 8.4.2 Fitting a mixed-design ANOVA with the help of mixed GLM -- 8.4.3 Post-hoc tests -- 8.5 Summary -- 9. Measuring associations between two categorical variables -- What you will learn from this chapter: -- 9.1 Testing independence -- 9.2 The story of over is not over: Metaphoric and non-metaphoric uses in two registers (analysis of a 2-by-2 contingency table) -- 9.2.1 The data and hypothesis
4.3.2 Deviation of proportions as a measure of dispersion -- 4.4 Summary -- 5. Comparing two groups -- What you will learn from this chapter: -- 5.1 Comparing group means (medians): An overview of the tests -- 5.2 Comparing the number of associations triggered by high- and low-frequency nouns with the help of an independent t-test -- 5.2.1 Data and hypothesis -- 5.2.2 Descriptive statistics and visualizations -- 5.2.3 Choosing an appropriate test to compare the measures of central tendency in two groups -- 5.2.4 Confidence intervals and standard errors -- 5.3 Comparing concreteness scores of high- and low-frequency nouns with the help of a two-tailed Wilcoxon test -- 5.3.1 Data and hypotheses -- 5.3.2 Descriptive statistics and visualizations: Strip charts and rug plots -- 5.3.3 Inferential statistics: The two-tailed Wilcoxon test -- 5.4. Comparing associations produced by native and non-native speakers: A paired one-tailed t-test -- 5.4.1 Creating simulation data -- 5.4.2 Performing the paired t-test -- 5.5 Summary -- 6. Relationships between two numerical variables -- What you will learn from this chapter: -- 6.1 What is correlation? -- 6.2 Word length and word recognition: The Pearson product-moment correlation coefficient -- 6.2.1 The data and hypothesis -- 6.2.2 Descriptive statistics and visualizations -- 6.2.3 Testing the significance of the correlation coefficient -- 6.3 Emergence of grammar from lexicon: Spearman's ρ and Kendall's τ. -- 6.3.1 The data and hypothesis -- 6.3.2 Exploring the data and computing correlation coefficients -- 6.4 Visualization of correlations between more than two variables with the help of correlograms -- 6.5 Summary -- 7. More on frequencies and reaction times -- What you will learn from this chapter -- 7.1 The basic principles of linear regression analysis
12.2.2 Fitting a binary logistic regression model: Main functions -- 12.2.3 Selection of variables -- 12.2.4 Testing possible interactions -- 12.2.5 Identifying outliers and overly influential observations -- 12.2.6 Checking the regression assumptions -- 12.2.7 Testing for overfitting -- 12.2.8 Interpretation of the model -- 12.3 Summary -- 13. Multinomial (polytomous) logistic regression models of three and more near synonyms -- What you will learn from this chapter: -- 13.1 What is multinomial regression? -- 13.2 Multinomial models of English permissive constructions -- 13.2.1 Data and hypotheses -- 13.2.2 Contrasting allow and permit with let -- 13.2.3 'One vs. rest' approach -- 13.3 Summary -- 14. Conditional inference trees and random forests -- What you will learn from this chapter: -- 14.1 Conditional inference trees and random forests -- 14.2 Conditional inference trees and random forests of three English causative constructions -- 14.2.1 The data and hypotheses -- 14.2.2 Fitting a conditional inference tree model -- 14.2.3 Random forests -- 14.3 Summary -- 15. Behavioural profiles, distance metrics and cluster analysis -- What you will learn from this chapter: -- 15.1 What are Behavioural Profiles? -- 15.2 Behavioural Profiles of English analytic causatives -- 15.2.1 Data and theoretical background -- 15.2.2 Computation of numeric BP vectors from the categorical data -- 15.2.3 Distance matrix -- 15.2.4 Hierarchical cluster analysis -- 15.2.4.1 Identifying the clusters -- 15.2.4.2 Interpretation of the cluster solution: Snake plots and effect size measures -- 15.2.4.3 Validation of a cluster solution -- 15.2.5 Partitioning methods -- 15.2.5.1 General introduction -- 15.2.5.2 Partitioning around centroids (k-means) -- 15.2.5.3 Partitioning around medoids -- 15.3 Summary -- 16. Introduction to Semantic Vector Spaces
9.2.2 Visualizations, proportions and measures of effect size: Odds ratios, Cramér's V and the ø coefficient -- 9.2.3 Testing statistical significance: The χ2 -test of independence -- 9.3 Metaphorical and non-metaphorical uses of see in four registers (analysis of a 4-by-2 table) -- 9.3.1 The data and hypothesis -- 9.3.2 Descriptive statistics and visualizations -- 9.3.3 Testing the statistical significance and analysing the residuals: The χ2-test and mosaic and association plots -- 9.4 Summary -- 10. Association measures -- What will you learn from this chapter: -- 10.1 Measures of association: A brief typology -- 10.1.1 Frequencies that you will need in order to compute association measures -- 10.1.2 Unidirectional (asymmetric) vs. bidirectional (symmetric) measures -- 10.1.3 Contingency-based vs. non-contingency-based measures -- 10.2 Case study: The Russian ditransitive construction and its collexemes -- 10.2.1 Theoretical background and data -- 10.2.2 Computation of some popular association measures -- 10.3 Summary -- 11. Geographic variation of quite: Distinctive collexeme analysis -- What you will learn from this chapter: -- 11.1 Introduction to distinctive collexeme analysis -- 11.2 Distinctive collexeme analysis of quite + ADJ in different varieties of English -- 11.2.1 Theoretical background and data -- 11.2.2 Simple distinctive collexeme analysis of quite + ADJ in British and American English -- 11.2.3 Multiple distinctive collexeme analysis: Quite + ADJ in the British, American and Canadian varieties of English -- 11.3 Summary -- 12. Probabilistic multifactorial grammar and lexicology -- What you will learn from this chapter: -- 12.1 Introduction to logistic regression -- 12.2 Logistic regression model of Dutch causative auxiliaries doen and laten -- 12.2.1 Theoretical background and data
What you will learn from this chapter
Intro -- How to do Linguistics with R -- Title page -- LCC data -- Dedication page -- Table of contents -- Acknowledgements -- Introduction -- 1. Who is this book written for? -- 2. The quantitative turn in linguistics -- 3. How to use this textbook -- 1. What is statistics? -- What you will learn from this chapter: -- 1.1 Statistics and statistics -- 1.2 Formulating and testing your hypotheses -- 1.2.1 Null and alternative hypotheses -- 1.2.2 Those mysterious p-values… -- 1.2.3 Type I and Type II errors -- 1.2.4 One-tailed and two-tailed statistical tests -- 1.3 What statistics cannot do for you -- 1.4 Types of variables -- 1.5 Summary -- 2. Introduction to R -- What you will learn from this chapter: -- 2.1 Introduction -- 2.2 Installation of the basic distribution and add-on packages -- 2.3 First steps with R -- 2.3.1 Starting R -- 2.3.2 R syntax -- 2.3.3 Exiting from R or terminating a process -- 2.3.4 Getting help -- 2.4 Main types of R objects -- 2.5 RStudio -- 2.6 Importing and exporting your data and saving your graphs -- 2.6.1 Importing your data to R -- 2.6.2 Exporting your data from R -- 2.6.3 Saving your graphs -- 2.7 Summary -- 3. Descriptive statistics for quantitative variables -- What you will learn from this chapter: -- 3.1 Analysing the distribution of word lengths: Basic descriptive statistics -- 3.1.1 The data -- 3.1.2 Measures of central tendency -- 3.1.3 Measures of dispersion -- 3.2 Bad times, good times: Visualization of a distribution and finding outliers -- 3.3 Zipf's law and word frequency: Transformation of quantitative variables -- 3.4 Summary -- 4. How to explore qualitative variables -- What you will learn from this chapter: -- 4.1 Frequency tables, proportions and percentages -- 4.2 Visualization of categorical data -- 4.3 Basic Colour Terms: Deviations of Proportions in subcorpora -- 4.3.1 The data and hypothesis
Chapter 2. Introduction to R --
References --
Chapter 17. Language and space --
Chapter 18. Multidimensional analysis of register variation --
Chapter 8. Finding differences between several groups --
Subject Index --
Chapter 20. Constructional change and motion charts --
Chapter 13. Multinomial (polytomous) logistic regression models of three and more near synonyms --
Chapter 12. Probabilistic multifactorial grammar and lexicology --
Chapter 14. Conditional inference trees and random forests --
Prelim pages --
Acknowledgements --
Chapter 11. Geographic variation of quite: Distinctive collexeme analysis --
Chapter 19. Exemplars, categories, prototypes --
Chapter 6. Relationships between two quantitative variables --
Index of R functions and packages
Table of contents --
Chapter 4. How to explore qualitative variables --
Introduction --
Chapter 10. Association measures --
Chapter 5. Comparing two groups --
The most important R objects and basic operations with them --
Chapter 15. Behavioural profiles, distance metrics and cluster analysis --
Epilogue --
Chapter 16. Introduction to Semantic Vector Spaces --
Chapter 3. Descriptive statistics for quantitative variables --
Chapter 1. What is statistics? --
Chapter 7. More on frequencies and reaction times --
Chapter 9. Measuring associations between two categorical variables --
Main plotting functions and graphical parameters in R --
Title How to do linguistics with R : data exploration and statistical analysis
URI https://cir.nii.ac.jp/crid/1130282271414332800
https://ebookcentral.proquest.com/lib/[SITE_ID]/detail.action?docID=4386605
https://www.degruyterbrill.com/isbn/9789027268457
https://www.vlebooks.com/vleweb/product/openreader?id=none&isbn=9789027268457&uid=none
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9NAEB6VBCEqVJV3oEWrCnFBBm_W9q6vrYJ6CFVVVVVv1j6r8LBR7ITCr2d247pxuMChl1XsxLvyN9HMfLOzMwBvnUCbrWwSCSZklDhtIxkbGzlDbcqpYjZ0nruY8pMTcXmZn7ZNS-vQToCXpbi-zn_cqajxHgrbH539D3F3k-IN_IxCxxHFjuOGR9xdtm1-fHe46r2pPNG-WrQVmEOgtdvOmdqljznJlWL11Q_lOu-n6QbvD4k1h7b8Ir_7XJq1oFVfkayIYo7s09d1WRWD_kttot-Ar_X7A101vOyXpd4wF10SX2_aezAcJylLBzBEezr5fBvnSnmcJOwBPGoX-hiW2YZtWX9FPY46vqnRsJezWc_J3_kZ0gWMvZovfjU329PB6p_vwtD6oyCPYcuWT-DFtA3t1uQdmXbVqOunsIfIk6YipiJryBOPPDl7BhefJudHx1HbdSKS_pRtFiUZ9S2djGPGCaOEUiyjmeF6rDPtkHG63DCuxlbhi0khbRprPWaWa2OlQxCew6CsSvsSCJJnI2LuuMYvVCxzpmiWCWTdeS5joUdwsIZCsfwWdsjrogftCPYRnELP_Ej9RjM6dZwm1JeeQ39_BOQGtiI836b1FpPDo4SJDAkrrrMBZ-ErpfTXefUvP3oND2__jHswaOYLuw_39bKZ1fM3rfD_AMKyOGk
linkProvider ProQuest Ebooks
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=book&rft.title=How+to+do+Linguistics+with+R&rft.au=Levshina%2C+Natalia&rft.date=2015-01-01&rft.pub=John+Benjamins+Publishing+Company&rft.isbn=9789027268457&rft_id=info:doi/10.1075%2Fz.195&rft.externalDBID=n%2Fa&rft.externalDocID=9789027268457
thumbnail_m http://cvtisr.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fwww.degruyterbrill.com%2Fdocument%2Fcover%2Fisbn%2F9789027268457%2Foriginal
http://cvtisr.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Fvle.dmmserver.com%2Fmedia%2F640%2F97890272%2F9789027268457.jpg