References
2011. IRIS. https://iris-database.org/.
Abeysooriya, Mandhri, Megan Soria, Mary Sravya Kasu & Mark Ziemann.
2021. Gene name errors: Lessons not learned. PLOS Computational
Biology. Public 17(7). e1008984. https://doi.org/10.1371/journal.pcbi.1008984.
Acheson, Daniel J., Justine B. Wells & Maryellen C. MacDonald. 2008.
New and updated tests of print exposure and reading abilities in college
students. Behavior Research Methods 40(1). 278–289. https://doi.org/10.3758/brm.40.1.278.
Alhazmi, Fahd. 2020. A visual interpretation of the standard deviation.
Medium. https://towardsdatascience.com/a-visual-interpretation-of-the-standard-deviation-30f4676c291c.
Almeida, Alexandre, Adam Loy & Heike Hofmann. 2018. ggplot2
compatible quantile-quantile plots in r. The R Journal 10(2).
248–261. https://doi.org/10.32614/RJ-2018-051.
alvinashcraft, alexbuckgit, ArcticLampyrid & bearmannl. 2022.
Maximum path length limitation. Learn Microsoft. https://learn.microsoft.com/en-us/windows/win32/fileio/maximum-file-path-limitation.
Baayen, R. Harald. 2008. Analyzing linguistic data: A practical
introduction to statistics using r. Cambridge University Press.
Barrett, Malcolm. 2018. Why should i use the here package when i’m
already using projects? - malcolm barrett. https://malco.io/articles/2018-11-05-why-should-i-use-the-here-package-when-i-m-already-using-projects.
Ben-Shachar, Mattan, Daniel Lüdecke & Dominique Makowski. 2020.
Effectsize: Estimation of effect size indices and standardized
parameters. Journal of Open Source Software 5(56). 2815. https://doi.org/10.21105/joss.02815.
Berez-Kroeker, Andrea L., Bradley McDonnell, Eve Koller & Lauren B.
Collister. 2022. The open handbook of linguistic data
management. MIT Press. https://doi.org/10.7551/mitpress/12200.001.0001.
Bochynska, Agata, Liam Keeble, Caitlin Halfacre, Joseph V. Casillas,
Irys-Amélie Champagne, Kaidi Chen, Melanie Röthlisberger, Erin M.
Buchanan & Timo B. Roettger. 2023. Reproducible research practices
and transparency across linguistics. Glossa Psycholinguistics
2(1). https://doi.org/10.5070/G6011239.
Breheny, Patrick & Woodrow Burchett. 2017. Visualization of
Regression Models Using visreg. The R Journal 9(2). 56. https://doi.org/10.32614/RJ-2017-046.
Bryan, Jennifer. 2018. Let’s git started | happy git and GitHub for
the useR. Open Education Resource. https://happygitwithr.com/.
Bryan, Jenny. 2017. Project-oriented workflow. Tidyverse.org.
https://www.tidyverse.org/blog/2017/12/workflow-vs-script/.
Busterud, Guro, Anne Dahl, Dave Kush & Kjersti Faldet Listhaug.
2023. Verb placement in L3 french and L3 german: The role of
language-internal factors in determining cross-linguistic influence from
prior languages. Linguistic Approaches to Bilingualism. John
13(5). 693–716. https://doi.org/10.1075/lab.22058.bus.
Çetinkaya-Rundel, Mine & Johanna Hardin. 2021. Introduction to
modern statistics. Second. Leanpub. https://openintro-ims.netlify.app/.
Cleveland, William S. & Robert McGill. 1987. Graphical perception:
The visual decoding of quantitative information on graphical displays of
data. Journal of the Royal Statistical Society: Series A
(General) 150(3). 192–210. https://doi.org/10.2307/2981473.
Cohen, Jacob. 1988. Statistical power analysis for the behavioral
sciences. 2. ed., reprint. New York, NY: Psychology Press.
Dąbrowska, Ewa. 2019. Experience, aptitude, and individual differences
in linguistic attainment: A comparison of native and nonnative speakers.
Language Learning 69(S1). 72–100. https://doi.org/10.1111/lang.12323.
Dauber, Daniel. 2024. R for non-programmers: A guide for social
scientists. Open Education Resource. https://bookdown.org/daniel_dauber_io/r4np_book/.
Douglas, Alex, Deon Roos, Francesca Mancini & David Lusseau. 2024.
An introduction to R. https://intro2r.com/.
Ekman, Paul & Wallace V Friesen. 1978. Facial action coding system.
Environmental Psychology & Nonverbal Behavior.
Few, Stephen. Save the pies for dessert. August 2007. http://www.perceptualedge.com/articles/08-21-07.pdf.
Field, Andy P., Jeremy Miles & Zoë Field. 2012. Discovering
statistics using r. Sage.
Fox, John & Sanford Weisberg. 2019. An r companion to applied
regression. Third edition. SAGE.
Fricke, Lea, Patrick G Grosz & Tatjana Scheffler. 2024. Semantic
differences in visually similar face emojis. Language and
Cognition. Cambridge University Press 1–15. https://doi.org/10.1017/langcog.2024.12.
Fugate, Jennifer MB & Courtny L Franco. 2021. Implications for
emotion: Using anatomically based facial coding to compare emoji faces
across platforms. Frontiers in Psychology. Frontiers Media SA
12. 605928. https://doi.org/10.3389/fpsyg.2021.605928.
Garnier, Simon, Noam Ross, BoB Rudis, Antoine Filipovic-Pierucci, Tal
Galili, Timelyportfolio, Alan O’Callaghan, et al. 2023.
Sjmgarnier/viridis: CRAN release v0.6.3. Zenodo. https://doi.org/10.5281/ZENODO.4679423.
Gelman, Andrew. 2018. Ethics in statistical practice and communication:
Five recommendations. Significance 15(5). 40–43. https://doi.org/10.1111/j.1740-9713.2018.01193.x.
Gelman, Andrew. 2019. Embracing variation and accepting uncertainty:
Implications for science and metascience. https://www.youtube.com/watch?v=VQCcMP4A5Ks.
Gries, Stefan Th. & Nick C. Ellis. 2015. Statistical measures for
usage-based linguistics. Language Learning 65(S1). 228–255. https://doi.org/10.1111/lang.12119.
Gries, Stefan Thomas. 2021. Statistics for linguistics with r: A
practical introduction (De Gruyter Mouton Textbook). 3rd revised
edition. de Gruyter Mouton.
Grömping, Ulrike. 2006. Relative Importance for Linear Regression
inR: The
Packagerelaimpo. Journal of
Statistical Software 17(1). https://doi.org/10.18637/jss.v017.i01.
Grosz, Patrick Georg, Gabriel Greenberg, Christian De Leon & Elsi
Kaiser. 2023. A semantics of face emoji in discourse. Linguistics
and Philosophy. Springer 46(4). 905–957. https://doi.org/10.1007/s10988-022-09369-8.
Harrell, Frank E. 2015. Regression modeling strategies: With
applications to linear models, logistic and ordinal regression, and
survival analysis (Springer Series in Statistics). Springer
International Publishing. https://doi.org/10.1007/978-3-319-19425-7.
Horst, Allison & Julie Lowndes. 2020. Openscapes - tidy data for
efficiency, reproducibility, and collaboration. https://openscapes.org/blog/2020-10-12-tidy-data/.
Hvitfeldt, Emil. 2021. Paletteer: Comprehensive collection of color
palettes. https://github.com/EmilHvitfeldt/paletteer.
Kaufman, Allison B. & James C. Kaufman (eds.). 2018. The illusion of
causality: A cognitive bias underlying pseudoscience. In
Pseudoscience. The MIT Press. https://doi.org/10.7551/mitpress/10747.003.0007.
Lakens, Daniël. 2022. Improving your statistical inferences.
Zenodo. https://doi.org/10.5281/ZENODO.6409077.
Lausberg, Hedda & Han Sloetjes. 2009. Coding gestural behavior with
the NEUROGES-ELAN system. Behavior Research Methods 41(3).
841–849. https://doi.org/10.3758/BRM.41.3.841.
Le Foll, Elen. 2022. Textbook English: A
corpus-based analysis of the language of EFL textbooks used in secondary
schools in France, Germany and
Spain. Osnabrück University PhD thesis. https://doi.org/10.48693/278.
Lenth, Russell V. 2025. Emmeans: Estimated marginal means, aka
least-squares means. https://rvlenth.github.io/emmeans/.
Levshina, Natalia. 2015. How to do linguistics with r: Data
exploration and statistical analysis. John Benjamins.
Levshina, Natalia. 2022. Comparing Bayesian and Frequentist Models of
Language Variation: The Case of Help + (to-)Infinitive. In Ole Schützler
& Julia Schlüter (eds.), 224–258. 1st edn. Cambridge University
Press. https://doi.org/10.1017/9781108589314.009.
Lindeman, Richard Harold, Peter Francis Merenda & Ruth Z. Gold.
1980. Introduction to bivariate and multivariate analysis.
Scott, Foresman.
Lüdecke, Daniel. 2020. sjPlot: Data visualization for statistics in
social science. https://CRAN.R-project.org/package=sjPlot.
Lüdecke, Daniel, Mattan S. Ben-Shachar, Indrajeet Patil, Philip Waggoner
& Dominique Makowski. 2021. performance:
An r package for assessment, comparison and testing of statistical
models. Journal of Open Source Software 6(60). 3139. https://doi.org/10.21105/joss.03139.
Lüdecke, Daniel, Indrajeet Patil, Mattan S. Ben-Shachar, Brenton M.
Wiernik, Philip Waggoner & Dominique Makowski. 2021. See: An r
package for visualizing statistical models. Journal of Open Source
Software 6(64). 3393. https://doi.org/10.21105/joss.03393.
Maier, Emar. 2023. Emojis as pictures. Ergo 10. https://doi.org/10.3998/ergo.4641.
Matejka, Justin & George Fitzmaurice. 2017. Same stats, different
graphs: Generating datasets with varied appearance and identical
statistics through simulated annealing. In, 12901294. New York, NY, USA:
Association for Computing Machinery. https://doi.org/10.1145/3025453.3025912.
Matute, Helena, Fernando Blanco, Ion Yarritu, Marcos Díaz-Lago, Miguel
A. Vadillo & Itxaso Barberia. 2015. Illusions of causality: How they
bias our everyday thinking and how they could be reduced. Frontiers
in Psychology. Frontiers 6. https://doi.org/10.3389/fpsyg.2015.00888.
Mertzen, Daniela, Sol Lago & Shravan Vasishth. 2021. The benefits of
preregistration for hypothesis-driven bilingualism research.
Bilingualism: Language and Cognition 24(5). 807–812. https://doi.org/10.1017/S1366728921000031.
Mizumoto, Atsushi. 2023. Calculating the relative importance of multiple
regression predictor variables using dominance analysis and random
forests. Language Learning 73(1). 161–196. https://doi.org/10.1111/lang.12518.
Neuwirth, Erich. 2022. Package “RColorBrewer.”
ColorBrewer palettes 991. https://cran.r-project.org/web/packages/RColorBrewer/RColorBrewer.pdf.
Nicenboim, Bruno, Daniel Schad & Shravan Vasishth. 2026.
Introduction to Bayesian Data Analysis for cognitive science
(Chapman & Hall/CRC Statistics in the social and behavioral sciences
series). Boca Raton London New York: CRC Press, Taylor & Francis
Group. https://doi.org/10.1201/9780429342646.
Nimon, Kim F. 2012. Statistical assumptions of substantive analyses
across the general linear model: A mini-review. Frontiers in
Psychology 3. https://doi.org/10.3389/fpsyg.2012.00322.
Ou, Jianhong. 2021. colorBlindness: Safe color set for color
blindness. https://CRAN.R-project.org/package=colorBlindness.
Parsons, Sam, Flávio Azevedo, Mahmoud M. Elsherif, Samuel Guay, Owen N.
Shahim, Gisela H. Govaart, Emma Norris, et al. 2022. A community-sourced
glossary of open scholarship terms. Nature Human
Behaviour. Nature 6(3). 312–318. https://doi.org/10.1038/s41562-021-01269-4.
Pedersen, Thomas Lin. 2024. Patchwork: The composer of plots.
https://patchwork.data-imaginist.com.
Pedersen, Thomas Lin & Maxim Shemanarev. 2024. Ragg: Graphic
devices based on AGG. https://ragg.r-lib.org.
Pfadenhauer, Katrin & Evelyn Wiesinger (eds.). 2024. Romance
motion verbs in language change: Grammar, lexicon, discourse. De
Gruyter. https://doi.org/10.1515/9783111248141.
Pfeifer, Valeria A, Emma L Armstrong & Vicky Tzuyin Lai. 2022. Do
all facial emojis communicate emotion? The impact of facial emojis on
perceived sender emotion and text processing. Computers in Human
Behavior. Elsevier 126. 107016. https://doi.org/10.1016/j.chb.2021.107016.
Plonsky, Luke & Frederick L. Oswald. 2014. How big is
“big”? Interpreting effect sizes in L2 research.
Language Learning 64(4). 878–912. https://doi.org/10.1111/lang.12079.
Prat, Chantel S., Tara M. Madhyastha, Malayka J. Mottarella &
Chu-Hsuan Kuo. 2020. Relating natural language aptitude to individual
differences in learning programming languages. Scientific
Reports. Nature 10(1). 3817. https://doi.org/10.1038/s41598-020-60661-8.
R Core Team. 2024. R: A language and environment for statistical
computing. R Foundation for Statistical Computing. https://www.R-project.org/.
Roettger, Timo B. 2021. Preregistration in experimental linguistics:
Applications, challenges, and limitations. Linguistics. De
59(5). 1227–1249. https://doi.org/10.1515/ling-2019-0048.
Scheffler, Tatjana & Ivan Nenchev. 2024. Affective, semantic,
frequency, and descriptive norms for 107 face emojis. Behavior
Research Methods. Springer 1–22. https://doi.org/10.3758/s13428-024-02444-x.
Schimke, Sarah, Israel de la Fuente, Barbara Hemforth & Saveria
Colonna. 2018. First language influence on second language offline and
online ambiguous pronoun resolution. Language Learning 68(3).
744–779. https://doi.org/10.1111/lang.12293.
Schweinberger, Martin. 2022. Data management, version control, and
reproducibility. https://ladal.edu.au/repro.html.
Seibold, Heidi & Rabea Müller. BERD course: Make your research
reproducible. https://doi.org/10.17605/OSF.IO/RUPT7.
Silge, Julia. 2022. Janeaustenr: Jane Austen’s complete
novels. https://CRAN.R-project.org/package=janeaustenr.
Smith, Gary. 2018. Step away from stepwise. Journal of Big
Data. SpringerOpen 5(1). 1–12. https://doi.org/10.1186/s40537-018-0143-6.
Sonderegger, Morgan. 2023. Regression modeling for linguistic
data. The MIT Press.
Sóskuthy, Márton. Generalised additive mixed models for dynamic analysis
in linguistics: A practical introduction. https://doi.org/10.48550/arXiv.1703.05339.
Stefanowitsch, Anatol & Susanne Flach. 2017. The corpus-based
perspective on entrenchment. In Hans-Jörg Schmid (ed.), Entrenchment
and the psychology of language learning: How we reorganize and adapt
linguistic knowledge, 101–127. De Gruyter. https://doi.org/10.1037/15969-006.
Tabachnick, Barbara G. & Linda S. Fidell. 2014. Using
multivariate statistics (Always Learning). Pearson new
international edition, sixth edition. Pearson.
The Turing Way Community. 2022. The turing way: A handbook for
reproducible, ethical and collaborative research (1.0.2). Zenodo. https://doi.org/10.5281/zenodo.3233853.
Thompson, Bruce. 1995. Stepwise regression and stepwise discriminant
analysis need not apply here: A guidelines editorial. Educational
and Psychological Measurement. SAGE 55(4). 525–534. https://doi.org/10.1177/0013164495055004001.
Van Hulle, Sven & Renata Enghels. 2024a. The category of throw verbs
as productive source of the spanish inchoative construction. In Katrin
Pfadenhauer & Evelyn Wiesinger (eds.), Romance motion verbs in
language change, 213–240. De Gruyter. https://doi.org/10.1515/9783111248141-009.
Van Hulle, Sven & Renata Enghels. 2024b. TROLLing replication data
for: “The category of throw verbs as productive source of the
spanish inchoative construction. DataverseNO, V1.” https://doi.org/10.18710/TR2PWJ.
Vasishth, Shravan & Andrew Gelman. 2021. How to embrace variation
and accept uncertainty in linguistic and psycholinguistic data analysis.
Linguistics 59(5). 1311–1342. https://doi.org/10.1515/ling-2019-0051.
Wickham, Hadley. 2016. ggplot2: Elegant graphics for data
analysis. New York: Springer. https://ggplot2.tidyverse.org.
Wickham, Hadley, Mine Çetinkaya-Rundel & Garrett Grolemund. 2023.
R for data science: Import, tidy, transform, visualize, and model
data. 2nd edition. O’Reilly. https://r4ds.hadley.nz/.
Wickham, Hadley, Romain François & Lucy D’Agostino McGowan. 2024.
Emo: Easily insert ’emoji’. https://github.com/hadley/emo.
Wickham, Hadley, Davis Vaughan & Maximilian Girlich. Tidy messy
data. https://tidyr.tidyverse.org/.
Wilkinson, Leland. 2005. The Grammar of Graphics (Statistics
and Computing). New York: Springer. https://doi.org/10.1007/0-387-28695-0.
Williams, Matt N., Carlos Alberto Gómez Grajales & Dason Kurkiewicz.
2013. Assumptions of multiple regression: Correcting two misconceptions.
Practical Assessment, Research, and Evaluation 18(11).
Winter, Bodo. 2019. Statistics for linguists: An introduction using
R. Routledge. https://doi.org/10.4324/9781315165547.
Ziemann, Mark, Yotam Eren & Assam El-Osta. 2016. Gene name errors
are widespread in the scientific literature. Genome Biology
17(1). 177. https://doi.org/10.1186/s13059-016-1044-7.