References

Vaclav Brezina

References

Published online by Cambridge University Press: 14 September 2018

Vaclav Brezina

Show author details

Vaclav Brezina: Affiliation:
Lancaster University

Book contents

Get access

Type: Chapter
Information: Statistics in Corpus Linguistics
A Practical Guide
, pp. 285 - 293

DOI: https://doi.org/10.1017/9781316410899 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2018

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Argamon, S., Koppel, M., Fine, J. & Shimoni, A. R. (2003). Gender, genre, and writing style in formal written texts. Text, 23(3), 321–46.Google Scholar

Argamon, S., Koppel, M., Pennebaker, J. W. & Schler, J. (2007). Mining the blogosphere: age, gender and the varieties of self-expression. First Monday, 12(9). http://firstmonday.org/issues/issue12_9/argamon/index.html Google Scholar

Arppe, A. (2008). Univariate, bivariate, and multivariate methods in corpus-based lexicography: a study of synonymy. Helsinki: University of Helsinki. Available at https://helda.helsinki.fi/bitstream/handle/10138/19274/univaria.pdf?sequence=2 (accessed 29/12/2015).Google Scholar

Azen, R. & Walker, C. M. (2011). Categorical data analysis for the behavioral and social sciences. London: Routledge.CrossRef Google Scholar

Baker, H., Brezina, V. & McEnery, T. (2017). Ireland in British parliamentary debates 1803–2005: plotting changes in discourse in a large volume of time-series corpus data. In Säily, T., Nurmi, A., Palander-Collin, M. & Auer, A. (eds.), Exploring future paths for historical sociolinguistics (Advances in Historical Sociolinguistics), pp. 83–107. Amsterdam: John Benjamins.CrossRef Google Scholar

Baker, P. (2009). The BE06 corpus of British English and recent language change. International Journal of Corpus Linguistics, 14(3), 312–37.CrossRef Google Scholar

Baker, P. (2011). Times may change, but we will always have money: diachronic variation in recent British English. Journal of English Linguistics, 39(1), 65–88.Google Scholar

Baker, P. (2014). Using corpora to analyze gender. London: Bloomsbury.Google Scholar

Baker, P. (2017) American and British English: divided by a common language. Cambridge University Press.CrossRef Google Scholar

Baker, P., Gabrielatos, C. & McEnery, T. (2013). Sketching Muslims: a corpus driven analysis of representations around the word ‘Muslim’ in the British press 1998–2009. Applied Linguistics, 34(3), 255–78.CrossRef Google Scholar

Baker, P., Gabrielatos, C., Khosravinik, M., Krzyżanowski, M., McEnery, T. & Wodak, R. (2008). A useful methodological synergy? Combining critical discourse analysis and corpus linguistics to examine discourses of refugees and asylum seekers in the UK press. Discourse & Society, 19(3), 273–306.CrossRef Google Scholar

Balakrishnan, N., Voinov, V. & Nikulin, M. S. (2013). Chi-squared goodness of fit tests with applications. Waltham, MA: Academic Press.Google Scholar

Barlow, M. (2013). Individual differences and usage-based grammar. International Journal of Corpus Linguistics, 18(4), 443–78.Google Scholar

Baroni, M., Bernardini, S., Ferraresi, A. & Zanchetta, E. (2009). The WaCky Wide Web: a collection of very large linguistically processed web-crawled corpora. Language Resources and Evaluation, 43 (3), 209–26.CrossRef Google Scholar

Baroni, M. & Ueyama, M. (2006). Building general-and special-purpose corpora by web crawling. In Proceedings of the 13th NIJL International Symposium, Language corpora: their compilation and application, pp. 31–40.Google Scholar

Benzécri, J. P. (1992). Correspondence analysis handbook. New York: Marcel Dekker.CrossRef Google Scholar

Bestgen, Y. (2014). Inadequacy of the chi-squared test to examine vocabulary differences between corpora. Literary and Linguistic Computing, 29(2), 164–70.CrossRef Google Scholar

Biber, D. (1988). Variation across speech and writing. Cambridge University Press.Google Scholar

Biber, D. & Conrad, S. (2009). Register, genre, and style. Cambridge University Press.Google Scholar

Biber, D., Johansson, S., Leech, G., Conrad, S. & Finegan, E. (1999). Longman grammar of spoken and written English. Harlow: Longman.Google Scholar

Biber, D. & Jones, K. (2009). Quantitative methods in corpus linguistics. In Lüdeling, A. & Kytö, M. (eds.), Corpus linguistics: an international handbook, vol. 2, pp. 1287–1304. Berlin: Walter de Gruyter.Google Scholar

Biber, D., Reppen, R., Schnur, E. & Ghanem, R. (2016). On the (non) utility of Juilland’s D to measure lexical dispersion in large corpora. International Journal of Corpus Linguistics, 21(4), 439–64.Google Scholar

Blythe, R. A. & Croft, W. (2012). S-curves and the mechanisms of propagation in language change. Language, 88(2), 269–304.CrossRef Google Scholar

Boneau, C. A. (1960). The effects of violations of assumptions underlying the t test. Psychological Bulletin, 57(1), 49.CrossRef Google Scholar

Borenstein, M. (2009). Effect sizes for continuous data. In Cooper, H., Hedges, L. & Valentine, J. (eds.), The handbook of research synthesis and meta-analysis, pp. 221–35. New York: Russell Sage Foundation.Google Scholar

Brezina, V. (2013). BNC64 Search & Compare. Available at: http://corpora.lancs.ac.uk/bnc64 (accessed 20/08/2016).Google Scholar

Brezina, V. (2014). Effect sizes in corpus linguistics: keywords, collocations and diachronic comparison. Presented at the ICAME 2014 conference, University of Nottingham.Google Scholar

Brezina, V. & Gablasova, D. (2015). Is there a core general vocabulary? Introducing the New General Service List. Applied Linguistics, 36(1), 1–22.CrossRef Google Scholar

Brezina, V., McEnery, T. & Baker, H. (in prep.) Usage fluctuation analysis: a new way of analysing shifts in historical discourse.Google Scholar

Brezina, V., McEnery, T. & Wattam, S. (2015). Collocations in context. International Journal of Corpus Linguistics, 20(2), 139–73.Google Scholar

Brezina, V. & Meyerhoff, M. (2014). Significant or random? A critical review of sociolinguistic generalisations based on large corpora. International Journal of Corpus Linguistics, 19(1), 1–28.Google Scholar

Brezina, V. & Timperley, M. (2017). How large is the BNC? A proposal for standardized tokenization and word counting. CL2017, Birmingham. Available at: www.birmingham.ac.uk/Documents/college-artslaw/corpus/conference-archives/2017/general/paper303.pdf (accessed 08/03/18).Google Scholar

Brezina, V., Timperley, M., Gablasova, D. & McEnery, T. (in prep.). #LancsBox: a new generation corpus tool for researchers, students and teachers.Google Scholar

Cabin, R. J. & Mitchell, R. J. (2000). To Bonferroni or not to Bonferroni: when and how are the questions. Bulletin of the Ecological Society of America, 81(3), 246–8.Google Scholar

Chernick, M. R. & LaBudde, R. A. (2014). An introduction to bootstrap methods with applications to R. Hoboken, NJ: John Wiley & Sons.Google Scholar

Chomsky, N. (2000). New horizons in the study of language and mind. Cambridge University Press.Google Scholar

Clausen, S. E. (1998). Applied correspondence analysis: an introduction. Thousand Oaks, CA: Sage.Google Scholar

Cleveland, W. S. (1994). The elements of graphing data. Summit, NJ: Hobart Press.Google Scholar

Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Hillsdale, NJ: Erlbaum.Google Scholar

Cohen, M. P. (2000). Note on the odds ratio and the probability ratio. Journal of Educational and Behavioral Statistics, 25(2), 249–52.Google Scholar

Colley, A. & Todd, Z. (2002). Gender-linked differences in the style and content of e-mails to friends. Journal of Language and Social Psychology, 21(4), 380–92.Google Scholar

Conrad, S. & Biber, D. (2001). Multidimensional methodology and the dimensions of register variation in English. In Conrad, S. & Biber, D. (eds.), Variation in English: multidimensional studies, pp. 18–19. Harlow: Pearson Education.Google Scholar

Coupland, N. (2007). Style: language variation and identity. Cambridge University Press.CrossRef Google Scholar

Covington, M. A. & McFall, J. D. (2010). Cutting the Gordian knot: the moving-average type–token ratio (MATTR). Journal of Quantitative Linguistics, 17(2), 94–100.Google Scholar

Crystal, D. (2003). English as a global language. Cambridge University Press.Google Scholar

Cumming, G. (2012). Understanding the new statistics. New York: Routledge.Google Scholar

Cumming, G., Fidler, F. & Vaux, D. L. (2007). Error bars in experimental biology. Journal of Cell Biology, 177(1), 7–11.Google Scholar

Davies, H. T. O., Crombie, I. K. & Tavakoli, M. (1998). When can odds ratios mislead? British Medical Journal, 316(7136), 989–91.Google Scholar

Davies, M. & Gardner, D. (2010). A frequency dictionary of contemporary American English: word sketches, collocates and thematic lists. London: Routledge.Google Scholar

de Winter, J. C. (2013). Using the Student’s t-test with extremely small sample sizes. Practical Assessment, Research & Evaluation, 18(10), 1–12.Google Scholar

Diggle, P. J. & Chetwynd, A. G. (2011). Statistics and scientific method: an introduction for students and researchers. Oxford University Press.Google Scholar

Divjak, D. & Gries, S. Th. (2006). Ways of trying in Russian: clustering behavioral profiles. Corpus Linguistics and Linguistic Theory, 2(1), 23–60.Google Scholar

Dodge, Y. (2008). The concise encyclopedia of statistics. New York: Springer.Google Scholar

Edgell, S. E. & Noon, S. M. (1984). Effect of violation of normality on the t test of the correlation coefficient. Psychological Bulletin, 95(3), 576.Google Scholar

Efron, B. (1979). Computers and the theory of statistics: thinking the unthinkable. SIAM Review, 21(4), 460–80.Google Scholar

Efron, B. & Tibshirani, R. J. (1994). An introduction to the bootstrap. Boca Raton, FL: CRC Press.Google Scholar

Erceg-Hurn, D. M. & Mirosevich, V. M. (2008). Modern robust statistical methods: an easy way to maximize the accuracy and power of your research. American Psychologist, 63(7), 591.Google Scholar

Everitt, B. S., Landau, S., Leese, M. & Stahl, D. (2011). Cluster analysis. New York: John Wiley & Sons.CrossRef Google Scholar

Evert, S. (2008). Corpora and collocations. In Lüdeling, A. & Kytö, M. (eds.), Corpus linguistics: an international handbook, vol. 1, pp. 223–33. Berlin: Walter de Gruyter.Google Scholar

Field, A., Miles, J. & Field, Z. (2012). Discovering statistics using R. London: Sage.Google Scholar

Firth, J. (1957). Papers in linguistics. Oxford University Press.Google Scholar

Francis, W. N. & Kučera, H. (1979). Brown Corpus manual: manual of information to accompany a standard corpus of present-day edited American English for use with digital computers. Brown University, Providence, RI. Available at http://clu.uni.no/icame/brown/bcm.html Google Scholar

Friendly, M. (2002). A brief history of the mosaic display. Journal of Computational and Graphical Statistics, 11(1), 89–107.Google Scholar

Friginal, E. & Hardy, J. (2014). Conducting multi-dimensional analysis using SPSS. In Sardinha, T. B. & Pinto, M. V. (eds.), Multi-dimensional analysis, 25 years on: a tribute to Douglas Biber, pp. 297–316. Amsterdam: John Benjamins.Google Scholar

Fritz, C. O., Morris, P. E. & Richler, J. J. (2012). Effect size estimates: current use, calculations, and interpretation. Journal of Experimental Psychology: General, 141(1), 2–18.CrossRef Google Scholar PubMed

Gablasova, D., Brezina, V. & McEnery, A. M. (2017a). Exploring learner language through corpora: comparing and interpreting corpus frequency information. Language Learning, 67(S1), 130–54.Google Scholar

Gablasova, D., Brezina, V. & McEnery, A. M. (2017b). Collocations in corpus-based language learning research: identifying, comparing and interpreting the evidence. Language Learning, 67(S1), 155–79.Google Scholar

Gablasova, D., Brezina, V., McEnery, T. & Boyd, E. (2017). Epistemic stance in spoken L2 English: the effect of task and speaker style. Applied Linguistics, 38(5), 613–37.Google Scholar

Gabrielatos, C. & Marchi, A. (2012) Keyness: appropriate metrics and practical issues. Presented at CADS International Conference 2012, Corpus-assisted Discourse Studies: More than the sum of Discourse Analysis and computing? University of Bologna, Italy.Google Scholar

Glass, G. V. (1965) A ranking variable analogue of biserial correlation: implications for short-cut item analysis. Journal of Educational Measurement, 2(1), 91–5.Google Scholar

Greenacre, M. (2007). Correspondence analysis in practice. Boca Raton: Chapman & Hall/CRC.Google Scholar

Gries, S. Th. (2008). Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics, 13(4), 403–37.Google Scholar

Gries, S. Th. (2010). Dispersions and adjusted frequencies in corpora: further explorations. In Gries, S. Th., Wulff, S. & Davies, M., Corpus linguistic applications: current studies, pp. 197–212. Amsterdam: Rodopi.Google Scholar

Gries, S. Th. (2013a). Statistics for linguistics with R: a practical introduction. Berlin: Walter de Gruyter.Google Scholar

Gries, S. Th. (2013b). 50-something years of work on collocations: what is or should be next … International Journal of Corpus Linguistics, 18(1), 137–66.Google Scholar

Gries, S. Th. & Hilpert, M. (2008). The identification of stages in diachronic data: variability-based neighbour clustering. Corpora, 3(1), 59–81.Google Scholar

Gries, S. Th. & Hilpert, M. (2010). Modeling diachronic change in the third person singular: a multifactorial, verb- and author-specific exploratory approach. English Language and Linguistics, 14(03), 293–320.Google Scholar

Gries, S. Th., Newman, J., Shaoul, C. & Dilts, P. (2009). N-grams and the clustering of genres. Presented at workshop on Corpus, Colligation, Register Variation at the 31st Annual Meeting of the Deutsche Gesellschaft für Sprachwissenschaft, March.Google Scholar

Grieve-Smith, A. (2007). The envelope of variation in multidimensional register and genre analyses. In Fitzpatrick, E. (ed.), Corpus linguistics beyond the word: corpus research from phrase to discourse, pp. 21–42. Amsterdam: Rodopi.Google Scholar

Gwet, K. (2002). Inter-rater reliability: dependency on trait prevalence and marginal homogeneity. Statistical Methods for Inter-Rater Reliability Assessment Series, 2, 1–9.Google Scholar

Hand, D. J. (2010). Evaluating diagnostic tests: the area under the ROC curve and the balance of errors. Statistics in Medicine, 29(14), 1502–10.Google Scholar

Hardie, A. (2014) Log ratio – an informal introduction. http://cass.lancs.ac.uk/?p=1133 Google Scholar

Harrington, J., Palethorpe, S. & Watson, C. I. (2000). Does the Queen speak the Queen’s English? Nature, 408(6815), 927–8.Google Scholar

Hayton, J. C., Allen, D. G. & Scarpello, V. (2004). Factor retention decisions in exploratory factor analysis: a tutorial on parallel analysis. Organizational Research Methods, 7(2), 191–205.Google Scholar

Healey, A. diPaolo (ed.) (2004). The Complete Corpus of Old English in Electronic Form. Dictionary of Old English Project. Centre for Medieval Studies, University of Toronto.Google Scholar

Hill, T., Lewicki, P. & Lewicki, P. (2006). Statistics: methods and applications: a comprehensive reference for science, industry, and data mining. Tulsa, OK: StatSoft.Google Scholar

Hilpert, M. (2011). Dynamic visualizations of language change: motion charts on the basis of bivariate and multivariate data from diachronic corpora. International Journal of Corpus Linguistics, 16(4), 435–61.CrossRef Google Scholar

Hilpert, M. & Gries, S. Th. (2009). Assessing frequency changes in multistage diachronic corpora: applications for historical corpus linguistics and the study of language acquisition. Literary and Linguistic Computing, 24(4), 385–401.Google Scholar

Hosmer, D. W., Lemeshow, S. & Sturdivant, R. X. (2013). Applied logistic regression, 3rd edn. Hoboken, NJ: John Wiley & Sons.CrossRef Google Scholar

Hudson, T. (2015). Presenting quantitative data visually. In Plonsky, L. (ed.), Advancing quantitative methods in second language research, pp. 78–105. London: Routledge.Google Scholar

Ito, R. & Tagliamonte, S. (2003). Well weird, right dodgy, very strange, really cool: layering and recycling in English intensifiers. Language in Society, 32(02), 257–79.Google Scholar

Jakubíček, M., Kilgarriff, A., Kovář, V., Rychlý, P. & Suchomel, V. (2013). The TenTen corpus family. Proceedings of the International Conference on Corpus Linguistics 2013, pp. 125–7. Lancaster University.Google Scholar

Jarvis, S. (2013). Capturing the diversity in lexical diversity. Language Learning, 63(s1), 87–106.Google Scholar

Johnson, D. E. (2009). Getting off the GoldVarb standard: introducing Rbrul for mixed‐effects variable rule analysis. Language and Linguistics Compass, 3(1), 359–83.Google Scholar

Juilland, A. G., Brodin, D. R. & Davidovitch, C. (1970). Frequency dictionary of French words. The Hague: Mouton.Google Scholar

Juilland, A. G. & Chang-Rodríguez, E. (1964). Frequency dictionary of Spanish words. The Hague: Mouton.Google Scholar

Kaiser, H. F. (1960). The application of electronic computers to factor analysis. Educational and Psychological Measurement, 20(1), 141–51.Google Scholar

Kepes, S., Banks, G. C. & Oh, I. S. (2014). Avoiding bias in publication bias research: the value of ‘null’ findings. Journal of Business and Psychology, 29(2), 183–203.Google Scholar

Kerby, D. S. (2014). The simple difference formula: an approach to teaching nonparametric correlation. Innovative Teaching, 3, 1–9.Google Scholar

Kilgarriff, A. (1997). Putting frequencies in the dictionary. International Journal of Lexicography, 10(2), 135–55.CrossRef Google Scholar

Kilgarriff, A. (2005). Language is never, ever, ever, random. Corpus Linguistics and Linguistic Theory, 1(2), 263–76.Google Scholar

Kilgarriff, A. (2009). Simple maths for keywords. In Proceedings of the Corpus Linguistics Conference, Liverpool, July.Google Scholar

Kilgarriff, A. (2012). Getting to know your corpus. In Proceedings of the 15th International Conference on Text, Speech and Dialogue, pp. 3–15. Berlin: Springer.Google Scholar

Kirk, R. E. (1996). Practical significance: a concept whose time has come. Educational and Psychological Measurement, 56(5), 746–59.Google Scholar

Kirk, R. E. (2005). Effect size measures. Wiley StatsRef: Statistics Reference Online. http://dx.doi.org/10.1002/9781118445112.stat06242.pub2 Google Scholar

Krippendorff, K. (2012 [1980]). Content analysis: an introduction to its methodology. London: Sage.Google Scholar

Kruskal, W. H. & Wallis, W. A. (1952). Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association, 47(260), 583–621.Google Scholar

Kučera, H. & Francis, W. N. (1967). Computational analysis of Present-Day American English. Providence, RI: Brown University Press.Google Scholar

Labov, W. (1966). The social stratification of English in New York City. Washington, DC: Center for Applied Linguistics.Google Scholar

Labov, W. (1972). Sociolinguistic patterns. Philadelphia: University of Pennsylvania Press.Google Scholar

Labov, W. (2010). Principles of linguistic change, vol. 3: Cognitive and cultural factors. Oxford: Wiley-Blackwell.Google Scholar

Lakoff, G. & Johnson, M. (1980). Metaphors we live by. University of Chicago Press.Google Scholar

Lakoff, R. T (1975). Language and woman’s place. New York: Harper & Row.Google Scholar

Lavandera, B. R. (1978). Where does the sociolinguistic variable stop? Language in Society, 7(02), 171–82.CrossRef Google Scholar

Ledesma, R. D. & Valero-Mora, P. (2007). Determining the number of factors to retain in EFA: an easy-to-use computer program for carrying out parallel analysis. Practical Assessment, Research & Evaluation, 12(2), 1–11.Google Scholar

Leech, G. (1992). Corpora and theories of linguistic performance. In Svartvik, J. (ed.), Directions in corpus linguistics, pp. 105–22. Berlin: Mouton de Gruyter.Google Scholar

Leech, G. (2003). Modals on the move: the English modal auxiliaries 1961–1992. In Facchinetti, R., Palmer, F. R. & Krug, M. (eds.), Modality in contemporary English, 223–40. Berlin: Mouton de Gruyter.Google Scholar

Leech, G. (2011). The modals ARE declining. International Journal of Corpus Linguistics, 16(4), 547–64.Google Scholar

Leech, G., Garside, R. & Bryant, M. (1994). CLAWS4: the tagging of the British National Corpus. In Proceedings of the 15th Conference on Computational Linguistics, Kyoto, vol. 1, pp. 622–8.Google Scholar

Leech, G., Rayson, P. & Wilson, A. (2001). Word frequencies in written and spoken English: based on the British National Corpus. London: Routledge.Google Scholar

Leek, J. T. & Peng, R. D. (2015). Statistics: p values are just the tip of the iceberg. Nature, 520(7549), 612.Google Scholar

Lijffijt, J., Nevalainen, T., Säily, T., Papapetrou, P., Puolamäki, K. & Mannila, H. (2016). Significance testing of word frequencies in corpora. Literary and Linguistic Computing, 31(2), 374–97.Google Scholar

Lijffijt, J., Säily, T. & Nevalainen, T. (2012). CEECing the baseline: lexical stability and significant change in a historical corpus. In Studies in Variation, Contacts and Change in English, vol. 10. Helsinki: Research Unit for Variation, Contacts and Change in English (VARIENG).Google Scholar

Love, R., Dembry, C., Hardie, A., Brezina, V. & McEnery, T. (2017). The Spoken BNC2014: designing and building a spoken corpus of everyday conversations. International Journal of Corpus Linguistics, 22 (3).Google Scholar

Lumley, T., Diehr, P., Emerson, S. & Chen, L. (2002). The importance of the normality assumption in large public health data sets. Annual Review of Public Health, 23(1), 151–69.Google Scholar

Malvern, D. & Richards, B. (2002). Investigating accommodation in language proficiency interviews using a new measure of lexical diversity. Language Testing, 19, 85–104.Google Scholar

Mann, H. B. & Whitney, D. R. (1947) On a test of whether one of two random variables is stochastically larger than the other. Annals of Mathematical Statistics, 18(1), 50–60.Google Scholar

Manning, C. D. (2011). Part-of-speech tagging from 97% to 100%: is it time for some linguistics? In Gelbukh, A. F. (ed.), International Conference on Intelligent Text Processing and Computational Linguistics, pp. 171–89. Berlin: Springer.Google Scholar

McEnery, T. (2006). Swearing in English: bad language, purity and power from 1586 to the present. Abingdon: Routledge.Google Scholar

McEnery, T. & Baker, H. (2017). Corpus linguistics and 17th-century prostitution: computational linguistics and history. London: Bloomsbury.Google Scholar

McEnery, T. & Hardie, A. (2011). Corpus linguistics: method, theory and practice. Cambridge University Press.Google Scholar

Mehl, M. R., Vazire, S., Ramírez-Esparza, N., Slatcher, R. B. & Pennebaker, J. W. (2007). Are women really more talkative than men? Science, 317(5834), 82.CrossRef Google Scholar PubMed

Michel, J. B., Shen, Y. K., Aiden, A. P., Veres, A., Gray, M. K., Pickett, J. P. … & Pinker, S. (2011). Quantitative analysis of culture using millions of digitized books. Science, 331(6014), 176–82.Google Scholar

Microsoft (2010). Microsoft Word [software].Google Scholar

Millar, N. (2009). Modal verbs in TIME: frequency changes 1923–2006. International Journal of Corpus Linguistics, 14 (2), 191–220.Google Scholar

Nevalainen, T. (1999). Making the best use of ‘bad’ data: evidence for sociolinguistic variation in Early Modern English. Neuphilologische Mitteilungen, 499–533.Google Scholar

Nevalainen, T. & Raumolin-Brunberg, H. (2003). Historical sociolinguistics: language change in Tudor and Stuart England. London: Routledge.Google Scholar

Newman, M. L., Groom, C. J., Handelman, L. D. & Pennebaker, J. W. (2008). Gender differences in language use: an analysis of 14,000 text samples. Discourse Processes, 45(3), 211–36.Google Scholar

Nini, A. (2015) Multidimensional Analysis Tagger (v. 1.3) – manual. Available at: https://sites.google.com/site/multidimensionaltagger/(accessed 26/08/15).Google Scholar

Osborne, J. W. (2012). Best practices in data cleaning: a complete guide to everything you need to do before and after collecting your data. Thousand Oaks, CA: Sage.Google Scholar

Osborne, J. W. (2015). Best practices in logistic regression. Thousand Oaks, CA: Sage.Google Scholar

Pearson, K. (1920). Notes on the history of correlation. Biometrika, 13(1), 25–45.CrossRef Google Scholar

Pechenick, E. A., Danforth, C. M. & Dodds, P. S. (2015). Characterizing the Google Books corpus: strong limits to inferences of socio-cultural and linguistic evolution. PloS One, 10(10), e0137041.Google Scholar

Phillips, M. (1985). Aspects of text structure: an investigation of the lexical organisation of text. Amsterdam: North-Holland.Google Scholar

Popper, K. (2005 [1935]). The logic of scientific discovery. London: Routledge.Google Scholar

Rayson, P. (2008). From key words to key semantic domains. International Journal of Corpus Linguistics 13(4), 519–49.Google Scholar

Rayson, P., Berridge, D. & Francis, B. (2004). Extending the Cochran rule for the comparison of word frequencies between corpora. Proceedings from 7th International Conference on Statistical Analysis of Textual Data (JADT 2004), pp. 926–36.Google Scholar

Rayson, P., Leech, G. N. & Hodges, M. (1997). Social differentiation in the use of English vocabulary: some analyses of the conversational component of the British National Corpus. International Journal of Corpus Linguistics, 2(1), 133–52.Google Scholar

Richardson, J. T. (2011). Eta squared and partial eta squared as measures of effect size in educational research. Educational Research Review, 6(2), 135–47.Google Scholar

Rosenthal, R. (1995). Writing meta-analytic reviews. Psychological Bulletin, 118(2), 183.Google Scholar

Rothstein, H. & Hopewell, S. (2009). Grey literature. In Cooper, H., Hedges, L. & Valentine, J. (eds.), The handbook of research synthesis and meta-analysis, pp. 103–26. New York: Russell Sage Foundation.Google Scholar

Savický, P. & Hlaváčová, J. (2002). Measures of word commonness. Journal of Quantitative Linguistics, 9(3), 215–31.Google Scholar

Schmider, E., Ziegler, M., Danay, E., Beyer, L. & Bühner, M.. (2010). ‘Is it really robust?’ Reinvestigating the robustness of ANOVA against violations of the normal distribution assumption. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 6(4), 147–51.Google Scholar

Schmidt, F. L. & Hunter, J. E. (2015). Methods of meta-analysis: correcting error and bias in research findings. Thousand Oaks, CA: Sage Publications.Google Scholar

Scott, M. (1997). PC analysis of key words – and key key words. System, 25(2), 233–45.Google Scholar

Scott, M. (2004). WordSmith tools version 4. Oxford University Press.Google Scholar

Shadish, W. R. & Haddock, C. K. (2009). Combining estimates of effect size. In Cooper, H., Hedges, L. & Valentine, J. (eds.), The handbook of research synthesis and meta-analysis, pp. 257–78. New York: Russell Sage Foundation.Google Scholar

Shaffer, J. P. (1995). Multiple hypothesis testing. Annual Review of Psychology, 46, 561–84.Google Scholar

Shakespeare, W. (1992 ). The Poems: Venus and Adonis, The Rape of Lucrece, The Phoenix and the Turtle, The Passionate Pilgrim. Cambridge University Press.Google Scholar

Sheskin, D. J. (2007). Handbook of parametric and nonparametric statistical procedures. Boca Raton, FL: Chapman & Hall/CRC.Google Scholar

Siegel, S. (1956). Nonparametric statistics for the behavioral sciences. New York: McGraw-Hill.Google Scholar

Sprent, P. (2011). Fisher Exact Test. In International encyclopedia of statistical science, pp. 524–5. Berlin: Springer.Google Scholar

Stubbs, M. (2001). Words and phrases: corpus studies of lexical semantics. Oxford: Blackwell.Google Scholar

Tagliamonte, S. A. (2006). Analysing sociolinguistic variation. Cambridge University Press.Google Scholar

Theus, M. & Urbanek, S. (2008). Interactive graphics for data analysis: principles and examples. Boca Raton, FL: CRC Press.Google Scholar

Toothaker, L. E. (1993). Multiple comparison procedures. Sage University Paper Series on Quantitative Applications in the Social Sciences, 07–089. Newbury Park, CA: Sage.Google Scholar

Trafimow, D. & Marks, M. (2015) Editorial. Basic and Applied Social Psychology, 37(1), 1–2.Google Scholar

Tufte, E. (1997). Visual explanations. Cheshire, CT: Graphics Press.Google Scholar

Tufte, E. (2001). Visual display of quantitative information. Cheshire, CT: Graphics Press.Google Scholar

Tufte, E. (2006). Beautiful evidence. Cheshire, CT: Graphics Press.Google Scholar

Tweedie, F. & Baayen, R. H. (1998). How variable may a constant be? Measures of lexical richness in perspective. Computers and the Humanities, 32, 323–52.Google Scholar

Upton, G. J. (1992). Fisher’s exact test. Journal of the Royal Statistical Society. Series A (Statistics in Society), 395–402.Google Scholar

Valentine, J. (2009). Judging the quality of primary research. In Cooper, H., Hedges, L. & Valentine, J. (eds.), The handbook of research synthesis and meta-analysis, pp. 129–46. New York: Russell Sage Foundation.Google Scholar

Verma, J. P. (2016). Repeated measures design for empirical researchers. Hoboken, NJ: John Wiley & Sons.Google Scholar

Vine, B. (1999). Guide to the New Zealand component of the International Corpus of English (ICE-NZ). School of Linguistics and Applied Language Studies, Victoria University of Wellington.Google Scholar

Vine, E. W. (2011). High frequency multifunctional words: accuracy of word-class tagging. Te Reo, 54, 71.Google Scholar

Williams, G. (1998). Collocational networks: interlocking patterns of lexis in a corpus of plant biology research articles. International Journal of Corpus Linguistics, 3( 1), 151–71.Google Scholar

Wilson, D. B. (2009). Systematic coding. In Cooper, H., Hedges, L. & Valentine, J. (eds.), The handbook of research synthesis and meta-analysis, pp. 159–76. New York: Russell Sage Foundation.Google Scholar

Xiao, R. (2009). Multidimensional analysis and the study of world Englishes. World Englishes, 28(4), 421–50.Google Scholar

Book contents

References

Access options

References

Save book to Kindle

Save book to Dropbox

Save book to Google Drive