Journal articles using WordSmith Tools

Email mike at lexically dot net to add yours...


Chapters, Books, Theses, Online resources


Aiello, Kenneth, and Michael Simeone, 2019. "Triangulation of Data Using Textual Data." Isis, Volume 110, No. 3.

Ahn, Jihyun, 2020., A Corpus-Based Classification of Lexical Bundles in Business Email. Journal of Language Sciences 27(4), 147-166.

Ahn, Jihyun, 2021. A Study of the Fixed Expressions of Politeness in Korean Email Corpus. Journal of Korean Language Education 32(4), 139-162.

Allan, Rachel, 2009. Can a graded reader corpus provide ‘authentic’ input?, ELT Journal, Vol. 63 No. 1., 23-32.

Andrusenko, Anastasiia, 2015. A Contrastive Analysis of Spanish-Arabic Metadiscourse Use in Persuasive Academic Writing, Procedia - Social & Behavioral Sciences, Vol 178, pp 9-14.

Ayers, D. F., 2013. From governance to competitiveness: A diachronic analysis of the community college discourse of local. Critical Discourse Studies, 10(1), 99-116.

Ayers, D. F. and Palmadessa, Alison,2015. The Community College and a Rising Global Imaginary 1950-2013, The Journal of Higher Education, Vol 86, Number 6, Nov/Dec 2015. pp. 864-892

Azar, Ali S & Hashim, Azirah, 2019. The Impact of Attitude Markers on Enhancing Evaluation in the Review Article Genre. GEMA Online Jounrla of Language Studies, Vol 19, No. 1.

Baker, Paul & Luke Collins, 2023. Creating and analysing a multimodal corpus of news texts with Google Cloud Vision's automatic image tagger. Applied Corpus Linguistics, Vol 3, No. 1.

Baker, P., Gabrielatos C., Khosravinik, M., Krzyzanowski, M., McEnery, T. & Wodak, R., 2008. A useful methodological synergy? Combining critical discourse analysis and corpus linguistics to examine discourses of refugees and asylum seekers in the UK press. Discourse & Society 19(3), 273-305.

Berber Sardinha, Tony, 1999. Using Key Words in Text Analysis: practical aspects. DIRECT Papers 42, LAEL, Catholic University of São Paulo. Available here.

Biel, Łucja, 2014. "The textual fit of translated EU law: a corpus-based study of deontic modality”, The Translator 20(2), doi 10.1080/13556509.2014.909675

Bondi M. (1996), "Language Variations across Genres. Quantifiers and worlds of reference in (and around) economics textbooks", Asp (Anglais de Specialité), Università di Bordeaux 2, 1996, 24-47.

Bondi M. (1997), "The Rise of Abstracts. Development of the Genre in the Discourse of Economics", Textus, X, E.Barisone and Hughes eds., 395-418.

Bondi M. (2002), “Attitude and episteme in academic discourse: adverbials of stance across genres and moves”, Textus, P.Evangelisti / E.Ventola (eds.) 249-264.

Bondi, M. & D.Mazzi (2008), “Per concludere veramente: Signalling conclusions in historical research articles in Italian and in English”, La Torre di Babele, 2009, 119-131.

Bondi M. & G.Diani (2009): Linguistica dei corpora e EAP: lingua, pratiche comunicative e contesto d’uso, RILA, Numero monografico a cura di Gianfranco Porcelli (La Linguistica Applicata oggi), XLI(1-2), 2009, 251-270.

Bondi, M. (2010) Metadiscursive practices in introductions. Phraseology and semantic sequences across genres, NJES, 9.2., 99-123.

Bondi, M. & Radighieri S. (2010), I corpora come risorsa per l’apprendimento: integrare corpus, testo e discorso specialistico, RILA, XLII(1), 59-80.

Bondi, M. (2010) “Arguing in economics and business discourse: phraseological tools in research articles", Bulletin Suisse de Linguistique Appliquée, Centre de linguistique appliquée, Université de Neuchâtel, N° spécial 2010/2, 219-234 • ISSN 1023-2044.

Bondi,M. & G. DIANI (2011) “Conveying deontic values in English and Italian contracts: a cross-cultural analysis”. In ESP across Cultures, special issue on “Legal English across cultures”, C.Williams &M.Gotti (eds.), 7-24. ISSN19728247.

Bondi M. & C. Lazzeretti (2012) “A hypnotic viewing experience: Promotional features in the language of exhibition press announcements”, Pragmatics: Quarterly Publication of the International Pragmatics Association, Pragmatics 22:3. 567-589.

Bondi Marina & Giuliana Diani (2015) I am wild about cabbage: evaluative ‘semantic sequences’ and cross-linguistic (dis)continuities, Nordic Journal of English Studies (NJES), 14(1), 116-151.

Bondi Marina (2016) The future in reports: prediction, commitment and legitimization in CSR, Pragmatics and Society, 7(1) (special issue on The Pragmatics of Professional Discourse, edited by W. Cheng), 57-81 ( ISSN 1878 9714 – DOI 10.1075/ps.7.1.3.bon).

Bondi Marina (2017) What came to be called: evaluative what and authorial voice in the discourse of history, Text & Talk, 37(1), 25-46. (DOI 10.1515/text-2016-0035)

Borlongan, Ariane Macalinga. 2008. Tag Questions in Philippine English. Philippine Journal of Linguistics 39, 109-134.

Bowker, Lynne, 2000. Towards a Methodology for Exploiting Specialized Target Language Corpora as Translation Resources. International Journal of Corpus Linguistics, Vol 5, No. 1. pp 17-52.

Breeze, Ruth, 2007. How Personal is this Text? Researching Writer and Reader Presence in Student Writing Using Wordsmith Tools. CORELL: Computer Resources for Language Learning 1, 14-21.

Breeze, Ruth, 2011. Disciplinary values in legal discourse: a corpus study. IBÉRICA, Vol. 21, Primavera/Spring, 93-116.

Breeze, Ruth, 2013. Lexical bundles across four legal genres. International Journal of Corpus Linguistics, vol. 18, No. 2, 229-253.

Breeze, Ruth, 2014. Perspectives on North and South. The 2012 financial crisis in Spain seen through two major British newspapers. Discourse and Communication 8 (3), pp. 241-259.

Breeze, Ruth, 2015. Teaching the vocabulary of legal documents: a corpus-driven approach. ESP Today 3(1), 134-155. e-ISSN 2334-9050

Bruce, Ian, 2008. Cognitive genre structures in Methods sections of research articles: A corpus study. Journal of English for Academic Purposes, Vol. 7, pp. 38-54.

Buckingham, Louisa, 2013. Mixed messages of solidarity in the Mediterranean: Turkey, the EU and the Spanish press. Discourse & Society, 24(2), 186-207.

Buckingham, Louisa, 2013. Light verb constructions in Latin American newspapers: Creative variants and coinages. Spanish in Context, 10(1), 114-135.

Buckingham, Louisa, 2011. ‘Poniendo de manifiesto’: Spanish light verb constructions in Latin American newspapers. Corpora, 6(2), 201-226.

Buckingham, Louisa, 2008. Spanish support verb constructions from a learner perspective. Estudios de lingüística inglesa aplicada. Universidad de Sevilla, 8, 151-179.

Butler, Christopher S., 2008. Three English adverbs and their formal equivalents in Romance languages: a corpus-based collocational study. Languages in Contrast 8(1), 107-124.

Butler, Christopher S., (in press). Formulaic sequences in functional and cognitive linguistics. Revista Canaria de Estudios Ingleses 57, 67-90.

Caiazzo, Luisa, 2009. 'National', 'internatioonal' and 'global' across British and Inian university websites. ESP Across Cultures, 9, 45-60.

Cesiri, Daniela & Colaci, L. A. 2011. Metaphors on the global crisis in economic discourse: a corpus-based comparison of The Economist, Der Spiegel and Il Sole 24 ORE, Rassegna Italiana De Linguistica Applicata, vol. 1-2/Gennaio-Agosto 2011 Monographic Volume "Clavier 09 Corpus Linguistics and Language Variation" Ed. by Bondi, M., Cacchiani, S., Palumbo G., pp. 201-223.

Changsoo Lee, 2012. Using lexical bundle analysis as discovery tool for corpus-based translation research, Perspectives: Studies in Translatology, DOI:10.1080/0907676X.2012.657655

Clark, Caoline, 2010. Evidence of evidentiality in the quality press 1993 and 2005. Corpora, Vol. 2, No. 10, 139-160.

Conoscenti, Michelangelo & Caffarena, Anna, 2017.The Tipping Point. Donald Trump and the Discourse on World (Dis)Order in the Press and Expert Media, Comunicazione Politica, vol. XVIII, n. 3, 2017, 385-406.

Crawford Camiciottoli, B., 2010. Discourse connectives in genres of financial disclosure: earnings calls vs. earnings releases. Journal of Pragmatics, 42, 3, 650-653.

Crawford Camiciottoli, B., 2009. ‘Just wondering if you could comment on that’: indirect requests for information in corporate earnings calls. Text & Talk, 29, 6, 661–681.

Crawford Camiciottoli, B., 2009. The teaching style of the business educator: A corpus-based investigation of the relationship between language and identity. International Journal of Business & Economics, 9, 117-136.

Crawford Camiciottoli, B., 2008. Interaction in academic lectures vs. written text materials: the case of questions, Journal of Pragmatics, 40, 7, 1216-1231.

Crawford Camiciottoli, B., 2005. Adjusting a business lecture for an international audience: a case study. English for Specific Purposes, 24, 2, 183-199.

Crawford Camiciottoli, B., 2004. Interactive discourse structuring in L2 guest lectures: some insights from a comparative corpus-based study, Journal of English for Academic Purposes, 3, 1, 39-54

Curado Fuentes, Alejandro, 2001. "Lexical Behaviour in Academic and Technical Corpora: Implications for ESP Development", Language Learning & Technology, Vol.5 pp. 106-129.

Dávila-Montes, José, 2019. Enthymeme, metonymy and method: comparing genre-bound rhetorical deviations between languages through corpus studies, Language and Intercultural Communication, 19:5, 407-420, DOI: 10.1080/14708477.2018.1545779

Degani, Marta & Alexander Onysko, 2010. Hybrid compounding in New Zealand English. World Englishes, 29 (2), 209-233.

Del Fante, Dario. 2018. Where Are They From? : a Corpus-Assisted Study of the Geographical Representation of Migrants in the UK and Italian Quality Press in 2016 and 2017. I-LanD Journal : Identity, Language and Diversity. Number 1, 2018. P. 28-52.

Demata, Massimiliano, Conoscenti, Michelagelo & Stavrakakis, Yannis, 2020. Riding the Populist Wave. Metaphors of Populism and Anti-Populism in the Daily Mail and the Guardian Iperstoria, Issue 15 Spring / Summer 2020, pp.8-35.

Duguid, Alison, 2010. Newspaper discourse informalisation: a diachronic comparison from keywords. Corpora, Vol. 2, No. 10, 109-138.

Duguid, Alison, 2010. Investigating anti and some reflections on Modern Diachronic Corpus-Assisted Discourse Studies (MD-CADS). Corpora, Vol. 2, No. 10, 191-220.

Ensslin, Astrid & Sally Johnson, 2006, Language in the news: investigating representations of 'Englishness' using WordSmith Tools. Corpora. Vol 1, No. 2, 153-185.

Flowerdew, Lynne, 2003, A combined corpus and systemic-functional analysis of the Problem-Solution pattern in a student and professional corpus of technical writing. TESOL Quarterly, vol. 37 (3), pp. 489-511.

Fonseca-Greber, Bonnie B., 2007. The emergence of emphatic 'ne' in Conversational Swiss French. Journal of French Language Studies, 17 (3), 249-275.

Fonseca-Greber, Bonnie B., 2018. A relative verb for Spoken French: The missing link. Lingua, 210-211 (2018), 1-29.

Fitzsimmons-Doolan, S., 2009. Is public discourse about language policy really public discourse about immigration? A corpus-based study. Language Policy, 8(4), 377-402.

Fromm, Guilherme, 2008. A construção e análise de corpora para alimentação de um banco de dados terminográfico: um exemplo. Revista Eletrônica de Lingüística ( Ano 2, nº 1 – 1º Semestre. de 2008.

Forchini, Pierfranca and Amanda Murphy, 2008. N-grams in comparable specialized corpora: Perspectives on phraseology, translation, and pedagogy. International Journal of Corpus Linguistics 13:3, 351–367.

Gabrielatos, C. & Baker, P., 2008. Fleeing, sneaking, flooding: A corpus analysis of discursive constructions of refugees and asylum seekers in the UK Press 1996-2005. Journal of English Linguistics 36(1), 5-38.

Gabrielatos, C., 2007. Selecting query terms to build a specialised corpus from a restricted-access database. ICAME Journal 31, 5-43. []

Gabrielatos, C. & Sarmento, S., 2006. Central modals in an aviation corpus: Frequency and distribution. Letras de Hoje 41(2), 215-240. []

Gill, P. S., Swartz T.B and Treschow M., 2007. 'A stylometric analysis of King Alfred’s literary works'. Journal of Applied Statistics. 34, 1251-1258.

Gilquin G. & Paquot M., 2008. Too chatty: Learner academic writing and register variation. English Text Construction 1(1):41-61.

Gledhill Chris., 2000 ‘The Discourse Function of Collocation in Research Article Introductions’. English for Specific Purposes. Volume 19/2:115-135. 1 ISSN 0889-4906.

Goded Rambaud, Margarita, 2007. A descriptive algorithm for a wine tasting lexicon corpus. Ibersid: revista de sistemas de información y documentación = journal of information and documentation systems, ISSN 1888-0967, Nº. 1, 2007, pags. 313-321

Goded Rambaud. Margarita, 2010. A descriptive algorithm for a wine tasting lexicon corpus. Journal of information and documentation systems. Scire: Vol 15, Nº 2. Pag. 39-62.

Gómez González-Jover, A y Vargas Sierra, Ch. 2004: Aspectos metodológicos para la elaboración de diccionarios especializados bilingües destinados al traductor, El español, lengua de traducción. II congreso internacional, Servicio de traducción de la Comisión Europea, Disponible en línea

Gómez A. y Vargas, Ch. 2002: Utilización de herramientas informáticas para la elaboración de diccionarios especializados bilingües, Interlinguistica, n.13, pp. 269-289

Gómez A. y Vargas, Ch., 2002: Córpora comparables y paralelos para la detección de terminología bilingüe: su explotación y uso con herramientas informáticas, Actas del VIII Simposio Iberoamericano de Terminología, Cartagena, Colombia, publicación en CD-ROM, ISBN: 958-33-4022-7

Gooberman-Hill R , French M , Dieppe P , Hawker G  2009. Expressing Pain and Fatigue: A New Method of Analysis to Explore Differences in Osteoarthritis Experience ARTHRITIS & RHEUMATISM-ARTHRITIS CARE & RESEARCH  Vol. 61 (3), 353-360.

Graham, P. & Rooney, D., 2001. A sociolinguistic approach to applied epistemology: Examining technocratic values in global 'Knowledge' Policy. Journal of Social Epistemology, 15 (3). [for the 2001 special issue on the commercialisation of epistemology]: 155 - 169.

Graham, P. 2001. Space: Irrealis objects in technology policy and their role in the creation of a new political economy. Discourse & Society, 12 (6): 761-788.

Graham, P. 2002. Predication and propagation: A method for analysing evaluative meanings in technology policy. TEXT, 22 (2): 227-268.

Grundmann, Reiner, and Ramesh Krishnamurthy. 2010. The Discourse of Climate Change : a corpus-based approach. Critical Approaches to Discourse Analysis across Disciplines 4, no. 2.

Haddington, Pentti, Jarmo H. Jantunen & Jari Sivonen 2011. Language and affect: go-say and come-say constructions in Finnish. – SKY Journal of Linguistics 24. 75–117.

Hakam, Jamila, 2009. “The ‘Cartoons Controversy’: A Critical Discourse Analysis of English-language Arab newspaper discourse”, Discourse & Society 20 (1), pp. 33-57

Harvey K.J., Brown B, Crawford P, Macfarlane A, McPherson A. 2007. ‘Am I normal?’ Teenagers, sexual health and the internet. Social Science & Medicine. 65: 771-781.

Harvey, K.., Churchill, D., Crawford, P., Brown, B., Mullany, L., Macfarlane, A., & McPherson, A. 2008. 'Health communication and adolescents: what do their emails tell us?' Family Practice, 25: 1-8.

Hathaway, Yulia, 2021. “They Made us into a Race. We Made Ourselves into a People”: A Corpus Study of Contemporary Black American Group Identity in the Non-Fictional Writings of Ta-Nehisi Coates. Corpus Pragmatics (2021).

Hou, Haiyan, Hildrun Kretschmer & Zeyuan Liu, 2008. The structure of scientific collaboration networks in Scientometrics. Scientometrics, Vol. 75, No. 2, 189–202. (used WS in demo mode!)

Huddlestone, K. & Fairhurst, M. 2013. The pragmatic markers anyway, okay, and shame: A South African English corpus study. Stellenbosch Papers in Linguistics Plus, 42, pp.93–110. Available at:

Hyland Ken. 2008. As can be seen: Lexical bundles and disciplinary variation. English for Specific Purposes. 27 (1): 4-21.

Hyland Ken. 2008. Academic clusters: text patterning in published and postgraduate writing. International Journal of Applied Linguistics. 18 (1): 41-62.

Hyland Ken. 2008. 'Small bits of textual material': voice and engagement in Swales' writing. English for Specific Purposes. 27 (2): 143-160.

Ivković, Dejan, 2013. The Eurovision Song Contest on YouTube: A Corpus-based Analysis of Language Attitudes. Language@Internet, vol. 10, article 1.

Inaki, Akiko & Tomoko Okita, 2006. 'A Small-Corpus-Based Approach to Alice's Roles.' Literary & Linguistic Computing. Vol. 21, No. 3. 283-294.

Jantunen, Jarmo H. 2001. Synonymity and Lexical Simplification in Translations: A Corpus-based Approach – Across Languages and Cultures 2 (1). 97–112.

Jantunen, Jarmo H., 2001. Tärkeä seikka ja keskeinen kysymys: mitä korpuslingvistinen analyysi paljastaa lähisynonyymeista? [What can corpus-linguistic analysis reveal about near synonyms.] – Virittäjä 105. 170–192.

Jantunen, Jarmo H., 2004. Suomennosten kieliasu puntarissa [Translated Finnish and mistakes]. – Virittäjä 108. 559–572.

Jantunen, Jarmo H., 2009. ”Minulla on aivan paljon rahaa”. Fraseologiset yksiköt suomen kielen opetuksessa. [‘I have really lots of money’ – Phraseological units in the teaching of Finnish] – Virittäjä 113. 356–381.

Jhang, Se-Eun & Lee, Sung-min, 2012. Key clusters analyses of lexical bundles used in English for academic purposes: The Biomed Corpus. The Linguistic Association of Korea Journal, 20(4), 219-239.

Jeffrey, D. & Hadley, G. 2002. “Balancing Intuition with Insight: Reflective Teaching through Diary Studies.” The Language Teacher 26 (5), 3-6 Online.

Johnson, A. and Clifford, R. 2011. Polite incivility in defensive attack: strategic politeness and impoliteness in cross-examination in the David Irving vs. Penguin Books Ltd. and Deborah Lipstadt trial. Journal of Politeness Research,7, Special Issue: Im/politeness and facework across legal contexts, D. Archer and J. Luchjenbroers (eds), 43-71.

Johnson, A. and Wright, D. 2014. Identifying idiolect in forensic authorship attribution. Language and Law / Linguagem e Direito, Vol. 1(1), 37-69.

Johnson, Sally, Culpeper, Jonathan & Suhr, Stephanie, 2003, From 'politically correct councillors' to 'Blairite nonsense': discourses of 'political correctness' in three British newspapers. Discourse and Society 14 (1): 29-47.

Johnson, Sally & Astrid Esslin, 2006. Language in the news: Some reflections on keyword analysis using Wordsmith Tools and the BNC. Leeds Working Papers in Linguistics, No. 11.

Kantner, Cathleen & Amelie Kutter, 2011. How to get rid of the Noise in the Corpus: Cleaning Large Samples of Digital Newspaper Texts. University of Stuttgart, Dept of International Relations & European Integration International Relations Online Working Paper Series.

Karimi, Neda, Lukin, A., Moore, A.R., Walczak, A. and Butow, P. 2018. Advanced cancer patients’ construction of self during oncology consultations: a transitivity concordance analysis. Functional Linguistics 5:6.

Kemppanen, Hannu 2004. Keywords and Ideology in Translated History Texts: A Corpus-based Analysis. Across Languages and Cultures 5 (1), 89-106.

de Klerk, V. 2003. Towards a norm in South African Englishes: the case for Xhosa English. World Englishes 22:4. 463-481.

de Klerk, V. 2002. Starting with Xhosa English ... towards a spoken corpus. International Journal of Corpus Linguistics. 7(1), 21-42.

de Klerk, V. 2002. Towards a corpus of Black South African English. Southern African Journal of Linguistics & Applied Language Studies. 20(1):25-35.

de Klerk, V. 2003. English as an institutionalised variety of English ... In search of evidence. English World-wide 24:2, 221-243.

de Klerk, V. 2004 The use of actually in spoken Xhosa English: a corpus study. World Englishes. 24(3): 275-288

de Klerk, V. 2005 Procedural meanings of "well" in a corpus of Xhosa English. Journal of Pragmatics. 37(8) 1183-1205

de Klerk, V. 2005 "Expressing levels of intensity in Xhosa English". English World Wide. 26(1): 77-96

de Klerk, V. Language and the Law: who has the upper hand? A corpus analysis of the TRC hearings. AILA Review. 16: 78-84.

de Klerk. V. 2003. "The Language of Truth and Reconciliation: Was it fair to all concerned? Southern African Journal of Linguistics & Applied Language Studies.21 (1 & 2): 1-14

Koteyko, Nelya, 2007. A diachronic approach to meaning in corpus linguistics: the use of English loanwords in Russian opposition discourse. Corpora, 3: 65-95.

Koteyko, Nelya, 2010. Mining the Internet for linguistic and social data:  An analysis of ‘carbon compounds’ in web feeds. Discourse & Society, 21 (6): 655-674.

Koteyko, Nelya, 2011. "Managing carbon emissions: A discursive presentation of 'market-driven sustainability' in the British media". Language & Communication, doi:10.1016/j.langcom.2011.11.001.

Koteyko, Nelya and R. Carter. 2008. Discourse of ‘transformational leadership’ in infection control, Health: An Interdisciplinary Journal for the Social Study of Health, Illness and Medicine, 12(4), 479-99

Koteyko, Nelya, B. Nerlich, P. Crawford & N. Wright, 2008. ‘Not rocket science’ or ‘No silver bullet’? Media and Government Discourses about MRSA and Cleanliness. Applied Linguistics Vol 29 No 2: 223–243.

Ku, Pei-yu. & Yang, Ai-ying., 1999, An Analysis of Key Words in Tourism English, The Proceedings of the 1999 English for Specific Purposes Conference, Taoyuan, Taiwan: Ming Chuan University, pp. 232-241.

Kutter, Amelie, 2012. Corpus-Based Content Analysis: A Method for Investigating News Coverage on War and Intervention. University of Stuttgart, Dept of International Relations & European Integration International Relations Online Working Paper Series.

Lamb, William, 2012. The Storyteller, the Scribe, and a Missing Man: Hidden Influences from Printed Sources in the Gaelic Tales of Duncan and Neil MacDonald. Oral Tradition, Vol 27 No. 1, pp. 109-160.

Laviosa, Sara, 2004."When Italians talk "business" they mean it”. Stella Tagnin (ed.) Corpora and Translation. TradTerm, Special Issue 10, p. 279-293.

Laviosa, Sara, 2003, "Europe in the making in translational and non-translational English". Keith Harvey (ed.) CTIS Occasional Papers II. Manchester: St. Jerome. (in press).

Laviosa, Sara, 2001, "The Commercial Italian Corpus: A child prodigy". Tuttitalia. N. 23: 15-18.

Laviosa, Sara, 2000, “TEC: A resource for studying what is "in" and "of" translational English” in Across Languages and Cultures. 1(2): 159-178.

Laviosa, Sara, 1998, “The corpus-based approach: A new paradigm in Translation Studies”. Sara Laviosa (ed.) L’Approche basèe sur le corpus/The Corpus-Based Approach. Meta, Special Issue. 474-479.

Laviosa, Sara, 1998, "Core patterns of lexical use in a comparable corpus of English narrative prose". Sara Laviosa (ed.) L’Approche basèe sur le corpus/The Corpus-Based Approach. Meta, Special Issue. 557-570.

Lee, David & John Swales, 2006. A corpus-based EAP course for NNS doctoral students: Moving from available specialized corpora to self-compiled corpora. English for Specific Purposes, Vol 25, No 1. pp. 56-75.

Lee, Younghee Cheri. 2018. The Hallmarks of L2 Writing Viewed through the Prism of Translation Universals. Linguistic Research, Vol. 35, Special Edition, 171-205.

Lee, Younghee Cheri. 2019. Spotting Non-nativeness in L2 Texts: A Statistical Approach to Translationese. Studies in English Language & Literature, Vol. 45, No. 1, 367-388.

Liang, Liming, Changhong, Hou & Ling, Zhu, 2002, How do the information scientists pay attention to and study modern science(I) ——Information scientists’ visual angle of attention to modern science. Journal of the China Society for Scientific and Technical Information, 21(6), 656-663.

Liang, Liming & Changhong, Hou, 2003, How do the information scientists pay attention to and study modern science(II) ——Information scientists’ methods used to analyze science. Journal of the China Society for Scientific and Technical Information, 22(4), 387-392.

Liang, Liming & Xiaoning, Li, 2003, Word Frequency Analysis and Quantitative Study of SPRU’s Research Themes. Science Research Management, 24(3), 97-108.

Lima-Lopes, Rodrigo & Maristella Gabardo, 2019, Ni una menos: a luta pelos direitos das mulheres na Argentina e suas representações no Facebook (Ni una menos: The Struggle for Women’s Rights in Argentina and Its Representations on Facebook), Revista Brasileira de Linguística. Aplicada., v. 19, n. 4, p. 801-824, 2019.

Llinares Garcia, Ana, 2007. Young learners’ functional use of the L2 in a low-immersion EFL context, EFL Journal, Vol 61, No. 1. 39-45.

Lozano, C. & Mendikoetxea, A., 2010. Interface conditions on postverbal subjects: a corpus study of L2 English. Bilingualism: Language and Cognition, 13(4): 475-497.

Lukač, Morana. 2011. Down to the bone: A corpus-based critical discourse analysis of pro-eating disorder blogs. Jezikoslovlje Vol 12 No 2. pp 187-209.

Maiwald, Patrick, 2011.“Exploring a Corpus of George MacDonald's Fiction.” North Wind: Journal of George MacDonald Studies 30 (2011): 50-84.

Marchi, Anna, 2010. 'The moral in the story': a diachronic investigation of lexicalised morality in the UK press. Corpora, Vol. 2, No. 10, 161-190.

Marín, M.J. (2012). “How relevant are Latin wordforms and clusters in legal English? A corpus-based study on the representativeness and specificity of such elements in UKSCC: an ad hoc legal corpus”. ES. Revista de Filología Inglesa, 33. Valladolid: Servicio de Publicaciones de la Universidad de Valladolid: 161-182.

Marín, M.J. (2014). “Evaluation of five single-word term recognition methods on a legal corpus”. Corpora, 9 (1). Endinburgh: Endinburgh University Press.

Marín, M.J. (2014). “A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus”. Miscelánea: A Journal of English and American Studies, 49. Zaragoza: Universidad de Zaragoza.

Marín, M.J., Rea, C. (2014). “Assessing four automatic term recognition methods: Are they domain-dependent?”. English for Specific Purposes World, 42.

Marín, M.J. (2015). “Measuring precision in legal term mining: a corpus-based validation of single and multi-word term recognition methods”. ESP World, 46.

Marín Pérez, M.J. (2016). “Measuring the Degree of Specialisation of Sub-Technical Legal Terms through Corpus Comparison: a Domain-Independent Method”. Terminology, 22(1): 80-102. John Benjamins.

Marko, Georg, 2010. Heart disease and cancer, diet and exercise, vitamins and minerals. The Construction of Lifestyle Risks in Popular Health Discourse. CADAAD Journal (Critical Approaches to Discourse Analysis Across Disciplines) 4 (2) (Special Edition. Risk as Discourse. Ed. By Jens O. Zinn), 147-170. 4 issue 2/65-53.

Martínez, Iliana A. 2005. First person use in native and non-native writers of research articles in Biology. Journal of Second Language Writing. 14:174-190.

Milizia, Denise, 2004. A contrastive analysis of English and Italian financial lexis: the pay-with-plastic system. ESP Across Cultures 1: 66-78. Bari: B.A. Graphis.

Milizia, Denise, 2006. Classifying phraseology in a spoken corpus of political discourse. ESP Across Cultures 3: 41-65. Bari: B.A. Graphis.

Milizia Denise. 2007. Translating and comparing words and phrases in English and Italian banking system. Englishes, Letterature Inglesi Contemporanee 32. Roma: Pagine, pp. 87-114.

Milizia Denise. 2007. A corpus-driven study of non-equivalence in financial English: credit or debit?. English for Specific Purposes World :

Milizia Denise & Spinzi Cinzia. 2008. The terroridiom principle between spoken and written discourse, special issue of International Journal of Corpus Linguistics, Volume 13, No.3, “Patterns, meaningful units and specialised discourses”, pp. 322-350. Also available in Ute Römer and Rainer Schulze (Eds.), 2010. Patterns, Meaningful Units and Specialized Discourses, Amsterdam: Benjamins.

Milizia Denise, 2010. A linguistic investigation of the Lisbon Treaty. Studi sull’integrazione europea. Bari: Cacucci, Vol. 5, No. 2, pp. 455-474.

Mulderrig, Jane, 2003."Consuming Education: a critical discourse analysis of social actors in New Labour's education policy", in Journal of Critical Education Policy Studies 1 [Available at:]

Mulderrig, J., 2011. ‘The Grammar of Governance’, Critical Disc ourse Studies 8(1)45-68

Mulderrig, J., 2011. ‘Manufacturing Consent: a corpus-based critical discourse analysis of New Labour’s educational governance’, Journal of Educational Philosophy and Theory, 43 (6)

Nelson, Mike, 2000, "The Lexis of of Business English", EL Gazette, November.

Nelson, Mike, 2003, "Worldly Experience" The Guardian Weekly, March 20.

Nesselhauf, Nadja, 2006. “Diachronic Analysis with the Internet? Will and Shall in ARCHER and in a corpus of e-texts from the web”. Language and Computers, 1 November 2006, vol. 59, no. 1, pp. 287-305.

O'Halloran, K.A (2009) 'Inferencing and cultural reproduction: a corpus-based critical discourse analysis', Text and Talk 29(1): 21-51.

Partington, Alan, 2010. Modern Diachronic Corpus-Assisted Discourse Studies (MD-CADS) on UK newspapers: an overview of the project. Corpora, Vol. 2, No. 10, 83-108.

Peacock, Michael, 2011. A comparative study of introductory it in research articles across eight disciplines. International Journal of Cortpus Linguistics, vol 26. No. 1, 72-100.

Peacock, Michael, 2010. Linking adverbials in research articles across eight disciplines. Ibérica, vol 20 (Autumn), 9-34.

Pearce, Michael, 2005, Informalization in UK party election broadcasts 1966-1997. Language and Literature, vol. 14 (1)  pp.65-90.

Perrotti-Garcia, Ana Julia. 2008. Afinal o que fizeram estes meninos?, University of São Paulo: Revista Crop, Revista de Estudos Lingüísticos e Literários em Inglês, nº 13/2008, pp. 17-26. (

Peters P, Smith A, Funk Y, et al, 2016.Language, terminology and the readability of online cancer information Medical Humanities;42:36-41.

Piper, Alison, 2000, "Some have credit cards and others have Giro cheques: 'individuals' and 'people' as lifelong learners in late modernity", Discourse and Society, 11 (3) 515-42.

Piqué-Angordans, J. S. Posteguillo & J.-V. Andreu-Besó, 2002. "Epistemic and deontic modality: A linguistic indicator of disciplinary variation in academic English". LSP & Professional Communication 2,2: 49-65.

Piqué-Noguera, Carmen, 2012. Writing business research abstracts: A genre approach. IBÉRICA, 24, 211-232.

Pontrandolfo G. 2011. “Phraseology in Criminal Judgments: A Corpus Study of Original vs. Translated Italian”. Sendebar, N° 22 (2011), Revista de la Facultad de Traducción e Interpretación de Granada. Universidad de Granada. pp. 209-234.

Rajh, Ivanka, 2012. A corpus-based analysis of Croatian and Slovenian term formation in the marketing domain. Croatian and Slovenian marketing terminology, Vol 1 No. 2.

Rea, C., Marín, M.J. (2015). “A Key Perspective on Specialized Lexis: Keywords in Telecommunication Engineering for CLIL”. Procedia-Social and Behavioral Sciences, 198: Current Work in Corpus Linguistics: Working with Traditionally- conceived Corpora and Beyond. Selected Papers from the 7th International Conference on Corpus Linguistics (CILC2015). Elsevier Ltd: 386-396.

Ren, Jing-Hui & Wang, Li-Xin, 2012. Mining textual-feature based research on translator's styles. Proceedings -- International Conference on Machine Learning and Cybernetics, Vol. 5, 1975-1980.

Roads, Judith, 2017.Key Seventeenth-Century Quaker Lexis, Quaker Studies, vol.22/1 (2017), 22(1), pp. 99–116..

Rodríguez-Inés, Patricia, 2010. Electronic corpora and other ICT (Information and communication technologies) tools: an integrated approach to translation teaching”. The Interpreter and Translator Trainer 4(2), 251-282.

Rodrigues Júnior, Adail Sebastião, 2005. An exploratory study of representation of gay characters in a parallel corpus of short stories: a systemic-functional approach. Cadernos de Tradução 16 - 2005/2. Available here

Rodríguez Martín, Gustavo, 2012. Comparison and other “Modes of Order” in the plays of Bernard Shaw. International Journal of English Studies, Vol 12, No. 2. Available here.

Romagnuolo, Anna, 2009. "Political Discourse in Translation: A Corpus-based perspective on Presidential Inaugurals", Translation and Interpreting Studies Journal, vol. 4, n. 1, pp. 1-30.

Römer, Ute. 2005. Shifting Foci in Language Description and Instruction: Towards a Lexical Grammar of Progressives. Arbeiten aus Anglistik und Amerikanistik 30,1 pp. 145-160.

Römer, Ute. 2006. Pedagogical applications of corpora: Some reflections on the current scope and a wish list for future developments. Zeitschrift für Anglistik und Amerikanistik 54,2: 121-134. Special issue: "The scope and limits of corpus linguistics - Empiricism in the description and analysis of English" (ed. Volker Gast).

Römer, Ute. 2008. Identification impossible? A corpus approach to realisations of evaluative meaning in academic writing. Functions of Language 15(1): 115-130. Available here

Römer, Ute. 2009. English in academia: Does nativeness matter? Anglistik: International Journal of English Studies 20,2: 89-100. Available here

Römer, Ute. 2009. The inseparability of lexis and grammar: Corpus linguistic perspectives. Annual Review of Cognitive Linguistics 7: 140-162.

Rühlemann, Christoph, Bagoutdinov, Andrej and Brook O'Donnell, Matthew, 2011. Windows on the mind: Pauses in conversational narrative. In: Gilquin, Gaëtanelle and Sylvie De Cock (eds.), Errors and Disfluencies in Spoken Corpora: Special issue of International Journal of Corpus Linguistics 16(2): 198–230.

Rybicki, Jan, 2006. Character Idiolects in Henryk Sienkiewicz's Trilogy and its Two English Translations, Literary and Linguistic Computing, 21/1, Oxford University Press, Oxford, 91-103

Samson, C. 2010. Boosting the company’s identity. Point of view and promotional discourse in a corpus of business to consumer (B2C) webpages. Textus, 1.

Samson, C. 2007. Academic and professional website identities: anything to share? Quaderni del Dipartimento di Linguistica. Università di Firenze: Unipress. pp. 169-182.

Samson, C. 2006. “' different from...'. A corpus-based study of evaluative adjectives in economics discourse. IEEE Transactions on Professional Communication. Special Issue. Insights from Corpus Linguistics for Professional Communication. Vol. 49, Issue 3, Sept. 2006, pp. 236-245.

de Schryver, Gilles-Maurice, & Gauton, Rachélle, 2002, The Zulu locative prefix ku- revisited: a corpus-based approach. South African Linguistics and Applied Language Studies, 20: 201-220.

de Schryver, Gilles-Maurice, 2003, Drawing up the macrostructure of a Nguni dictionary, with special reference to isiNdebele. South African Journal of African Languages, 23.

de Schryver, Gilles-Maurice & Lepota, Biki, 2001, The Lexicographic Treatment of Days in Sepedi, or When Mother-Tongue Intuition Fails. LEXIKOS, 11: 1-37.

de Schryver, Gilles-Maurice & Prinsloo, Daan J., 2000, The Concept of 'Simultaneous Feedback': Towards a New Methodology for Compiling Dictionaries. LEXIKOS, 10: 1-31.

de Schryver, Gilles-Maurice & Prinsloo, Daan J., 2000, The compilation of electronic corpora, with special reference to the African languages. Southern African Linguistics and Applied Language Studies, 18: 89-106.

de Schryver, Gilles-Maurice & Prinsloo, Daan J., 2003, Compiling a lemma-sign list for a specific target user group: The Junior Dictionary as a case in point. Dictionaries: Journal of The Dictionary Society of North America, 24.

de Schryver, Gilles-Maurice, & Taljard, Elsabé, 2006, Locative trigrams in Northern Sotho, preceded by analyses of formative bigrams. Linguistics, An Interdisciplinary Journal of the Language Sciences, 44: 135-193.

Seale C, Charteris-Black J, Ziebland S. 2006. Gender, cancer experience and internet use: a comparative keyword analysis of interviews and online cancer support groups. Social Science and Medicine. 62, 10: 2577-2590

Seale, C. 2006. Gender accommodation in online cancer support groups. Health 10(3): 345-360.

Seale, C. 2008. Mapping the field of medical sociology: a comparative analysis of journals.  Sociology of Health and Illness 30, 5, 677-695.

Seale C, Boden S, Lowe P, Steinberg D, Williams S. 2007. Media constructions of sleep: a study of UK national newspapers, Social Science and Medicine. 65(3): 418-430.

Seale, C, Charteris-Black J, Dumelow C. Locock, J. Ziebland S. 2008. The effect of joint interviewing on the performance of gender, Field Methods 20, 107-128.

Seale, C. and Charteris-Black, J. 2008. The interaction of class and gender in illness narratives Sociology 42(3): 453-469.

Seale, C. and Charteris-Black, J. 2008. The interaction of age and gender in illness narratives. Ageing and Society 28(7), 1025-1043

Sealey, A. and Thompson, P. 2006. 'Nice things get said'. Corpus evidence and the National Literacy Strategy. Literacy. Vol 40 No 1: 22 - 28 (ISSN 1741-4350)

Sealey, A. and Thompson, P. 2004. 'What do you call the dull words?' Primary school children using corpus-based approaches to learn about language. English in Education. Vol 38 No 1: 80 - 91 (ISSN 0425-0494)

Shin, Dongkwang and Paul Nation, 2008. Beyond single words: the most frequent collocations in spoken English. ELT Journal, Vol. 64, No. 4, 339-348.

Skorczynska Sznajder, H. & J. Piqué-Angordans, 2005. "A corpus-based description of metaphorical marking patterns in scientific and popular business discourse". 09: 112-129

Smith, Nick, Sebastian Hoffmann & Paul Rayson, 2008. "Corpus Tools and Methods, Today and Tomorrow: Incorporating Linguists' Manual Annotations". Literary and Linguistic Computing.

Sours, P. & E. Castello, "The use of corpora for teaching LSP"(legal and journalistic corpora), in Atti del Seminario Esperienze innovative e riflessioni per il futuro (2004), a cura del Centro Linguistico di Ateneo, Università di Padova.

Starke, Bettina, 2006. The Phraseology of Jane Austen's Persuasion: Phraseological units as carriers of meaning. ICAME Journal, No. 30, 87-104.

Taljard, Elsabé & de Schryver, Gilles-Maurice, 2002, Semi-automatic Term Extraction for the African Languages, with special reference to Northern Sotho. LEXIKOS, 12: 44-74.

Tagnin, S. E. O. e Teixeira, 2004. E. D. British vs. American English, Brazilian vs. European Portuguese - how close or how far apart? A corpus-driven study. In: Lewandowska-Tomaszczyk, B. (ed.). Lodz Studies in Language vol. 9: Practical Applications in Language and Computers - PALC 2003 . Frankfurt am Main: Peter Lang, Available at:

Tagnin, S. E. O. e Teixeira, E. D. 2004. Lingüística de corpus e tradução técnica – relato da montagem de um corpus multivarietal de culinária. [Corpus linguistics and technical translation – reporting the creation of a multivariate corpus of cooking recipes]. TradTerm vol. 10, São Paulo.

Tarasheva, Elena, 2011. The place of Eastern European researchers in international discourse: Critical discourse analysis applied to corpora from specialized journals. Discourse and Society, 22 (2). pp. 190-208. ISSN 0957-9265.

Тарашева, Елена, 2008. Корпусите в преподаването на чужди езици. (Corpora in Foreign Language Teaching) Чуждоезиково обучение : Двумесечно научно-методическо списание = Foreign language teaching, XXIV (4). pp. 3-12. ISSN 0205-1834

Taylor, Charlotte, 2008. Metaphors of anti-Americanism in a corpus of UK, US and Italian newspapers. ESP Across Cultures, 5: 137-152.

Taylor, Charlotte, 2008. What is corpus linguistics? What the data says. ICAME Journal, 32: 143-164.

Taylor, Charlotte, 2010. Science in the news: a diachronic perspective. Corpora, Vol 5, No. 2: 221-250.

Taylor, Charlotte, 2018. Representing the Windrush generation: metaphor in discourses then and now. Critical Discourse Studies, DOI: 10.1080/17405904.2018.1554535.

Teixeira, Elisa Duarte. 2003. Em busca de um novo modelo tecno-formal para a construção de dicionários técnicos bilíngües – o exemplo da culinária. [In search of a new techno-formal model for the compilation of bilingual technical dictionaries – the example of Culinary]. Intercâmbio, vol. XII, pp. 243-251. São Paulo: LAEL – PUC/SP.

Tennant, Bob, 2010. ‘On the good name of the dead’, Religion in the Age of the Enlightenment (no. 1, 2010)

Tennant, Bob, 2010. ‘Enlightenment Sermon Studies: a multidisciplinary activity’, Religion in the Age of the Enlightenment

Toolan, Michael. 2004. Values are Descriptions; or, from Literature to Linguistics and back again by way of Keywords. Belgian Journal of English Language and Literatures (BELL New Series 2), 11-30.

Toolan, Michael. 2006. Top keyword abridgements of short stories: a corpus linguistic resource? Journal of Literary Semantics 35, 181-194.

Trigg, Lisa J. 2006. Reproducing social inequality in the National Health Information Infrastructure. In Nursing Informatics 2006—9th International Congress in Nursing Informatics, Stud Health Technol Inform. 2006;122:353-8

Trigg, Lisa J. 2007. Social Construction of the Patient through Problems of Safety, Uninsurance, and Unequal Treatment—A Discourse Analysis of Institute of Medicine Executive Summaries. MedInfo 2007- 12th International Congress Medical (Health) Informatics. Poster: Brisbane Convention Centre, Queensland, Australia, August 20-24, 2007

Valipoor, Valeh, Nader Asadi Aidinlu, Haniyeh Davatgari Asl, 2016. Investigating Lexico-grammaticality in Academic Abstracts and Their Full Research Papers from a Diachronic Perspective. The Journal of Applied Linguistics Vol.9, No.19, pp.178-197, Fall & Winter 2016.

Valipoor, Leila, & Nassaji, Hossein, 2013. A corpus-based study of academic vocabulary in chemistry research articles. Journal of English for Academic Purposes, Vol 12 pp 248-63.

Vargas Sierra, Ch., 2002: «Utilización de los programas de concordancias en la traducción especializada», El español, lengua de traducción. I congreso internacional, Servicio de traducción de la Comisión Europea, pp. 468-483. Disponible en línea

van der Wouden, Ton. 2002. Particle research meets corpus linguistics: on the collocational behavior of particles, ca. 19 pages. Belgian Journal of Linguistics 16, 151-174. available here

Wang, Shih-ping, 2005. Corpus-based approaches and discourse analysis in relation to reduplication and repetition. Journal of Pragmatics, Vol 37/4, pp 505-540. 

Wang, Shih-ping, 2014. Using Technological Tols to Analyze Biblical Texts Metaphorical Repetition in the Book of Matthew. European Journal of Science and Theology, October 2014, Vol.10, No.5, 147-158.

Wang, Shih-ping, & Chen, Yih-Lan. 2018. Effects of Multimodal Learning Analytics with Concept Maps on College Students’
Vocabulary and Reading Performance. Educational Technology & Society, 21 (4), 12–25.

Wang, Shih-ping, Tseng, Wen-Ta, Johanson, Robert. 2021. To We or Not to We: Corpus-Based Research on First-Person Pronoun Use in Abstracts and Conclusions. SAGE Open, 2021, 1–18.

Weninger, Csilla. 2010. The lexico-grammar of partnerships: Corpus patterns of facilitated agency. Text & Talk 30(5): 591-613.

Wilkinson, Michael, 2005a. Using a Specialized Corpus to Improve Translation Quality. Translation Journal Volume 9 No 3. (online).

Wilkinson, Michael, 2005b. Discovering Translation Equivalents in a Tourism Corpus by Means of Fuzzy Searching, in Translation Journal, Volume 9, No 4.

Wilkinson, Michael, 2006. Legal Aspects of Compiling Corpora to be used as Translation Resources, in Translation Journal, Volume 10, No 2.

Wilkinson, Michael, 2006. "The corpus analysis tool - an under-exploited translation aid" , Kääntäjä, issue 7/06. Available here.

Wilkinson, Michael, 2007. "Corpora, Serendipity & Advanced Search Techniques", in The Journal of Specialised Translation, Issue 7. Online at:

Wilkinson, Michael, 2010. Quick Corpora Compiling -- Using Web as Corpus. Translation Journal. Volume 14, No. 3.

Wilkinson, Michael (2011). "WordSmith Tools: The best corpus analysis program for translators?", in Translation Journal, Vol. 15, No 3. Online at:

Wilkinson, Michael (2014). Using the Keyword Tool to Explore Lexical Differences between British and American English in Specialised Corpora. CALL-EJ 15(1), 21-38. Online at

Williams, Christopher & Denise Milizia, 2007. How (un)readable is the European Constitution? A comparison of the English version and Italian version. Studi in Onore di Vincenzo Starace. Napoli: Editoriale Scientifica: 2209-2227.

Williams, Geoffrey, 1998. Collocational Networks : Interlocking Patterns of Lexis in a Corpus of Plant Biology Research Articles. International Journal of Corpus Linguistics. Vol 3/1 : 151-171.

Williams, Geoffrey, 2002. In search of representativity in specialised corpora: categorisation through collocation International Journal of Corpus Linguistics. Vol. 7/1 : 43-64

Williams Geoffrey, Sionis, C., Boucher P., 2003 Single language corpus, multilingual background. AsP 37-38 : 47-57.

Williams Geoffrey, 2003, From meaning to words and back: Corpus linguistics and specialised lexicography. AsP, 39-40, pp91-106.

Winters, Marion (2007) F. Scott Fitzgerald's Die Schönen und Verdammten: A corpus-based study of speech-act report verbs as a feature of translators' style, Meta.

Winters, Marion (2004) F. Scott Fitzgerald's Die Schönen und Verdammten: A corpus-based study of loan words and code switches as features of translators' style', Language Matters 35(1):248-58.

Wolsey, T. D. (2010).  Complexity in student writing: The relationship between the task and vocabulary uptake. Literacy Research and Instruction, 49(2), 194-208. doi: 10.1080/19388070902947360

Wolsey, T.D., Karkouti, I.M., Hiebert, E.H. et al. (2022). Texts for reading instruction and the most common words in modern standard Arabic: An investigation. Reading and Writing.

Wulff, Stefanie & Ute Römer. 2009. Becoming a proficient academic writer: Shifting lexical preferences in the use of the progressive. Corpora 4(2): 115-133.

Xiao, Z. & A. McEnery. 2005, Two approaches to genre analysis: three genres in modern American English. available here J ournal of English Linguistics (33)1: 62-82.

Yamada, Minako. 2005. Task proficiency and L1 private speech. IRAL 43 (2), 81-108.

Zanettin, Federico. 1998. "Bilingual comparable corpora and the training of translators". In Laviosa, Sara. (ed.) META, 43:4, Special Issue. The corpus-based approach: a new paradigm in translation studies: 616-630.

Chapters, Books, Theses, Online resources