Corpus Studies

Course Code :2002FLWDTA
Study domain:Linguistics and Proficiency
Academic year:2020-2021
Semester:1st semester
Contact hours:12
Credits:3
Study load (hours):84
Contract restrictions: No contract restriction
Language of instruction:English
Exam period:exam in the 1st semester
Lecturer(s)Peter Petré

3. Course contents *

The course 'Corpus Studies' serves as an introduction in the use of corpora, structured collections of texts, to analyse language data with the aim of answering research questions of a linguistic, literary, historical, or interdisciplinary nature. The course consists of a a praticial introduction to how to do corpus research independently. This includes a discussion of tools, retrieval techniques (particularly regular expressions), commonly used corpora as well as how to build your own corpus, and some statistical techniques typically associated with corpus research (e.g., collostructional analysis, dispersion, correlation measures of variables). Much of this part will be hands-on, with students working on their own case towards a short end paper. In addition, the course is intended as a reflection on the more theoretical implications of corpus research, discussing questions such as: What kind of knowledge do we think we are gaining by doing corpus research? Can we look into the heads of people by looking into their texts quantitatively? What is the importance of making the difference between idiolect and population perspectives? What is the complementary nature of qualitative and quantitative corpus research?