CORPUS AND COMPUTATIONAL LINGUISTICS ON THE EXAMPLE OF THE EC QUESTIONNAIRE
Keywords:
corpus, linguistic analysis, computers, the EC QuestionnaireAbstract
This paper deals with the analysis of one part of the European Commission Questionnaire. From the complete analysis in our doctoral thesis, two aspects have been chosen for the purposes of this paper. The first one is the analysis of impersonalised register, while the second aims to find the connexion between hapax legomena and the mistakes we have spotted in the abovementioned corpus. We will show how the use of corpora and computers can shed new light onto traditional linguistic research. Furthermore, this paper should also be understood as a way of promoting the Corpus and Computational linguistics which are yet to find their full application within linguistic research.
References
Agresti 2002: A. Agresti, Categorical Data Analysis. Hoboken, NJ: Wiley.
Aston (ed.) 2001: G. Aston, Learning With Corpora. Bologna. CLUEB.
Baayen 2008: R. Baayen, Analyzing Linguistic Data: A Practical Introduction to Statistics Using R. Cambridge: Cambridge University Press.
Baayen, R. Harald, Richard Piepenbrock, and Leon Gulikers. 1995. The CELEX Lexical Database (Release 2). Philadelphia, PA: Linguistic Data Consortium.
Barnbrook 1996: G. Barnbrook, Language and Computers. Edinburgh: Edinburgh University Press.
Biber 1988: D. Biber, Variation Across Speech and Writing. Cambridge: Cambridge University Press.
Biber 1998: D. Biber, Corpus Linguistics: Investigating Language Structure and Use. Cambridge: Cambridge University Press.
Bod 2003: R. Bod, Probabilistic Linguistics. Cambridge, MA: MIT Press.
Bugarski 1972: R. Bugarski, Jezik i lingvistika, Nolit, Beograd.
Bugarski 1986: R. Bugarski, Terminologija kontrastivne lingvistike, u: Kontrastivna jezička istraživanja, III simpozijum (Novi Sad, 6. i 7. decembar 1985), Zbornik radova, Univerzitet u Novom Sadu, Filozofski fakultet, Novi Sad, 383–390.
Bugarski 1990: R. Bugarski, Integralna kontrastivna analiza, u: Kontrastivna jezička istraživanja, IV simpozijum (Novi Sad, 8. i 9. decembar 1989), Zbornik radova, Filozofski fakultet, Novi Sad, 58–62.
Church 1991: K. Church, Using statistics in lexical analysis. In Lexical Acquisition: Exploiting On-line Resources to Build a Lexicon, ed. Uri Zernik, 115–164. Hillsdale, NJ: Lawrence Erlbaum.
McEnery 2001: T. McEnery, Corpus Linguistics, Edinburgh: Edinburgh University Press.
Sinclair 1991: J. Sinclair, Corpus, Concordance, Collocation, Oxford: Oxford University Press.