Talk: Keynote: Corpora, methods and tools for (German) linguistics
Marc Kupietz Keynote
The talk will give an overview over the basic principles and recent developments of the empirical fundamentals for German (corpus) linguistics that have been developed and made available at the Leibniz-Institute for the German Language (IDS), already since the late 1960s. I will begin with an introduction to the German Reference Corpus DeReKo, discussing its design principles, development considerations and characteristics. Then I will present the tools we offer for querying and analyzing DeReKo, showing useful functionalities of the open-source platform KorAP and demonstrating practically the use of it’s recently published client libraries for R and Python. Finally, I will report on recent work in distributional syntax and semantics to gain new insights into language use from very large corpora, demonstrating also some publicly available tools. The talk will focus on German, but will use English examples where possible.
Info
Day:
2020-11-21
Start time:
14:00
Duration:
01:00
Room:
Agathe Lasch
Track:
Applied Linguistics
Language:
en
Links:
Feedback
Click here to let us know how you liked this event.