Talk: Keynote: Corpora, methods and tools for (German) linguistics

Marc Kupietz Keynote

The talk will give an overview over the basic principles and recent developments of the empirical fundamentals for German (corpus) linguistics that have been developed and made available at the Leibniz-Institute for the German Language (IDS), already since the late 1960s. I will begin with an introduction to the German Reference Corpus DeReKo, discussing its design principles, development considerations and characteristics. Then I will present the tools we offer for querying and analyzing DeReKo, showing useful functionalities of the open-source platform KorAP and demonstrating practically the use of it’s recently published client libraries for R and Python. Finally, I will report on recent work in distributional syntax and semantics to gain new insights into language use from very large corpora, demonstrating also some publicly available tools. The talk will focus on German, but will use English examples where possible.

Info

Day: 2020-11-21
Start time: 14:00
Duration: 01:00
Room: Agathe Lasch
Track: Applied Linguistics
Language: en

Links:

Feedback

Click here to let us know how you liked this event.

Concurrent Events