From CLS Wiki

Main: Corpora

We (the Malex project) will focus on authentic language samples to extract a list of words to be included in the lexicon (headword list) and in order to create grammatical descriptions based on current usage.

The multilingual production of CLS will also open the way to applying corpus techniques to bilingual activities as well as monolingual.

The first seeds of the Centre's corpus can be seen here.

Access system

We are using software from The Institute of Natural Language Processing, University of Stuttgart: IMS Corpus Workbench

Retrieved from
Page last modified on September 05, 2009, at 10:14 AM