Bevezető Kereső Forrásjegyzék Morfoszintaktikai címkék Útmutató a kereséshez Jelmagyarázat A találatok értelmezése Kapcsolat About the project

Lectori salutem

The corpus contains texts belonging to two genres that are supposed to best represent informal language use: private letters and testimonies of witnesses in trials. All the sources predate 1772, the symbolic end of the Middle Hungarian period. As its normalized and morphologically annotated texts are also equipped with sociolinguistic metadata, it is a highly practical tool for conducting research on historical morphology and sociolinguistics, but it can also be used for studies on historical syntax, pragmatics, and lexicology. Its current size is approximately 6 M characters (850 thousand tokens).

The creation of the corpus was funded by grants Nr. 81189 and 116217 of the Hungarian Scientific Research Fund.

For further information on the corpus please refer to the following article:
Attila Novák, Katalin Gugán, Mónika Varga & Adrienne Dömötör: Creation of an annotated corpus of Old and Middle Hungarian court records and private correspondence. Language Resources and Evaluation, 2018,
or, in Hungarian:
Dömötör Adrienne, Gugán Katalin, Novák Attila, Varga Mónika: Kiútkeresés a morfológiai labirintusból – korpuszépítés ó- és középmagyar kori magánéleti szövegekből. NyK. 113 (2017): 85–110.

We kindly ask you to notify us if you publish results that were obtained using this corpus, and to cite the article above.