Bevezető Kereső Forrásjegyzék Morfoszintaktikai címkék Útmutató a kereséshez Jelmagyarázat A találatok értelmezése Kapcsolat About the project
.

Lectori salutem

The corpus contains texts belonging to two genres that are supposed to best represent informal language use: private letters and testimonies of witnesses in trials. All the sources predate 1772, the symbolic end of the Middle Hungarian period. As its normalized and morphologically annotated texts are also equipped with sociolinguistic metadata, it is a highly practical tool for conducting research on historical morphology and sociolinguistics, but it can also be used for studies on historical syntax, pragmatics, and lexicology. Its current size (as of 09. 2017) is approximately 7 M characters (xx tokens), X % of which are letters and X % are deposits of witnesses.

The creation of the corpus was funded by grants Nr. 81189 and 116217 of the Hungarian Scientific Research Fund.

For further information on the corpus please refer to the following article:
Attila Novák, Katalin Gugán, Mónika Varga & Adrienne Dömötör: Creation of an annotated corpus of Old and Middle Hungarian court records and private correspondence. Language Resources and Evaluation, 2017,
or, in Hungarian:
Dömötör Adrienne, Gugán Katalin, Novák Attila, Varga Mónika: Kiútkeresés a morfológiai labirintusból – korpuszépítés ó- és középmagyar kori magánéleti szövegekből. NyK. 113 (2017): 85–110.

We kindly ask you to notify us if you publish results that were obtained using this corpus, and also to cite the given article in that case.

A korpusz létrehozását a következő két pályázat támogatta, illetve támogatja: OTKA K 81189 és NKFI–OTKA K 116217.