The Megrelian Language Corpus (MLC) aims to stimulate various approaches to linguistic and socio-cultural studies of Megrelian texts and to foster a deeper understanding of the context they represent. The MLC consists of a digital collection of texts, audio and video materials.
The digital documentation and archiving process was carried out in two main stages: fieldwork conceptualization and data collection, followed by laboratory analysis and data processing.
The project began with the planning and implementation of expeditions, including route selection and risk assessment. Target informants were identified, ensuring diversity in age and gender to create a balanced corpus covering various topics and genres. Throughout this process, video, audio, and textual data were systematically collected to ensure comprehensive documentation.
Following data collection, a structured process was implemented for analysis and storage. Linguistic transcriptions followed the International Phonetic Alphabet (IPA), while glossing adhered to the Leipzig Glossing Rules and Eurotyp Guidelines.
The finalized data were made available online through The Megrelian Language Corpus (MLC).
Click here to use the Online Converter for Megrelian.
Total number of unique words (types): | 60954 |
Megrelian: | 30578 |
Megrelian (International Phonetic Alphabet): | 30371 |
Total word count (tokens): | 97688 |
Megrelian: | 97688 |
Megrelian (International Phonetic Alphabet): | 97565 |
Total number of sentences (segments): | 9261 |
This work was supported by the Shota Rustaveli National Science Foundation of Georgia (SRNSFG) [FR-21-993-3, Annotated Corpus of the Megrelian Language with Sketch Grammar and Online Dictionary]. Additionally, co-funding from Ilia State University enabled us to host the website online and ensure its IT support. All ideas expressed herein are those of the authors and do not represent the opinions of the Foundation or the University.
The Corpus of Megrelian Language (MLC) by Rusudan Gersamia, Irina Lobzhanidze is licensed under CC BY-NC-SA 4.0. This licence allows users to copy and redistribute the material in any medium or format, as well as remix, transform, and build upon it, provided that their contributions are distributed under the same license. The primary restriction is that the material must not be used for commercial purposes.
In future all contributions to the MLC should be submitted to the editors for review at rgersamia[@]iliauni.edu.ge or irina_lobzhanidze[@]iliauni.edu.ge and will be distributed under the same license.
If you want to cite material from the MLC, please, copy and paste the following information: Gersamia, R., Lobzhanidze, I., ed. The Megrelian Language Corpus. Megrelian Language Corpus, 4 April. 2022(v1). Web, https://xmf.iliauni.edu.ge/
This project, aimed at preserving the endangered Megrelian language, was led by principal investigators Professor Dr. Rusudan Gersamia and Professor Dr. Irina Lobzhanidze.
We also acknowledge the invaluable contributions of project team members Nino Tsulaia, MA and Tamuna Skhulukhia, MA, whose hard work was crucial to the project's progress.
Throughout various phases of the project, several individuals provided their expertise and support: Professor Dr. Zaal Kikvidze, Tamar Gogia, PhD, and Mariam Nadaraia, MA. Their involvement significantly enriched the project's outcomes.
We are grateful to the volunteers — Khatia Danelia, BA, Khatia Kobalia, BA, Lika Jalaghonia, BA, , Sopiko Tsertsvadze, BA, and Lasha Kvlividze, BA — whose enthusiasm and commitment considerably advanced our efforts.