× Description Download Publication(s) Contact
 Back to Software and Resources

UDLexicons

Multilingual collection of morphological lexicons

Main website Download

Description

The UDLexicons collection is a multilingual collection of 53 morphological lexicons covering 38 languages that follow the guidelines and format of the Universal Dependencies (UD) initiative. These lexicons were created based on exiting resources using three different approaches described in (Sagot 2018).

They use the CoNLL-UL extension of UD's CoNLL-U format, described in (More et al. 2018).

They are named using the following naming scheme: UDL-language_name-source_name.conllul, where source_name is the name of the main source of lexical information, sometimes followed by indications about the method used for extracting this information (see Sagot 2018 for details).

Download

You can download it here!

UDLexicons is distributed under the following licence: .

Citation and publication(s)

If you use this work, please cite the following:

Benoît Sagot. 2018. A multilingual collection of CoNLL-U-compatible morphological lexicons.
In Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan.
HAL PDF
@inproceedings{Sagot_A-multilingual-collection-of_2018,
 address = {Miyazaki, Japan},
 author = {Sagot, Beno{\^i}t},
 title = {{A multilingual collection of CoNLL-U-compatible morphological lexicons}},
year = {2018},
 booktitle = {{Eleventh International Conference on Language Resources and Evaluation (LREC 2018)}},
 url = {https://inria.hal.science/hal-01798798},
 hal_pdf = {https://inria.hal.science/hal-01798798v2/file/lrec18udlexicons.pdf},
}

Contact

For more information or if you have any questions, please contact Benoît Sagot

Benoit.Sagot[at]inria.fr