RELATE - Romanian Portal of Language Technologies

RELATE Architecture
(click to zoom)

RELATE is a Romanian language technology platform integrating different state-of-the art tools, algorithms, models and language resources for processing the Romanian language. The modules are developed either in-house or by our partners in different research projects. Please check each page for appropriate references. The modular architecture (see the diagram) allows chaining the available modules into custom pipelines providing advanced language processing capabilities.

The platform allows direct interaction with Romanian language tools for annotating and processing data. For small data sizes it is possible to directly invoke the modules from the web interface in an interactive way. For larger data volumes, the internal platform components allow creating corpora of any size and execute parallel processing pipelines. The platform is open to the public for research purposes (including the internal part, following an account request). The platform is developed for research purposes and may not be suitable for any commercial or production use.

Platform development takes place at GitHub: https://github.com/racai-ai/relate

Featured components

TEPROLIN is a web service providing lemmatization, part-of-speech tagging, dependency parsing. The processing flow can be customized if needed.

Domain-specific Named Entity Recognition (NER) provides the ability to recognize named entities in domain-specific text, such as legal-domain or biomedical text.

Automatic Speech Recognition (ASR) services are available based on different models. The current best model available in RELATE is based on a WAV2VEC2 model adaptation.

CoRoLa is the Representative Corpus of Contemporary Romanian Language. The different query interfaces associated with the corpus can be accessed through RELATE.

Resources and Models for the Romanian language can be accessed or downloaded.

For the full list of components, check the menu on the left.

Scientific papers

The papers listed here are dedicated to the platform architecture and usage scenarios. For details about individual components, check each component's page for appropriate references. When using the platform for research work, please cite one or more of the below papers and one or more papers covering the specific components used in your research.