EUROVOC Classification

Introduceți un document în limba română.


Num Threshold (0.0 - 1.0)

The EUROVOC classification model for Romanian language was trained using FastText using CoRoLa based word embeddings. It is served using a changed version of FastText supporting serving trained models. It can be downloaded from our github: https://github.com/racai-ai/ServerFastText.

The model achieves P=50.93, R=56.40, F1=53.53 on EUROVOC IDs, while conversion to MT labels translates into P=56.05, R=68.95, F1=61.83 and for top-level domains P=64.9, R=77.89, F1=70.80.

The model can be downloaded here: BIN or VEC.