The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages R Steinberger, B Pouliquen, A Widiger, C Ignat, T Erjavec, D Tufis, ... arXiv preprint cs/0609058, 2006 | 776 | 2006 |
MULTEXT-East Version 3: Multilingual Morphosyntactic Specifications, Lexicons and Corpora. T Erjavec LREC, 2004 | 329 | 2004 |
Universal Dependencies 1.2 J Nivre, Ž Agić, MJ Aranzabe, M Asahara, A Atutxa, M Ballesteros, J Bauer, ... Universal Dependencies Consortium, 2015 | 221 | 2015 |
TEI P5: Guidelines for electronic text encoding and interchange TEI Consortium, L Burnard, CM Sperberg-Mac Queen | 195 | 2008 |
Sense discrimination with parallel corpora N Ide, T Erjavec, D Tufiş Proceedings of the ACL-02 workshop on Word sense disambiguation: recent …, 2002 | 177 | 2002 |
hrWaC and slWaC: Compiling web corpora for Croatian and Slovene N Ljubešić, T Erjavec Text, Speech and Dialogue: 14th International Conference, TSD 2011, Pilsen …, 2011 | 145 | 2011 |
MULTEXT-East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages L Dimitrova, N Ide, V Petkevic, T Erjavec, HJ Kaalep, D Tufis Proceedings of the 36th Annual Meeting of the Association for Computational …, 1998 | 144 | 1998 |
Towards a Slovene Dependency Treebank S Džeroski, T Erjavec, N Ledinek, P Pajas, Ž Zdenek, A Žele Proceedings of the 5th International Conference on Language Resources and …, 2006 | 133 | 2006 |
MULTEXT-East: morphosyntactic resources for Central and Eastern European languages T Erjavec Language resources and evaluation 46, 131-142, 2012 | 132 | 2012 |
Lemmagen: Multilingual lemmatisation with induced ripple-down rules M Juršic, I Mozetic, T Erjavec, N Lavrac Journal of Universal Computer Science 16 (9), 1190-1214, 2010 | 109 | 2010 |
Machine learning of morphosyntactic structure: Lemmatizing unknown Slovene words T Erjavec, S Džeroski Applied Artificial Intelligence 18 (1), 17-41, 2004 | 107 | 2004 |
Legal framework, dataset and annotation schema for socially unacceptable online discourse practices in Slovene D Fišer, T Erjavec, N Ljubešić Proceedings of the first workshop on abusive language online, 46-51, 2017 | 103 | 2017 |
Massive multi lingual corpus compilation: Acquis Communautaire and ToTaLe T Erjavec, C Ignat, B Pouliquen, R Steinberger ARCHIVES OF CONTROL SCIENCE 15 (4), 529, 2005 | 92 | 2005 |
The ParlaMint corpora of parliamentary proceedings T Erjavec, M Ogrodniczuk, P Osenova, N Ljubešić, K Simov, A Pančur, ... Language resources and evaluation 57 (1), 415-448, 2023 | 86 | 2023 |
Designing and evaluating a Russian tagset S Sharoff, M Kopotev, T Erjavec, A Feldman, D Divjak | 86 | 2008 |
Uvod v korpusno jezikoslovje V Gorjanc, M Stabej, T Erjavec, I Saksida, S Kranjc Izolit, 2005 | 79 | 2005 |
Korpusi slovenskega jezika Gigafida, KRES, ccGigafida in ccKRES: gradnja, vsebina, uporaba N Logar, M Grčar, M Brakus, T Erjavec, ŠA Holdt, S Krek Znanstvena založba Filozofske fakultete, 2020 | 74 | 2020 |
The JOS Linguistically Tagged Corpus of Slovene. T Erjavec, D Fišer, S Krek, N Ledinek LREC, 2010 | 73 | 2010 |
The MULTEXT East corpus. T Erjavec, N Ide LREC, 971-974, 1998 | 67 | 1998 |
The multext-east morphosyntactic specification for slavic languages T Erjavec, C Krstev, V Petkevic, K Simov, M Tadić, D Vitas Proceedings of the 2003 EACL Workshop on Morphological Processing of Slavic …, 2003 | 63 | 2003 |