4th edition

Nitro NLP Hackathon 2025

29 - 30 March 2025

CMMS wins 2025 edition

CMMS wins 2025 edition

Congratulations to the fourth edition winners!

  • Scarlat Marius Ștefan
  • Popescu Ștefan-Alexandru
  • Dinu Matei-Alexandru
  • Cristiana Cocheci

We look forward to working with you for preparing the next edition!

Task & Leaderboard

Editorial

Thank you all for a great edition! We went duck hunting, we went bug hunting, but most importantly we solved some real-life problems. If you want to see how our best competitors solved this task check out our closing ceremony and winner presentations.

Watch on YouTube

About the dataset

About the dataset

MoRoCo - The Moldavian and Romanian Dialectal Corpus was published in Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL) 2019. The corpus contains 33564 news articles belonging to one of the following six topics: culture, finance, politics, science, sports and tech. As such, it can be used for binary discrimination of Moldavian versus Romanian text samples, intra-dialect multi-class categorization by topic and cross-dialect multi-class categorization by topic.

Original Paper

Host

Universitatea din București logo

Edition Sponsor

JetBrains logo