Simone Tedeschi's Projects
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Modelsโ Safety through Red Teaming"
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
MTEB: Massive Text Embedding Benchmark
Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)
A collection of notebooks for Natural Language Processing from NLP Town
Data and code for "Nibbling at the Hard Core of Word Sense Disambiguation" (ACL 2022).