Experience
Education
Certificates
Skills
Publications
Projects
Tags
Contact
Light
Dark
Automatic
English
Euskara
Español
Article-Journal
IKER-GAITU: research on language technology for Basque and other low-resource languages
The general objective of the IKER-GAITU project is to research on language technology to increase the presence of Basque in the digital …
Eneko Agirre
,
Itziar Aldabe
,
Xabier Arregi
,
Mikel Artetxe
,
Unai Atutxa
,
Ekhi Azurmendi
,
Iker De La Iglesia
,
Julen Etxaniz
,
Victor García-Romillo
,
Inma Hernaez-Rioja
,
Others
PDF
Cite
XNLIeu: a dataset for cross-lingual NLI in Basque
XNLI is a popular Natural Language Inference (NLI) benchmark widely used to evaluate cross-lingual Natural Language Understanding (NLU) …
Maite Heredia
,
Julen Etxaniz
,
Muitze Zulaika
,
Xabier Saralegi
,
Jeremy Barnes
,
Aitor Soroa
PDF
Cite
Code
Dataset
arXiv
Latxa: An Open Language Model and Evaluation Suite for Basque
We introduce Latxa, a family of large language models for Basque ranging from 7 to 70 billion parameters. Latxa is based on Llama 2, …
Julen Etxaniz
,
Oscar Sainz
,
Naiara Perez
,
Itziar Aldabe
,
German Rigau
,
Eneko Agirre
,
Aitor Ormazabal
,
Mikel Artetxe
,
Aitor Soroa
PDF
Cite
Code
Dataset
arXiv
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark
In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is …
Oscar Sainz
,
Jon Ander Campos
,
Iker García-Ferrero
,
Julen Etxaniz
,
Oier Lopez De Lacalle
,
Eneko Agirre
PDF
Cite
arXiv
Do Multilingual Language Models Think Better in English?
Translate-test is a popular technique to improve the performance of multilingual language models. This approach works by translating …
Julen Etxaniz
,
Gorka Azkune
,
Aitor Soroa
,
Oier Lopez De Lacalle
,
Mikel Artetxe
PDF
Cite
Code
Dataset
arXiv
Cite
×