Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque Paper • 2506.07597 • Published Jun 9
Lessons from the Trenches on Reproducible Evaluation of Language Models Paper • 2405.14782 • Published May 23, 2024
Truth Knows No Language: Evaluating Truthfulness Beyond English Paper • 2502.09387 • Published Feb 13 • 1
BertaQA: How Much Do Language Models Know About Local Culture? Paper • 2406.07302 • Published Jun 11, 2024 • 1
Latxa: An Open Language Model and Evaluation Suite for Basque Paper • 2403.20266 • Published Mar 29, 2024 • 3
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark Paper • 2310.18018 • Published Oct 27, 2023 • 1
Do Multilingual Language Models Think Better in English? Paper • 2308.01223 • Published Aug 2, 2023 • 2