In Benchmarks We Trust ... Or Not?
Bron
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing- () p. 23673-23687
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
Bron
Proceedings of the 29th Conference on Computational Natural Language Learning, 31 July - 1 August, 2025, Vienna, Austria- () p. 68-80
Bag of lies: robustness in continuous Pre-training BERT
Bron
Computational Linguistics in the Netherlands Journal - ISSN 2211-4009-14 (2025) p. 67-84
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
Bron
Zenodo, 2025,
Bag of Lies: Robustness in Continuous Pre-training BERT
Bron
Zenodo, 2025,