Is It Smaller Than a Tennis Ball? Language Models Play the Game of Twenty Questions
Bron
Proceedings of the Fifth BlackboxNLP Analyzing and Interpreting Neural Networks for NLP- () p. 80-90
Machine Translation for Multilingual Intent Detection and Slots Filling
Bron
Proceedings of the Massively Multilingual Natural Language Understanding Workshop (MMNLU-22)- (2022) p. 69-82
20Q: Overlap-Free World Knowledge Benchmark for Language Models
Bron
Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM)- () p. 494-508
What was your name again? Interrogating generative conversational models For factual consistency evaluation
Bron
Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), Abu Dhabi, United Arab Emirates (Hybrid)- (2022) p. 509-519
Open-domain dialog evaluation using follow-ups likelihood
Bron
Proceedings of the 29th International Conference on Computational Linguistics,-29 (2022) p. 496-504