VIDEO DOI: https://doi.org/10.48448/fdc4-y857

poster

EMNLP 2021

November 08, 2021

Live on Underline

What happens if you treat ordinal ratings as interval data? Human evaluations in NLP are even more under-powered than you think

Please log in to leave a comment

Downloads

SlidesPaperTranscript English (automatic)

Next from EMNLP 2021

IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation
poster

IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation

EMNLP 2021

+9Pascale FungSamuel Cahyawijaya
Samuel Cahyawijaya and 11 other authors

08 November 2021

Similar lecture

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
technical paper

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

EMNLP 2021

Sebastian Ruder
Sebastian Ruder

08 November 2021