UNDERLINE DOI: https://doi.org/10.48448/fdc4-y857
poster
What happens if you treat ordinal ratings as interval data? Human evaluations in NLP are even more under-powered than you think
Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
