Evaluation in natural language processing

Abstract

What are the purposes of evaluation. Different kinds of evaluation (of an hypothesis, of a resource, of a system in terms of its requirements, of a system in termos of usability, of model adequacy, of economical impact). Measures and concepts (properties of measures, relationship with desirable properties, statistical remarks). Evaluation of user-visible vs. user-transparent tasks; black-box vs. glass-box evaluation. The evaluation contest paradigm. Evaluation resources (golden resources, pooling, ablation). Baselines, ceilings, inter-annotator agreement. Corpus-based evaluation. Detailed examples: parsing, information retrieval, information extraction, machine translation, morphological analysis, and generation.

Language

English

Author(s)

Diana Santos

Affiliation

SINTEF Digital / Sustainable Communication Technologies

Presented at

19th European Summer School in Logic, Language and Information, ESSLLI 2007

Place

Dublin, England

Date

06.08.2007 - 17.08.2007

Organizer

University of Dublin

Year

2007

External resources

https://www.cs.tcd.ie/esslli2007/content/courses/id17.html

View this publication at Cristin

Contact us

Our services

Career

Sustainability

Management and board

Institutes

Other units

About us

Follow us