To main content

Evaluation in natural language processing

Abstract

What are the purposes of evaluation. Different kinds of evaluation (of an hypothesis, of a resource, of a system in terms of its requirements, of a system in termos of usability, of model adequacy, of economical impact). Measures and concepts (properties of measures, relationship with desirable properties, statistical remarks). Evaluation of user-visible vs. user-transparent tasks; black-box vs. glass-box evaluation. The evaluation contest paradigm. Evaluation resources (golden resources, pooling, ablation). Baselines, ceilings, inter-annotator agreement. Corpus-based evaluation. Detailed examples: parsing, information retrieval, information extraction, machine translation, morphological analysis, and generation.

Category

Academic lecture

Language

English

Author(s)

  • Diana Santos

Affiliation

  • SINTEF Digital / Sustainable Communication Technologies

Presented at

19th European Summer School in Logic, Language and Information, ESSLLI 2007

Place

Dublin, England

Date

06.08.2007 - 17.08.2007

Organizer

University of Dublin

Year

2007

View this publication at Cristin