Gold standard for English-Swedish Europarl data (GES)
SND-ID: ext0283-1.
Access to data via
Contact
Lars Ahrenberg
Creator/Principal investigator(s)
Lars Ahrenberg - Linköping University, Department of Computer and Information Science
Maria Holmqvist - Linköping University, Department of Computer and Information Science
Research principal
Linköping University
- Department of Computer and Information Science
Description
Data are created from the English-Swedish part of the Europarl corpus. For each sentence pair in the selected subset, token correspondences are stated as pairs of integral token identifiers
Responsible department/unit
Department of Computer and Information Science
Research area
Engineering and technology (Standard för svensk indelning av forskningsämnen 2011)
Language technology (computational linguistics) (Standard för svensk indelning av forskningsämnen 2011)
Keywords
Maria Holmqvist and Lars Ahrenberg (2011). A Gold Standard for English-Swedish Word Alignment. In Proceedings of the 18th Nordic Conference on Computational Linguistics, Riga, Latvia, May 11-13, 2011.