2013
pdf
bib
Generating Paths through Cultural Heritage Collections
Samuel Fernando
|
Paula Goodale
|
Paul Clough
|
Mark Stevenson
|
Mark Hall
|
Eneko Agirre
Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
pdf
bib
PATHS: A System for Accessing Cultural Heritage Collections
Eneko Agirre
|
Nikolaos Aletras
|
Paul Clough
|
Samuel Fernando
|
Paula Goodale
|
Mark Hall
|
Aitor Soroa
|
Mark Stevenson
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations
2012
pdf
bib
Adapting Wikification to Cultural Heritage
Samuel Fernando
|
Mark Stevenson
Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
pdf
bib
Comparing Taxonomies for Organising Collections of Documents
Samuel Fernando
|
Mark Hall
|
Eneko Agirre
|
Aitor Soroa
|
Paul Clough
|
Mark Stevenson
Proceedings of COLING 2012
pdf
bib
abs
Mapping WordNet synsets to Wikipedia articles
Samuel Fernando
|
Mark Stevenson
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Lexical knowledge bases (LKBs), such as WordNet, have been shown to be useful for a range of language processing tasks. Extending these resources is an expensive and time-consuming process. This paper describes an approach to address this problem by automatically generating a mapping from WordNet synsets to Wikipedia articles. A sample of synsets has been manually annotated with article matches for evaluation purposes. The automatic methods are shown to create mappings with precision of 87.8% and recall of 46.9%. These mappings can then be used as a basis for enriching WordNet with new relations based on Wikipedia links. The manual and automatically created data is available online.
pdf
bib
abs
Matching Cultural Heritage items to Wikipedia
Eneko Agirre
|
Ander Barrena
|
Oier Lopez de Lacalle
|
Aitor Soroa
|
Samuel Fernando
|
Mark Stevenson
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Digitised Cultural Heritage (CH) items usually have short descriptions and lack rich contextual information. Wikipedia articles, on the contrary, include in-depth descriptions and links to related articles, which motivate the enrichment of CH items with information from Wikipedia. In this paper we explore the feasibility of finding matching articles in Wikipedia for a given Cultural Heritage item. We manually annotated a random sample of items from Europeana, and performed a qualitative and quantitative study of the issues and problems that arise, showing that each kind of CH item is different and needs a nuanced definition of what ``matching article'' means. In addition, we test a well-known wikification (aka entity linking) algorithm on the task. Our results indicate that a substantial number of items can be effectively linked to their corresponding Wikipedia article.