Institutions | About Us | Help | Gaeilge
rian logo


Mark
Go Back
Investigating Entity Linking in Early English Legal Documents
Lawless, Seamus
In this paper we investigate the accuracy and overall suitability of a variety of Entity Linking systems for the task of disambiguating entities in 17 th century depositions obtained during the 1641 Irish Rebellion. The depositions are extremely difficult for modern NLP tools to work with due to inconsistent spelling, use of language and archaic references. In order to assess the severity of difficulty faced by Entity Linking systems when working with these documents we use the depositions to create an evaluation corpus. This corpus is used as an input to the General Entity Annotator Benchmarking Framework, a standard benchmarking platform for entity annotation systems. Based on this corpus and the results obtained from General Entity Annotator Benchmarking Framework we observe that the accuracy of existing Entity Linking systems is lacking when applied to content like these depositions. This is due to a number of issues ranging from problems with existing state-of-the-art systems to poor representation of historic entities in modern knowledge bases. We discuss some interesting questions raised by this evaluation and put forward a plan for future work in order to learn more.
Keyword(s): Named Entity Disambiguation; Digital Humanities; Cultural Heritage; Digital Engagement; Making Ireland; Entity Linking; Knowledge and data engineering; SEMANTIC WEB
Publication Date:
2018
Type: Conference item
Peer-Reviewed: Yes
Language(s): English
Institution: Trinity College Dublin
Citation(s): Gary Munnelly and S?amus Lawless, Investigating Entity Linking in Early English Legal Documents, ACM/IEEE Joint Conference on Digital Libraries, JCDL 2018, Fort Worth, Texas, USA, 3rd-6th June, 2018
Alternative Title(s): ACM/IEEE Joint Conference on Digital Libraries, JCDL 2018
First Indexed: 2019-05-19 06:12:11 Last Updated: 2019-10-04 07:06:53