Show simple item record

Automatic induction of verb classes using clustering

dc.contributorKorhonen, Anna
dc.creatorSun, Lin
dc.date.accessioned2018-11-24T13:11:53Z
dc.date.available2013-07-17T11:45:14Z
dc.date.available2018-11-24T13:11:53Z
dc.date.issued2013-04-30
dc.identifierhttp://www.dspace.cam.ac.uk/handle/1810/244714
dc.identifierhttps://www.repository.cam.ac.uk/handle/1810/244714
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/123456789/3043
dc.description.abstractVerb classifications have attracted a great deal of interest in both linguistics and natural language processing (NLP). They have proved useful for important tasks and applications, including e.g. computational lexicography, parsing, word sense disambiguation, semantic role labelling, information extraction, question-answering, and machine translation (Swier and Stevenson, 2004; Dang, 2004; Shi and Mihalcea, 2005; Kipper et al., 2008; Zapirain et al., 2008; Rios et al., 2011). Particularly useful are classes which capture generalizations about a range of linguistic properties (e.g. lexical, (morpho-)syntactic, semantic), such as those proposed by Beth Levin (1993). However, full exploitation of such classes in real-world tasks has been limited because no comprehensive or domain-specific lexical classification is available. This thesis investigates how Levin-style lexical semantic classes could be learned automatically from corpus data. Automatic acquisition is cost-effective when it involves either no or minimal supervision and it can be applied to any domain of interest where adequate corpus data is available. We improve on earlier work on automatic verb clustering. We introduce new features and new clustering methods to improve the accuracy and coverage. We evaluate our methods and features on well-established cross-domain datasets in English, on a specific domain of English (the biomedical) and on another language (French), reporting promising results. Finally, our task-based evaluation demonstrates that the automatically acquired lexical classes enable new approaches to some NLP tasks (e.g. metaphor identification) and help to improve the accuracy of existing ones (e.g. argumentative zoning).
dc.languageen
dc.publisherUniversity of Cambridge
dc.publisherFaculty of Computer Science and Technology
dc.publisherComputer Laboratory
dc.rightshttp://creativecommons.org/licenses/by-nc-sa/2.0/uk/
dc.rightsAttribution-NonCommercial-ShareAlike 2.0 UK: England & Wales
dc.subjectLexical semantics
dc.subjectVerb classification
dc.subjectVerb clustering
dc.subjectUnsupervised learning
dc.subjectLangauge acquisition
dc.titleAutomatic induction of verb classes using clustering
dc.typeThesis


Files in this item

FilesSizeFormatView
thesis.pdf907.8Kbapplication/pdfView/Open

This item appears in the following Collection(s)

Show simple item record