Language Technology Seminar Series

Department of Computer Science and Software Engineering
The University of Melbourne


Title: Interpretation of Compound Nominalisations using Corpus and Web Statistics

Speaker: Jeremy Nicholson (University of Melbourne)

Location: ICT Building, Room 2.06

Date: 7 July 2006

Time: 1-2pm

Abstract:

We present two novel paraphrase tests for automatically predicting the inherent semantic relation of a given compound nominalisation as one of subject, direct object, or prepositional object. We compare these to the usual verb--argument paraphrase test using corpus statistics, and frequencies obtained by scraping the Google search engine interface. We also implemented a more robust statistical measure than maximum likelihood estimation --- the confidence interval. A significant reduction in data sparseness was achieved, but this alone is insufficient to provide a substantial performance improvement.
Disclaimer: This page, its contents and style, are the responsibility of the author and do not necessarily represent the views, policies or opinions of The University of Melbourne.