ACL/HCSNet Advanced Program in
Natural Language Processing

University of Melbourne, 10-14 July 2006

Steven Bird: Parsing with the Natural Language Toolkit

Abstract

The Natural Language Toolkit is a suite of program modules, data sets and tutorials supporting research and teaching in computational linguistics and natural language processing. NLTK is written in Python and distributed under the GPL open source license. This session will introduce and demonstrate NLTK's modules for weighted grammars grammars and probabilistic parsing.

Biographical Sketch

Steven Bird is Associate Professor in Computer Science at the University of Melbourne, and Senior Research Associate at the Linguistic Data Consortium. His research focusses on formal and computational models for linguistic information, with application to human language technologies and to the description of the world's ~7,000 languages. Before coming to Melbourne University he did doctoral and post-doctoral research at the University of Edinburgh (1987-94). From 1995-97 he conducted linguistic fieldwork on the languages of western Cameroon, published a dictionary, and helped develop several new writing systems. From 1998-2002 he was associate director of the Linguistic Data Consortium at the University of Pennsylvania, where he led an R&D team working on open-source software for linguistic annotation.


ACL/HCSNet Advanced Program in Natural Language Processing