An Intelligent Search Infrastructure
for Language Resources on the Web
Special Research Initiatives - E-Research SR0567353
Amount(s): 2005: AU$49,018; 2006: AU$49,018
Project Activities: Q1 2006
In Q1 2006, the project plans and objectives are:
- Module 1: Language Crawler.
This will use pre-existing tools to selectively obtain text from the web
which has been identified as being of potential linguistic interest. The
language of each such document will be identified through a combination
of code-string analysis and character n-gram analysys and those documents
authored in languages of economic, scientific and cultural interest to
Australia will be identified. This material will be stored in a centralized
repository for the purposes of indexing, annotation and preservation.
Last Updated:
Mon Jan 23 13:17:34 EST 2006