Text Mining and Information Retrieval
Project Investigator Professor Paul Watry
Scientific/ Technical Objectives To enhance components of the National Centre for Text Mining (based at Manchester)
Role of NW-GRID Information retrieval over a defined text-collection requires that relevant semantic indices for this collection are built. This process is computationally intensive and can benefit from being run on a distributed compute platform such as NW-GRID.
Applications Software Cheshire
Grid Software Kepler (workflow), SRB(data virtualisation), other data-grid components
Progress to date A prototype National Text Mining service is available at NaCTeM - the National Centre for Text Mining
Publications
Cheshire 3 FrameworkWhite Paper: Implementing Support for Digital Repositories in a Data Grid Environment. (PDF) Paul B. Watry and Ray R. Larson. This is a white paper on Cheshire3 that was included in the International IEEE-Computer Society Symposium Mass Storage Systems and Technologies, June 19-24, 2005 - Sardinia Italy.