The data-mining project contains Peregrine and several supporting modules, e.g. for ontologies and datasets. Peregrine is an indexing engine or tagger: a piece of software that can be used to recognize concepts in human readable text, based on a database (thesaurus) of known terms. Multi-word terms are correctly recognized. If terms can represent multiple concepts, Peregrine will attempt to disambiguate them.