|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
See:
Description
| Class Summary | |
|---|---|
| HTMLLanguageParser | Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats |
| LanguageIdentifier | Identify the language of a content, based on statistical analysis. |
| LanguageIndexingFilter | An IndexingFilter that
add a lang (language) field to the document. |
| LanguageQueryFilter | Handles "lang:" query clauses, causing them to search the "lang" field indexed by LanguageIdentifier. |
| NGramProfile | This class runs a ngram analysis over submitted text, results might be used for automatic language identifiaction. |
Text document language identifier.
Language profiles are based on material from http://www.isi.edu/~koehn/europarl/.
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||