Revision 7588,
861 bytes
checked in by adir, 12 years ago
(diff) |
Ticket #000 - Adicionando a integracao de buscas com Solr na base a ser isnerida na comunidade
|
Rev | Line | |
---|
[7588] | 1 | Apache Solr Language Identifier |
---|
| 2 | |
---|
| 3 | |
---|
| 4 | Introduction |
---|
| 5 | ------------ |
---|
| 6 | This module is intended to be used while indexing documents. |
---|
| 7 | It is implemented as an UpdateProcessor to be placed in an UpdateChain. |
---|
| 8 | Its purpose is to identify language from documents and tag the document with language code. |
---|
| 9 | The module can optionally map field names to their language specific counterpart, |
---|
| 10 | e.g. if the input is "title" and language is detected as "en", map to "title_en". |
---|
| 11 | Language may be detected globally for the document, and/or individually per field. |
---|
| 12 | Language detector implementations are pluggable. |
---|
| 13 | |
---|
| 14 | Getting Started |
---|
| 15 | --------------- |
---|
| 16 | Please refer to the module documentation at http://wiki.apache.org/solr/LanguageDetection |
---|
| 17 | |
---|
| 18 | Dependencies |
---|
| 19 | ------------ |
---|
| 20 | The Tika detector depends on Tika Core (which is part of extraction contrib) |
---|
| 21 | The Langdetect detector depends on LangDetect library |
---|
Note: See
TracBrowser
for help on using the repository browser.