Revision 7588,
861 bytes
checked in by adir, 11 years ago
(diff) |
Ticket #000 - Adicionando a integracao de buscas com Solr na base a ser isnerida na comunidade
|
Line | |
---|
1 | Apache Solr Language Identifier |
---|
2 | |
---|
3 | |
---|
4 | Introduction |
---|
5 | ------------ |
---|
6 | This module is intended to be used while indexing documents. |
---|
7 | It is implemented as an UpdateProcessor to be placed in an UpdateChain. |
---|
8 | Its purpose is to identify language from documents and tag the document with language code. |
---|
9 | The module can optionally map field names to their language specific counterpart, |
---|
10 | e.g. if the input is "title" and language is detected as "en", map to "title_en". |
---|
11 | Language may be detected globally for the document, and/or individually per field. |
---|
12 | Language detector implementations are pluggable. |
---|
13 | |
---|
14 | Getting Started |
---|
15 | --------------- |
---|
16 | Please refer to the module documentation at http://wiki.apache.org/solr/LanguageDetection |
---|
17 | |
---|
18 | Dependencies |
---|
19 | ------------ |
---|
20 | The Tika detector depends on Tika Core (which is part of extraction contrib) |
---|
21 | The Langdetect detector depends on LangDetect library |
---|
Note: See
TracBrowser
for help on using the repository browser.