Re: What is the simpliest text search configuration?
От | Michael Nacos |
---|---|
Тема | Re: What is the simpliest text search configuration? |
Дата | |
Msg-id | 407fa4640911120540x57565291r197d655f0f228bce@mail.gmail.com обсуждение исходный текст |
Ответ на | What is the simpliest text search configuration? (Jérôme Etévé <jerome.eteve@gmail.com>) |
Ответы |
Re: What is the simpliest text search configuration?
|
Список | pgsql-general |
Dear Jerome,
from personal experience full-text searching in PostgreSQL can be quite powerful
but it's not simple, it requires thought, planning and coding. PostgreSQL mainly
provides an efficient token matching mechanism supporting positional information
and weights, but natural language processing and normalization is pretty basic.
If you don't mind writing a couple of user-defined functions to take control of lexeme
normalization, then tsvector/tsquery support can be a very powerful tool for custom
search engines.
regards,
Michael
from personal experience full-text searching in PostgreSQL can be quite powerful
but it's not simple, it requires thought, planning and coding. PostgreSQL mainly
provides an efficient token matching mechanism supporting positional information
and weights, but natural language processing and normalization is pretty basic.
If you don't mind writing a couple of user-defined functions to take control of lexeme
normalization, then tsvector/tsquery support can be a very powerful tool for custom
search engines.
regards,
Michael
2009/11/12 Jérôme Etévé <jerome.eteve@gmail.com>
Hi all,
I'd like to implement a full text search with postgresql, and I can't find
a text search configuration that would just:
map unicode accentuated letters to an un-accentuated equivalent
tokenize the words (and skip any non word characters)
no stopwords
lower case the tokens
How can I achieve this? I'm particularly interested in deactivating
the stopwords filtering.
I tried pg_catalog.simple, but despite its name, it still considers stop words.
Thanks for your help!
Jerome.
В списке pgsql-general по дате отправления: