Re: Clarification of the "simple" dictionary
От | Oleg Bartunov |
---|---|
Тема | Re: Clarification of the "simple" dictionary |
Дата | |
Msg-id | Pine.LNX.4.64.1007222327270.32129@sn.sai.msu.ru обсуждение исходный текст |
Ответ на | Re: Clarification of the "simple" dictionary (Andreas Joseph Krogh <andreak@officenet.no>) |
Список | pgsql-general |
Andreas, I'd create myself copy of dictionary to be independent on system changes. Oleg On Thu, 22 Jul 2010, Andreas Joseph Krogh wrote: > On 07/22/2010 07:44 PM, Oleg Bartunov wrote: >> Don't guess, but read docs >> http://www.postgresql.org/docs/8.4/interactive/textsearch-dictionaries.html#TEXTSEARCH-SIMPLE-DICTIONARY >> >> 12.6.2. Simple Dictionary >> >> The simple dictionary template operates by converting the input token to >> lower case and checking it against a file of stop words. If it is found in >> the file then an empty array is returned, causing the token to be >> discarded. If not, the lower-cased form of the word is returned as the >> normalized lexeme. Alternatively, the dictionary can be configured to >> report non-stop-words as unrecognized, allowing them to be passed on to the >> next dictionary in the list. >> >> d=# \dFd+ simple >> List of text search dictionaries >> Schema | Name | Template | Init options | >> Description >> ------------+--------+-------------------+--------------+----------------------------------------------------------- >> pg_catalog | simple | pg_catalog.simple | | simple >> dictionary: just lower case and check for stopword >> >> By default it has no Init options, so it doesn't check for stopwords. > > Guess what - I *have* read the docs which sais "...and checking it against a > file of stop words". What was unclear to me was whether or not it was > configured with a stopwords-file or not as default, which is not the case I > understand from your reply. Very good, fits my needs like a glove:-) It might > be worth considering updating the docs to make this clearer? > > So - can we rely on "simple" to remain this way forever (no Init options) or > is it better to make a copy of it with the same properties as today? > > It seems "simple" + the unaccent dict. available in 9.0 saves my day, thanks > Mr. Bartunov. > > Regards, Oleg _____________________________________________________________ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83
В списке pgsql-general по дате отправления: