Re: Tsearch2 Dutch snowball stemmer in PG8.1
От | Oleg Bartunov |
---|---|
Тема | Re: Tsearch2 Dutch snowball stemmer in PG8.1 |
Дата | |
Msg-id | Pine.LNX.4.64.0710031911200.3304@sn.sai.msu.ru обсуждение исходный текст |
Ответ на | Re: Tsearch2 Dutch snowball stemmer in PG8.1 (Alban Hertroys <a.hertroys@magproductions.nl>) |
Список | pgsql-general |
On Wed, 3 Oct 2007, Alban Hertroys wrote: > Alban Hertroys wrote: >> The only odd thing is that to_tsvector('dutch', 'some dutch text') now >> returns '|' for stop words... >> >> For example: >> select to_tsvector('nederlands', 'De beste stuurlui staan aan wal'); >> to_tsvector >> ------------------------------------------------ >> '|':1,5 'bes':2 'wal':6 'staan':4 'stuurlui':3 > > I found the cause. The stop words list I found contained comments > prefixed by '|' signs. Removing the contents and recreating the database > solved the problem. Just updating the reference didn't seem to help... you need to recreate tsvector field and index, after changing any dicts. > > There's undoubtedly some cleaner way to replace the stop words list, but > at the current stage of our project this was the simplest to achieve. > > Regards, Oleg _____________________________________________________________ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83
В списке pgsql-general по дате отправления: