Re: Wich hardware suits best for large full-text indexed
От | Diogo Biazus |
---|---|
Тема | Re: Wich hardware suits best for large full-text indexed |
Дата | |
Msg-id | 406AFD8A.8090305@ikono.com.br обсуждение исходный текст |
Ответ на | Re: Wich hardware suits best for large full-text indexed (Oleg Bartunov <oleg@sai.msu.su>) |
Ответы |
Re: Wich hardware suits best for large full-text indexed
|
Список | pgsql-general |
Oleg Bartunov wrote: >On Tue, 30 Mar 2004, Diogo Biazus wrote: > > > >>Hi folks, >> >>I have a database using tsearch2 to index 300 000 documents. >>I've already have optimized the queries, and the database is vacuumed on >>a daily basis. >>The stat function tells me that my index has aprox. 460 000 unique words >>(I'm using stemmer and a nice stopword list). >> >> > >460 000 unique words is a lot ! Have you seen on them ? Sometimes it's >very useful to analyze what did you indexed and do you want all of them. >I suggest you to use ispell dictionary and, if you index numbers >(look statistics), use special dictionaries for integer and decimal numbers >http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/dicts/README.intdict > > I 'll try the ispell dictionaries and dicts for numbers too ;) Could the synonym dictionary help me on this (reducing unique words)? thanks, -- Diogo Biazus diogo@ikono.com.br http://www.ikono.com.br
В списке pgsql-general по дате отправления: