Re: Wich hardware suits best for large full-text indexed
От | Oleg Bartunov |
---|---|
Тема | Re: Wich hardware suits best for large full-text indexed |
Дата | |
Msg-id | Pine.GSO.4.58.0404011401120.11543@ra.sai.msu.su обсуждение исходный текст |
Ответ на | Re: Wich hardware suits best for large full-text indexed (Diogo Biazus <diogo@ikono.com.br>) |
Список | pgsql-general |
On Wed, 31 Mar 2004, Diogo Biazus wrote: > Oleg Bartunov wrote: > > >On Tue, 30 Mar 2004, Diogo Biazus wrote: > > > > > > > >>Hi folks, > >> > >>I have a database using tsearch2 to index 300 000 documents. > >>I've already have optimized the queries, and the database is vacuumed on > >>a daily basis. > >>The stat function tells me that my index has aprox. 460 000 unique words > >>(I'm using stemmer and a nice stopword list). > >> > >> > > > >460 000 unique words is a lot ! Have you seen on them ? Sometimes it's > >very useful to analyze what did you indexed and do you want all of them. > >I suggest you to use ispell dictionary and, if you index numbers > >(look statistics), use special dictionaries for integer and decimal numbers > >http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/dicts/README.intdict > > > > > I 'll try the ispell dictionaries and dicts for numbers too ;) > Could the synonym dictionary help me on this (reducing unique words)? why not ? It useful for words, which doesnt' correctly stemmed. > > thanks, > > Regards, Oleg _____________________________________________________________ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
В списке pgsql-general по дате отправления: