Re: Mnogosearch (Was: Re: website doc search is ... )
От | Marc G. Fournier |
---|---|
Тема | Re: Mnogosearch (Was: Re: website doc search is ... ) |
Дата | |
Msg-id | 20040101182954.D913@ganymede.hub.org обсуждение исходный текст |
Ответ на | Re: Mnogosearch (Was: Re: website doc search is ... ) (Tom Lane <tgl@sss.pgh.pa.us>) |
Список | pgsql-general |
On Thu, 1 Jan 2004, Tom Lane wrote: > "Marc G. Fournier" <scrappy@postgresql.org> writes: > > On Thu, 1 Jan 2004, Tom Lane wrote: > >> "Marc G. Fournier" <scrappy@postgresql.org> writes: > >>> what sort of impact does CLUSTER have on the system? For instance, an > >>> index happens nightly, so I'm guessing that I'll have to CLUSTER each > >>> right after? > >> > >> Depends; what does the "index" process do --- are ndict8 and friends > >> rebuilt from scratch? > > > nope, but heavily updated ... basically, the indexer looks at url for what > > urls need to be 're-indexed' ... if it does, it removed all words from the > > ndict# tables that belong to that url, and re-adds accordingly ... > > Hmm, but in practice only a small fraction of the pages on the site > change in any given day, no? I'd think the typical nightly run changes > only a small fraction of the entries in the tables, if it is smart > enough not to re-index pages that did not change. that is correct, and I further restrict it to 10000 URLs a night ... > My guess is that it'd be enough to re-cluster once a week or so. > > But this is pointless speculation until we find out whether clustering > helps enough to make it worth maintaining clustered-ness at all. Did > you get any results yet? Its doing the CLUSTERing right now ... will post results once finished ... ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664
В списке pgsql-general по дате отправления: