Re: Bunch of tsearch fixes and cleanup
От | Tom Lane |
---|---|
Тема | Re: Bunch of tsearch fixes and cleanup |
Дата | |
Msg-id | 1618.1187880567@sss.pgh.pa.us обсуждение исходный текст |
Ответ на | Bunch of tsearch fixes and cleanup ("Heikki Linnakangas" <heikki@enterprisedb.com>) |
Ответы |
Re: Bunch of tsearch fixes and cleanup
Re: Bunch of tsearch fixes and cleanup Re: Bunch of tsearch fixes and cleanup |
Список | pgsql-patches |
"Heikki Linnakangas" <heikki@enterprisedb.com> writes: > - readstopwords calls recode_and_lowerstr directly, instead of using the > "wordop" function pointer in StopList struct. All callers used > recode_and_lowerstr anyway, so this simplifies the code a little bit. Is > there any external dictionary implementations that would require > different behavior? I don't think eliminating wordop altogether is such a hot idea; some dictionary could possibly want to do different processing than that. Something that was annoying me yesterday was that it was not clear whether we had fixed every single place that uses a tsearch config file to assume that the file is in UTF8 and should be converted to database encoding. So I was thinking of hardwiring the "recode" part into readstopwords, and using wordop just for the "lowercase" part, which seemed to me like a saner division of labor. That is, UTF8 is a policy that we want to enforce globally, but lowercasing maybe not, and this still leaves the door open for more processing besides lowercasing. Oleg, Teodor, what do you think about this? regards, tom lane
В списке pgsql-patches по дате отправления: