Re: pg_trgm
От | Tatsuo Ishii |
---|---|
Тема | Re: pg_trgm |
Дата | |
Msg-id | 20100528.085439.93468861.t-ishii@sraoss.co.jp обсуждение исходный текст |
Ответ на | Re: pg_trgm (Peter Eisentraut <peter_e@gmx.net>) |
Ответы |
Re: pg_trgm
|
Список | pgsql-hackers |
> I think the problem at hand has nothing at all to do with agglutination > or CJK-specific issues. You will get the same problem with other > languages *if* you set a locale that does not adequately support the > characters in use. E.g., Russian with locale C and encoding UTF8: > > select similarity(E'\u0441\u043B\u043E\u043D', E'\u0441\u043B\u043E > \u043D\u044B'); > similarity > ──────────── > NaN > (1 row) Of course. That's why I started this thread. With my patch: test=# select similarity(E'\u0441\u043B\u043E\u043D', E'\u0441\u043B\u043E\u043D\u044B');similarity ------------ 0.75 (1 row) Or you could just #undef KEEPONLYALNUM in trgm.h. But I'm not sure this is the right thing for you. -- Tatsuo Ishii SRA OSS, Inc. Japan English: http://www.sraoss.co.jp/index_en.php Japanese: http://www.sraoss.co.jp
В списке pgsql-hackers по дате отправления: