ERROR: syntax error in tsquery - for high-unicode whitespace
| От | hubert depesz lubaczewski |
|---|---|
| Тема | ERROR: syntax error in tsquery - for high-unicode whitespace |
| Дата | |
| Msg-id | 20130315000838.GA12142@depesz.com обсуждение исходный текст |
| Ответы |
Re: ERROR: syntax error in tsquery - for high-unicode whitespace
|
| Список | pgsql-bugs |
hi
it was tested on 9.1 and 9.3. Interestingly - it worked without error in
8.2.
$ select to_tsquery('english', E'a\xe2\x80\x86a');
ERROR: syntax error in tsquery: "aâa"
the 3-byte utf8 character is SIX-PER-EM SPACE (based on info from
http://www.fileformat.info/info/unicode/char/2006/index.htm)
Not sure what should happen with it, but generally I thought that
whitespace characters will get ignored (treated as separators) when
building tsquery.
It seems to work that way when building tsvector though:
$ select to_tsvector('english', E'a\xe2\x80\x86a');
to_tsvector
-------------
(1 row)
and for larger example:
$ select to_tsvector('english', E'depesz\xe2\x80\x86whatever');
to_tsvector
-----------------------
'depesz':1 'whatev':2
(1 row)
$ select to_tsquery('english', E'depesz\xe2\x80\x86whatever');
ERROR: syntax error in tsquery: "depeszâwhatever"
Best regards,
depesz
--
The best thing about modern society is how easy it is to avoid contact with it.
http://depesz.com/
В списке pgsql-bugs по дате отправления: