Re: fulltext parser strange behave
От | Tom Lane |
---|---|
Тема | Re: fulltext parser strange behave |
Дата | |
Msg-id | 13471.1194634433@sss.pgh.pa.us обсуждение исходный текст |
Ответ на | Re: fulltext parser strange behave (Andrew Dunstan <andrew@dunslane.net>) |
Ответы |
Re: fulltext parser strange behave
|
Список | pgsql-hackers |
Andrew Dunstan <andrew@dunslane.net> writes: > I've just been looking at the state machine in wparser_def.c. I think > the processing for entities is also a few bob short in the pound. It > recognises decimal numeric character references, but nor hexadecimal > numeric character references. That's fairly silly since the HTML spec > specifically says the latter are "particularly useful". The rules for > named entities are also deficient w.r.t. digits, just like the case of > tags that Tom noticed. This isn't academic: HTML features a number of > named entities with digits in the name (sup2, frac14 for example). > In XML at least, legal names are defined by the following rules from the > spec: > ... > [A-Za-z:_][A-Za-z0-9:_.-]* > I suggest we use that or something very close to it as the rule for > names in these patterns. No objections here. Who wants to patch wparser_def? regards, tom lane
В списке pgsql-hackers по дате отправления: