Re: BUG #5075: Text Search parser does not identify xml tag when attribute name's contains underscore
От | Peter Eisentraut |
---|---|
Тема | Re: BUG #5075: Text Search parser does not identify xml tag when attribute name's contains underscore |
Дата | |
Msg-id | 1258293365.14314.28.camel@vanquo.pezone.net обсуждение исходный текст |
Ответ на | Re: BUG #5075: Text Search parser does not identify xml tag when attribute name's contains underscore (Euler Taveira de Oliveira <euler@timbira.com>) |
Ответы |
Re: BUG #5075: Text Search parser does not identify xml tag
when attribute name's contains underscore
|
Список | pgsql-bugs |
On ons, 2009-09-23 at 20:31 -0300, Euler Taveira de Oliveira wrote: > Marek Lewczuk escreveu: > > Please execute following example: > > select * from ts_debug('english', '<img width="182" height="120" > > align="right" style="margin: 0px 0px 5px 5px;" test_aa="26461"/>') > > > > As the result you will see, that <img/> is not identified as XML tag, but > > rather splitted as words, blank spaces etc. The reason for that is the fact, > > that last attribute "test_aa" contains underscore in its name - when the > > underscore is removed, then img tag is properly identified as XML tag. > > > > XML definition allows using underscore in tag and attribute names. > > > The problem is we already allow it in tag names but not in attribute names. So > the proper fix is to allow underscore when the state is TPS_InTag; according > to XML spec [1], the underscore is a valid character in attribute names. > > A possible downside is that we don't have underscores in HTML attribute names. > In this case, should it fail? I don't think so but... > > The problem exists in 8.3, 8.4 and HEAD. It is a trivial fix so I think there > isn't a problem to back-patch it. Fix committed to 8.3, 8.4, 8.5.
В списке pgsql-bugs по дате отправления: