tsearch2 for alphabetic character strings & codes
От | Ron Mayer |
---|---|
Тема | tsearch2 for alphabetic character strings & codes |
Дата | |
Msg-id | Pine.LNX.4.58.0509231355220.1550@greenie.cheapcomplexdevices.com обсуждение исходный текст |
Ответы |
Re: tsearch2 for alphabetic character strings & codes
|
Список | pgsql-general |
I'm looking for a way search for substrings strings within documents in a way very similar to tsearch2, but my strings are not alphabetical codes so I'm having a tough time trying to use the current tsearch2 configurations with them. For example, using tsearch to search for codes like '31.03(e)(2)(A)' in a set of documents is tricky because tsearch seems to treat most of the punctuation as word separators. fli=# select fli-# to_tsvector('default','31.03(e)(2)(A)'), fli-# to_tsvector('simple','31.03(e)(2)(A)'); to_tsvector | to_tsvector -----------------------+----------------------------- '2':3 'e':2 '31.03':1 | '2':3 'a':4 'e':2 '31.03':1 (1 row) I see that tsearch2 allows different "configurations" that appaently differ in how they parse strings. I guess what I'm looking for is a "configuration" that's even simpler-than-simple, and only breaks up strings on whitespace and doesn't use any natural language dictionaries. I was hoping I could download or define such a configuration; but didn't see any obvious documentation on how to set up my own configuration. Does this sound like a good approach (and if so, could someone please point me in the right direction), or are there other things I should be looking to. Ron
В списке pgsql-general по дате отправления: