Re: large document multiple regex

Поиск

Список

Период

Сортировка

От	Jim Nasby
Тема	Re: large document multiple regex
Дата	2 февраля 2007 г. 12:41:28
Msg-id	BF7F6E71-0397-4AB7-B138-9A88B98A8B99@decibel.org обсуждение исходный текст
Ответ на	large document multiple regex ("Merlin Moncure" <mmoncure@gmail.com>)
Ответы	Re: large document multiple regex
Список	pgsql-general

Дерево обсуждения

On Jan 26, 2007, at 9:06 AM, Merlin Moncure wrote:
> I am receiving a large (300k+_ document from an external agent and
> need to reduce a few interesting bits of data out of the document on
> an insert trigger into separate fields.
>
> regex seems one way to handle this but is there any way to avoid
> rescanning the document for each regex.  One solution I am kicking
> around is some C hackery but then I lose the expressive power of
> regex.  Ideally, I need to be able to scan some text and return a
> comma delimited string of values extracted from it.  Does anybody know
> if this is possible or have any other suggestions?

Have you thought about something like ~ '(first_string|second_string|
third_string)'? Obviously your example would be more complex, but I
believe that with careful crafting, you can get regex to do a lot
without resorting to multiple passes.
--
Jim Nasby                                            jim@nasby.net
EnterpriseDB      http://enterprisedb.com      512.569.9461 (cell)

В списке pgsql-general по дате отправления:

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: large document multiple regex