Re: extracting the domain from an email address

Поиск

Список

Период

Сортировка

От	Rui DeSousa
Тема	Re: extracting the domain from an email address
Дата	16 января 2019 г. 21:28:08
Msg-id	8B3FC027-0DC2-4A7B-9511-F6A06824B1BE@crazybean.net обсуждение исходный текст
Ответ на	extracting the domain from an email address (Mark Steben <mark.steben@drivedominion.com>)
Список	pgsql-admin

Дерево обсуждения

On Jan 16, 2019, at 8:20 AM, Mark Steben <mark.steben@drivedominion.com> wrote:

Good morning,
I am searching for a better more efficient way to extract the domain portion of an email address. The two I have been using are very expensive.
The one extract I mostly use:  substring(email from '@(.*)$')
also has an index on the email column which is the full email address:
btree (email DESC) WHERE email::text = "substring"(email::text, '@(.*)$'::text) AND length(email::text) > 0,
The other extract:
  substr(e.email,(strpos(e.email, '@') + 1))
Currently has no index.

The referenced table has 72 million rows. The email column can be empty (ergo the length = 0 check. I am running postgresql 9.4.

Any insights/observations welcome.

Not sure of the exact problem you are trying to solve; is it just querying the table via domain name? Does the index that you created work?

A few things; the regex given does not get domain name for some valid emails addresses;

i.e. foo@bar@nowhere.com is a valid email; however, the regex expression would return bar@nowhere.com instead of nowhere.com.

Assuming the emails are valid:

create or replace function get_domainname(_value text)

returns text

as $$

begin

_value := reverse(_value);

return nullif(reverse(substring(_value, 0, strpos(_value, '@'))), '');

end;

$$ language plpgsql

immutable returns null on null input

;

create index table_name_idx1 on table_name (get_domainname(email));

Then in the where clause:

where get_domainname(email) = 'nowhere.com'

p.s.

If you want to validate that the email address conforms to the RFC 2822 you can use this function which makes use of Perl’s Email::Address module. You would need to install plperl and Email::Address module.

create or replace function is_valid_email_address(eaddr text)

returns boolean

$body$

use Email::Address;

return Email::Address->parse($_[0]);

$body$

language 'plperlu' immutable

;

В списке pgsql-admin по дате отправления:

Предыдущее

От: Pepe TD Vo
Дата: 16 января 2019 г., 21:15:00
Сообщение: how to store data file in Postgres

Следующее

От: Achilleas Mantzios
Дата: 16 января 2019 г., 22:26:48
Сообщение: Re: Backup solution over unreliable network

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: extracting the domain from an email address

Предыдущее

Следующее