Re: [HACKERS] Announce: Search PostgreSQL related resources

Поиск
Список
Период
Сортировка
От Oleg Bartunov
Тема Re: [HACKERS] Announce: Search PostgreSQL related resources
Дата
Msg-id Pine.GSO.4.58.0401052045540.3406@ra.sai.msu.su
обсуждение исходный текст
Ответ на Re: [HACKERS] Announce: Search PostgreSQL related resources  (Marek Lewczuk <newsy@lewczuk.com>)
Список pgsql-general
On Mon, 5 Jan 2004, Marek Lewczuk wrote:

> Dave Cramer wrote:
> > connection failed :(
> works for me... :-) (poland)
>

We have small downtime because of upgrading server software, so this may
be a reason for the problem. We're in stage of optimizing crawler because
some sites are very-very ugly, for example, our crawler have discovered
2 millions URLs on  http://ems-hitech.com/pgmanager/ ! 99.99 % of URLs are
just 404 (document not found), but server does return 200 code )\:)
So we have to explicitly exclude these pages. btw, archives.postgresql.org
doesn't returns modification date in header. This prevent crawler to
optimize downloading process. So, there are many problems, but we hope
soon we'll tune crawling process. I estimate average time to refresh index
about 1 week.

>
>

    Regards,
        Oleg
_____________________________________________________________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

В списке pgsql-general по дате отправления:

Предыдущее
От: Thomas Beutin
Дата:
Сообщение: Re: Slow Performance with 7.4.1
Следующее
От: Gregory Stone
Дата:
Сообщение: forking postmaster on my own - not as pguser