Re: large xml database

Поиск

Список

Период

Сортировка

От	Viktor Bojović
Тема	Re: large xml database
Дата	31 октября 2010 г. 19:36:57
Msg-id	AANLkTi=Gg_uVB7B-tUNDH6HVs0oYxrBkYAzogMe05SUE@mail.gmail.com обсуждение исходный текст
Ответ на	Re: large xml database (Lutz Steinborn <l.steinborn@4c-ag.de>)
Список	pgsql-sql

Дерево обсуждения

On Sun, Oct 31, 2010 at 7:08 AM, Lutz Steinborn <l.steinborn@4c-ag.de> wrote:

On Sat, 30 Oct 2010 23:49:29 +0200
Viktor Bojović <viktor.bojovic@gmail.com> wrote:

>
> many tries have failed because 8GB of ram and 10gb of swap were not enough.
> also sometimes i get that more than 2^32 operations were performed, and
> functions stopped to work.
>
we have a similar problem and we use the Amara xml Toolkit for python. To avoid
the big memory consumption use pushbind. A 30G bme catalog file takes a maximum
up to 20min to import. It might be faster because we are preparing complex
objects with an orm. So the time consumption depends how complex the catalog is.
If you use amara only to perform a conversion from xml to csv the final import
can be done much faster.

regards

--
Lutz

http://www.4c-gmbh.de

Thanx Lutz, I will try to use that Amara and also I will try to parse it with SAX.

I have tried twig and some other parsers but they consumed too much RAM.

--
---------------------------------------
Viktor Bojović
---------------------------------------
Wherever I go, Murphy goes with me

В списке pgsql-sql по дате отправления:

Предыдущее

От: Tom Lane
Дата: 31 октября 2010 г., 14:10:58
Сообщение: Re: A more efficient way?

Следующее

От: Viktor Bojović
Дата: 31 октября 2010 г., 19:54:05
Сообщение: Re: large xml database

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: large xml database

Предыдущее

Следующее