Re: Make COPY extendable in order to support Parquet and other formats

Поиск
Список
Период
Сортировка
От Andrew Dunstan
Тема Re: Make COPY extendable in order to support Parquet and other formats
Дата
Msg-id cad1cec1-a148-c488-bf51-5821cc1a9b16@dunslane.net
обсуждение исходный текст
Ответ на Re: Make COPY extendable in order to support Parquet and other formats  (Andres Freund <andres@anarazel.de>)
Ответы Re: Make COPY extendable in order to support Parquet and other formats  (Aleksander Alekseev <aleksander@timescale.com>)
Список pgsql-hackers
On 2022-06-23 Th 21:45, Andres Freund wrote:
> Hi,
>
> On 2022-06-23 11:38:29 +0300, Aleksander Alekseev wrote:
>>> I know little about parquet - can it support FROM STDIN efficiently?
>> Parquet is a compressed binary format with data grouped by columns
>> [1]. I wouldn't assume that this is a primary use case for this
>> particular format.
> IMO decent COPY FROM / TO STDIN support is crucial, because otherwise you
> can't do COPY from/to a client. Which would make the feature unusable for
> anybody not superuser, including just about all users of hosted PG.
>

+1


Note that Parquet puts the metadata at the end of each file, which makes
it nice to write but somewhat unfriendly for streaming readers, which
would have to accumulate the whole file in order to process it.


cheers


andrew


--
Andrew Dunstan
EDB: https://www.enterprisedb.com




В списке pgsql-hackers по дате отправления:

Предыдущее
От: Tom Lane
Дата:
Сообщение: Re: Unify DLSUFFIX on Darwin
Следующее
От: Matthias van de Meent
Дата:
Сообщение: Pre-installed index access methods cannot be manually installed.