Re: Large files for relations

Поиск
Список
Период
Сортировка
От Dagfinn Ilmari Mannsåker
Тема Re: Large files for relations
Дата
Msg-id 87jzxdu6dn.fsf@wibble.ilmari.org
обсуждение исходный текст
Ответ на Large files for relations  (Thomas Munro <thomas.munro@gmail.com>)
Ответы Re: Large files for relations
Список pgsql-hackers
Thomas Munro <thomas.munro@gmail.com> writes:

> On Fri, May 12, 2023 at 8:16 AM Jim Mlodgenski <jimmy76@gmail.com> wrote:
>> On Mon, May 1, 2023 at 9:29 PM Thomas Munro <thomas.munro@gmail.com> wrote:
>>> I am not aware of any modern/non-historic filesystem[2] that can't do
>>> large files with ease.  Anyone know of anything to worry about on that
>>> front?
>>
>> There is some trouble in the ambiguity of what we mean by "modern" and
>> "large files". There are still a large number of users of ext4 where
>> the max file size is 16TB. Switching to a single large file per
>> relation would effectively cut the max table size in half for those
>> users. How would a user with say a 20TB table running on ext4 be
>> impacted by this change?
[…]
> A less aggressive version of the plan would be that we just keep the
> segment code for the foreseeable future with no planned cut off, and
> we make all of those "piggy back" transformations that I showed in the
> patch set optional.  For example, I had it so that CLUSTER would
> quietly convert your relation to large format, if it was still in
> segmented format (might as well if you're writing all the data out
> anyway, right?), but perhaps that could depend on a GUC.  Likewise for
> base backup.  Etc.  Then someone concerned about hitting the 16TB
> limit on ext4 could opt out.  Or something like that.  It seems funny
> though, that's exactly the user who should want this feature (they
> have 16,000 relation segment files).

If we're going to have to keep the segment code for the foreseeable
future anyway, could we not get most of the benefit by increasing the
segment size to something like 1TB?  The vast majority of tables would
fit in one file, and there would be less risk of hitting filesystem
limits.

- ilmari



В списке pgsql-hackers по дате отправления:

Предыдущее
От: Alexey Gordeev
Дата:
Сообщение: Re: Orphaned files in base/[oid]
Следующее
От: Robert Haas
Дата:
Сообщение: Re: v16 regression - wrong query results with LEFT JOINs + join removal