Re: Speed up COPY FROM text/CSV parsing using SIMD
От | Nazir Bilal Yavuz |
---|---|
Тема | Re: Speed up COPY FROM text/CSV parsing using SIMD |
Дата | |
Msg-id | CAN55FZ2AxiwSah7TiQoMB==r=JKT0bOtooCB7ov4xRrGkVmJ1A@mail.gmail.com обсуждение исходный текст |
Ответ на | Re: Speed up COPY FROM text/CSV parsing using SIMD (Nazir Bilal Yavuz <byavuz81@gmail.com>) |
Ответы |
Re: Speed up COPY FROM text/CSV parsing using SIMD
|
Список | pgsql-hackers |
Hi, On Thu, 7 Aug 2025 at 14:15, Nazir Bilal Yavuz <byavuz81@gmail.com> wrote: > > On Thu, 7 Aug 2025 at 04:49, Shinya Kato <shinya11.kato@gmail.com> wrote: > > > > I have implemented SIMD optimization for the COPY FROM (FORMAT {csv, > > text}) command and observed approximately a 5% performance > > improvement. Please see the detailed test results below. > > Also, I did a benchmark on text format. I created a benchmark for line > length in a table being from 1 byte to 1 megabyte.The peak improvement > is line length being 4096 and the improvement is more than 20% [1], I > saw no regression on your patch. I did the same benchmark for the CSV format. The peak improvement is line length being 4096 and the improvement is more than 25% [1]. I saw a 5% regression on the 1 byte benchmark, there are no other regressions. > What do you think about adding SIMD to CopyReadAttributesText() and > CopyReadAttributesCSV() functions? When I add your SIMD approach to > CopyReadAttributesText() function, the improvement on the 4096 byte > line length input [1] goes from 20% to 30%. I wanted to try using SIMD in CopyReadAttributesCSV() as well. The improvement on the 4096 byte line length input [1] goes from 25% to 35%, the regression on the 1 byte input is the same. CopyReadAttributesCSV() changes are attached as feedback v2. -- Regards, Nazir Bilal Yavuz Microsoft
Вложения
В списке pgsql-hackers по дате отправления: