Re: [FFmpeg-devel] RISC-V vector DSP functions: Motivation for commit 446b009

2024-07-06 Thread Rémi Denis-Courmont
(Updating an old thread) Le perjantaina 19. tammikuuta 2024, 19.14.02 EEST Rémi Denis-Courmont a écrit : > Hi, > > Le perjantaina 19. tammikuuta 2024, 17.30.00 EET Michael Platzer via ffmpeg- > devel a écrit : > > Commit 446b0090cbb66ee614dcf6ca79c78dc8eb7f0e37 by Remi Denis-Courmont has > >

Re: [FFmpeg-devel] RISC-V vector DSP functions: Motivation for commit 446b009

2024-01-23 Thread Rémi Denis-Courmont
Le tiistaina 23. tammikuuta 2024, 19.34.46 EET Michael Platzer via ffmpeg-devel a écrit : > I agree that the indexed and strided loads and stores are certainly slower > than unit-strided loads and stores. However, the vrgather instruction is > unlikely to be very performant either, unless the

Re: [FFmpeg-devel] RISC-V vector DSP functions: Motivation for commit 446b009

2024-01-23 Thread Michael Platzer via ffmpeg-devel
Hi Rémi, Thanks for your reply. > It was faster on what the best approximation of real hardware available at > the time, i.e. a Sipeed Lichee Pi4A board. There are no benchmarks in the > commit because I don't like to publish benchmarks collected from prototypes. > Nevertheless I think the

Re: [FFmpeg-devel] RISC-V vector DSP functions: Motivation for commit 446b009

2024-01-19 Thread Rémi Denis-Courmont
Hi, Le perjantaina 19. tammikuuta 2024, 17.30.00 EET Michael Platzer via ffmpeg- devel a écrit : > Commit 446b0090cbb66ee614dcf6ca79c78dc8eb7f0e37 by Remi Denis-Courmont has > replaced RISC-V vector loads and stores with negative stride with vrgather > (generalized permutation within vector