[FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-03 Thread remi
From: Rémi Denis-Courmont RVV defines a total of 12 different extensions: V, Zvl32b, Zvl64b, Zvl128b, Zvl256b, Zvl512b, Zvl1024b, Zve32x, Zve32f, Zve64x, Zve64f and Zve64d. At this stage, we don't care about the vector length extensions Zvl*, as most or all optimisations will be running in a loo

Re: [FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-03 Thread Rémi Denis-Courmont
Le lauantaina 3. syyskuuta 2022, 22.01.45 EEST r...@remlab.net a écrit : > +#define ZVE_UP_TO(cap) ((2 * (cap)) - 1) Stray code. Ignore. -- Rémi Denis-Courmont http://www.remlab.net/ ___ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ff

Re: [FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-03 Thread Lynne
Sep 3, 2022, 21:01 by r...@remlab.net: > From: Rémi Denis-Courmont > > RVV defines a total of 12 different extensions: V, Zvl32b, Zvl64b, > Zvl128b, Zvl256b, Zvl512b, Zvl1024b, Zve32x, Zve32f, Zve64x, Zve64f and > Zve64d. > > At this stage, we don't care about the vector length extensions Zvl*, >

Re: [FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-03 Thread Rémi Denis-Courmont
Le lauantaina 3. syyskuuta 2022, 22.20.20 EEST Lynne a écrit : > Sep 3, 2022, 21:01 by r...@remlab.net: > > From: Rémi Denis-Courmont > > > > RVV defines a total of 12 different extensions: V, Zvl32b, Zvl64b, > > Zvl128b, Zvl256b, Zvl512b, Zvl1024b, Zve32x, Zve32f, Zve64x, Zve64f and > > Zve64d.

Re: [FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-03 Thread Lynne
Sep 3, 2022, 21:59 by r...@remlab.net: > Le lauantaina 3. syyskuuta 2022, 22.20.20 EEST Lynne a écrit : > >> Sep 3, 2022, 21:01 by r...@remlab.net: >> > From: Rémi Denis-Courmont >> > >> > RVV defines a total of 12 different extensions: V, Zvl32b, Zvl64b, >> > Zvl128b, Zvl256b, Zvl512b, Zvl1024b

Re: [FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-03 Thread Rémi Denis-Courmont
Le sunnuntaina 4. syyskuuta 2022, 0.38.32 EEST Lynne a écrit : > I need to know the length in C, not assembly. There may be some corner cases where that makes sense, but typically it doesn't. Even if you're dealing in fixed-size macro blocks, you should leverage the larger vectors to unroll and

Re: [FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-03 Thread Lynne
Sep 4, 2022, 07:41 by r...@remlab.net: > Le sunnuntaina 4. syyskuuta 2022, 0.38.32 EEST Lynne a écrit : > >> I need to know the length in C, not assembly. >> > > There may be some corner cases where that makes sense, but typically it > doesn't. Even if you're dealing in fixed-size macro blocks, y

Re: [FFmpeg-devel] [PATCH 1/3] riscv: add CPU flags for the RISC-V Vector extension

2022-09-04 Thread Rémi Denis-Courmont
Le sunnuntaina 4. syyskuuta 2022, 9.39.36 EEST Lynne a écrit : > In particular, doing the tail, which consists of 2 equal length transforms. > On AVX we interleave the coefficients from 2x4pt transforms during > lookups since we can do them simultaneously and save on > shuffles. Doing them individu