It's at c908 According to the benchmark results, if vlseg2e64 is used, the speed is almost as slow as C language (dcmul_add_rvv_f64: 86.2), if vsseg2e64 is used, it will be only a bit slower (dcmul_add_rvv_f64: 50.2).
Rémi Denis-Courmont <r...@remlab.net> 于2023年12月22日周五 04:52写道: > Le tiistaina 19. joulukuuta 2023, 4.53.12 EET flow gg a écrit : > > c908: > > dcmul_add_c: 88.0 > > dcmul_add_rvv_f64: 46.2 > > > > Did not use vlseg2e64, because it is much slower than vlse64 > > Did not use vsseg2e64, because it is slightly slower than vsse64 > > Is this about C910 or C908? I have not checked this specific function, but > the > general understanding for C908 has been the exact opposite so far, i.e. > segmented accesses are fast, while strided accesses are (unsurprisingly) > slow. > > See also > https://camel-cdr.github.io/rvv-bench-results/canmv_k230/index.html > > -- > レミ・デニ-クールモン > http://www.remlab.net/ > > > > _______________________________________________ > ffmpeg-devel mailing list > ffmpeg-devel@ffmpeg.org > https://ffmpeg.org/mailman/listinfo/ffmpeg-devel > > To unsubscribe, visit link above, or email > ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe". > _______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org https://ffmpeg.org/mailman/listinfo/ffmpeg-devel To unsubscribe, visit link above, or email ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".