Le maanantaina 1. heinäkuuta 2024, 19.09.01 EEST flow gg a écrit :
> I reviewed it again, the purpose of is_w is to limit lmul to a maximum of
> 1/4 of vlen,

1/4 of vlen? Do you mean limit to EMUL=1 for EEW=32 and EMUL=1/4 for EEW=8?

Limiting LMUL to less than 1 at maximum EEW is useless from a functional 
standpoint, since fractional registers cannot be addressed individually. (Of 
course it might still be useful for performance reasons.)

> to prevent vector register shortage, which can also be
> considered as vset limiting lmul. I renamed it to quarter_len_limit.

TBH, I don't really understand.

If a lower LMUL limit is reached, then specialisations for the corresponding 
VLEN are simply unncessary/infeasible and the code for lower VLEN should be 
used.

If a higher LMUL limit is reached due to register pressure (or the 8 hard 
limit), then the given VLEN cannot be supported at all, or requires some 
completely different code.

Either way, I don't really follow why vsetvlfixed macros need to be involved.

-- 
雷米‧德尼-库尔蒙
http://www.remlab.net/



_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to