Re: [PR] [KVCache] Support KVCache decode from forked sequence and pop more tokens [tvm]

2024-05-20 Thread via GitHub
tqchen merged PR #16995: URL: https://github.com/apache/tvm/pull/16995 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@tvm.apache.or

[PR] [KVCache] Support KVCache decode from forked sequence and pop more tokens [tvm]

2024-05-13 Thread via GitHub
cyx-6 opened a new pull request, #16995: URL: https://github.com/apache/tvm/pull/16995 This PR supports the KVCache to decode from the forked sequence, and pop trailing tokens over multiple blocks. cc: @tqchen @MasterJH5574 -- This is an automated message from the Apache Git Servi