Suppose I split this bug (i.e., file a new bug) into the Intel-acceleration part and the fork-join part.
Does that makes this a hair easier? It will still contain the assembly language, and I am still attempting to get anywhere at all on Windows (our official instructions don't work, largely because they seem to depend on a specific version of DirectX from Microsoft that is no longer available for download). This would also make the bug filers somewhat happy, since they were specifically interested in the nifty instruction. No love for Sparc or Vintage Intel, but oh well. David