I'm going to put up a better description later. Right now I just want to get this up on the web. It works! (Took a while). The source HTML was generated by the built in arrayForth HTML generator.
To run the code put the following at the top of block 1302:
0 node 200 load 1 node 204 load 2 node 206 node 3 node 210 load exit
Note that the multicore matrix example is in blocks 200 to 208 and uses nodes 0, 1 and 2. Block 210 does the same thing with a singlecore (node 3). The multicore example finishes in 1479 cycles and the singlecore example takes 3040 cycles. I verified that the multiplications are correct using the statistical language R. At some point I plan to make a video explaining how this all works, but again right now I just want to get the results up.