amd64/cu128: links for matrix-output