在Tech Report (
http://techreport.com/articles.x/17670/2)找到底下這句話:
"Fermi avoids this inefficiency by executing up to 16 different kernels concurrently, including multiple kernels on the same SM. The limitation here is that the different kernels must come from the same CUDA context—so the GPU could process, say, multiple PhysX solvers at once, if needed, but it could not intermix PhysX with OpenCL."
看來只有同樣是CUDA的運算kernel才能同時運算
OpenCL和CUDA的kernel不能混合同時運算
這樣這個功能還得保證應用程式跑的是CUDA,才能發揮它的威力
另外Tech Report還提到:
"Incidentally, AMD tells us its Cypress chip can also run multiple kernels concurrently on its different SIMDs. In fact, different kernels can be inter...