- Fixed problems with timing of kernel functions when using multiple streams per physical GPU.
- Fixed some indexing issues in kernel WaveBoundary (memory accesses of 1D-Arrays were not 0-based). - Closed small memory leaks caused by cuda-calls.
Showing