-
vfdev authored
* [WIP] Fixes #2598 - Adjusted num_kernels and batch_size according to kMaxGridNum * CUDA_NUM_THREADS - Tests to add * - Redefined kMaxGridNum as max grid according to current CUDA device - Added test to check the code from issue and compared grads CPU/CUDA * Fixed static kMaxGridNum evaluation to dynamic
8c32666b