autotuner.hpp
Go to the documentation of this file.
72 OPM_GPU_SAFE_CALL(cudaEventElapsedTime(&candidateBlockSizeTime, events[i].get(), events[i + 1].get()));
82 fmt::format("[Kernel tuning completed] {}: Tuned Blocksize = {}, Fastest Runtime = {}ms.", descriptionOfFunction, bestBlockSize, bestTime));
#define OPM_GPU_SAFE_CALL(expression) OPM_GPU_SAFE_CALL checks the return type of the GPU expression (function call) and throws an exceptio... Definition: gpu_safe_call.hpp:150 Definition: autotuner.hpp:30 int tuneThreadBlockSize(func &f, std::string descriptionOfFunction) Function that tests the best thread block size, assumes the provided function depends on threadblock-... Definition: autotuner.hpp:37 |