The source code and dockerfile for the GSW2024 AI Lab.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
This repo is archived. You can view files and clone it, but cannot push or open issues/pull-requests.
 
 
 
 
 
 

9 lines
503 B

extern "C" int cuda_basicAdd(int a, int b);
extern "C" void cuda_arrayFmaOptimized(int * const A, int const N, int const M);
extern "C" void cuda_arrayFmaOptimizedHelper(int * const A, int const N);
extern "C" void cuda_arrayFma(int const * const A, int const * const B, int const * const C, int * const D, int const N);
extern "C" void cuda_arrayFmaHelper(int const * const A, int const * const B, int const * const C, int * const D, int const N);
void cpp_cuda_bandwidthTest(int entryCount, int N);