The source code and dockerfile for the GSW2024 AI Lab.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
This repo is archived. You can view files and clone it, but cannot push or open issues/pull-requests.

8 lines
503 B

4 weeks ago
  1. extern "C" int cuda_basicAdd(int a, int b);
  2. extern "C" void cuda_arrayFmaOptimized(int * const A, int const N, int const M);
  3. extern "C" void cuda_arrayFmaOptimizedHelper(int * const A, int const N);
  4. extern "C" void cuda_arrayFma(int const * const A, int const * const B, int const * const C, int * const D, int const N);
  5. extern "C" void cuda_arrayFmaHelper(int const * const A, int const * const B, int const * const C, int * const D, int const N);
  6. void cpp_cuda_bandwidthTest(int entryCount, int N);