Skip to content

Machine learning and AI kernels implemented in various languages and frameworks

License

Notifications You must be signed in to change notification settings

markdewing/AI_kernels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI_kernels

Machine learning and AI kernels implemented in various languages and frameworks.

General information and links for each framework are located in the vector_add subdirectories.

  • vector_add - The simplest kernel that adds two vectors
  • dot - Dot product of two vectors
  • gemv - Matrix-vector multiplication
  • gemm - Matrix-matrix multiplication
  • latency - Microbenchmark to measure memory latency

There are links to additional technical information about GPU topics in the info directory.

About

Machine learning and AI kernels implemented in various languages and frameworks

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published