Developing a deep learning framework
Early stages of writing up about my project to create a neural network in c++. over time i want to improve training latency of the net via SW optimisations. But also it serves as a foundation to explore CUDA processing to make incredibly fast training loops - or atleast compared to cpu?
Github link: @codegen-cnn
Will create a series on this - currently the training algorithm is taking a while. I think I need to do vectorised updates for matrix processing with SIMD? Currently iโm doing the naive โupdate each cell in the matrix on its ownโ :/
Other notes about AI and/or Deep Learning and/or C++
- ๐ฒPart 1. Thread Pools
Thread pooling with c++20 primitives
- ๐ฒPart 2. Work Stealing Thread Pools
Work Stealing Thread Poools C++20
- ๐ฑMPMC Queue
MPMC Queue
- ๐ฑC++ low-latency design patterns
A brief description of what this note covers
- ๐ฑAtomics
Atomics
- ๐ฟSPSC Queue
SPSC Thread-Safe Queue
- ๐ฟImplementing STL's std::shared_ptr
Implementing STL's std::shared_ptr
- ๐ฟImplementing STL's std::unique_ptr
Implementing STL's std::unique_ptr
- ๐ฟImplementing STL's std::vector
A brief description of what this note covers
- ๐ฟType Erasure in C++
Type Erasure in C++