Stage 05
Custom Kernels & Production
Write CUDA/Triton kernels, master quantization, implement speculative decoding, and deploy with vLLM.
10notebooks
9hestimated
Write CUDA/Triton kernels, master quantization, implement speculative decoding, and deploy with vLLM.