MENU

Fun & Interesting

D-Matrix Corsair: delivering low latency batched inference for inference-time-compute

GPU MODE 986 lượt xem 2 days ago
Video Not Working? Fix It Now

Speakers:
Gaurav Jain - Kernels
Akhil Arunkumar - Inference Engine
Satyam Srivastava - Architecture

Comment