Key Value Cache from Scratch: The good side and the bad side

Vizuara 1,712 lượt xem 1 month ago

Video Not Working? Fix It Now

In this video, we learn about the key-value cache (KV cache): one key concepts which ultimately led to the Multi-Head Latent Attention innovation.

The KV cache speeds up things, but comes with a dark side: memory overload!

We will understand the entire theory, intuition about the KV cache and then run a simple code to demonstrate the benefits of the KV cache.

======================================================

This video is sponsored by invideoAI (https://invideo.io/).

invideoAI is looking for talented engineers, junior research scientists and research scientists to join their team.

Elixir/Rust full stack engineer:
https://invideo.notion.site/Elixir-Rust-full-stack-engineer-158316ee111a8044846be07038d3e481

Research scientist - generative AI:
https://invideo.notion.site/Research-scientist-generative-AI-17c316ee111a8096bae4c7669b602dec

If you want to apply for any of the ML or engineering roles, reach out to them at careers@invideo.io

======================================================

Comment