MENU

Fun & Interesting

Scaling your tuned models with Cloud Run

Google Cloud Tech 674 lượt xem 2 days ago
Video Not Working? Fix It Now

Stop wasting money on idle GPUs! Google Cloud Run offers a cost-effective solution for scaling your tuned AI models. By scaling to zero when inactive, Cloud Run eliminates unnecessary GPU expenses. Simply use a provided image with Ollama to run your model with a well-defined API endpoint accessible through libraries like GenKit and LangChain. Resources spin up in seconds as requests come in, ensuring optimal performance. Learn how to add GPUs to Cloud Run and maximize your AI investment!

Subscribe to Google Cloud Tech → https://goo.gle/GoogleCloudTech

#GoogleCloud

Speakers: Allen Fistenburg
Products Mentioned: Cloud Run

Comment