MENU

Fun & Interesting

Chroma - Vector Database for LLM Applications | OpenAI integration

BugBytes 21,118 9 months ago
Video Not Working? Fix It Now

โ˜•๏ธ ๐—•๐˜‚๐˜† ๐—บ๐—ฒ ๐—ฎ ๐—ฐ๐—ผ๐—ณ๐—ณ๐—ฒ๐—ฒ: To support the channel and encourage new videos, please consider buying me a coffee here: https://ko-fi.com/bugbytes โญTop resource to learn Python - https://datacamp.pxf.io/kOjKkV โญ In this video, we'll take a look at the ChromaDB vector database, which can be used to store embedding data and retrieve embeddings that are most similar to an input query. We'll take a look at loading and. embedding a real-life text dataset, and then querying for similar vectors. We'll also look at different client options for in-memory databases and persistent databases with Chroma, and how to integrate with OpenAI's embeddings API. ๐Ÿ“Œ ๐—–๐—ต๐—ฎ๐—ฝ๐˜๐—ฒ๐—ฟ๐˜€: 00:00 Intro 00:56 ChromaDB introduction 02:18 Creating a ChromaDB client and collections 03:32 Adding documents to a collection 08:45 Passing filters to collection queries 10:24 Reading in real-life dataset with Polars 13:10 Creating Embeddings with OpenAI APIs 18:11 Adding OpenAI vectors to ChromaDB 28:27 Persisting the ChromaDB database ๐—ฆ๐—ผ๐—ฐ๐—ถ๐—ฎ๐—น ๐— ๐—ฒ๐—ฑ๐—ถ๐—ฎ: ๐Ÿ“– Blog: https://bugbytes.io/posts/vector-databases-pgvector-and-langchain/ ๐Ÿ‘พ Github: https://github.com/bugbytes-io/ ๐Ÿฆ Twitter: https://twitter.com/bugbytesio ๐Ÿ“š ๐—™๐˜‚๐—ฟ๐˜๐—ต๐—ฒ๐—ฟ ๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐—ถ๐—ป๐—ด ๐—ฎ๐—ป๐—ฑ ๐—ถ๐—ป๐—ณ๐—ผ๐—ฟ๐—บ๐—ฎ๐˜๐—ถ๐—ผ๐—ป: ChromaDB: https://docs.trychroma.com/ ChromaDB Embeddings: https://docs.trychroma.com/guides/embeddings ChromaDB Integrations: https://docs.trychroma.com/integrations Kaggle News Articles Dataset: https://www.kaggle.com/datasets/asad1m9a9h6mood/news-articles #python #chromadb #datascience

Comment