Ensuring Privacy for Any LLM with Patricia Thaine - 716

The TWIML AI Podcast with Sam Charrington 478 lượt xem 3 months ago

Video Not Working? Fix It Now

Today, we're joined by Patricia Thaine, co-founder and CEO of Private AI to discuss techniques for ensuring privacy, data minimization, and compliance when using 3rd-party large language models (LLMs) and other AI services. We explore the risks of data leakage from LLMs and embeddings, the complexities of identifying and redacting personal information across various data flows, and the approach Private AI has taken to mitigate these risks. We also dig into the challenges of entity recognition in multimodal systems including OCR files, documents, images, and audio, and the importance of data quality and model accuracy. Additionally, Patricia shares insights on the limitations of data anonymization, the benefits of balancing real-world and synthetic data in model training and development, and the relationship between privacy and bias in AI. Finally, we touch on the evolving landscape of AI regulations like GDPR, CPRA, and the EU AI Act, and the future of privacy in artificial intelligence.

🎧 / 🎥 Listen or watch the full episode on our page: https://twimlai.com/go/716.

🔔 Subscribe to our channel for more great content just like this: https://youtube.com/twimlai?sub_confirmation=1

🗣️ CONNECT WITH US!
===============================
Subscribe to the TWIML AI Podcast: https://twimlai.com/podcast/twimlai/
Follow us on Twitter: https://twitter.com/twimlai
Follow us on LinkedIn: https://www.linkedin.com/company/twimlai/
Join our Slack Community: https://twimlai.com/community/
Subscribe to our newsletter: https://twimlai.com/newsletter/
Want to get in touch? Send us a message: https://twimlai.com/contact/

📖 CHAPTERS
===============================
00:00 - Introduction
2:06 - Private AI
3:32 - Data flows
13:13 - User requirements
14:06 - Entity recognition
18:39 - Warranties
19:43 - Generalized model
20:42 - Multilingual challenges
22:13 - OCR
25:31 - Synthetic data
28:06 - Multimodality
37:44 - Managing system and product scope
39:30 - Data catalogs and MDM systems
41:43 - The connection between privacy and ethical responsible AI
45:23 - AI regulations
48:07 - Future directions

🔗 LINKS & RESOURCES
===============================
Private AI - https://private-ai.com/en/redact/
Forum Ventures - https://hubs.ly/Q0346ccs0

📸 Camera: https://amzn.to/3TQ3zsg
🎙️Microphone: https://amzn.to/3t5zXeV
🚦Lights: https://amzn.to/3TQlX49
🎛️ Audio Interface: https://amzn.to/3TVFAIq
🎚️ Stream Deck: https://amzn.to/3zzm7F5

Podcast

Tech

Technology

ML

AI

Machine Learning

Artificial Intelligence

Sam Charrington

data

science

computer science

deep learning

TWIML AI

Comment