MENU

Fun & Interesting

Build Local LLM for OCR, Object Detection & Image Parsing with TOP Precision - LLM Python Project

Machine Learning With Hamza 1,570 lượt xem 5 months ago
Video Not Working? Fix It Now

Hello everyone, I hope you're doing well!
In this video, I will show you how to run the best small VLM locally so that you can perform tasks like OCR, Object Detection, Code generation, document parsing and more. This is the newly introduced model called Mono-InternVL.

Used material links:
Repo: https://github.com/Hmzbo/MonoInternVLTuto/tree/main
Paper: https://arxiv.org/abs/2410.08202
Model on HF: https://huggingface.co/OpenGVLab/Mono-InternVL-2B

Let's connect:
LinkedIn: https://bit.ly/3roXgQ2
GitHub: https://bit.ly/3CrfRRP
Kaggle: https://bit.ly/3C1mqZD
Twitter: https://bit.ly/3UR06e3
--------------------------------------------------------------
♪ A Homey lofi background music
--------------------------------------------------------------

Chapters:
00:00 Intro
00:33 Model presentation
04:44 Run the model locally

If you have any question, suggestion, or remark. Feel free to leave it in a comment below!
Until next time, stay safe!
#MLWH

Comment