Hello everyone, I hope you're doing well!
In this video, I will show you how to run the best small VLM locally so that you can perform tasks like OCR, Object Detection, Code generation, document parsing and more. This is the newly introduced model called Mono-InternVL.
Used material links:
Repo: https://github.com/Hmzbo/MonoInternVLTuto/tree/main
Paper: https://arxiv.org/abs/2410.08202
Model on HF: https://huggingface.co/OpenGVLab/Mono-InternVL-2B
Let's connect:
LinkedIn: https://bit.ly/3roXgQ2
GitHub: https://bit.ly/3CrfRRP
Kaggle: https://bit.ly/3C1mqZD
Twitter: https://bit.ly/3UR06e3
--------------------------------------------------------------
♪ A Homey lofi background music
--------------------------------------------------------------
Chapters:
00:00 Intro
00:33 Model presentation
04:44 Run the model locally
If you have any question, suggestion, or remark. Feel free to leave it in a comment below!
Until next time, stay safe!
#MLWH