MENU

Fun & Interesting

Sentence Tokenization in Transformer Code from scratch!

CodeEmporium 13,564 lượt xem 2 years ago
Video Not Working? Fix It Now

ABOUT ME
β­• Subscribe: https://www.youtube.com/c/CodeEmporium?sub_confirmation=1
πŸ“š Medium Blog: https://medium.com/@dataemporium
πŸ’» Github: https://github.com/ajhalthor
πŸ‘” LinkedIn: https://www.linkedin.com/in/ajay-halthor-477974bb/

RESOURCES
[ 1 πŸ”Ž ] Samanantar: The paper https://paperswithcode.com/paper/samanantar-the-largest-publicly-available
[ 2 πŸ”Ž] Samanantar: Download https://ai4bharat.iitm.ac.in/samanantar
[ 3 πŸ”Ž] Code for video:https://github.com/ajhalthor/Transformer-Neural-Network/blob/main/Sentence_Tokenization.ipynb

PLAYLISTS FROM MY CHANNEL
β­• Transformers from scratch playlist: https://www.youtube.com/watch?v=QCJQG4DuHT0&list=PLTl9hO2Oobd97qfWC40gOSU8C0iu0m2l4
β­• ChatGPT Playlist of all other videos: https://youtube.com/playlist?list=PLTl9hO2Oobd9coYT6XsTraTBo4pL1j4HJ
β­• Transformer Neural Networks: https://youtube.com/playlist?list=PLTl9hO2Oobd_bzXUpzKMKA3liq2kj6LfE
β­• Convolutional Neural Networks: https://youtube.com/playlist?list=PLTl9hO2Oobd9U0XHz62Lw6EgIMkQpfz74
β­• The Math You Should Know : https://youtube.com/playlist?list=PLTl9hO2Oobd-_5sGLnbgE8Poer1Xjzz4h
β­• Probability Theory for Machine Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd9bPcq0fj91Jgk_-h1H_W3V
β­• Coding Machine Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd82vcsOnvCNzxrZOlrz3RiD


MATH COURSES (7 day free trial)
πŸ“• Mathematics for Machine Learning: https://imp.i384100.net/MathML
πŸ“• Calculus: https://imp.i384100.net/Calculus
πŸ“• Statistics for Data Science: https://imp.i384100.net/AdvancedStatistics
πŸ“• Bayesian Statistics: https://imp.i384100.net/BayesianStatistics
πŸ“• Linear Algebra: https://imp.i384100.net/LinearAlgebra
πŸ“• Probability: https://imp.i384100.net/Probability

OTHER RELATED COURSES (7 day free trial)
πŸ“• ⭐ Deep Learning Specialization: https://imp.i384100.net/Deep-Learning
πŸ“• Python for Everybody: https://imp.i384100.net/python
πŸ“• MLOps Course: https://imp.i384100.net/MLOps
πŸ“• Natural Language Processing (NLP): https://imp.i384100.net/NLP
πŸ“• Machine Learning in Production: https://imp.i384100.net/MLProduction
πŸ“• Data Science Specialization: https://imp.i384100.net/DataScience
πŸ“• Tensorflow: https://imp.i384100.net/Tensorflow

TIMESTAMP
0:00 Dataset Source
2:39 Alpha Syllabery Explained
5:33 Reading & Processing Sentences
8:43 Pytorch Dataset & TextDataset
10:13 Batching Sentences
12:05 Character to Number Encoding
14:50 Masking
18:15 Creating a Class

Comment