In this video, we look at how to do tf-idf in Python with Scikit Learn. GitHub repo: https://github.com/wjbmattingly/topic_modeling_textbook/blob/main/lessons/02_tf_idf_official.py Scikit Learn docs: https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfVectorizer.html Sources: http://brandonrose.org/clustering https://github.com/Make-School-Courses/DS-2.1-Machine-Learning/blob/master/Lessons/Clustering.md https://towardsdatascience.com/applying-machine-learning-to-classify-an-unsupervised-text-document-e7bb6265f52 If you enjoy this video, please subscribe. I provide all my content at no cost. If you want to support my channel, please donate via PayPal: https://www.paypal.com/cgi-bin/webscr?cmd=_donations&business=AZ73QW52SUX8N¤cy_code=USD&source=url Patreon: https://www.patreon.com/WJBMattingly (its my www.themedievalworld.com account as well). If there's a specific video you would like to see or a tutorial series, let me know in the comments and I will try and make it. If you liked this video, check out www.PythonHumanities.com, where I have Coding Exercises, Lessons, on-site Python shells where you can experiment with code, and a text version of the material discussed here. You can follow me at: https://twitter.com/wjb_mattingly