In this new video series, data science instructor Vincent Warmerdam gets started with spaCy, an open-source library for Natural Language Processing in Python. His mission: building a system to automatically detect programming languages in large volumes of text. Follow his process from the first idea to a prototype all the way to data collection and training a statistical named entity recogntion model from scratch. SPACY ● Website: https://spacy.io ● GitHub: https://github.com/explosion/spaCy ● Free online course: https://course.spacy.io ● Twitter: https://twitter.com/spacy_io THIS VIDEO ● Code: https://github.com/koaning/spacy-youtube-material ● Stack Overflow dataset: https://www.kaggle.com/stackoverflow/stacksample ABOUT VINCENT Vincent Warmerdam is a co-founder of PyData Amsterdam and experienced data science instructor. He has been evangelizing data and open source for the last 5 years. You might know him from his PyData videos where he attempts to defend common sense over hype in data science. ● Follow Vincent on Twitter: https://twitter.com/fishnets88