In this video you will be building a High Performance Real-time Analytics Database using state of the art tools in the Apache Ecosystem like Apache Kafka, Apache Druid, Apache Superset, Docker, and Orbstack.
FOR MORE DATA ENGINEERING COURSES: datamasterylab.com
📚 What You'll Learn:
👉 Understand Apache Frameworks for Data Engineering
👉 Streaming data into Apache Kafka
👉 Using Zookeeper for distributed synchronization
👉 Data processing with Apache Druid
👉 Data storage and Realtime Aggregations with Apache Druid
👉 Containerising your data engineering environment with Docker
✨ Timestamps: ✨
0:00 Introduction
1:47 List of Apache Frameworks for Data Engineering
3:20 System Architecture
10:36 Starting up a project from scratch
13:18 Setting up the containers and services on Docker
28:10 Streaming data into Apache Kafka
42:35 Apache Druid Walkthrough
48:57 Connecting Apache Druid to Apache Kafka
1:00:04 Realtime Queries and Aggregations on Apache Druid
1:07:34 Time Aggregations on Apache Druid
1:09:32 Outro
👦🏻 My Linkedin: https://www.linkedin.com/in/yusuf-ganiyu-b90140107/
🚀 Twitter: https://twitter.com/YusufOGaniyu
📝 Medium: https://medium.com/@yusuf.ganiyu
🌟 Please LIKE ❤️ and SUBSCRIBE for more AMAZING content! 🌟
Like this video? Buy me a coffee ❤️ https://www.buymeacoffee.com/yusuf.ganiyu/
🔗 Useful Links and Resources:
👉 Full Source Code: buymeacoffee.com/yusuf.ganiyu/full-source-code-building-high-performance-realtime-analytics-database
👉 Apache Druid Documentation: https://druid.apache.org/docs/latest/design/
👉 Apache Superset Documentation: https://superset.apache.org/docs/
👉 Docker Documentation: https://docs.docker.com/
👉 Orbstack Documentation: https://orbstack.dev/docs
✨ Tags ✨
Data Engineering, Apache Kafka, Apache Druid, Real-time Analytics, Apache Superset, Docker, Orbstack, High-Performance Databases, Big Data, Streaming Data, Data Processing, Open Source, Data Aggregation
✨ Hashtags ✨
#DataEngineering #ApacheKafka #ApacheDruid #RealTimeAnalytics #ApacheSuperset #Docker #BigData #DataScience #StreamingData #OpenSource #DataAggregation #Orbstack #HighPerformanceDatabases #bigdatatechnologies