Welcome to the 883 Group Onboarding for Newbies! This repository is your gateway to a structured 10-chapter program, designed to seamlessly integrate you into our dynamic Data Ops team. The onboarding program is designed like "100 DAYS OF CODE", each chapter divided into days, and each day is dedicated to a specific topic or skill set that you'll be exploring. Each day will have a set of tasks, goals, and resources to guide you through the learning process. You'll also have the opportunity to engage in discussions, Q&A sessions, and hands-on exercises to reinforce your understanding and apply your knowledge in real-world scenarios.
- Day is not a day, it's just a session.
- Day 00: Welcome & Introduction - Understand the onboarding process and the 883 Group's vision and mission 🎯
- Day 01: Introduction to Big Data - Core Concepts
- Day 02: Customer Interaction and Insight into Data Operations & Data Lifecycle in the 883 Group :bow_and_arrow:
- Day 03: Introduction to Data concept & Hadoop Ecosystem
- Day 04: Hadoop Distributed File System (HDFS)
- Day 05: MapReduce Programming Model, YARN & Oozie
- Day 06: Introduction to Partitioning & Hive
- Day 07: Introduction to Impala
- Day 08: Introduction to HBase
- [Final Exercise 01 - SKILA]: Introduction to Data Ops Team and Hadoop Ecosystem concepts Q&A ❓
- Practical Exercise 01: Meet the Hadoop Ecosystem in Action :computer:
- Showcase 01: Develop a high-level understanding of the Hadoop ecosystem and its role in big data processing. 🔎
- Day 09: Mastering OOP and Design Principles with Test-Driven Development (TDD)
- Day 10: Writing Clean Code
- Day 11: Embarking on the Docker Journey
- Final Exercise 02: Clean Code Refactoring
- Final Exercise 03: Embracing DevOps: GitLab, Docker, and CI/CD Pipeline
- Bonus - 60 Linux Commands you NEED to know (in 10 minutes) 🔥
- Day 12: Openshift Fundamentals for Data Ops using "zero to hero" Openshift labs
- Day 13: Helm for Kubernetes Package Management using "Hero to Master" Openshift labs
- Day 14: Openshift "Best Practises" Openshift labs
- Day 15: Basic Spark Topics
- Day 12: Advanced Spark Topics
- Day 13: Exploring Spark SQL
- Final Exercise 04: Spark Q&A and Discussion
- Final Exercise 05: Spark RDD Exercises with Joker and Batman
- SKILA: 💃
- Showcase 02: In-Depth Exploration of Spark Component(s)
- Day 14: Introduction to Apache Airflow concepts, terminology and architecture
- Day 15: exploring the Airflow web UI for workflow management
- Day 16: Introduction to Trino concepts, terminology and architecture
- Day 17: Introduction to 883 Culture concepts, terminology and architecture.
- BIG Final Exercise: Senior Data Infra Engineer Onboarding Final Exercise 🚀
- Showcase: Prepare a presentation to the group to showcase the story, architecture, insights gained, and troubleshooting strategies of your final exercise solution in just two sentences.
- Day 18: Create documentation for a sample project from the 883 Group and present it to the team.(ask your mentor for the link to the project)
- Day 19: Open Source Contribution
- Day 20: Review the onboarding process and suggest improvements.
If you have any suggestions or ideas to enhance the onboarding for future members, feel free to contribute. Fork, create a feature branch, commit changes, and create a pull request.