data-pipelines
Here are 210 public repositories matching this topic...
Udacity project 4 - salary predictions based on census data
-
Updated
Aug 7, 2022 - Python
EDA ,data-Pipelines, Randamforest, ROC curve
-
Updated
Aug 30, 2022 - Jupyter Notebook
A simple data processing pipeline supporting FIFO, fixed & dynamic worker pools, and broadcast stages.
-
Updated
Sep 23, 2024 - Go
Udacity Data Engeneering Nanodegree Program - My Submission of Project: Data Pipelines
-
Updated
Apr 12, 2021 - Python
🔎 Versionamento de queries para o datalake (ELT) | https://rj-smtr.github.io/maestro-docs/infra/maestro-bq/
-
Updated
Jun 8, 2022 - Python
This project implements a real-time event streaming pipeline for a music streaming service, inspired by Spotify Wrapped and Billboard charts. The pipeline is powered by Apache Airflow, Apache Kafka, dbt, Docker, GCP, Spark-Streaming, and Terraform.
-
Updated
Apr 22, 2023 - Python
Demonstration of Apache Airflow as one of the data science tools
-
Updated
Dec 14, 2023 - TeX
Sense which / how many computers in a local area network (LAN) are on.
-
Updated
Dec 8, 2022 - Python
This repository contains homework solutions and course material for 10 weeks data engineering zoomcamp by DataTalksClub.
-
Updated
Mar 3, 2023 - Jupyter Notebook
Предиктивный анализ оттока клиентов
-
Updated
Nov 7, 2022 - Jupyter Notebook
PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available
-
Updated
Jul 6, 2024 - Jupyter Notebook
This is a repository to document the entire process and learning throughout the Coursera's IBM Data Engineering Professional Certificate program.
-
Updated
Sep 26, 2023
Добыча золота - предсказание коэффициента обогащения
-
Updated
Nov 7, 2022 - Jupyter Notebook
Building Data Pipelines for a data warehouse with Airflow and AWS
-
Updated
Dec 23, 2022 - Python
Build a web application to classify big data of messages into 36 categories that sent to related disaster relief agencies, and help disaster workers to classify new messages.
-
Updated
Jul 29, 2021 - Jupyter Notebook
Ce projet vise au developpement d’une application qui implemente une Data Pipeline afin de collecter des donnees, appliquer sur ces derniers certains pretraitements afin de les consume par un modele Machine Learning , et finalement les consumer par une SPA.
-
Updated
Dec 5, 2021 - Jupyter Notebook
-
Updated
Jul 27, 2023 - Jupyter Notebook
An example project demonstrating how to submit data to Seafowl from a dagster job.
-
Updated
May 17, 2023 - Python
Improve this page
Add a description, image, and links to the data-pipelines topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-pipelines topic, visit your repo's landing page and select "manage topics."