Distributed Frequency Count Algorithms for Data Streams
-
Updated
Jun 20, 2022 - Kotlin
Distributed Frequency Count Algorithms for Data Streams
HFlow is a platform for I/O forwarding managed elastically, dynamically, and actively
Tool to approximate the frequency of occurrences of different items in a data stream.
PyFlink data stream processing utilities 🐿
Automated deployment of an Apache Flink cluster in your Grid'5000 reserved nodes.
A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.
Simulation toolbox for Crimegraph.
Hands on data streaming
DataStream-SQLServer provides real-time data streaming from SQL Server using Zookeeper, Kafka, and Debezium. This repository contains the necessary configurations, Docker setups, and sample code to get you started.
In a team of 4 people, we implemented a public lighting control and monitoring system for a smart city
Real-time data engineering pipeline for an American hiring platform
This project focuses on implementing and demonstrating how stream and buffer works along together in nodejs.
Geometric Figure Clasifier program
This project is a data pipeline to stream data from meetup, perform realtime analysis and mapping back to google map.
This repo has various work I've done or assisted with in my capstone project for my Bachelors degree in Computer Engineering at Cal Poly Pomona.
My notes on Databases
Final project for the course 'Architecture for Large Data Volumes', taught in the Bachelor's program in Data Science at ITAM
Apache Flink boilerplate to build performant data streaming applications from.
Udacity Data Streaming project based on Apache Kafka
Add a description, image, and links to the data-stream-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-stream-processing topic, visit your repo's landing page and select "manage topics."