hadoop
Here are 3,363 public repositories matching this topic...
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
Jul 6, 2024 - Java
Scalable data processing pipelines in JavaScript
-
Updated
Jul 5, 2024 - TypeScript
-
Updated
Jul 5, 2024 - Java
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
-
Updated
Jul 5, 2024 - Scala
Alluxio, data orchestration for analytics and machine learning in the cloud
-
Updated
Jul 6, 2024 - Java
A large-scale entity and relation database supporting aggregation of properties
-
Updated
Jul 5, 2024 - Java
Apache Ignite
-
Updated
Jul 5, 2024 - Java
Kafka Connect HDFS connector
-
Updated
Jul 5, 2024 - Java
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
Updated
Jul 6, 2024 - Jupyter Notebook
Adding a cool README file
-
Updated
Jul 4, 2024
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."