#

hive-metastore

Here are 53 public repositories matching this topic...

tomkat-cr / data_lakehouse_local_stack

Data Lakehouse local stack with PySpark, Trino, and Minio. Includes an example to process Raygun error data and the IP address occurrence.

python spark hive docker-compose minio spark-sql trino hive-metastore minio-storage

Updated Jul 3, 2024
Python

hienduyph / docker-hive-metastore

Apache Hive Standalone Metastore

hive hive-metastore hive-standalone-metastore

Updated Jun 28, 2024
Dockerfile

beekeeper

ExpediaGroup / beekeeper

Service for automatically managing and cleaning up unreferenced data

java big-data hive s3 maintenance cleanup metastore hive-metastore oss-portal-featured

Updated Jun 14, 2024
Java

OKDP / charts

Collection of OKDP helm charts

kubernetes superset helm-charts trino hive-metastore spark-kubernetes spark-history-server okdp

Updated Jun 6, 2024
Smarty

ExpediaGroup / waggle-dance

Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.

hive federation metastore hive-metastore oss-portal-listed

Updated Jul 8, 2024
Java

rishuatgithub / hive-custom-udfs

This is a repository for custom user defined functions used in Apache Hive

sql hive jar apache hacktoberfest hive-udfs apache-hive hiveql hive-metastore hacktoberfest2020 custom-udfs

Updated May 28, 2024
Java

GoogleCloudPlatform / datacatalog-connectors-hive

Sample code with integration between Data Catalog and Hive data source.

python hive analytics gcp data-warehouse metadata-management hive-metastore apache-atlas datacatalog

Updated May 1, 2024
Python

cloudera-labs / hms-mirror

"hms-mirror" is a utility used to bridge the gap between two clusters and migrate hive metadata.

hive hive-metastore

Updated May 1, 2024
Java

recap-build / hive-metastore-standalone

Apache Hive Metastore in Standalone Mode With Docker

docker presto hive hadoop prestodb trino hcatalog hive-metastore github-workflow github-workflows trinodb

Updated Apr 23, 2024
Dockerfile

harrydevforlife / building-lakehouse

Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.

python airflow spark s3 metabase minio dbt flask-api hive-metastore delta-lake lakehouse

Updated Apr 20, 2024
Python

aaliashraf / airflow-spark-hive-azure-docker-workflow

Foundation Workspace for Airflow, Spark, Hive, and Azure Data Lake Gen2 via Docker

python docker airflow spark apache-spark hive pyspark azure-storage apache-airflow hive-metastore bitnami-image azuredatalakegen2

Updated Mar 31, 2024
Python

hive-metastore-client

quintoandar / hive-metastore-client

A client for connecting and running DDLs on hive metastore.

python package hive etl data-engineering hive-metastore-client metastore hive-metastore ddls

Updated Mar 20, 2024
Thrift

AhmetFurkanDEMIR / minio-hive-example

Kubernetes Hive Minio connection example

kubernetes hive hadoop postgresql s3 s3-bucket kubernetes-cluster minio k8s kubernetes-deployment apache-hive hive-metastore hive-server

Updated Mar 19, 2024
Shell

ExpediaGroup / circus-train

Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.

bigquery big-data hive replication s3 replicate-data hive-metastore hive-table

Updated Mar 5, 2024
Java

naushadh / hive-metastore

Apache Hive Metastore as a Standalone server in Docker

docker spark presto trino hive-metastore localstack

Updated Feb 29, 2024
Shell

recap-build / pymetastore

A Python Client for Hive Metastore

python hive thrift data-engineering hcatalog hive-metastore

Updated Dec 19, 2023
Python

gmrqs / lasagna

A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka

docker spark jupyter docker-compose pyspark minio spark-streaming jupyterlab trino hive-metastore

Updated Dec 8, 2023
Jupyter Notebook

dominikhei / Local-Data-LakeHouse

Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.

data-lake minio trino hive-metastore apache-iceberg lakehouse data-lakehouse

Updated Sep 2, 2023
Dockerfile

drone-fly

ExpediaGroup / drone-fly

A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service

hive hive-metastore

Updated Mar 5, 2024
Java

BhagiaSheri / apache-spark-SQL

Big Data Pipeline | Querying Data from Hive Table Phase

spark hive java-8 spark-sql big-data-analytics hive-metastore

Updated Jun 17, 2023
Java

Improve this page

Add a description, image, and links to the hive-metastore topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hive-metastore topic, visit your repo's landing page and select "manage topics."