Skip to content
View jalvarezcabada's full-sized avatar
๐Ÿ’ป
๐Ÿ’ป

Block or report jalvarezcabada

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
jalvarezcabada/README.md

Hi there, I'm Joaquin!

I'm a Data Engineer with a passion for exploring and applying new technologies.

I enjoy researching innovative solutions and incorporating them into my projects to improve data processes and infrastructure.

๐Ÿ› ๏ธ Technologies & Tools

  • Languages: Python, SQL
  • Big Data: PySpark, Spark, Databricks
  • Cloud: AWS, GCP
  • Containerization: Docker
  • Orchestration: Airflow

๐Ÿ’ผ What I Do

I primarily work on building data pipelines and ETLs, extracting data from various sources, and processing it through all stages of a data lake. My expertise includes:

  • Data Pipeline/ETL Creation: Designing and implementing efficient data pipelines to move and transform data across various systems.
  • Lakehouse Architecture: Building data solutions using the Lakehouse architecture, integrating the best of data lakes and data warehouses for efficient and scalable data storage and analytics.
  • Complex Process Orchestration: Managing the orchestration of complex workflows using tools like Airflow to ensure smooth and efficient execution of multi-step data processes.
  • Optimization & Performance: Continuously improving the performance and optimization of data processes within the data lake, ensuring faster and more efficient data retrieval and processing.
  • Data Quality: Ensuring high standards of data quality through validation and monitoring processes.
  • CI/CD for Pipelines: Setting up and deploying data pipelines using automated CI/CD workflows.

Popular repositories Loading

  1. apache-airflow-docker apache-airflow-docker Public

    Python

  2. scraping-videogames scraping-videogames Public

    Python

  3. data-quality-library data-quality-library Public

    Python

  4. jalvarezcabada jalvarezcabada Public