Welcome to my GitHub profile! I’m a data scientist with a strong background in handling complex, multi-dimensional datasets. Below you’ll find an overview of my skills and expertise.
I am currently maintaining sed @ OpenCOMPES , a specialized open-source project for the MPES community. My contributions include:
- Developing efficient data loading and processing pipelines.
- Ensuring data provenance and integrity through metadata management.
- Implementing high-performance data reduction techniques.
- Creating and maintaining comprehensive user documentation and API modules.
- Setting up CI/CD pipelines for automated testing, linting, and deployment using GitHub Actions.
- Languages: Python
- Data Processing: Pandas, NumPy, Dask, Numba
- Machine Learning: Scikit-Learn, TensorFlow, PyTorch, Optuna
- Computer Vision: OpenCV, Scikit-Image
- Project Management: Poetry, Git, GitHub Actions, Conda
- CI/CD: GitHub Actions, Coveralls, MyPy, Linters, Ruff, PyTest
- Development Environments: Jupyter Notebooks/Labs
Feel free to connect with me on LinkedIn or reach out via email.