Skip to content
View karinakozlowski's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@ICARO-DATA

Block or report karinakozlowski

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
karinakozlowski/README.md

Banner

Karina Kozlowski

Data Scientist | Machine Learning Specialist | Finance & Business Intelligence Expert

Data Scientist with a background in Computer Engineering from MIT and Harvard, specializing in Machine Learning applied to optimize supply chain and sales performance. With over 8 years of experience, I combine critical analysis, KPI evaluation, and process automation to transform data into high-impact strategies and decisions at Siemens.

Experience in Corporate Data Science

Passionate about data analysis for developing predictive models, extracting actionable insights, and leveraging Natural Language Processing (NLP) models like ChatGPT to enhance communication and interaction within information systems. I work with Python, SQL, Power BI, and Snowflake to build solutions that improve efficiency and business performance in finance, sales, and logistics sectors. My role extends to teaching data science and machine learning, with experience at Harvard, MIT, and ICARO, where I cover topics like statistical modeling, data processing, and ML applications in time series.

Evaluation of Financial KPIs and Metrics

My approach to KPI evaluation is comprehensive and strategic. I design and monitor metrics that guide current performance and project future success, using advanced tools to identify improvement opportunities in P&L reports, profit margins, and order analysis. My reports for Latin America and collaboration with teams in Germany ensure that decisions are based on solid, relevant data.

Process Automation and Digitalization

I automate complex processes, from data cleaning and preparation to report generation and visualization. I have developed Python applications and Power Automate flows that streamline repetitive tasks, ensuring consistency and reducing errors. Additionally, I orchestrate data pipelines in dbt and GitLab, connected to Snowflake, to guarantee efficient and scalable data analytics workflows.

Highlighted Projects

  • NLP for Communication Enhancement: Implemented ChatGPT models at Siemens to enhance interaction and communication within information systems.
  • Migration to Snowflake: Automated balance sheet consolidation and reporting in Big Data environments.
  • Regional KPI Development: Implemented finance reports for Siemens in Latin America, including sales and logistics analysis.
  • Data Science Teaching and Mentorship: Created curricula at ICARO and MIT for diploma programs in Machine Learning, Python, and time series analysis.
  • Automation with Python and Power BI: Implemented SAP robots and Power Automate workflows for account statements and automated email deliveries.

Technical Skills

  • Programming Languages: Python, SQL, DAX
  • BI and Data Warehousing Tools: Power BI, Snowflake, dbt, Mendix
  • Machine Learning: Predictive models, regression, classification, anomaly detection
  • NLP and Chatbot Development: ChatGPT models, Siemens-specific NLP solutions
  • Automation and Orchestration: Power Automate, IPy, GitLab
  • Documentation and Presentation: Financial reports, English presentations, process documentation in schematics.

Always looking for opportunities to innovate, enhance performance, and create value through data science.

Always looking for opportunities to innovate, enhance performance, and create value through data science.

📫 Contact

Feel free to contact me if you want to discuss exciting projects or collaborate on data initiatives!

My Skills

  • Programming languages: Python R (Statistics) SQL Markdown
  • Libraries; TensorFlow Keras Pandas Numpy Matplotlib Seaborn Scikitlearn FastAPI Streamlit
  • Data Engineering tools: MySQL Postgres BigQuery MongoDB
  • BigData: Docker Apache Hadoop Apache Hive Apache Spark
  • BI Analyst; Tableau Power BI
  • IDE & version control; Git GitHub Jupyter colab Visual Studio Code RStudio
  • Cloud technologies; Render Clevercloud

Active Repos




📁  Proyectos destacados

ReadMe Card ReadMe Card
ReadMe Card ReadMe Card
Language Role Deployment Project Client Link
1. Python Data Engineer AWS APP NYC2050 ( In Process) Link
2. Python Machine Engineer Streamlit Fintech Fraud Transactions No Country Link
3. Python Machine Engineer FastApi API Steam Games Link
4. Python Data Analyst CleverCloud - Streamlit Dashboard Gobierno de la Ciudad BA Link
5. Python Data Analyst PowerBI Sales Dashboard Wallmart - Retail Link
6. Python Data Analyst Streamlit Dashboard Johnson & Johnson Link

Pinned Loading

  1. Data_Siniestros_Viales Data_Siniestros_Viales Public

    Role: Data Analyst | App Siniestros Viales

    Jupyter Notebook 6 1

  2. MLOPS_API MLOPS_API Public

    STEAM PROYECT - Machine Learning Engineering for Production

    Jupyter Notebook 2

  3. Data-Synergy/EcoDriverNY Data-Synergy/EcoDriverNY Public

    Eco Driver NYC

    Jupyter Notebook 4 6

  4. Traffic_Sign_Classification Traffic_Sign_Classification Public

    Role: Machine Learning - Proyect : Traffic Sign Classification

    Jupyter Notebook

  5. Online_Retail Online_Retail Public

    Jupyter Notebook