Overview

This is an optional honors project for the IBM Exploratory Data Analysis for Machine Learning Course on Coursera. The aim is to demonstrate the applications of skills and knowledge gained from this course such as Data Cleaning, Feature Engineering, Exploratory Data Visualization, and Hypothesis Testing.

Project Deliverables

Select a dataset that you are curious about.
Provide a brief description of the data set and a summary of its attributes.
Provide an initial plan for data exploration.
Describe actions taken for data cleaning and feature engineering.
Provide key findings and insights, which synthesizes the results of Exploratory Data Analysis in an insightful and actionable manner.
Formulate at least 3 hypothesis about this data.
Conduct a formal significance test for one of the hypotheses and discuss the results.
Provide suggestions for next steps in analyzing this data.
Include a paragraph that summarizes the quality of this data set and a request for additional data if needed.

Dataset

Using Kaggle Data set, High School Alcoholism and Academic Performance

Motivation

To explore what causes teenage alcoholism and its impact on academic performance, as well as factors that could reduce it.

Installation and Setup

Download Kaggle dataset and extract contents into ./data.
Create and activate virtual environment following this tutorial. https://docs.python.org/3/tutorial/venv.html

Install requirements

On Windows

install -r .\requirements.txt

On Linux

install -r ./requirements.txt

Run File

On Windows

python .\src\exploratory_data_analysis.py

On Linux

python src/exploratory_data_analysis.py

Code Structure

Results and Evaluation

https://medium.datadriveninvestor.com/how-to-write-a-good-readme-for-your-data-science-project-on-github-ebb023d4a50e

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
docs		docs
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Project Deliverables

Dataset

Motivation

Installation and Setup

On Windows

On Linux

On Windows

On Linux

Code Structure

Results and Evaluation

About

Releases

Packages

Languages

qasimza/highschool-alchoholism

Folders and files

Latest commit

History

Repository files navigation

Overview

Project Deliverables

Dataset

Motivation

Installation and Setup

On Windows

On Linux

On Windows

On Linux

Code Structure

Results and Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages