Welcome to my Data Science Projects Repository! This repository contains a collection of my data science projects, showcasing my skills and expertise in the field. Each project demonstrates different aspects of data analysis, machine learning, and visualization.
- Breast Cancer Prediction
- Description: The project predicts the diagnosis (M = malignant, B = benign) of the Breast Cancer
- Technologies Used: The notebooks uses Decision Tree Classification and Logistic Regression
- Results: The logistic regression gave 97% accuracy and decision tree gave 93.5% accuracy
- Red Wine Quality Prediction
- Description: The project predicts the quality of the wine in the value 0 or 1. 1 for good quality and 0 for bad quality
- Technologies Used: The notebooks uses logistic regression, support vector machine, decision tree and knn
- Results: The logistic regression model performs the best with accuracy of 86.67%
- Heart Stroke Prediction
- Description: The project predicts the risk of heart stroke on studying the person's demographics and medical info
- Technologies Used: The notebooks uses logistic regression, support vector machine, decision tree and knn
- Results: The logistic regression, SVM and KNN performs the best with 93.8 % accuracy
- House Price Prediction
- Description: The project predicts the house price after studying the variables such as location, area, bredroom, bathroom count and many more.
- Technologies Used: The notebooks uses Linear Regression, Ridge Regression and Random Forest Regressor
- Results: The Random Forest Regressor performed best with accuracy of 87.89%
- Titanic Survival Prediction
- Description: The project predicts the survival during the titanic disaster based on socio-economic measures
- Technologies Used: The notebooks uses Descision Tree Classifier
- Results: The Decision Tree Classifer performed well on the test dataset with an accuracy of 89.5%
- Diamond Price Prediction
- Description: The project predicts the price (in US dollars) of the diamonds based on their features
- Technologies Used: The notebooks uses Descision Tree Regressor and Random Forest Regressor
- Results: The Decision Tree Regresor performed well on the test dataset with an accuracy of 96%
- Medical Cost Prediction
- Description: The project predicts the medical treatment cost by analysing the patients age, gender, bmi, smoking habits etc.
- Technologies Used: The notebooks uses Linear and Polynomial Regression, Decision Tree and Random Forest Regressor
- Results: The Decision Tree Regressor and Random Forest Regressor performed well
- Room Occupancy Detection
- Description: The project predicts the room occupancy by analyzing the sensor data such as temperature, light and co2 level.
- Technologies Used: The notebooks uses Random Forest Classifier
- Results: The Random Forest Classifier performed well with an accuracy of 98%
- Sleep Disorder Prediction
- Description: The project aims to predict sleep disorders and their types by analyzing lifestyle and medical variables, such as age, BMI, sleep duration, blood pressure, and more
- Technologies Used: The notebooks uses Random Forest Classifier and Decision Tree cLassifier
- Results: The Random Forest Classifier performed well with an accuracy of 89%
- Pima Indians Diabetes Prediction
- Description: The primary objective of the Pima Indian Diabetes Prediction project is to analyze various medical factors of female patients, to predict whether they have diabetes or not.
- Technologies Used: The notebooks uses Logistic Regression, Random Forest Classifier and Support Vector Machine
- Results: The Logistic Regression performed with an accuracy of 78%.
- Bank Customer Churn Prediction
- Description: The main objective of the Bank Customer Churn Prediction project is to analyze the demographics in order to predict whether a customer will leave the bank or not.
- Technologies Used: The notebooks uses Random Forest Classifier and Decision Tree Classifier
- Results: The Random Forest Classifier and Decision Tree Classifier performed equally well with an accuracy of 87%
- Salary Prediction
- Description: The main objective of the Salary Prediction project is analyze the employee's demographics such as age, experience job title, country and race to predicts the salary.
- Technologies Used: The notebooks uses Descision Tree Regressor and Random Forest Regressor
- Results: The Random Forest Regressor performed best with 94.6% accuracy
- Delhi House Price Prediction
- Description: he primary objective is to develop a predictive model that can accurately estimate the prices of houses based on several key features present in the dataset.
- Technologies Used: The notebooks uses Descision Tree Regressor and Random Forest Regressor
- Results: The Random Forest Regressor performed best with 84.98% accuracy
- Loan Approval Prediction
- Description: The Loan Approval Prediction project aims to predict whether a loan application will be approved by a bank.
- Technologies Used: The notebooks uses Random Forest Classifier and Decision Tree Classifier
- Results: The Decision Tree Classifier performed well with an accuracy of 91.4%
- Cardiovascular Disease Prediction
- Description: The Cardiovascular Disease Prediction project aims to predict the occurrence of cardiovascular disease in patients based on their medical records and history.
- Technologies Used: The notebooks uses Random Forest Classifier, Decision Tree Classifier and Logistic Regression
- Results: The Logistic Regression performed well with an accuracy of 91.4%
- Belarus Car Price Prediction
- Description: The Belarus Car Price Prediction project aims to predict the price of car in Belarus based on car features.
- Technologies Used: The notebooks uses Decision Tree Regressor
- Results: The Decision Tree Regressor gave an accuracy of 86.29%
- Warranty Claims Fraud Prediction
- Description: The aim of this data science project is to predict the authenticity of warranty claims by analyzing various factors such as region, product category, claim value, and more.
- Technologies Used: The notebooks uses Decision Tree Classifier, Random Forest Classifier and Logistic Regression
- Results: All three models gave an accuracy of 91-92%
- E-Commerce Product Delivery Prediction
- Description: The aim of this project is to predict whether products from an international e-commerce company will reach customers on time or not.
- Technologies Used: The notebooks uses Decision Tree Classifier, Random Forest Classifier, Logistic Regression and KNN Classifier
- Results: The decision tree classifier model performed best with 69% accuracy
- Hotel Reservations Cancellation Prediction
- Description: The aim of this project to predict the possible reservations that are going to cancelled by the customers by analyzing various features and variables associated with the reservation.
- Technologies Used: The notebooks uses Decision Tree Classifier, Random Forest Classifier and Logistic Regression.
- Results: The decision tree classifier model performed best with 85% accuracy
- Telecom Customer Churn Prediction
- Description: The aim of this project is to analyze customer demographics, services, tenure and other variables to predict whether a particular customer will churn or not.
- Technologies Used: The notebooks uses Decision Tree Classifier, Random Forest Classifier and K Nearest Neighbor Classifier.
- Results: The random forest classifier model performed best with 82% accuracy
- SFR Analysis
- Description: The objective of this project is to analyze the SFR (SpaceFund Realty) of the aerospace companies and their missions in order to help the investors to make better decisions.
- Technologies Used: The notebooks uses Decision Tree Classifier, Random Forest Classifier.
- Results: The random forest classifier and decision tree classifier gave 87% accuracy.
- Indian Used Car Price Prediction
- Description: The aim of this data science project is to predict the price of used cars in major Indian metro cities.
- Technologies Used: The notebooks uses Decision Tree Regressor and Random Forest Regressor.
- Results: The random forest regressor gave 87.8% accuracy
- Crop Yield Prediction
- Description: The aim of this data science project is to predict crop yield using the dataset provided from Crop Yield Prediction..
- Technologies Used: The notebooks uses Decision Tree Regressor and Random Forest Regressor.
- Results: The random forest regressor gave 80.2% accuracy
- Osteoporosis Risk Prediction
- Description: The aim of this project is to predict the risk of osteoporosis in patients using a dataset of patients' medical records.
- Technologies Used: The notebooks uses Logistic Regression, Random Tree, Decision Tree and Support Vector Classifier.
- Results: The Decision Tree Classifier gave 87% accuracy
- Calgary Crime Data Analysis and Neural Network Prediction
- Description: The aim of this project is to analysis the crime reportings from 2018 to April 2024 in the city of Calgary and predict the crime count using neural network.
- Technologies Used: The notebooks uses LSTM Neural Network to predict the crime count by using adam optimizer.
This project is licensed under the MIT License. You are free to use the code and resources for educational or personal purposes with citation or reference to the original code and resources used.
Contributions are welcome! If you would like to contribute to this repository, please follow the guidelines outlined in CONTRIBUTING.md. Any improvements, bug fixes, or additional projects are greatly appreciated.
We believe in the power of community and collaboration. Head over to our Discussion Page to engage with fellow data enthusiasts, share your ideas, ask questions, and contribute to our vibrant community. Whether you're a seasoned data scientist or just starting out, your voice matters! Let's learn, grow, and innovate together. See you there! 🚀
I welcome any feedback, suggestions, or questions you may have about the projects or any kind of sponsorships for the repository. Feel free to reach out to me via email at [email protected]
Enjoy exploring my data science projects!