Skip to content

Dataset on property sales price in Melbourne, Australia

Notifications You must be signed in to change notification settings

sofianieva/EDA_and_data_cleaning

Repository files navigation

Diploma in Data Science and Machine Learning

Data Exploration and Data Cleaning Assignment (in spanish)

We worked with the Kaggle competition's dataset on property sales price estimation in Melbourne, Australia, specifically with the reduced data set produced by DanB. In the first part, we performed exploratory data analysis, outlier detection, feature selection and creation of new features by merging with an AirBnB dataset. In the second part, we encoded the variables, tried different imputation methods for the columns with missing values and applied PCA to reduce the dimensionality