Skip to content

alysiayx/Melbourne-Footfalls

Repository files navigation

Melbourne Footfall Analysis and Modelling

Dependencies

Setting Up the Environment:

Open your terminal or command prompt. Use the cd command to change directory to the location where you downloaded the "Melbourne-Footfalls" project. For example:

    cd /path/to/Melbourne-Footfalls

Replace /path/to/Melbourne-Footfalls with the actual path to the directory where you downloaded the project.

  • Option 1: Using requirements.txt
    • Create a virtual environment

      • using venv or virtualenv:
      python -m venv <env_name>
      source <env_name>/bin/activate   # On Unix/Linux
      <env_name>\Scripts\activate.bat  # On Windows
      • using conda:
        conda create --name <env_name> python=3.9 --file requirements.txt
        conda activate <env_name>

      replace <env_name> with the name of virtual environement you specified.

    • Execute the following command in the terminal to install the project dependencies:

      pip install -r requirements.txt
  • Option 2: Using pyproject.toml
    • Execute the following command in the terminal to install the project dependencies:
      pip install poetry
      poetry install
      poetry shell

Basic Data Analysis

Data Format Analysis

📊 Notebook: Access the analysis via data_format_analysis.ipynb.

  • Understand the chosen data format for our analysis.

Imputation Analysis

📈 Notebook: Explore through imputation_analysis.ipynb.

  • Conduct experiments related to data imputation.

Data Pre-Processing

How to Execute:

🔄 Notebook: Download the dataset Data (20230918) and process it using Melbourne_footfall_data_preprocessing.ipynb.

Additional Data Sources:

Pre-Processing Steps:

  1. Duplicate Data Handling:
    • Objective: Ensure data integrity and accuracy.
    • Action: Remove records with duplicated sensor IDs, location IDs, or geo-locations.
  2. Sensor ID Unification:
    • Objective: Standardize datasets, focusing on 2023 data.
    • Action: Complement 2023 records, which have only Location IDs, with corresponding sensor names and geo-locations.

Preprocessed Data Storage:

Footfall Modelling

Objective: Analyze pedestrian traffic patterns in Melbourne.

Guided Modelling:

📘 For Beginners or Detailed Guidance:

Direct Modelling:

🚀 For Advanced Users:

Future Developments:

  • Integration of AutoML is pending.
  • Plans to combine data preparation with model training and execution, rather than pre-processing for online learning separately.