AiCity24 DeepDrivePL

This repository presents training and testing code prepared by us as part of the AiCityChallenge2024 (Track 4: Road Object Detection in Fish-Eye Cameras). We focused on utilizing additional traffic-oriented datasets and transforming them to make the images similar to FishEye8K. You can:

Download the original dataset, COCO-style annotations, and our pre-trained model to run inference and compute metrics.
Download the entire augmented training set and run training according to our strategy.
Download only the original data and generate the transformed images yourself.

Originally, the FishEye8K dataset is divided into train (5288 images) and validation (2712 images) splits. We kept all 2712 validation images as a local test set, and for validation, we extracted 2 whole sequences (621 images). So in our code, and data, the following terms are used:

train - 4667 images
validation - 621 images (originally part of the train set)
test - 2712 images
test-challenge - 1000 images (without annotations)

Dependencies

We provide a Dockerfile that handles all the dependencies needed in this repository.

cd scripts/mmdet
./build_and_run.sh

Run inference using a trained Co-DETR model

To reproduce the values presented in the leaderboard, you need to download the FishEye8K dataset along with the annotations in COCO format, as well as our best Co-DETR model.

Download data

Download and extract the original (non-augmented) FishEye8K dataset

cd /aicity/scripts
./download_FE8K.sh

Run inference

Run inference on validation and test-challenge data (inside a container).

cd /aicity/scripts/mmdet/
./test.sh

Filter detections & evaluate

For evaluation we use a modified pycocotools library.
Our reposotory is a pip-installable version of https://github.com/MoyoG/FishEye8K/tree/main/evaluation_Linux.

cd /aicity/scripts
python run_filtering_and_eval.py --detections mmdet/006-ep7-val.bbox.json
python run_filtering_and_eval.py --detections mmdet/006-ep7-test-challenge.bbox.json --split test-challenge

The provided script will filter detections according to class-dependent confidence thresholds. These thresholds were chosen experimentally to maximize the F1 score on the test set (original validation) and are as follows:

{'Bike': 0.4, 'Bus': 0.5, 'Car': 0.5, 'Pedestrian': 0.4, 'Truck': 0.45}

If --split is different from "test-challenge", and --skip_metrics is not specified, the script will also calculate metrics for the specified detection file --detections.

Run training

In order to train the Co-DETR model, you can either download the already augmented dataset, or generate it yourself.

Download augmented dataset

cd /aicity/scripts
./download_augmented.sh

Generate augmented dataset

Download non-augmented datasets

cd /aicity/scripts
./download_original.sh

Run distortion (VisDrone, UAVDT)

cd /aicity/scripts
python gen_distorted.py --debug

Run pixel-level data augmentation (WoodScape, VisDrone, UAVDT)
```
cd /aicity/scripts
python gen_augmented.py --debug
```

Run GAN-based data augmentation

InstructPix2Pix: Learning to Follow Image Editing Instructions

cd /aicity/3rdparty/instruct-pix2pix
./scripts/download_checkpoints.sh
python generate-FE8K.py

Image Style Transfer Using Convolutional Neural Networks

cd /aicity/3rdparty/pytorch-neural-style-transfer
python style_transfer_FE8K.py

Generate COCO-style annotations

The script will copy all the augmented images to the specified --out_dir, along with the annotations in COCO format.
```
python convert2coco.py --out_dir ../data/augmented-all
```

Run training

Our training strategy is as follows:

Train Co-DINO Swin-L for 16 epochs with the augmented dataset.
Fine-tune the model with the best F1 score for 5 epochs using the train-test dataset.
Fine-tune once more for 16 epochs using the train-test dataset.

Run /aicity/scripts/mmdet/train-mmdet-codetr.sh to reproduce the above training strategy, but keep in mind that you may need to adjust the load_from path in each of the config files (as well as dataset paths if you generated the dataset yourself). We selected the best checkpoints for fine-tuning based on F1 score.

Submitted model

Our final submitted model can be downloaded from https://archive.org/details/006-epoch-7

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
3rdparty		3rdparty
data		data
images		images
models		models
scripts		scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AiCity24 DeepDrivePL

Dependencies

Run inference using a trained Co-DETR model

Download data

Run inference

Filter detections & evaluate

Run training

Download augmented dataset

Generate augmented dataset

Run training

Submitted model

About

Releases

Packages

Contributors 2

Languages

deepdrivepl/aicity24-DDPL

Folders and files

Latest commit

History

Repository files navigation

AiCity24 DeepDrivePL

Dependencies

Run inference using a trained Co-DETR model

Download data

Run inference

Filter detections & evaluate

Run training

Download augmented dataset

Generate augmented dataset

Run training

Submitted model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages