Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation

This repository contains the code of the ECB method for Classification in Domain Adaptation.

Ba-Hung Ngo*, Nhat-Tuong Do-Tran*, Tuan-Ngoc Nguyen, Hae-Gon Jeon and Tae Jong Choi†
Accepted In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024).

Proposed Method

Supervised Training: We train both ViT and CNN branches on labeled samples.
Finding To Conquering Strategy (FTC): We find class-specific boundaries based on the fixed ViT Encoder E1 by maximizing discrepancy between the Classifier F1 and F2. Subsequently, the CNN Encoder E2 clusters the target features based on those class-specific boundaris by minimizing discrepancy.

Prepare

Dataset

Please follow the instructions in DATASET.md to download datasets.

Installation

conda env create -f environment.yml

Training

The train.yaml is the config file for training our method. You can change the arguments to train Semi-Supervised Domain Adaptation (SSDA) or Unsupervised Domain Adaptation (UDA).

python train.py --cfg configs/train.yaml

Evaluation

If you need evaluate the test dataset with our pretrained model. You need to download these checkpoint.

sh download_pretrain.sh

For evaluation, you need to modify the configuration arguments in test/yaml in the configs folder. These arguments are described in CONFIG.md

python test.py --cfg configs/test.yaml

Visualization

The visualization compares features from two networks (CNN, ViT) for the real --> sketch on the DomainNet dataset in the 3-shot scenario, before and after adaptation with the FTC strategy.

The visualization in a few samples using GRAD-CAM technique to show to performance for CNN and ViT when applying ECB method.

Citation

@InProceedings{
    author    = {Ngo, Ba Hung and Do-Tran, Nhat-Tuong and Nguyen, Tuan-Ngoc and Jeon, Hae-Gon and Choi, Tae Jong},
    title     = {Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2024},
    pages     = {28545-28554}
}

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
configs		configs
dataset		dataset
images		images
log_utils		log_utils
model		model
utils		utils
website		website
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
download_pretrain.sh		download_pretrain.sh
environment.yml		environment.yml
environment_visualize.yml		environment_visualize.yml
test.py		test.py
train.py		train.py
trainer.py		trainer.py
trainer_warmup.py		trainer_warmup.py
visualize_final.ipynb		visualize_final.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation

Proposed Method

Prepare

Dataset

Installation

Training

Evaluation

Visualization

Citation

License

About

Releases 1

Packages

Contributors 2

Languages

License

dotrannhattuong/ECB

Folders and files

Latest commit

History

Repository files navigation

Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation

Proposed Method

Prepare

Dataset

Installation

Training

Evaluation

Visualization

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages