Cross-Domain Data Augmentation with Domain-Adaptive Language Modeling for Aspect-Based Sentiment Analysis

The code for our ACL2023 paper (https://aclanthology.org/2023.acl-long.81/)

Jianfei Yu, Qiankun Zhao, Rui Xia. "Cross-Domain Data Augmentation with Domain-Adaptive Language Modeling for Aspect-Based Sentiment Analysis"

Datasets

The training data comes from four domains: Restaurant(R) 、 Laptop(L) 、 Service(S) 、 Devices(D).

The in-domain corpus(used for training BERT-E) come from yelp and amazon reviews.

Click here to get BERT-E (BERT-Extented) , and the extraction code is by0i. (Please specify the directory where BERT is stored in modelconfig.py.)

Usage

1. Domain-Adaptive Pseudo Labeling

To assign pseudo labels to unlabeled data in the target domain, run below code:

bash pseudo_label.sh

2. Domain-Adaptive Language Modeling

Train a domain-adaptive language model, generate target-domain labeled data, and finally use the generated data for the main tasks. We use LSTM and GPT2 as decoder in language modeling respectively.

2.1 To train the GPT2-based DALM for data generation and evaluation, run below code:

bash GPT2.sh

2.2 To train the LSTM-based DALM for data generation and evaluation, run below code:

bash LSTM.sh

Acknowledgements

Some code in LSTM-based language modeling are based on the codes of DAGA, many thanks!

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
GPT2_based/cross_domain_LM		GPT2_based/cross_domain_LM
LSTM_based/cross_domain_LM		LSTM_based/cross_domain_LM
absa		absa
raw_data		raw_data
tuple_dp_op		tuple_dp_op
GPT2.sh		GPT2.sh
LSTM.sh		LSTM.sh
README.md		README.md
pseudo_label.sh		pseudo_label.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-Domain Data Augmentation with Domain-Adaptive Language Modeling for Aspect-Based Sentiment Analysis

Datasets

Usage

1. Domain-Adaptive Pseudo Labeling

2. Domain-Adaptive Language Modeling

Acknowledgements

About

Releases

Packages

Languages

NUSTM/DALM

Folders and files

Latest commit

History

Repository files navigation

Cross-Domain Data Augmentation with Domain-Adaptive Language Modeling for Aspect-Based Sentiment Analysis

Datasets

Usage

1. Domain-Adaptive Pseudo Labeling

2. Domain-Adaptive Language Modeling

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages