Skip to content

Commit

Permalink
Update 2021-09-01-self-refine.md
Browse files Browse the repository at this point in the history
  • Loading branch information
guotong1988 authored Jul 4, 2023
1 parent 90a591b commit 35327a1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion research/_posts/2021-09-01-self-refine.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ description: "Self-Refine Learning For Data-Centric Deep Learning"

### Abstract

In industry NLP application, our manually labeled data has a certain number of noise data. We present a simple method to find the noise data and remove them. We select the noise data whose human label is not contained in the top-K model's predictions. The experiment result shows that our method works. For industry deep learning application, our method improve the text classification accuracy from 80.5% to 90.6% in dev dataset, and improve the human-evaluation accuracy from 83.2% to 90.1%. The conclusion is: the self-predict and self-drop method of this paper can not improve the accuracy to more than 95%, without human labeling again for the training dataset.
In industry NLP application, our manually labeled data has a certain number of noise data. We present a simple method to find the noise data and remove them. We select the noise data whose human label is not contained in the top-K model's predictions. The experiment result shows that our method works. For industry deep learning application, our method improve the text classification accuracy from 80.5% to 90.6% in dev dataset, and improve the human-evaluation accuracy from 83.2% to 90.1%. The conclusion is: The self-predict and self-drop method of this paper can not improve the accuracy to more than 95%, without human labeling again for the training dataset.


#### Keywords
Expand Down

0 comments on commit 35327a1

Please sign in to comment.