ELECTRA

中文预训练 ELECTREA 模型: 基于对抗学习 pretrain Chinese Model

code Repost from google official code: https://github.com/google-research/electra

具体使用说明：参考官方链接

Electra Chinese tiny模型路径

google drive

electra-tiny google-drive

baidu drive

electra-tiny baidu-pan code:rs99

模型说明

与 tinyBERT 的配置相同
generator 为 discriminator的 1/4

How to use official code

Steps

修改 configure_pretraining.py 里面的数据路径、tpu、gpu 配置
修改 model_size：可在 code/util/training_utils.py 里面自行定义模型大小
数据输入格式：原始的input_ids, input_mask, segment_ids，训练过程中会在线做 uniform mask sampling（不需要离线生成 masked input ids）

Performance

gen+disc:

electra-tiny

metric	value
disc_accuracy	0.95093095
disc_auc	0.9762006
disc_loss	0.14071295
disc_precision	0.8018275
disc_recall	0.6088053
loss	9.516352
masked_lm_accuracy	0.46732807
masked_lm_loss	2.8209455
sampled_masked_lm_accuracy	0.3504382

The model are trained on CLUE 10G Chinese Corpus with 1M-steps

Downstream finetuning on CLUE benchmark:

注：only use pretrained electra-tiny with layer-wise learning rate decay without any distilaltion、data-augmentation. learning rate is set to 1e-4 for each task and run 10-epochs. (According to official results, the results may have large variance)

	AFQMC	TNEWS	IFLYTEK	CMNLI	WSC	CSL
Metrics	Acc	Acc	Acc	Acc	Acc	Acc
ELECTRA-tiny	70.319	54.280	53.538	73.745	64.336	78.700
Roberta-tiny	69.904	54.150	56.808	74.037	64.336	74.133

注：

electra 在多分类问题上面可能会有 performance 下降
gen、disc的规模配比比较hacky，与 mask的方法等相关

报名NLPCC-高性能小模型测评

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ELECTRA

Electra Chinese tiny模型路径

google drive

baidu drive

模型说明

How to use official code

Steps

Performance

gen+disc:

Downstream finetuning on CLUE benchmark:

About

Releases

Packages

CLUEbenchmark/ELECTRA

Folders and files

Latest commit

History

Repository files navigation

ELECTRA

Electra Chinese tiny模型路径

google drive

baidu drive

模型说明

How to use official code

Steps

Performance

gen+disc:

Downstream finetuning on CLUE benchmark:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages