GitHub - adonisues/Pytorch-NCE: The Noise Contrastive Estimation for softmax output written in Pytorch

This NCE module is forked from the pytorch/examples repo.

Requirements

Please run pip install -r requirements first to see if you have the required python lib.

tqdm is used for process bar during training

New Arguments

--nce: whether to use NCE as approximation
--noise-ratio <10>: numbers of noise samples per data sample
--norm-term <9>: the constant normalization term Ln(z)
--index-module <linear>: index module to use for NCE module (currently and available, does not support PPL calculating )
--train: train or just evaluation existing model
--vocab <None>: use vocabulary file if specified, otherwise use the words in train.txt

Examples

Run NCE criterion with linear module:

python main.py --cuda --noise-ratio 10 --norm-term 9 --nce --train

Run NCE criterion with gru module:

python main.py --cuda --noise-ratio 10 --norm-term 9 --nce --train --index-module gru

Run conventional CE criterion:

python main.py --cuda --train

File structure

log/: some log files of this scripts
nce.py: the NCE module wrapper
index_linear.py: an index module used by NCE, as a replacement for normal Linear module
index_gru.py: an index module used by NCE, as a replacement for the whole language model module
model.py: the wrapper of all nn.Modules.
main.py: entry point
utils.py: some util functions for better abstraction

Modified README from Pytorch/examples

This example trains a multi-layer RNN (Elman, GRU, or LSTM) on a language modeling task. By default, the training script uses the PTB dataset, provided. The trained model can then be used by the generate script to generate new text.

python main.py --cuda --epochs 6        # Train a LSTM on PTB with CUDA

The model uses the nn.LSTM module which will automatically use the cuDNN backend if run on CUDA with cuDNN installed.

During training, if a keyboard interrupt (Ctrl-C) is received, training is stopped and the current model is evaluted against the test dataset.

The main.py script accepts the following arguments:

optional arguments:
  -h, --help         show this help message and exit
  --data DATA        location of the data corpus
  --emsize EMSIZE    size of word embeddings
  --nhid NHID        humber of hidden units per layer
  --nlayers NLAYERS  number of layers
  --lr LR            initial learning rate
  --lr-decay         learning rate decay when no progress is observed on validation set
  --weight-decay     weight decay(L2 normalization)
  --clip CLIP        gradient clipping
  --epochs EPOCHS    upper epoch limit
  --batch-size N     batch size
  --dropout DROPOUT  dropout applied to layers (0 = no dropout)
  --seed SEED        random seed
  --cuda             use CUDA
  --log-interval N   report interval
  --save SAVE        path to save the final model

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
data/penn		data/penn
log		log
test		test
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
alias_multinomial.py		alias_multinomial.py
data.py		data.py
generic_model.py		generic_model.py
index_gru.py		index_gru.py
index_linear.py		index_linear.py
main.py		main.py
model.py		model.py
nce.py		nce.py
requirements		requirements
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirements

New Arguments

Examples

File structure

Modified README from Pytorch/examples

About

Releases

Packages

Languages

adonisues/Pytorch-NCE

Folders and files

Latest commit

History

Repository files navigation

Requirements

New Arguments

Examples

File structure

Modified README from Pytorch/examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages