Skip to content

Latest commit

 

History

History
31 lines (26 loc) · 517 Bytes

README.md

File metadata and controls

31 lines (26 loc) · 517 Bytes

NameVector

A name finder using machine learning techniques

#Links word2vec spark

#Dependencies

  • jdk-8
  • Gson
  • spark

#Installation

cd word2vec
make
chmod +x *.sh

#Word2vec ##Training

word2vec/./word2vec -train <<input_file>> -output <<output_file>> -cbow 0 -size 200 -window 5" +
  			" -negative 0 -hs 1 -sample 1e-3 -threads 12 -binary 1

##Distance

distance <<Trained_model_file>>