Skip to content

I try to load word vectors from a bin file in the directory but I get an error #10078

Discussion options

You must be logged in to vote

Are you using the vectors listed here? http://evexdb.org/pmresources/vec-space-models/

One thing to try would be converting from the binary format to the text file format. You'll need gensim for this, according to this SO post.

from gensim.models.keyedvectors import KeyedVectors

model = KeyedVectors.load_word2vec_format('PubMed-and-PMC-w2v.bin', binary=True)
model.save_word2vec_format('PubMed-and-PMC-w2v.txt', binary=False)

Then, try the same command, but point it at the new .txt file, like so:

python -m spacy init vectors en PubMed-and-PMC-w2v.txt output/

I'm still downloading the vectors myself to see if this is indeed the solution, but I'm inferring from the docs that the txt format i…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@noam267
Comment options

@sushantsp
Comment options

Answer selected by noam267
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / vectors Feature: Word vectors and similarity
3 participants