Skip to content

duke - Dataset Understanding via Knowledge-base Embeddings

License

Notifications You must be signed in to change notification settings

uncharted-distil/duke

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

duke -- Dataset Understanding via Knowledge-base Embeddings

  1. First, you should download wiki2vec model (English Wikipedia) using a torrent as described on this link: https://github.com/idio/wiki2vec -- untar it at a preferred location and note it.

  2. Clone this repository, open main.py in scripts/ subdirectory and modify embedding_path variable to the aforementioned wiki2vec model location.

  3. Using python3.5+, pip3 install required libraries in requirements.txt.

  4. pip install the program Duke as

pip3 install git+https://github.com/NewKnowledge/duke
  1. Identify a dataset of interest, and specify its path in main.py (via dataset_path variable).

  2. Finally, run it as follows, and enjoy the magic:

python3 main.py

About

duke - Dataset Understanding via Knowledge-base Embeddings

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 94.8%
  • Dockerfile 5.0%
  • Shell 0.2%