Skip to content

Commit

Permalink
LODlit package updates
Browse files Browse the repository at this point in the history
  • Loading branch information
AndreiNesterov committed Sep 11, 2023
1 parent 4dded71 commit b14dc89
Show file tree
Hide file tree
Showing 15 changed files with 49 additions and 14 deletions.
7 changes: 7 additions & 0 deletions LODlit_package/README.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
LODlit simplifies retrieval of literals from Linked Open Data.
Different LOD-datasets are available online in different formats with diffferent user-friendliness levels.
LODlit allows you to search over different linked open datasets in one place using keywords and outputs the search results in the same json structure convinient for further processing.
For example, LODlit retrieves labels, aliases, and descriptions of Wikidata entities by search terms in a specific language with optional search filtering. It is also possible to get literals in different languages by entity identifiers.
Additionally, LODlit provides the functionality to make bag-of-words from literals for natural language processing, for example, to calculate cosine similarity between literals.
Currently, LODlit supports parsing from Wikidata, Getty Art & Architecture Thesaurus (AAT), Princeton WordNet (3.1), and Open Dutch WordNet (1.3).
The license is CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0/).
Binary file removed LODlit_package/dist/LODlit-0.0.0-py3-none-any.whl
Binary file not shown.
Binary file removed LODlit_package/dist/LODlit-0.0.0.tar.gz
Binary file not shown.
Binary file added LODlit_package/dist/LODlit-0.5.0-py3-none-any.whl
Binary file not shown.
Binary file added LODlit_package/dist/LODlit-0.5.0.tar.gz
Binary file not shown.
12 changes: 3 additions & 9 deletions LODlit_package/setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,13 @@

setup(
name='LODlit',
version='0.1.0',
version='0.5.0',
description = "Retrieving literal values from LOD",
author = "Andrei Nesterov",
author_email = "[email protected]",
license = "CC BY 4.0",
keywords = ["LOD", "literals", "linked open data", "strings", "NLP"],
url='https://github.com/cultural-ai/LODlit/LODlit_package',
# packages=['LODlit'],
url='https://github.com/cultural-ai/LODlit/tree/main/LODlit_package',
install_requires=[
'nltk==3.8.1',
'pandas==2.0.3',
Expand All @@ -18,9 +17,4 @@
'simplemma==0.9.1',
'spacy==3.6.1',
'SPARQLWrapper==2.0.0',
'lxml==4.9.3'
],

classifiers=[
],
)
'lxml==4.9.3'])
16 changes: 15 additions & 1 deletion LODlit_package/src/LODlit.egg-info/PKG-INFO
Original file line number Diff line number Diff line change
@@ -1,4 +1,18 @@
Metadata-Version: 2.1
Name: LODlit
Version: 0.0.0
Version: 0.5.0
Summary: Retrieving literal values from LOD
Home-page: https://github.com/cultural-ai/LODlit/tree/main/LODlit_package
Author: Andrei Nesterov
Author-email: [email protected]
License: CC BY 4.0
Keywords: LOD,literals,linked open data,strings,NLP
License-File: LICENSE
Requires-Dist: nltk==3.8.1
Requires-Dist: pandas==2.0.3
Requires-Dist: numpy==1.21.0
Requires-Dist: requests==2.31.0
Requires-Dist: simplemma==0.9.1
Requires-Dist: spacy==3.6.1
Requires-Dist: SPARQLWrapper==2.0.0
Requires-Dist: lxml==4.9.3
4 changes: 4 additions & 0 deletions LODlit_package/src/LODlit.egg-info/SOURCES.txt
Original file line number Diff line number Diff line change
@@ -1,6 +1,9 @@
LICENSE
README.md
README.txt
pyproject.toml
setup.py
src/LODlit/__init__.py
src/LODlit/aat.py
src/LODlit/bows.py
src/LODlit/odwn.py
Expand All @@ -9,4 +12,5 @@ src/LODlit/wd.py
src/LODlit.egg-info/PKG-INFO
src/LODlit.egg-info/SOURCES.txt
src/LODlit.egg-info/dependency_links.txt
src/LODlit.egg-info/requires.txt
src/LODlit.egg-info/top_level.txt
8 changes: 8 additions & 0 deletions LODlit_package/src/LODlit.egg-info/requires.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
nltk==3.8.1
pandas==2.0.3
numpy==1.21.0
requests==2.31.0
simplemma==0.9.1
spacy==3.6.1
SPARQLWrapper==2.0.0
lxml==4.9.3
Empty file.
2 changes: 1 addition & 1 deletion LODlit_package/src/LODlit/aat.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
import simplemma
import pandas as pd
import requests
import bows
from . import bows

def main():
if __name__ == "__main__":
Expand Down
4 changes: 4 additions & 0 deletions LODlit_package/src/LODlit/bows.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,10 @@
import simplemma
import spacy # numpy version 1.21

def main():
if __name__ == "__main__":
main()

def make_bows(text:list, lang:str, merge_bows=False) -> list:
'''
Makes a BoW from a list of str:
Expand Down
6 changes: 5 additions & 1 deletion LODlit_package/src/LODlit/odwn.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,11 @@
from nltk.corpus import stopwords
from nltk.stem import WordNetLemmatizer
import simplemma
import bows
from . import bows

def main():
if __name__ == "__main__":
main()

def _set_odwn(path_odwn:str):
"""
Expand Down
2 changes: 1 addition & 1 deletion LODlit_package/src/LODlit/pwn31.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@
from nltk.stem import WordNetLemmatizer
import warnings
import pandas as pd
import bows
from . import bows

def main():

Expand Down
2 changes: 1 addition & 1 deletion LODlit_package/src/LODlit/wd.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@
import pandas as pd
import gzip
import warnings
import bows
from . import bows

def main():
if __name__ == "__main__":
Expand Down

0 comments on commit b14dc89

Please sign in to comment.