Automated Phrase Mining from Massive Text Corpora in Python.
-
Updated
May 23, 2021 - Python
Automated Phrase Mining from Massive Text Corpora in Python.
A simplified version of the ECON pipeline from the Concept Mining via Embedding paper
For a corpus linguistics project, I created an information retrieval program called "You Are Not Alone". My phrase_finder() function searches for a self-identifying phrase in 4 large classic texts (The Souls of Black Folk, Jane Eyre, The Strange Case of Dr. Jekyll & Mr. Hyde, and Frankenstein). Standpoint: "So Matilda’s strong young mind continu…
Add a description, image, and links to the phrase-mining topic page so that developers can more easily learn about it.
To associate your repository with the phrase-mining topic, visit your repo's landing page and select "manage topics."