GitHub - sillasgonzaga/lexiconPT: R package: Lexicons for Portuguese Text Analysis

title	output
lexiconPT	github_document

lexiconPT: An R package that provides lexicons for Portuguese Text Analysis

lexiconPT was developed to make it easy for analysts and researchers who want to perform text analysis and mining in Portuguese texts in R. Its main contribution is that it easily imports cleaned versions of some famous Portuguese lexicons, such as OpLexicon and SentiLex. These datasets can be loaded to R using the data() function:

data("sentiLex_lem_PT02")
data("oplexicon_v2.1")
data("oplexicon_v3.0")

Also, you can individually check a word sentiment by using the function lexiconPT::get_word_sentiment():

get_word_sentiment("temer")

The cleaning process can be seen in data-raw/setup.R. Note that, in order to clean the SentiLex dataset in a tidy way, some assumptions had to be made (I'm gonna write a blog post about it). On the other hand, both OpLexicon versions were already cleaned.

To know how both lexicons were developed, please see the help pages of the datasets to check their references.

To install lexiconPT, please run devtools::install_github("sillasgonzaga/lexiconPT").

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
R		R
data-raw		data-raw
data		data
man		man
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
lexiconPT.Rproj		lexiconPT.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lexiconPT: An R package that provides lexicons for Portuguese Text Analysis

About

Releases

Packages

Contributors 3

Languages

License

sillasgonzaga/lexiconPT

Folders and files

Latest commit

History

Repository files navigation

lexiconPT: An R package that provides lexicons for Portuguese Text Analysis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages