Skip to content

Creating word cloud in R on the definition of 'Big Data' as provided in wiikipedia. Final result used as cover photo for linkedin.

Notifications You must be signed in to change notification settings

anurima-saha/WordCloud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

WordCloud

OBJECTIVE :

Creatig a word cloud from the definition of "Big Data" as provided by Wikipedia.

OVERVIEW:

INPUT - WEB SCRAPPING

For this project, the definition of “Big Data” along with its characteristics has been used to generate a word could after removing certain stop words. To establish a precise model for model comparison we have only used the introduction, definition and characteristics of Big Data as provided in the aforementioned article from Wikipedia. We have not used any information on “Architecture”, “Applications” etc for this analysis.

DATA CLEANING:

We have converted the data into a text corpus after removing Stopwords and unnecessary text resulting from scrapping Wikipedia like: image

PACKAGES USED -

  • tm
  • NLP
  • rvest
  • RColorBrewer
  • SnowballC
  • Stringr
  • worldcloud

FINAL OUTPUT -

The worldcloud is saved in form a png file. image

About

Creating word cloud in R on the definition of 'Big Data' as provided in wiikipedia. Final result used as cover photo for linkedin.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages