Creatig a word cloud from the definition of "Big Data" as provided by Wikipedia.
For this project, the definition of “Big Data” along with its characteristics has been used to generate a word could after removing certain stop words. To establish a precise model for model comparison we have only used the introduction, definition and characteristics of Big Data as provided in the aforementioned article from Wikipedia. We have not used any information on “Architecture”, “Applications” etc for this analysis.
We have converted the data into a text corpus after removing Stopwords and unnecessary text resulting from scrapping Wikipedia like:
- tm
- NLP
- rvest
- RColorBrewer
- SnowballC
- Stringr
- worldcloud