Skip to content

This project was run in DataBricks using spark to analyze the recent news in 'cancer' for sentiment evaluation. The goal of this project is to practice traditional NLP like tokenization, stopwords, CV and TF-IDF, N-grams. Also, this project applied tools like AWS S3, athena, QuickSight etc. to address big data.

Notifications You must be signed in to change notification settings

MaggieUBC/Cancer-Sentiment-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Cancer-Sentiment-Analysis

Summary

  • This project aims to practice NLP using spark and process bigdata using Amazon Web Service.
  • The workflow showed below:

workflow

About

This project was run in DataBricks using spark to analyze the recent news in 'cancer' for sentiment evaluation. The goal of this project is to practice traditional NLP like tokenization, stopwords, CV and TF-IDF, N-grams. Also, this project applied tools like AWS S3, athena, QuickSight etc. to address big data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published