Skip to content

End to End NLP text similarity project, using kaggle Quora Dataset , served as a REST API via Flask

License

Notifications You must be signed in to change notification settings

R-aryan/Text-Similarity-Using-BERT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text Similarity Using BERT

  • End to End NLP text similarity project , served as a REST API via Flask App.
  • The Kaggle dataset can be found Here Click Here

Steps to run the project Click Here

Dataset Description

  • id - the id of a training set question pair
  • qid1, qid2 - unique ids of each question (only available in train.csv)
  • question1, question2 - the full text of each question
  • is_duplicate - the target variable, set to 1 if question1 and question2 have essentially the same meaning, and 0 otherwise.

Goal

The goal of this competition is to predict which of the provided pairs of questions contain two questions with the same meaning.

Following are the screenshots for the sample request and sample response.

  • Request sample

Sample request

  • Response Sample

Sample response

sample request and response