Solving the Story Clozed Task

The target of this NLP project was to successfully solve the story cloze task. This task was developed to measure advanced commonsense understanding within everyday written stories [1]. To solve the task, one has to determine the correct ending sentence out of two available options, given the four initial context sentences of the story.

In the LSDSem 2017 competition, in which different teams competed to solve the story cloze task, validation accuracies between 0.595 and 0.752 were reached [2]. The major difficulty of this task is to extract semantic information from the story context and use it to determine the correct ending.

Our ﬁnal model relies on a combination of features derived from word and sentence embeddings, as well as from predicted sentence embeddings from a generative RNN. The features are fed into a logistic regression model to perform the classiﬁcation. The ﬁnal implementation was able to reach a validation accuracy of 71% which is near the state-of-the art solutions scores [4].

The following datasets were provided:

Training Set

the frist consisted of the context sentences with the correct ending only: train
the second included an additional incorrect ending to the correct one: validation 1

Evaluation Set

an evaluation set validation 2 with context sentences and labeled endings to determine the accuracy of the classifier

Test Set

a test set test to participate in the competition by classifying unlabeled sentences

Sources:

[1] Nasrin Mostafazadeh et al. A corpus and cloze evaluation for deeper understanding commonsense stories. 2016. [2] Nasrin Mostafazadeh et al. Lsdsem 2017 shared task: The story cloze test. 2017. [3] Story Cloze Test and ROCStories Corpora Description [4] Final Project Report

(Date: 30.05.2018)

Getting Started

Install dependencies by running: python3 setup.py install
Run word2vec.py to create the word emebeddings. They will be saved and reloaded for further task. The code is taken from: https://github.com/tensorflow/tensorflow/blob/r1.8/tensorflow/examples/tutorials/word2vec/word2vec_basic.py
Run main_story_clozed_task.py with the training_mode rnn option set to true. The first run will take a couple minutes because multiple embeddings will have to be loaded and calculated.
Run 'main_story_clozed_task.py' with the training_mode rnn option set to false. The model will predict on the validation set and display the validation accuracy. A output file containing predictions on the test set is saved in the current folder.

Authors

Mauro Luzzatto - Maurol
Dario Kneubuehler - Darkneu
Thomas Brunschwiler

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
Report		Report
data		data
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
feature_extraction.py		feature_extraction.py
main_story_clozed_task.py		main_story_clozed_task.py
recurrent_neural_network.py		recurrent_neural_network.py
setup.py		setup.py
word2vec.py		word2vec.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solving the Story Clozed Task

Training Set

Evaluation Set

Test Set

Sources:

Getting Started

Authors

About

Releases

Packages

Contributors 2

Languages

MauroLuzzatto/story-cloze-task

Folders and files

Latest commit

History

Repository files navigation

Solving the Story Clozed Task

Training Set

Evaluation Set

Test Set

Sources:

Getting Started

Authors

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages