Learning Human Action Representations from Temporal Context in Lifestyle Vlogs

This repository contains the dataset and code for our paper.

Task Description

Human action co-occurrence in lifestyle vlogs: two actions co-occur if they occur in the same interval of time (10 seconds) in a video. The actions are represented as nodes in a graph, the co-occurrence relation between two actions is represented through a link between the actions, and the action co-occurrence identification task as a link prediction task.

Data

The resources for creating the data are in data/utils. More details in data_processing.py
The graph is saved in data/graph/edges.csv
The node embeddings are saved in data/graph/{default_feat_nodes}_nodes.csv
where default_feat_nodes can be: "txt_action", "txt_transcript", "vis_action", "vis_video", "vis_action_video". More details in link_prediction.py.
Sample frames 4 frames per video and their action label are found in frames_sample

Features

The visual features are in data/clip_features.pt
The textual and graph features are in data/graph and are computed in data_processing.py, function save_nodes_edges_df

Setup

conda env create
conda activate action_order
pip install -r requirements.txt

spacy download en_core_web_sm
spacy download en_core_web_trf

Experiments

Run data collection and processing from data_processing.py
Run action co-occurrence/ link prediction models from link_prediction.py
Run downstream task experiments from action_downstream.py and from get_nearest_neighbours function in link_prediction.py
Run data analyses from data_analysis.ipynb
Run video related scripts from utils

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning Human Action Representations from Temporal Context in Lifestyle Vlogs

Task Description

Data

Features

Setup

Experiments

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
data		data
frames_sample		frames_sample
img		img
utils		utils
.gitignore		.gitignore
README.md		README.md
action_downstream.py		action_downstream.py
data_analysis.ipynb		data_analysis.ipynb
data_processing.py		data_processing.py
environment.yml		environment.yml
link_prediction.py		link_prediction.py
requirements.txt		requirements.txt

MichiganNLP/vlog_action_co-occurrence

Folders and files

Latest commit

History

Repository files navigation

Learning Human Action Representations from Temporal Context in Lifestyle Vlogs

Task Description

Data

Features

Setup

Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages