Skip to content

This repository contains the source code for my MSc thesis named: Video summarization leveraging Contrastive Learning

Notifications You must be signed in to change notification settings

ceeskaan/Contra-SUM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Contra-SUM: Video Summarization using Contrastive Learning

This repository includes the source code for my MSc thesis named: Video summarization leveraging Contrastive Learning.

Video Summarization Pipeline

Description

We use the preprocessed datasets provided by: https://github.com/KaiyangZhou/pytorch-vsumm-reinforce
We fetch the input features for the video frames and the ground truth scores from the preprocessed datasets.
These datasets are HDF5 files structured in the following way:

***********************************************************************************************************************************************
/key
    /features                 2D-array with shape (n_steps, feature-dimension)
    /gtscore                  1D-array with shape (n_steps), stores ground truth improtance score (used for training, e.g. regression loss)
    /user_summary             2D-array with shape (num_users, n_frames), each row is a binary vector (used for test)
    /change_points            2D-array with shape (num_segments, 2), each row stores indices of a segment
    /n_frame_per_seg          1D-array with shape (num_segments), indicates number of frames in each segment
    /n_frames                 number of frames in original video
    /picks                    posotions of subsampled frames in original video
    /n_steps                  number of subsampled frames
    /gtsummary                1D-array with shape (n_steps), ground truth summary provided by user (used for training, e.g. maximum likelihood)
    /video_name (optional)    original video name, only available for SumMe dataset
***********************************************************************************************************************************************
Note: OVP and YouTube only contain the first three keys, i.e. ['features', 'gtscore', 'gtsummary']

In addition to this, we utilize the data splitfiles provided by: https://github.com/ok1zjf/VASNet
These files are used for n-fold cross validation, and are structured in the following way:

[
    {
        "test_keys": [
            "eccv16_dataset_tvsum_google_pool5/video_10",
            "eccv16_dataset_tvsum_google_pool5/video_20",
            "eccv16_dataset_tvsum_google_pool5/video_23",
            ...
        ],
        "train_keys": [
            "eccv16_dataset_tvsum_google_pool5/video_1",
            "eccv16_dataset_tvsum_google_pool5/video_11",
            "eccv16_dataset_tvsum_google_pool5/video_12",
            ...

About

This repository contains the source code for my MSc thesis named: Video summarization leveraging Contrastive Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages