GitHub - ceeskaan/Contra-SUM: This repository contains the source code for my MSc thesis named: Video summarization leveraging Contrastive Learning

Contra-SUM: Video Summarization using Contrastive Learning

This repository includes the source code for my MSc thesis named: Video summarization leveraging Contrastive Learning.

Video Summarization Pipeline

Description

We use the preprocessed datasets provided by: https://github.com/KaiyangZhou/pytorch-vsumm-reinforce
We fetch the input features for the video frames and the ground truth scores from the preprocessed datasets.
These datasets are HDF5 files structured in the following way:

***********************************************************************************************************************************************
/key
    /features                 2D-array with shape (n_steps, feature-dimension)
    /gtscore                  1D-array with shape (n_steps), stores ground truth improtance score (used for training, e.g. regression loss)
    /user_summary             2D-array with shape (num_users, n_frames), each row is a binary vector (used for test)
    /change_points            2D-array with shape (num_segments, 2), each row stores indices of a segment
    /n_frame_per_seg          1D-array with shape (num_segments), indicates number of frames in each segment
    /n_frames                 number of frames in original video
    /picks                    posotions of subsampled frames in original video
    /n_steps                  number of subsampled frames
    /gtsummary                1D-array with shape (n_steps), ground truth summary provided by user (used for training, e.g. maximum likelihood)
    /video_name (optional)    original video name, only available for SumMe dataset
***********************************************************************************************************************************************
Note: OVP and YouTube only contain the first three keys, i.e. ['features', 'gtscore', 'gtsummary']

In addition to this, we utilize the data splitfiles provided by: https://github.com/ok1zjf/VASNet
These files are used for n-fold cross validation, and are structured in the following way:

[
    {
        "test_keys": [
            "eccv16_dataset_tvsum_google_pool5/video_10",
            "eccv16_dataset_tvsum_google_pool5/video_20",
            "eccv16_dataset_tvsum_google_pool5/video_23",
            ...
        ],
        "train_keys": [
            "eccv16_dataset_tvsum_google_pool5/video_1",
            "eccv16_dataset_tvsum_google_pool5/video_11",
            "eccv16_dataset_tvsum_google_pool5/video_12",
            ...

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
figures		figures
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
randomized_experiment.py		randomized_experiment.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contra-SUM: Video Summarization using Contrastive Learning

Video Summarization Pipeline

Description

About

Releases

Packages

Languages

ceeskaan/Contra-SUM

Folders and files

Latest commit

History

Repository files navigation

Contra-SUM: Video Summarization using Contrastive Learning

Video Summarization Pipeline

Description

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages