Human Action Recognition based on Pose Estimation

Aim of this project is to automatically recognize human actions based on analysis of the body landmarks using pose estimation.

Problem Statement:

Analysis of people’s actions and activities in public and private environments are highly necessary for security. This cannot be done manually as the number of cameras for surveillance produce lengthy hours of video feed every day. Real-time detection and alerting of suspicious activities or actions are also challenging in these scenarios. This issue can be solved by applying deep learning-based algorithms for action recognition.

Project Description:

The following are the major tasks performed:

Implementation of Convolutional Neural Network based pose estimation for body landmark detection
Implementation of pose features-based action recognition and its improvement using graphical feature representation and data augmentation of body landmarks
Preparation and preprocessing of image datasets
Fine tuning and improvement of the action recognition model with better feature representation and data augmentation
Development, error analysis and deep learning model improvement

Dataset:

A subset of Frames Labeled in Cinema (FLIC). Training images and their annotations in train_joints_coords.csv and action_joints.csv consists of 7 joints - 'left shoulder', 'left elbow', 'left wrist', 'right shoulder', 'right elbow', 'right wrist', 'left eye', 'right eye', 'nose'. The dataset is available at '/Action_dataset' and '/Pose_dataset'

Project Solutions:

1.Human Pose Estimation:

Pose estimation is achieved through the implementation of a CNN model. Transfer learning has been utilized for detection of body joints. VGG16 pre-trained model is used as the convolution base. The top model is customized for the problem. Training takes place in 2 groups. I was able to obtain a R2 score of 0.92 on the test data, which seems accurate enough.

2.Action Recognition:

I created two action recognition models.

xy_actions_model.h5: Model that trains on the (X,Y) coordinates of joints.

This Neural network was built by directly considering x and y coordinates as features and action label as target. This model gave me a validation accuracy of 100%. But humans can appear on any scale in a real scenario, considering raw coordinates as features is not a good idea. Therefore, I developed the below model.
dist_actions_model.h5: Model that trains on the Euclidean distance of the joint coordinates

For this model I have extracted the distance between every joint and further normalized the distance features. These distance features were considered for training the action recognition model. I was able to achieve a validation accuracy of 100%. However, this model wasn't stable, ie.. this model occasionally did return a 50% accuracy.

Therefore, for action recognition in Video, I have used the first model (xy_actions_model.h5)

3.Implementation of Action Recognition in videos using pose joints estimated by the CNN model

The 'pose_estimation_model.h5' has estimated Pose fairly accurately and 'xy_actions_model.h5' has recognized the action correctly as 'Namaste'.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Action_Dataset		Action_Dataset
Models		Models
Pose_Dataset		Pose_Dataset
images		images
Action_Recognition_using_pose_estimation_Final.ipynb		Action_Recognition_using_pose_estimation_Final.ipynb
README.md		README.md
man.jpg		man.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Action Recognition based on Pose Estimation

Problem Statement:

Project Description:

Dataset:

Project Solutions:

1.Human Pose Estimation:

2.Action Recognition:

3.Implementation of Action Recognition in videos using pose joints estimated by the CNN model

About

Releases

Packages

Languages

Resh-97/Human-Action-Recognition-based-on-Pose-Estimation-

Folders and files

Latest commit

History

Repository files navigation

Human Action Recognition based on Pose Estimation

Problem Statement:

Project Description:

Dataset:

Project Solutions:

1.Human Pose Estimation:

2.Action Recognition:

3.Implementation of Action Recognition in videos using pose joints estimated by the CNN model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages