VMD-Lifting

VMD-Lifting is a fork of 'Lifting from the Deep' that outputs estimated 3D pose data to a VMD file

The authers of 'Lifting from the Deep' are Denis Tome', Chris Russell and Lourdes Agapito. Please refer 'README-original.md' and http://visual.cs.ucl.ac.uk/pubs/liftingFromTheDeep/ for more information about the original 'Lifting from the Deep'.

This project is licensed under the terms of the GNU GPLv3 license. By using the software, you are agreeing to the terms of the license agreement (see LICENSE file).

Note: 'shape_predictor_68_face_landmarks.dat', the default trained model for head pose estimation, was trained on the iBUG 300-W face landmark dataset. And the license for the iBUG 300-W dataset excludes commercial use. So you should contact Imperial College London to find out if it's OK for you to use this model file in a commercial product.

概要

写真から人のポーズを推定し、VMDフォーマットのモーション(ポーズ)データを出力するプログラムです。ポーズ推定には Lifting from the Deep (https://github.com/DenisTome/Lifting-from-the-Deep-release) のプログラムを使用しています。

必要なもの

python (3.x or 2.7)
Tensorflow
OpenCV
python-tk (Tkinter)
PyQt5
dlib

Ubuntu や Debian GNU/Linux の環境では、rootになって下記のコマンドを実行すると必要なものが揃います。

# apt-get install python-pip
# pip install tensorflow-gpu
# apt-get install python-opencv
# apt-get install python-tk
# apt-get install python-pyqt5
# pip install dlib

Windowsの場合は次の手順で必要なものをインストールします。

cygwin をインストール: https://cygwin.com/install.html
https://www.tensorflow.org/install/install_windows に従って、CUDA、cuDNN、Python 3.6 をインストール
cygwin の pythonでなく、上記でインストールしたpythonを使うように環境変数PATHを設定
tensorflowをpipでインストール

$ pip install tensorflow-gpu

OpenCVをインストール

$ pip install opencv-python

PyQt5をインストール

$ pip install PyQt5

dlibをインストール

$ pip install dlib

準備

まず setup.sh を実行します。このスクリプトは必要なデータを取得し、外部ユーティリティをインストールします。
(次に、Lifting from the Deep 本体の動作を確認したい場合は、application ディレクトリで demo.py を実行します。)
dlib + OpenCVによるHead Pose Estimation(頭部姿勢推定)を行う場合は、http://dlib.net/files/shape_predictor_68_face_landmarks.dat.bz2 をダウンロードし、展開してできたファイル shape_predictor_68_face_landmarks.dat を applications/predictor/ に置きます。

$ mkdir applications/predictor
$ cd applications/predictor
$ wget http://dlib.net/files/shape_predictor_68_face_landmarks.dat.bz2
$ bunzip2 shape_predictor_68_face_landmarks.dat.bz2

使用方法

cd application

./vmdlifting.py IMAGE_FILE VMD_FILE [POSITION_FILE]

IMAGE_FILE: 入力元画像ファイル(JPEG, PNGなど)
VMD_FILE: 出力先VMDファイル
POSITION_FILE(オプション): 関節の位置を出力するテキストファイル。デバッグ用。

使用例:

./vmdlifting.py photo.jpg estimated.vmd

Lifting from the Deep について

Lifting from the Deep 畳み込みニューラルネットワーク(CNN)を用いて、単一のRGB画像から3Dのポーズ推定を行う手法(の論文)および、それを実装したプログラムです。著者は Denis Tome', Chris Russell and Lourdes Agapito です。詳しくはプロジェクトのWebページ( http://visual.cs.ucl.ac.uk/pubs/liftingFromTheDeep/ )の論文や動画を参照してください。

ライセンスについて

(はじめに英語で書いたとおり)GNU GPLv3 licenseです。詳しくはLICENSEファイルを読んでください。なお、顔の向きを推定するのに使う学習済みモデル shape_predictor_68_face_landmarks.dat は、学習に用いられた iBUG 300-W データセットが商用利用を許可されていません。もし商用利用する場合は Imperial College London に許諾を得るか、別の学習済みモデルを用意してください。

参考文献

D. Tome, C. Russell and L. Agapito. Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README-errno-mmd.md

README-errno-mmd.md

VMD-Lifting

概要

必要なもの

準備

使用方法

Lifting from the Deep について

ライセンスについて

参考文献

Files

README-errno-mmd.md

Latest commit

History

README-errno-mmd.md

File metadata and controls

VMD-Lifting

概要

必要なもの

準備

使用方法

Lifting from the Deep について

ライセンスについて

参考文献