Skip to content

Latest commit

 

History

History
132 lines (100 loc) · 5.89 KB

human-model-generation.md

File metadata and controls

132 lines (100 loc) · 5.89 KB

human_model_generation module

The human_model_generation module contains the PIFuGeneratorLearner class, which inherits from the abstract class Learner.

Class PIFuGeneratorLearner

Bases: engine.learners.Learner

The PIFuGeneratorLearner class is a wrapper of the PIFu [1] object detection algorithm based on the original PIFu implementation. It can be used to perform human model generation from a single image (inference). In addtion, the PIFuGeneratorLearner enables the 3D pose approximation of a generated human model as well as the generation of multi-view renderings of the human model.

The PIFuGeneratorLearner class has the following public methods:

PIFuGeneratorLearner constructor

PIFuGeneratorLearner(self, device, checkpoint_dir)

Constructor parameter explanation:

  • device: {'cuda', 'cpu'}, default='cpu'
    Specifies the device to be used.
  • checkpoint_dir: str, default='utilities/PIFu/checkpoints'
    Specifies a path to be used for loading the checkpoints for inference.

PIFuGeneratorLearner.infer

PIFuGeneratorLearner.infer(self, imgs_rgb, imgs_msk, obj_path, extract_pose)

This method generates a 3D human model from a single image. A future release will allow the use of multiple images as input. The joints of the 3D model in the 3D space can be optionally approximated. The method returns the 3D human model as an object of type simulation.human_model_generation.utilities.model_3D.Model_3D. The locations of the model's joints are also returned as a List, in case were computed.

Parameters:

  • imgs_rgb: list, default=None
    List of images of type engine.data.Image. These images will be used as input. At the current release, the list's length must be 1.
  • imgs_msk: list, default=None
    List of images of type engine.data.Image. These images will be used as masks, depicting the silhouette of the portrayed human. At the current release, the list's length must be 1.
  • obj_path: str, default=None
    Specifies a path for saving the generated 3D human model in OBJ format.
  • extract_pose: bool, default=False
    Specifies whether the joints of the 3D model in the 3D space will be approximated or not.

PIFuGeneratorLearner.load

PIFuGeneratorLearner.load(self, path)

This method loads a pretrained model.

Parameters:

  • path: str, default=None*
    Specifies the folder where the model will be loaded from.

PIFuGeneratorLearner.download

PIFuGeneratorLearner.download(self, path, url)

This method downloads the needed pretrained models.

Parameters:

  • path: str, default=None
    Specifies the folder where data will be downloaded.
  • url: str, default=OpenDR FTP URL
    URL of the FTP server.

ROS Node

A ROS client node and a ROS service node are available for performing inference on an image stream. Documentation on how to use this node can be found here.

Tutorials and Demos

A demo in the form of a Jupyter Notebook is available here.

Example

  • Generation of a 3D human model from a single image using the PIFuGeneratorLearner.

    This example shows how to perform inference on an RGB image, using along an image of the silhouette of the depicted human, and generate a 3D human model.

    import sys
    import os
    from opendr.engine.data import Image
    from opendr.simulation.human_model_generation import PIFuGeneratorLearner
    import matplotlib.pyplot as plt
    import numpy as np
    OPENDR_HOME = os.environ["OPENDR_HOME"]
    
    # We load a full-body image of a human as well as an image depicting its corresponding silhouette. 
    rgb_img = Image.open(os.path.join(OPENDR_HOME, 'projects/python/simulation/human_model_generation/demos', 'imgs_input/rgb/result_0004.jpg'))
    msk_img = Image.open(os.path.join(OPENDR_HOME, 'projects/python/simulation/human_model_generation/demos', 'imgs_input/msk/result_0004.jpg'))
    
    # We initialize learner. Using the infer method, we generate human 3D model. 
    model_generator = PIFuGeneratorLearner(device='cuda', checkpoint_dir='./temp')
    model_3D = model_generator.infer(imgs_rgb=[rgb_img], imgs_msk=[msk_img], extract_pose=False)

Performance Evaluation

TABLE-1: OpenDR 3D human model generation speed evaluation.

Method CPU i7-9700K (ms) RTX 2070 (ms)
Human Model Generation only 488.2 212.3
Human Model Generation + 3D pose approximation 679.8 531.6

TABLE-2: 3D Human Model Generation platform compatibility evaluation.

Platform Test results
x86 - Ubuntu 20.04 (bare installation - CPU) Pass
x86 - Ubuntu 20.04 (bare installation - GPU) Pass
x86 - Ubuntu 20.04 (pip installation) Pass
x86 - Ubuntu 20.04 (CPU docker) Pass*
x86 - Ubuntu 20.04 (GPU docker) Pass*
NVIDIA Jetson TX2 Not tested
NVIDIA Jetson Xavier NX Not tested

*On docker installation, the skeleton approximation of the 3D human models is not available.

References

[1] PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization, arXiv.