The Menpo benchmark for multi-pose 2D and 3D facial landmark localisation and tracking

Deng, Jiankang, Roussos, Anastasios, Chrysos, Grigorios, Ververas, Evangelos, Kotsia, Irene ORCID logoORCID: https://orcid.org/0000-0002-3716-010X, Shen, Jie and Zafeiriou, Stefanos (2019) The Menpo benchmark for multi-pose 2D and 3D facial landmark localisation and tracking. International Journal of Computer Vision, 127 . pp. 599-624. ISSN 0920-5691 [Article] (doi:10.1007/s11263-018-1134-y)

[img]
Preview
PDF - Published version (with publisher's formatting)
Available under License Creative Commons Attribution 4.0.

Download (5MB) | Preview

Abstract

In this article, we present the Menpo 2D and Menpo 3D benchmarks, two new datasets for multi-pose 2D and 3D facial landmark localisation and tracking. In contrast to the previous benchmarks such as 300W and 300VW, the proposed benchmarks contain facial images in both semi-frontal and profile pose. We introduce an elaborate semi-automatic methodology for providing high-quality annotations for both the Menpo 2D and Menpo 3D benchmarks. In Menpo 2D benchmark, different visible landmark configurations are designed for semi-frontal and profile faces, thus making the 2D face alignment full-pose. In Menpo 3D benchmark, a united landmark configuration is designed for both semi-frontal and profile faces based on the correspondence with a 3D face model, thus making face alignment not only full-pose but also corresponding to the real-world 3D space. Based on the considerable number of annotated images, we organised Menpo 2D Challenge and Menpo 3D Challenge for face alignment under large pose variations in conjunction with CVPR 2017 and ICCV 2017, respectively. The results of these challenges demonstrate that recent deep learning architectures, when trained with the abundant data, lead to excellent results. We also provide a very simple, yet effective solution, named Cascade Multi-view Hourglass Model, to 2D and 3D face alignment. In our method, we take advantage of all 2D and 3D facial landmark annotations in a joint way. We not only capitalise on the correspondences between the semi-frontal and profile 2D facial landmarks but also employ joint supervision from both 2D and 3D facial landmarks. Finally, we discuss future directions on the topic of face alignment.

Item Type: Article
Research Areas: A. > School of Science and Technology > Computer Science
Item ID: 25830
Notes on copyright: © The Author(s) 2018.
Open Access: This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Useful Links:
Depositing User: Irene Kotsia
Date Deposited: 12 Dec 2018 12:26
Last Modified: 29 Nov 2022 19:05
URI: https://eprints.mdx.ac.uk/id/eprint/25830

Actions (login required)

View Item View Item

Statistics

Activity Overview
6 month trend
271Downloads
6 month trend
486Hits

Additional statistics are available via IRStats2.