Automatic generation of a 3D sign language avatar on AR glasses given 2D videos of human signers

Lan Thao Nguyen, Florian Schicktanz, Aeneas Stankowski, Eleftherios Avramidis


Abstract
In this paper we present a prototypical implementation of a pipeline that allows the automatic generation of a German Sign Language avatar from 2D video material. The presentation is accompanied by the source code. We record human pose movements during signing with computer vision models. The joint coordinates of hands and arms are imported as landmarks to control the skeleton of our avatar. From the anatomically independent landmarks, we create another skeleton based on the avatar’s skeletal bone architecture to calculate the bone rotation data. This data is then used to control our human 3D avatar. The avatar is displayed on AR glasses and can be placed virtually in the room, in a way that it can be perceived simultaneously to the verbal speaker. In further work it is aimed to be enhanced with speech recognition and machine translation methods for serving as a sign language interpreter. The prototype has been shown to people of the deaf and hard-of-hearing community for assessing its comprehensibility. Problems emerged with the transferred hand rotations, hand gestures were hard to recognize on the avatar due to deformations like twisted finger meshes.
Anthology ID:
2021.mtsummit-at4ssl.8
Volume:
Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages (AT4SSL)
Month:
August
Year:
2021
Address:
Virtual
Editor:
Dimitar Shterionov
Venue:
MTSummit
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
71–81
Language:
URL:
https://aclanthology.org/2021.mtsummit-at4ssl.8
DOI:
Bibkey:
Cite (ACL):
Lan Thao Nguyen, Florian Schicktanz, Aeneas Stankowski, and Eleftherios Avramidis. 2021. Automatic generation of a 3D sign language avatar on AR glasses given 2D videos of human signers. In Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages (AT4SSL), pages 71–81, Virtual. Association for Machine Translation in the Americas.
Cite (Informal):
Automatic generation of a 3D sign language avatar on AR glasses given 2D videos of human signers (Nguyen et al., MTSummit 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2021.mtsummit-at4ssl.8.pdf