CONTENT4ALL, at the end of the project, produced an innovative technical system that allows a virtual human (originated and animated by a professional sign language interpreter) to present broadcasted television content as a natural sign language interpreter via HbbTV or WebApp. In addition to that, a proof-of-concept of automatic sign-language generation via the virtual human is created for a limited domain of news, for example, sport or programs reporting about COVID.
To deploy such an innovative system, CONTENT4ALL makes use of Hardware and Software tools to capture a human signer in real-time in a studio which can be deployed in many locations even at home, thus called “Remote Studio”. This is cost-effective compared to a regular broadcast studio and is based on the state-of-the-art technologies developed to photo-realistically reproduce posture, gesture and facial expressions of the human sign interpreter via a 3D photorealistic virtual human. The model of a human is first recorded in a dedicated volumetric studio and then associated to a set of mathematical algorithms which allows the virtual human to exactly reproduce the real one. The creation and animation of such virtual human constitutes an advancement in the state of the art: first, the incorporation into a single human model of different elements at a high level of details e.g. hands at a precision of fingers or facial expressions including cheeks and forehead, then the challenge of animating such a model in real-time in the most precise way and without markers of obstructive elements. The generated stream of the virtual human is combined with the original broadcaster one and can be transmitted to watchers as a separate stream, to be watched on-demand on HbbTV 1.5 and 2.x compliant devices.
Another goal reached by the project is the creation of a repository of annotated data based on broadcast-quality video which is intended to be open to other research groups and within CONTENT4ALL will be used for training Artificial Intelligence models for experimenting with automatic translation into sign language and rendered with the improved 3D virtual human. The design of these algorithms, their implementation and testing constitute an advanced in the state-of-the-art, as they are among the first ones published in the scientific literature.
Finally, in the medium to long term, automated sign interpretation technologies are explored and tested. If this goal can be fulfilled, the sign language-speaking community in Europe will overcome the current state of media poverty closing that gap and explores avenues to enable a better society with equal opportunities.