

Frames of Reference Transformations in a Mimickry Task
Introduction
In this work, we aim at exploring the mechanisms underlying simple forms of imitation such as mimicry. Specifically, we focus on the problem of how to map an allocentric representation of motions performed by others onto an egocentric representation of self-generated motions. Indeed, this process needs the visual representation of others, gathered in retino-centered coordinates, to transferred into an ego-centric frame of reference.
|
Illustration of the need of an other-centered representation of movements. Each observer in green imitates the demonstrator in red. On the left, a viewer-centered representation leads to a bad reproduction. However, on the right, a common demonstrator-centered frame of reference allows each demonstrator to perfectly reproduce the observed movement. |
The frame of reference problem |
|
Biological motivations First, the discovery in the monkey brain of the mirror neurons system (MNS) has suggested the existence of a direct-mapping mechanism between visual and motor systems. This link between action observation with self motor execution underlies a strong need for a common visuomotor representation, a common frame of reference (FR). Moreover, we know from neurophysiology, that along the ventral visual pathway, the information flows from the primary visual cortex to the superior temporal sulcus (STS). This region contains populations of neurons that separately exhibit sensitivity to a variety of body parts, and also to their locations, sizes and orientations relative to a viewer, object or goal-centered FR. Therefore, being indirectly connected to the MNS, STS appears clearly to be a candidate where this viewer to body-centered transformation occurs. The model presented here, proposes a biologically plausible mechanism of how such transformation might be performed in this brain region. |
|
Robotic implementation We implemented this network in a humanoid robot. The visual inputs are given by a color-based stereo vision system that allows the simultaneous 3D tracking of a human demonstrator's hand, body and principal axes. These information are fed into the neural network that computes the target location in the demonstrator's body centered reference frame. It is then directly mapped to the robot egocentric frame of reference, so that it can immediately imitate the human's hand trajectory using a classical inverse kinematic algorithm. As illustrated here on the right, the robot is able to mimic gestures shown by a human demonstrator that is not always perfectly facing the cameras. |
![]() |
Selected publications
People involved in this project