Impact of Personalized Avatars and Motion Synchrony on Embodiment and Users’ Subjective Experience: Empirical Study


Introduction

Recent developments in technology have made humans citizens of the virtual world (widely known as the “metaverse”). People use virtual reality (VR) devices including head-mounted displays (HMDs) to wander around the digital world. Additionally, people use an entity such as a virtual avatar in VR to overcome limitations that their physical body cannot achieve. However, it is difficult for people to feel that they are the owner of this newly created body. This feeling of owning the body is called embodiment []. Embodiment consists of 3 components: body ownership (feeling that the body undergoing a certain experience is subordinate to oneself []), agency (feeling that “I” am the cause for body motion []), and self-location (“My view is located at the place where it should be” []).

Previous studies have found that it is possible to form a sense of embodiment toward external objects such as a rubber hand [], virtual arm [], and virtual body []. To induce this illusory feeling of embodiment, several studies [,-] have used a motion-capture device for measuring a body movement and an HMD for observing a virtual avatar’s actions with first-person perspective to synchronize a human activity with the movement of the virtual avatar. These studies showed that motion synchrony increased the degree of agency and found that people felt an ownership toward the virtual body used in the experiments. In addition, increased agency could make people feel a greater presence with less simulator sickness in VR []. Various studies have used the motion synchronization method to induce size perception [] and emotion [], as well as reduce racial bias [].

Other studies have developed a method of enhancing embodiment by increasing appearance similarity between a real body and virtual body [-]. For instance, Kim et al [] demonstrated a body size–matching method using 4 measurements of height, shoulder width, belly width, and hip width. Likewise, Gorisse et al [] attempted to increase appearance similarity by attaching a 3D-scanned head to a virtual body, and furthermore, Waltemate et al [] presented a “personalized avatar” by attaching a 3D-scanned head to a participant-sized body. These studies found that the more an avatar’s body resembled a human’s, the more people felt ownership of the virtual avatar’s body [-]. This increment in ownership resulted in higher presence [,] and decreased simulator sickness [], which could increase people’s virtual experience. In line with this, Jung et al [] showed that it is possible to measure one’s body-related perception by using a virtual body resembling the individual’s physical characteristics.

Although embodiment can be enhanced with motion synchrony (factor 1) and appearance similarity (factor 2), there has been little research to discover the interaction effect between these 2 factors. Kim et al [] demonstrated that the 2 factors affected body ownership independently using a dot avatar and body size–matched avatar. Moreover, they found that the factors affecting the subcomponents of embodiment could boost each other, which can be beneficial to future applications of VR. As we can create more personalized avatars using technologies such as 3D scanning [-], it is worth investigating whether the boosting effect between the 2 factors would be maintained with more individualized avatars.

This study aimed to detect the effects of 2 factors (motion synchrony and appearance similarity) on embodiment and their secondary effect on virtual experiences. Our first hypothesis is that motion synchronization would increase the feeling of agency and body ownership toward the virtual body. The second hypothesis is that the appearance similarity would increase body ownership toward the virtual body. We also speculate that there are potential interactions between the 2 factors. Third, in the higher embodiment condition, the participants would have a more positive subjective experience toward VR comprising presence, simulator sickness, and emotion.


MethodsParticipants

In all, 24 participants (age: mean 23.29, SD 1.97 years; female: n=12, 50%) were recruited for the study. A power analysis using PASS power analysis and sample size software (version 2019; NCSS) was run to determine sample size. All participants provided written informed consent and were compensated with US $40 for their cooperation.

Ethics Approval

All experimental protocols were approved by the Hanyang University Institutional Review Board (HYUIRB-202107-014-1). Photographed individuals in the figures provided written informed consent to publish the case details.

Hardware Setup

To generate synchrony between the participant and avatar, a motion-capture system (Motive; version 2.0.2; Natural Point) and 18 Flex 13 cameras (Natural Point) were used. Participants wore a full-body motion-capture suit with 37 reflective markers. To generate a 3D-scanned face, images of the participant’s face were captured using a mobile device (iPhone SE2; Apple Inc). With the captured images, the participant’s 3D face model was generated through the Metashape (Agisoft LLC) and Blender (Blender Institute) software. The virtual environment of the experiment was implemented using Unity software (version 2018.3.0f2; Unity Technologies). The software was run on a desktop PC (Windows 10 OS; Microsoft) with Intel Core i7-6700 (Intel) CPU, 16GB RAM (Samsung), and NVIDIA GeForce GTX 3070 (Nvidia) GPU.

Virtual Environment

Participants wore a motion-capture suit and HMD (HTC Vive Pro eye; HTC) to experience the virtual environment. The virtual environment was a small room, about 4 m (width) × 4 m (length) × 2.5 m (height), and a virtual mirror was set in front of the participant’s location. A participant could observe the virtual avatar’s body either directly through first-person perspective or by looking in the mirror. The virtual environment and experimental settings are presented in .

Figure 1. Virtual environment (A) and experimental settings (B) of this study. (A) illustrates the virtual environment from the participant’s perspective. A virtual mirror was set in front of the participant’s location. (B) illustrates the experimental setting of each participant, where each participant wears a motion-capture suit with 37 reflective markers attached and a head-mounted display for observing their virtual avatar’s movement in first-person perspective. View this figureProceduresAvatar Creation Phase

In this phase, the personalized avatar was created through the 3D scanning of participants’ faces. Before the scanning, each participant completed the questionnaires for demographic information. After completing the questionnaires, the experimenter measured the participant’s height, shoulder width, belly width, and hip width. They then underwent face scanning, where the experimenter recorded the participant’s face while rotating around them in diverse angles for approximately 1 minute. A total of 300 frame images (3840 × 2160 pixels of resolution) of the participant’s face in different angles were selected. Subsequently, images were processed by the Metashape program to build 3D point-cloud data. The point-cloud data were then simplified into 3D geometry and texture of each participant’s face, and then their 3D face model was merged with an existing virtual avatar body. Finally, the participants’ body sizes were applied to a virtual avatar’s body to produce a personalized avatar. shows the process of the avatar creation.

Figure 2. Procedure for avatar creation. (A) 300 facial image data of diverse angles in 4K resolution are acquired through a mobile device. (B) Point-cloud data are extracted from the images and processed into a face model by the Metashape program. (C) Each participant’s face model is merged with an avatar body, and the avatar’s body size is adjusted to that of the participant. View this figureExperimental Phase

This experiment had a 2 (motion; sync vs async) × 2 (appearance; personalized vs generic) within-subject design to examine the main effects of motion synchrony and appearance similarity and the interaction effect between them. In the motion “sync” condition, the virtual avatar’s body moved according to the participant’s movement, whereas in the motion “async” condition, the virtual avatar moved according to prerecorded movements, regardless of the participant’s movement. For appearance similarity (or personalized appearance), a virtual avatar consisting of a participant’s 3D-scanned head with a size- and gender-matched body was used. In the generic appearance condition, only a gender-matched avatar was used. shows examples of personalized and generic avatars used for a male participant.

During the experimental phase, participants underwent 4 blocks of VR experiences (ie, sync-personalized, sync-generic, async-personalized, and async-generic); the order of the blocks was counterbalanced using the Latin-square method. Participants were asked to move freely and observed a virtual avatar for 5 minutes during each block. After each block, the participants completed the similarity, embodiment, and virtual experience questionnaires listed below. Between blocks, participants rested until they felt ready to start the next block. The entire procedure lasted approximately 90 minutes.

Figure 3. Example of a (A) generic avatar and (B) personalized avatar for the (C) real person. View this figureDependent MeasuresSimilarity Questionnaire

The Similarity Questionnaire (SQ) from a recent study [] was applied in this study. The SQ consists of four 7-point Likert scale questions assessing participants’ feeling on how similar the body parts of the virtual avatar are akin to theirs. The scale ranges from –3 (fully disagree) to +3 (fully agree); shows the 4 questions in detail. The average score of the questions was used as the similarity score.

Table 1. List of items used in the Similarity Questionnairea.QuestionStatementFaceI felt the face of virtual body was similar to mine.TorsoI felt the torso of virtual body was similar to mine.ArmsI felt the arm of virtual body was similar to mine.Legs I felt the leg of virtual body was similar to mine

aScale ranges from –3 (fully disagree) to +3 (fully agree).

Embodiment Questionnaire

The Embodiment Questionnaire (EQ) from recent studies [,,,,] was modified and used in this study. The EQ consists of three 7-point Likert scale questions on agency, body ownership, and self-location. The EQ scale ranges from –3 (fully disagree) to +3 (fully agree). Each question is detailed in .

Table 2. List of items used in the Embodiment Questionnairea.QuestionStatementAgencyI felt that the movement of the virtual body was caused by my own movements.Body ownershipI felt that the virtual body I saw when looking at myself in the mirror was my own body.Self-locationI felt that I was inside the virtual body.

aScale ranges from –3 (fully disagree) to +3 (fully agree).

Virtual Experience Questionnaire

The Virtual Experience Questionnaire consisted of 3 subjective virtual experiences: presence, simulator sickness, and emotion. Participants’ presence was assessed with the Presence Questionnaire [], which consists of 21 questions using a 7-point Likert scale ranging from 1 (not at all) to 7 (completely). Participants’ simulator sickness was assessed with the Simulator Sickness Questionnaire [], which has 16 questions for checking symptoms. Each question was scored using a 4-point Likert scale ranging from 0 (not at all) to 3 (severe). Participants’ emotions were assessed with the self-assessment manikin []. It consists of 2 subscales of arousal and valence. A visual representation of a manikin with varying levels of emotional expression was used to rate participants’ emotional arousal, from 1 (extremely calm) to 9 (extremely excited), and valence, from 1 (extremely negative) to 9 (extremely positive).

Data Analysis

All data were analyzed using SPSS statistical software (version 27.0; IBM Corp). The evaluations of normality were performed using the skewness, kurtosis, and Kolmogorov-Smirnov tests. A 2 (sync vs async) × 2 (personalized vs generic) repeated measures ANOVA was conducted on all dependent measures to examine the effects of motion synchrony, appearance similarity, and interaction between the 2 variables. Post hoc analyses then followed. The level of statistically significant P value was set to .05.


ResultsSimilarity

The average similarity score from the SQ did not show a significant main effect for motion (F1,23=1.448; P=.24; sync mean .14; async mean –.01), whereas a significant main effect was demonstrated for appearance (F1,23=32.920; P<.001; η2=.589; personalized mean .92; generic mean –.79). The interaction effect between the 2 factors was not significant (F1,23=1.178; P=.29).

Embodiment

The result of embodiment is illustrated in .

Table 3. Embodiment scores of agency, body ownership, and self-location.Embodiment, appearanceMotionP value
Sync, mean (SE)Async, mean (SE)
Agency

Personalized2.38 (.13)–.79 (.42)<.001
Generic2.25 (.22)–1.75 (.36)<.001
Sync (personalized vs generic)N/AaN/A.59
Async (personalized vs generic)N/AN/A.02Body ownership

Personalized1.54 (.20).00 (.41)<.001
Generic.96 (.33)–1.08 (.42).002
Sync (personalized vs generic)N/AN/A.08
Async (personalized vs generic)N/AN/A.01Self-location

Personalized1.58 (.26)–.29 (.42)<.001
Generic1.25 (.34)–.29 (.42)<.001
Sync (personalized vs generic)N/AN/A.22
Async (personalized vs generic)N/AN/A>.99

aN/A: not applicable.

Agency

The agency score displayed a significant main effect for motion (F1,23=91.653; P<.001; η2=.799) and appearance (F1,23=4.442; P=.046; η2=.162). A significant interaction effect was present between the 2 factors (F1,23=5.867; P=.02; η2=.203). Post hoc analysis revealed that the participants reported higher agency in the sync condition than the async condition, regardless of appearance (personalized-sync vs personalized-async: P<.001; generic-sync generic-async: P<.001). Although the virtual avatar moved regardless of a participant’s body motion in the async condition, participants felt higher agency toward the personalized avatar’s movement (P=.02).

Body Ownership

The body ownership score exhibited a significant main effect for motion (F1,23=22.876; P<.001; η2=.499) and appearance (F1,23=10.047; P=.004; η2=.312). The interaction effect between the 2 factors was not significant (F1,23=1.078; P=.31).

Self-location

The self-location score showed a significant main effect for motion (F1,23=31.306; P<.001; η2=.576); however, no main effect for appearance was found (F1,23=.416; P=.53). There was no significant interaction effect between the 2 factors (F1,23=.358; P=.56).

Virtual Experience

The result of virtual experience from the Virtual Experience Questionnaire is illustrated in .

Table 4. Virtual experience scores of presence, simulator sickness, and emotion (arousal and valence).Virtual experience, appearanceMotionP value
Sync, mean (SE)Async, mean (SE)
Presence

Personalized74.21 (2.42)55.71 (3.68)<.001
Generic74.71 (3.01)54.92 (4.00)<.001
Sync (personalized vs generic)N/AaN/A.77
Async (personalized vs generic)N/AN/A.71Simulator sickness

Personalized23.53 (5.62)24.00 (5.46).86
Generic25.09 (6.18)27.12 (6.29).52
Sync (personalized vs generic)N/AN/A.58
Async (personalized vs generic)N/AN/A.43Arousal

Personalized4.13 (.39)4.08 (.40).86
Generic3.96 (.41)3.96 (.33)>.99
Sync (personalized vs generic)N/AN/A.57
Async (personalized vs generic)N/AN/A.61Valance

Personalized6.38 (.27)5.63 (.37).004
Generic5.75 (.31)5.54 (.29).41
Sync (personalized vs generic)N/AN/A.07
Async (personalized vs generic)N/AN/A.81

aN/A: not applicable.

Presence

The presence score showed a significant main effect for motion (F1,23=40.126; P<.001; η2=.636) but not for appearance (F1,23=.009; P=.93). Furthermore, the interaction effect between the 2 factors was not significant (F1,23=.333; P=.57).

Simulator Sickness

The main effects of motion (F1,23=.247; P=.62) and appearance were not significant (F1,23=.608; P=.44). Additionally, the interaction effect between the 2 factors was not significant (F1,23=.269; P=.61).

Emotion

Arousal did not indicate a significant main effect for motion (F1,23=.023; P=.88) or appearance (F1,23=.495; P=.49). There was no significant interaction effect between the 2 factors (F1,23=.016; P=.90).

Valence showed a significant main effect for motion (F1,23=6.111; P=.02; η2=.210) and no main effect for appearance (F1,23=1.325; P=.26). There was no significant interaction effect between the 2 factors (F1,23=3.524; P=.07).


DiscussionPrincipal Findings

This study investigated the effects of motion synchronization and appearance similarity on participants’ embodiment, perceived similarity, and subjective experience in VR. The results showed that participants experienced a higher level of agency and body ownership when body motion was synchronized and appearance was similar to theirs. Surprisingly, under the motion and appearance synchronizations, participants reported that the body motion of the virtual avatar was driven by them even when the virtual avatar moved independently, suggesting that the personalized appearance of the virtual avatar’s body can create an illusory agency toward the avatar’s movement that was not performed by the participant. Furthermore, our results indicated that the synchronization of motion could contribute to higher presence and induce more positive emotion.

Two novel findings on embodiment can be listed from the results of this study. First, there was a statistically significant interaction between motion and appearance on agency (significant difference between async-personalized and async-generic conditions in agency). Prior studies have reported that synchronizing motion [,-] could contribute to form a higher level of agency. Results from this study support such previous findings, proving that the integration of visual-motion synchronization, which fulfills proprioception [], forms a feeling that “I am the only cause of the avatar’s body motion” []. In addition to this finding, this study demonstrated that agency was associated with body motion that was not caused by the participant; therefore, the body motion could be felt as the participant’s movement when the virtual avatar resembled the participant’s appearance. This kind of attribution of behavior toward oneself has been shown in several previous studies. For example, Aymerich-Franch et al [] reported that participants felt shame and guilt for the misbehavior of a humanoid robot, which they did not perform. Likewise, Jun et al [] reported that participants’ emotions changed to happy, neutral, and sad according to a virtual avatar’s emotional status. Furthermore, this study extends previous findings that body movement can be attributed to oneself with a personalized avatar. In the motion sync condition, however, the agency difference was not significant between the personalized and generic avatars. We speculate that the reason behind this finding is a higher agency score (average score of 2.31 out 3), which might have caused a ceiling effect. The current result can be used in the field of cognitive behavioral therapy, such as behavior modeling, which involves observing others to learn desirable behaviors. Using a personalized avatar, which facilitates more agency toward the avatar’s action, can be more effective compared to using general avatars in behavior modeling.

Second, it was evident that the 2 factors (ie, motion and appearance) in this study affected body ownership independently. Prior studies have found that synchronizing motion [] and personalized avatars [,-] could contribute to form a higher level of body ownership. The current findings support previous studies by showing that the integration of motion synchronization [] and visual aspects with higher fidelity [] form a feeling that the body is one’s own. Prior studies have revealed that the level of visual fidelity increased in the order of point-light avatar [], robot avatar [], human avatar [], size matched avatar [], and 3D-scanned avatar []. This study extends these previous findings by revealing that body ownership can be further increased with a avatar that has a 3D-scanned face and size-matched body.

In addition, it was evident that the factors of subjective virtual experience (ie, presence, simulator sickness, and emotion) were affected by motion synchrony. This result supports previous studies that confirmed higher presence [,] and positive emotion [,] by motion synchronization. Furthermore, a moderate interaction between appearance similarity (personalized vs generic) and motion synchrony (sync vs async) in emotional valence (significant difference between sync and async in personalized avatar, whereas no difference in generic avatar condition in valence; ) showed that motion synchronization with a personalized avatar could induce more positive emotions than a generic avatar. On the contrary, the effect on arousal and simulator sickness was not significant. Low arousal scores across the conditions suggest that the participants were in a calm state during the experimental tasks. We speculate that low arousal might have affected simulator sickness, resulting in a floor effect across conditions.

In addition to the main findings addressed above, it is worth noting the methodology we used for generating personalized avatars in this study. In fact, many prior studies have attempted to create personalized avatars by 3D scanning a participant’s face and body [,,,]. However, due to complex settings such as red-green-blue-depth camera [] or multiple time-synchronized red-green-blue cameras [,], it is challenging to use this methodology to construct a personalized avatar; thus, it is not appropriate for general use. Inspired by Gorisse et al [] and Jung et al [], our study proposed a more convenient method for creating a personalized avatar using a single smartphone and body-size measuring. The result from the SQ showed that our method could enhance a feeling of appearance similarity between the person and avatar. Thus, in future metaverse applications, each user can easily generate their personalized avatar using their smartphone to dive into VR.

There are a few limitations in this study. First, the participants were restricted to healthy young adults. Future studies should consider recruiting participants across diverse age groups and health statuses. Second, as the prerecorded animation of the async condition consisted of simple body movements that can be performed easily by the participants, it would be stimulating to use a body motion that is “impossible” to perform to generalize the effect of appearance similarity on agency in the follow-up studies. Third, this study asked participants to move freely in the main task and measured arousal and valence as emotional variables. Although appearance similarity did not affect participants’ emotions, future studies should consider participants who have negative feelings toward their body, such as individuals with eating disorders []. An attitude toward one’s body should be considered for future medical applications.

Conclusion

This study proposed a method for creating personalized avatars using a single smartphone camera and an avatar’s body size manipulation. Furthermore, this study’s result indicated that participants perceived that the virtual avatar’s appearance was similar to them. Furthermore, participants established a higher sense of embodiment toward the virtual avatar’s body that was similar to theirs in body motion and appearance. Moreover, we discovered that the synchronization of appearance could result in a sense of agency toward an avatar’s movement. We hope that the findings in the current study can contribute to the fields of physical activity promotion [], social cognition training [], and pain intervention [], suggesting that matching body motion and appearance can enhance the VR experience.

This work was supported by the National Research Foundation of Korea (NRF) and Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (Ministry of Science and ICT [Information and Communications Technology]; grant 2021R1A2C2013479 and 2021-0-00590, Decentralized High Performance Consensus for Large-Scale Blockchains). The funders were not involved in the design or conduction of the study; data collection; management; analysis; interpretation; or preparation, review, or approval of the manuscript.

None declared.

Edited by N Zary; submitted 07.06.22; peer-reviewed by S Han, C Nöthiger; comments to author 01.09.22; revised version received 07.09.22; accepted 11.10.22; published 08.11.22

©Myeongul Jung, Sangyong Sim, Jejoong Kim, Kwanguk Kim. Originally published in JMIR Serious Games (https://games.jmir.org), 08.11.2022.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Serious Games, is properly cited. The complete bibliographic information, a link to the original publication on https://games.jmir.org, as well as this copyright and license information must be included.

留言 (0)

沒有登入
gif