Attributes, Methods, and Frameworks Used to Evaluate Wearables and Their Companion mHealth Apps: Scoping Review

IntroductionBackground

Wearable technology, also known as wearable devices, includes smart electronic devices worn in close proximity to the surface of the human body. These devices can detect, analyze, and transmit information concerning body signals such as vital signs and physiological data, including step count and heartbeat [-]. Smart wearable technologies and their high-performance microsensors are of growing importance for patient health monitoring and are being widely accepted into clinical use and trials [-]. These technologies have the capability to amplify personal wellness and raise awareness in the spectrum of preventive health care. Consumers continue to rely on smart devices such as mobile phones and smartwatches to engage in healthy behavior [-]. They also assist in the self-management of chronic conditions, preventive measures, and aftercare, for example, diabetes monitoring [], rehabilitation [-], fall detection [,], wound healing [], and even monitoring symptoms of long-term illness [-]. Wearable technology further enhances the continuum of care within interdisciplinary communication and improves individuals’ health and well-being, all in their natural mobile environment [,].

Commercial Wearables Versus Medical Wearables

The growing demand for health care technology, particularly wearable devices, has led to the proliferation of various medical and smart health care wearables. However, there is ambiguity in distinguishing between commercial consumer wearable devices and wearable medical devices. The European Union regulations [] define medical devices as those intended for medical purposes such as disease diagnosis, monitoring, treatment, injury management, and physiological process modification; however, this scope does not include wearable technologies, such as smartwatches, smart bands, and mobile phone–based devices, designed primarily to provide users as tools for health monitoring and management. Fotiadis et al [] defined wearable medical devices as self-contained, noninvasive devices with specific medical functions. Although a clear definition of wearable medical wearables remains elusive, these devices serve as a convergence point for both conventional medical device manufacturers and consumer-oriented companies aiming to enter the profitable medical market. The traditional distinction between medical and consumer-grade devices relies on the primary intention; however, we found that many commercial devices are being used opportunistically in health care and clinical trials.

Despite the expanding scope of wearable devices, unresolved concerns persist among general consumers regarding the safety, security, and usability of these devices [-]. Therefore, ensuring the fit-for-use of these technologies for specific users in clinical settings must be ascertained. The assessment of the usability of these technologies is critical to the success and adoption of wearable and mobile technology or the combination thereof. The identification and consideration of the appropriate attributes and methods for the measurement of usability as early on in the product development process can increase productivity, reduce errors, reduce user training and user support, and improve efficacy, thereby further broadening the acceptance of wearable and mobile technology by users [-].

Definition of Usability

In the literature, the definition of usability varies, with some studies equating it to assess a device’s functionality, whereas others focus on aspects such as feasibility or performance. This ambiguity highlights the need for a comprehensive approach in measuring usability, considering users, devices, environment, and the actions users perform. The International Organization for Standardization (ISO) 9241 clarifies usability as “the extent to which a system, product or service can be used by specified users to achieve specified goals with effectiveness, efficiency, and satisfaction in a specified context of use” []. It is important to recognize that usability extends beyond the immediate outcomes of use. Established standards such as ISO 9241 or other regulatory frameworks primarily view usability as a result of use, emphasizing attributes such as effectiveness, efficiency, and satisfaction. However, a holistic evaluation of usability attributes that goes beyond immediate outcomes contributes to a deeper understanding of user interactions with these technologies and their acceptance in everyday use.

For better adoption of wearables in combination with their companion mobile health (mHealth) apps in clinical settings, usability needs to be considered to safeguard the effectiveness, functionality, and ease of use of these technologies. Concerning the acceptance of wearable technologies, it has been advocated that the devices must be easy to wear, affordable, possess suitable functions, and be appealing to users [-]. In such circumstances, designers, developers, and interdisciplinary researchers need to consider the development and use of such devices in a user-centered manner [-], thereby affirming that wearable technology is relative to the requirements of the users as it is a vital factor in the adoption of digital health apps and devices because it can be challenging for users owing to their health conditions. Furthermore, usability testing of these technologies allows researchers to understand how the wearable being developed meets users’ requirements before being used in health interventions.

Exploring Key Attributes of Usability

Previous scientific literature disclosed the measurements of usability using different entities. These entities are defined as dimensions, components, scales, or factors of usability. According to Folmer and Bosch [], these terms are analogous and hold the same meaning. Therefore, as defined by Wixon and Wilson [], the term usability attribute is the characteristics of a product that can be measured. The most consistently reported usability attributes are effectiveness, efficiency, and satisfaction, which are part of the usability definition of ISO 9241-11:2018 [,]. Existing reviews have focused on reporting the usability attributes of mobile apps in health care; however, the shortcoming of applicable attributes for wearables and their companion app poses a challenge in assessing the usability of these technologies.

Exploring Evaluation Methods of Usability

Usability assessment is instrumental in determining how well users learn and use technology to meet their goals. This includes the effectiveness and efficiency of a device and how satisfied the users are with the process. Therefore, different usability evaluation methods should be used to gather this information. Existing literature shows that different methods have been used for the testing of wearables such as field studies and laboratory experiments [-]. Although laboratory experiments, field studies, and hands-on measurement are some of the most commonly used methodologies, these are sometimes difficult to apply and have drawbacks. The prevailing usability methods assess different facets of usability, each providing different data. Accordingly, the selection of methods plays a pivotal role in evaluating the desired attributes of usability. Previous reviews have investigated the usability of wearables [-] and mHealth apps independently [,-]. These reviews examined the usability assessment of wearables or mobile apps according to specific use cases in the health care domain [-]. Moreover, evaluation studies on the combination of the aforementioned devices were not taken into consideration. Research in this area continues to be fragmented, which demonstrates the importance of exploring further the requirements, functionalities, and capabilities of such wearable devices to enhance our comprehensive understanding of their use and acceptance.

Objectives

This study aims to survey the existing literature in the field of medicine and health care that reports on the usability of wearable technology, mHealth apps, or their combination. Our scoping review seeks to analyze the literature in three ways: (1) type (commercial or medical) and category (stand-alone or paired) of wearable devices and their frequency of use in studies; (2) medical use cases; and (3) usability evaluation of these technologies, specifically usability attributes, methods, and frameworks.

MethodsFramework

This study uses the framework developed by Arksey and O’Malley [] for reporting on scoping reviews, following the recommendations for enhancement of this approach by Levac et al []. We followed the five stages of the framework: (1) identifying the research question (RQ), (2) identifying relevant studies, (3) selecting studies, (4) charting the data, and (5) summarizing and reporting the results. In addition, the review followed the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) checklist to report the study selection process of the scoping review () [,].

Stage 1: Identifying the RQs

For our scoping review, we used the PICO (Population, Intervention, Comparison, and Outcome) model [,] shown in to help us regulate our RQs, outline the search strategy, and identify relevant studies within the health care domain. However, for our scoping study, the control or comparison aspect of the PICO methods was eliminated because our focus was not on comparative studies or controlled exposure.

Table 1. The PICO (Population, Intervention, Comparison, and Outcome) method applied to our review.AspectDescriptionOur reviewPatient, population, or problem Problem to be addressed Wearables
Fitness trackers
Physiological data
Intervention, prognostic factor, or exposure Situation or condition or a characteristic of a patient (technology savvy). Exposure to be considered in treatments and tests Mobile devices or smartphones
Mobile apps
Control or comparison Control or comparison intervention treatment or placebo or standard of care Comparison is eliminated as the focus is not on comparative studies or controlled exposure
Outcome to measure or achieve Outcome of interest—what can be accomplished, ensured, improved, or affected? Usability and human factors

This scoping review aims to accomplish its objectives by answering the following RQs:

RQ1: What type (commercial or medical) and category (stand-alone or paired) of wearable devices and their companion mHealth apps were implemented and how frequently were they used in the studies?RQ2: What medical use cases and medical data were reported?RQ3: What usability methods, frameworks, and attributes were used for the usability evaluation?Stage 2: Identifying Relevant Studies

The focus of the second stage of the Arksey and O’Malley framework [] was to find the relevant studies that match the RQs and the purpose of the scoping review. We began the review with an extensive search using keywords related to the PICO model. However, this raised questions about the sensitivity and specificity of the articles, that is, retrieving and identifying relevant research topic publications. Therefore, redefining the search terms after the initial search added the advantage of prioritizing the sensitivity of the relevant article.

We conducted our search in 4 electronic databases, including ACM Digital Library, IEEE Xplore, PubMed, and Web of Science (Clarivate Analytics), resulting in relevant studies covering the last 30 years up to December 2021. Relevant additional literature was also identified through other resources such as citations and expert recommendations. The search strategy was developed in association with the university librarian at the Medical Faculty Mannheim, Heidelberg University. The search integrated both search terms and Medical Subject Headings associated with the topics of health care, wearables, mHealth apps and terms used under the umbrella term user experience. The search strategy for the respective databases can be found in .

Stage 3: Study Selection

For our scoping review, all types of articles ranging from journal articles to conference papers were considered, without restrictions on the period of publication. In line with the systematic review methodology, we formulated inclusion and exclusion criteria for this scoping review. The inclusion and exclusion criteria are provided in . This allowed us to reduce the number of papers that were included in the screening of titles and abstracts. Citavi (version 6; Swiss Academic Software GmbH) was used for the collection of the articles.

Textbox 1. Inclusion and exclusion criteria for the scoping review.

Inclusion criteria

Language: EnglishPapers focused on wearables or a prototype of wearables that have used usability testing for their evaluation, where methods such as questionnaires, observations, experimental testing, or surveys were usedPapers focused on a mobile health (mHealth) app or a prototype of the mHealth app that used usability testing for its evaluation, where methods such as questionnaires, observations, experimental testing, or surveys were usedPapers that use either a wearable, an mHealth app, or both in a medical use case, for example, chronic diseasesPapers that use a user-centered design approach for developing wearables or mHealth apps

Exclusion criteria

Inclusion criteria not fulfilled, for example, papers not written in English or not matching any of the secondary inclusion criteriaPapers with the theme or topic of augmented reality and virtual reality, which may also include usability studies (eg, Google Glass)Papers that have only used audio and visual wearable aids (ie, without additional support from smartphones)Papers purely focused only on the technical aspects, technical descriptions, or features of wearables or mobile apps or PDAs in the development and testing processes of materials; self-developed sensors; or wearable sensors such as accelerometers, gyroscopes, or inertial measurement unitPapers that are focused mainly on medical professionals rather than patients, for example, describing algorithms or methods used for the optimization of viewing medical data (such as electrocardiogram and electroencephalogram)

We followed the recommendations from Daudt et al [] for interdisciplinary teamwork in scoping reviews: we incorporated reviewers from different disciplines and backgrounds such as health services research, usability engineering, and medical informatics. The reviewers were divided into 2 groups such that each group had members with diverse backgrounds and expertise. Furthermore, an expert not involved in the screening reviewed mismatched publications from the groups and made discrete decisions for the inclusion and exclusion of articles. Each team member independently reviewed the titles, abstracts, and full text of the publications assigned to them. Studies were considered for the full-text reading if the inclusion criteria were met and cross-verified among team members.

Stage 4: Charting the Data

At this stage, the data from the included studies were extracted. The review team collectively designed a structured data-charting format aligned with the RQs of the scoping review. Each team member individually extracted relevant characteristics from the included studies and adapted them to the data-charting format. Disagreements between the reviewers were resolved through discussions and feedback. The characteristics extracted from the included studies that are associated with the aim of this scoping review are as follows: (1) classification of wearable devices and mHealth apps (only wearables or paired); (2) type of wearable devices; (3) type of mHealth app (stand-alone or interactive); (4) medical use cases (if wearables or mobile apps or combination of both were used in a specific medical use case); (5) physiological data; (6) type of connections between wearables and apps (Wi-Fi or cables); (7) duration of usability studies; and (8) usability evaluation—usability attributes, frameworks, and methods. Excel (Microsoft Corp) was used to facilitate this process.

Stage 5: Summarizing and Reporting the Results

To effectively summarize and organize the extracted data, a comprehensive search of the relevant literature was conducted to identify suitable articles to structure the examination and analysis. Two relevant literature sources were identified to fulfill our objectives. We aimed to find a suitable classification system that could effectively categorize various wearable devices. The classification proposed by Seneviratne et al [], which provides a comprehensive survey of commercial wearable products grouped into 3 categories—accessories, e-textiles, and e-patches—served as a helpful tool for our analysis. In addition, we used the usability taxonomy hierarchy proposed by Alonso-Ríos et al [] for our analysis. This taxonomy provided a comprehensive framework for organizing usability attributes in a logical and meaningful order.

We present our findings by integrating descriptive tables and graphical illustrations of the outcomes. These figures and graphics helped our analysis to directly connect the findings to the objectives of our review and identify the gaps in the literature. In our study, we illustrate the frequencies and percentages of the findings in coherent data visualizations, emphasizing the analysis and reporting of data and giving them a comprehensive meaning.

ResultsEligible Studies

Our search yielded 382 records, including articles about wearables, mHealth apps, or their combination; research about the implementation of these technologies in medical use cases; and evaluations of their usability. Overall, 323 records were evaluated for the initial screening of titles and abstracts after eliminating duplicates. From these, 132 full-text papers were found, of which 62 were excluded, resulting in 69 studies whose data were charted per the study questions. Following the final text reading, a single study had to be excluded from this scoping review because it did not meet the predefined inclusion criteria, despite the presence of relevant keywords in the paper. The process of selection of articles for the scoping review can be seen in the PRISMA-ScR diagram shown in .

‎

Figure 1. PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) flow diagram of the selection process for the scoping review. Wearable Devices and Their Frequency of Use

Most studies (57/68, 84%) used a combination of wearables and mobile apps. Overall, 12% (8/68) of the studies used wearables in conjunction with other technological devices such as smartphones, computers, recording devices, or PDAs; however, these devices were used independently from the wearables. Furthermore, the data extraction indicated that two-thirds of the studies (45/68, 66%) used commercially available wearables for their evaluation studies, of which approximately half of the studies (21/45, 47%) used Fitbit (Fitbit Inc.) devices as their source of data tracking and collection. Our data further showed that only 11% (5/45) of the studies used wearable devices that were certified as medical devices. In addition, only 18% (12/68) of the studies used a self-developed wearable prototype and a mobile app for data tracking and monitoring.

From the included studies, 9% (6/68) of the studies reported using only wearables in their studies. Of these 6 studies, 5 (83%) used commercially available wearable devices such as Fitbit and Samsung Gear S3 (Samsung Electronics Co, Ltd), whereas 1 (1%) study reported using a self-developed wearable prototype. However, 7% (5/68) of outliers were detected where 60% (3/5) of the studies mentioned using only a mobile app; 20% (1/5) of the studies reported using only a smartphone; and in 20% (1/5) of the studies, smartphone was used as a wearable device by attaching a 3D-printed phone holder around the user’s neck []. Although this is typically not defined as a wearable, the outcome of this study proved imperative in determining the different devices and variables used for evaluation. We used a classification system consisting of 6 distinct groups to categorize the diverse use of wearable technologies and mobile apps (with, without, or a combination thereof). They are (1) only wearables (eg, stand-alone wearables such as Fitbit and Garmin [Garmin Ltd]), (2) wearables+companion apps (eg, Garmin tracker+corresponding Garmin mobile app), (3) smartphone as wearables (eg, smartphone used in close proximity to the skin to track physiological data), (4) wearables+connectivity (not companion) apps (eg, wearables paired with connectivity apps for Bluetooth connection and not for presenting data), (5) wearables+other technologies (eg, Garmin+laptops, recording units, and PDAs), and (6) others (only smartphone or only app). The specified categories and the corresponding studies included in this review are presented in .

In most studies, the validation, accuracy, and certification of the used wearables were not thoroughly discussed, despite these aspects being considered essential in good research practices. Although some studies briefly touched upon validation or accuracy, they did not necessarily indicate that the wearables had undergone certification, such as Food and Drug Administration approval or Conformité Européenne mark. Authors of the included studies often omitted reporting the inaccuracies and validation limitations of consumer-grade wearables, particularly when usability was of significant importance. Instead, the accuracy of wearables was often assumed based on the authors’ validation of wearables’ selection through peer-reviewed research and their alignment with traditional instruments for measuring health data. The list of included studies along with the extracted information can be found in [,-].

Most of the wearables (35/68, 51%) covered in the review were wrist worn, such as wristbands or smartwatches. However, only 9% (6/68) of the studies used multiple wearables, such as a smartwatch and chest belt or multiple wrist-worn wearables. presents the data on the different categories of the types of wearable devices extracted from the included studies. More than half of the studies (48/68, 71%) deployed stand-alone mobile apps, which implies that users or patients collected health information using wearables and apps without sharing it with their professionals. Overall, 68% (46/68) of the studies determined that Bluetooth connections were the primary means of connectivity for wearables and the mobile apps that accompanied them. Data from the extracted studies revealed no linkage between the technologies; hence, outliers (4/68, 6%) were also recognized.

Table 2. Wearable devices, mobile apps, and their combination along with their frequency of use in studies (n=68).Category of wearable devices and mobile appsFrequency of use in studies, n (%)ReferencesOnly wearables6 (9)[-]Wearable devices+companion apps49 (72)[,-]Wearable devices+connected with apps (not companion app)5 (7)[-]Wearables+other technologies (eg, laptops, recording units, and PDAs)3 (4)[-]Smartphone as wearables1 (1)[]Others4 (6)[] (smartphone only), [-] (app only)Table 3. Categorization of the type of wearable devices according to the classification of wearable devices by Seneviratne et al [] ordered by their frequency of use in studies (n=68).Categorization of the type of wearablesDescriptionDevicesStudies, n (%)Wrist wornWrist-worn devices with fitness tracking capabilities or other functionalities, generally without a touchscreenWrist bands26 (38)Wrist wornWrist-worn devices with a touchscreen displaySmartwatches9 (13)>1 wearable device in the studyStudy includes >1 wearable device (any type)Wrist bands, Upper arm bands, e-Patches, Sensor patches6 (9)Other accessoriesChest straps, belts, upper arm bands (in contrast to wrist-worn bands), or knee straps equipped with sensors for health tracking or other functionalitiesStraps5 (7)e-TextilesMain clothing items that also serve as wearables, such as shirts, pants, and undergarmentsSmart garments5 (7)HearablesFits in or on an ear that contains a wireless linkHearing devices4 (6)Other accessoriesClip-onClip-on3 (4)e-PatchesSensor patches that can be adhered to the skin for either fitness tracking or haptic applicationsSensor patches3 (4)OutliersDoes not fit the categoriesWrist bands, Upper arm bands, e-Patches, Sensor patches3 (4)e-TextilesShoes, socks, insoles, or gloves embedded with sensorsFoot or hand-worn2 (3)Other accessoriesJewelry designed with features such as health monitoring and handless controlSmart jewelry1 (1)e-PatchesTattoos with flexible and stretchable electronic circuits to realize sensing and wireless data transmissione-tattoo or e-skin—aHead-mounted devicesSpectacles or contact lenses with sensing, wireless communication, or other capabilitiesSmart eyewear—Head-mounted devicesBluetooth enables headsets or earplugs. Sensor-embedded hats and neck-work devices are also found in research productsHeadsets or earbuds—

aNot available.

Medical Use Cases and Reported Data

Our data showed that approximately half of the studies (32/68, 47%) focused on participants with chronic illnesses or disorders, indicating the importance of wearable technologies in managing and monitoring chronic conditions. The remaining 53% (36/68) of the studies encompassed various other medical use cases such as wellness, mental health, rehabilitation, sleeping disorders, otolaryngology, and preventive measures.

Most studies (40/68, 59%) routinely collected physiological data from users or patients. The most commonly collected health data revolved around physical activity, encompassing metrics such as steps taken, stairs climbed, and inertial measurement units. Approximately 35% (24/68) of the health data gathered in the studies focused on cardiac measurements, including electrocardiogram, heart rate variability, heart rate, or blood pressure. In addition to physical activity and cardiac measurements, other data types were also collected, albeit to a lesser extent. Sleep data accounted for 25% (17/68) of the collected information, and brain activity data, such as electroencephalogram recordings, constituted 4% (3/68) of the data. Furthermore, biosignals, including measurements such as skin conductance and respiration rate, were captured in 18% (12/68) of the studies. Other health data and observations such as acoustics, posture, blood glucose, and weight were also monitored in approximately 21% (14/68) of the cases, indicating the broad range of parameters that wearables can track and analyze.

In addition to physiological data, a small proportion of the studies (4/68, 6%) included in our analysis also collected nonphysiological data that were not directly linked to health parameters. These data contained various variables such as the number of cigarettes consumed per day, location data, and dietary intake. Although not directly related to traditional health measurements, the inclusion of such data provides a broader context and enables a more holistic understanding of individuals’ behaviors and lifestyle factors.

Usability Attributes

The studies included in this review used various terms such as usability characteristics and attributes, which we consider to be synonymous. Therefore, we applied the usability attributes from the extracted data to the usability taxonomy []. We found that satisfaction (34/68, 50%), ease of use (27/68, 40%), user experience (16/68, 24%), perceived usefulness (18/68, 26%), and effectiveness (15/68, 22%) were the most commonly used attributes for assessing usability. Although user experience is acknowledged as the overarching term encompassing usability, conceptualizing user experience as a facet of usability captures the comprehensive perception arising from interactions with devices. This extends beyond the mere use of the device, encapsulating the entirety of the experience or the anticipated use of the technologies. Furthermore, we identified 32% (22/68) of the studies that simply reported usability or perceived usability. Moreover, our findings further indicate that out of the 29 identified usability attributes, 6 (21%) can be classified as quality attributes. shows the mapping of the attributes identified in the review to the attributes defined in the usability taxonomy. These particular usability characteristics possess qualities that are directly related to the overall quality and performance of the technologies used in the studies. presents the quality and product attribute matrix of the attributes ascertained in the review and the defined attributes from the ISO norm 25010 []. A detailed explanation of the different attributes in and can be found in [].

Table 4. Matrix mapping of the attributes identified in the scoping review and the usability taxonomy []a.Usability attributes from scoping reviewUsability taxonomy
KnowabilityOperabilityEfficiencyRobustnessSafetySubjective satisfactionNumber of studies, n (%)Accuracy
✓

5 (7)Aesthetics

✓7 (10)Attitude

✓4 (6)Attractiveness

✓2 (3)Clarity✓

1 (1)Cognitive load

✓

2 (3)Controllability
✓

1 (1)Data quality
✓

1 (1)Ease of use or perceived ease of use (effort expectancy, easiness, self-descriptiveness, and self-efficacy)✓

27 (40)Effectiveness (user errors, ease of executing a task, task completion, and task completeness)
✓

15 (22)Efficiency (task time)

✓

6 (9)Engagement

✓7 (10)Error tolerance

✓

1 (1)Functionality
✓

7 (10)Hedonic motivation

✓2 (3)Learnability✓

4 (6)Likes and dislikes

✓1 (1)Perceived usefulness (usefulness, utility, performance expectancy, and system usefulness)✓

18 (26)Satisfaction (subjective app quality, survey and ratings, positive and negative feedback, opinions and reactions, and participants’ experience)

✓34 (50)Technical difficulties

✓

2 (3)Trust

✓
1 (1)User control
✓

1 (1)User experience (experience, overall subjective quality, and user-friendliness)

✓16 (24)

aThe rows list all the usability attributes identified in the scoping review and the columns list the first-level usability attributes from Alonso-Ríos et al [], with each checkmark symbol indicating a match based on their description and their sublevel attributes. The last column lists the number of studies using the term from that row.

Table 5. Quality and product attributes identified in the scoping review that match the attributes from the International Organization for Standardization (ISO) norm ISO/International Electrotechnical Commission (IEC) 25010 []a.Attributes identified in the scoping reviewQuality attributeProduct attribute
EffectivenessEfficiencySatisfactionSafetyUsabilityFunctional suitabilityPerformance efficiencyCompatibilityUsabilityReliabilitySecurityMaintainabilityPortabilityNumber of studies (%)Comfort

✓

8 (12)Design—app interface and design and interface quality

✓

3 (4)Effectivenessb✓

15 (22)Efficiencyb
✓

✓

6 (9)Facilitating conditions

✓

1 (1)Information quality

✓

3 (4)Interface quality

✓

2 (3)Reliability

✓

1 (1)Trustb

✓

1 (1)Satisfactionb

✓

34 (50)

aThe rows list all the attributes identified in the scoping review and the columns list the quality and product attributes from ISO norm 25010 [], with each checkmark symbol indicating a match based on the description of the attributes. The last column lists the number of studies using the term from that row.

bOverlapping attributes also identified as usability attributes.

In addition, during the data extraction process, we obtained insights into the elements and factors that affect the evaluation of usability. Among the 68 included studies, 52 (76%) reported these elements and factors, which played a crucial role in shaping and guiding the measurement of usability. Notably, acceptance (21/68, 31%) emerged as the most commonly used element or factor in assessing usability, indicating its significance in understanding users’ acceptance and adoption of wearable technologies.

Usability Evaluation Methods and Frameworks

Only 12% (9/68) of the studies outlined using some sort of framework for usability evaluation. User- or human-centered design (7/68, 10%) was the most commonly used framework. Our findings revealed that more than half of the studies (37/68, 54%) collected data using the mixed methods approach. Only 15% (10/68) of the articles used only qualitative methods. These data collection methods included interviews, focus group discussions, thinking-aloud protocols, cognitive walkthroughs, open-ended discussions, Wizard of Oz, and free-text writing. Approximately two-thirds of the studies (21/68, 31%) used quantitative approaches for data collection during evaluation studies. The System Usability Scale (SUS) outnumbered other usability questionnaires (17/68, 25%) such as the Mobile Application Rating Scale, Net Promoter Score, Single Ease Question, NASA Task Load Index, and Technology Acceptance Model. However, a large percentage (21/68, 31%) of the articles used self-developed surveys or self-reporting questionnaires. Only one-fourth of the articles (16/68, 24%) further implemented statistical analysis including task completion, number of errors, descriptive statistics, or Google Analytics for the assessment of usability. Consequently, only a small proportion (3/68, 4%) of the included studies performed heuristic evaluation as a form of expert evaluation.

DiscussionPrincipal Findings

Our data suggest that the evaluation of wearables for medical purposes was largely conducted without direct integration with mobile apps. Although some studies used smartphones as a means of connecting with the wearables, users were not assigned companion apps for data viewing. This limits the analysis and data visualization capabilities of the data collected within the studies. Wrist-worn devices were the most common type of wearables identified in the studies, indicating the convenience of using this type of wearables for measuring physical activities in a research setting. We also found that most studies (21/68, 47%) reported the use of consumer-grade fitness and activity trackers from Fitbit, and only a handful of the studies (5/68, 7%) implemented medical-grade wearable devices. A small number of the studies (10/68, 15%) investigated the data collection and use of the wearable in the aspects of aftercare of patients, wellness, and rehabilitation.

We noted the absence of standards or guidelines to facilitate the analysis of the usability of wearables, mobile apps, or their combination. Although user- and human-centered design frameworks were mentioned in a few studies (7/68, 10%), they are guiding the design and development of systems and devices focusing on users and their needs but not the usability of these devices. Despite the fact that many wearable technologies were included in the study, no usability evaluations of multiple devices, the combination of devices, or multidevice interfaces were reported.

Only a little more than one-third of the included publications (22/68, 32%) in our review explicitly reported the measurement of usability or perceived usability. Some studies (8/68, 12%) primarily focused on assessing the measurement of usability or perceived usability as user perceptions of the devices, attitudes, and compliance using different qualitative or quantitative methods [,,,,,,,]. However, it is worth noting that these studies encountered a challenge in clearly differentiating the evaluation of the usability of the wearable device from that of the accompanying mobile app. Consequently, the intended purpose of the evaluation may have been limited in these studies. Therefore, we address the term usability, a broad term that encompasses various factors including technology and user acceptability. Studies might have reported on usability alone either due to missing expertise or due to a high-level summarization of various aspects they investigated. To help mitigate this, we incorporated and synthesized a set of attributes or subattributes to measure the capability and performance of a system based on the extracted data.

Our analysis showed that a subset of usability studies lacked testing with the intended target group and instead relied on healthy adults as participants. Despite the absence of explicit acknowledgment of this limitation in the included studies, it raises concern regarding the extent to which the devices and apps under investigation adequately address the unique needs and requirements of the target users, particularly individuals with chronic health conditions. This observation is in line with the findings reported in previous studies [-], emphasizing the concern regarding the devices and apps under investigation that adequately meet the unique needs and requirements of the target users.

Comparison With Prior WorkWearables

Most of the studies (45/68, 66%) in our review used consumer-grade wearables. This matches the observations of other studies [-] that investigated commercially available wearables and reported a wide variety of purposes, ranging from digital diagnostic tools to sports tracking to remote monitoring. Niknejad et al [] and Ferreira et al [] reported that consumer-grade wearables have been used to foster self-awareness among users. In contrast, we observed that the studies using consumer-grade wearables in our corpus focused on their use for monitoring chronic conditions such as diabetes, obesity, cardiology, and cancer.

We found that wearables are gradually being used more widely in health care and clinical settings. As stated in the previous paragraph, this is true for consumer-grade wearables such as Fitbit, Jawbone, Apple, and Garmin [-]. However, concerns about the safety, reliability, and accuracy of these devices persist. In their work, Piwek et al [] raised concerns about wearables in terms of user safety, emphasizing the need to better address the reliability and security of the data collected from these devices. Considering the inherent lack of emphasis on user safety in consumer wearables, it is imperative to acknowledge the importance of adhering to standard safety and privacy protocols. This includes ensuring ethical transparency and providing appropriate education to users regarding the privacy and information security risks they may encounter when using such devices [,,]. In addition, the use of consumer wearables in health care settings remains somewhat ambivalent at present [,]. Although our review did not specifically address these concerns, we acknowledge that these factors significantly affect the usability of wearable technologies, whether used independently or with companion mHealth apps. Similar to Piwek et al [], who pointed out acceptance challenges of wearables concerning safety and security, we believe that a structured framework with clear definitions and well-defined methods would allow bringing wearables into more diverse practices in the health care system, encouraging a broader adoption and implementation of use cases with high and tested usability.

Studies by Niknejad et al [], Dimou et al [], and Yang et al [] proposed different categorization approaches for wearables, considering factors such as industry relevance (eg, health care or fashion) or wearable placement on the body. However, owing to the wide variety of wearables available in the market, establishing a standardized classification or hierarchy for these different types of wearables becomes challenging. Thus, to help designers and developers, a standardized classification or hierarchy would be helpful when selecting wearables for specific use cases.

Usability Attributes

Many studies in our corpus did not explicitly state the usability attributes they evaluated. Some mentioned generic terms such as usability or user experience, but did not define them further for their specific cases. As we have argued, more specific usability attributes can facilitate the development of more appropriate requirements and clearer identification of problems in usability studies. This matches observations in related areas. Meyer et al [] analyzed usability evaluation practices in wearable robotics and recommended better distinguishing between the different usability dimensions and including qualitative measures for identifying a wider range of usability issues. Chiauzzi et al [] examined the use of wearable devices for long-term chronic disease management. Patient concerns regarding technical difficulties and the appeal of the devices were identified, but their investigation did not address usability attributes such as device comfort and usefulness. Furthermore, the authors emphasized the importance of wearables being perceived as usable and generating comprehensible data to facilitate wider adoption among patients.

Among the 29 identified attributes in our review, 6 (21%) attributes were found to be more suitable for capturing the quality or product-related aspects of the technologies investigated in this study (). However, determining the most appropriate attributes for wearables and their associated mobile apps can be challenging because of the potential overlap between the product and inherent attributes, such as effectiveness and satisfaction (). This finding is consistent with Bakhshian and Lee [], who argued that consumers’ attitudes and purchase intentions toward wearable technology are influenced by both product attributes and inherent attribut

View original article

JMIR MHEALTH AND UHEALTH

分享书签

0 0 0 0 0 0 0

More from this channel

Attributes, Methods, and Frameworks Used to Evaluate Wearables and Their Companion mHealth Apps: Scoping Review

留言 (0)