Evaluation of Appendicular Muscle Mass in Sarcopenia in Older Adults Using Ultrasonography: A Systematic Review and Meta-Analysis

Background: The measurement of appendicular muscle mass is essential for the diagnosis of sarcopenia. Ultrasonography is an accurate and convenient method used to evaluate muscle mass. Objective: The aim of the study was to evaluate the diagnostic value of ultrasonography for appendicular muscle mass in sarcopenia in older adults and find out proper ultrasound parameters. Methods: Medline, Embase, Cochrane, and Web of Science databases were searched for relevant articles. Published studies on the validity and/or reliability of ultrasonography for quantifying muscle mass of the limbs in sarcopenia in the older population were included. A systematic review was conducted based on specific muscles and reference methods. A meta-analysis was conducted to assess the validity and reliability of the ultrasonography. Results: Forty articles were included in this review. There were nine, nine, nine, and four studies included in the qualitative synthesis for a diagnostic test, correlation coefficient, intra-class reliability, and inter-class reliability, respectively. The diagnostic value of rectus femoris (RF) or gastrocnemius (GM) thickness on ultrasonography for sarcopenia or low muscle mass was moderate (the area under summary receiver operating characteristic curve [SROC] = 0.76, 95% confidence interval [CI]: 0.72–0.79, SROC = 0.80, 95% CI: 0.76–0.83, respectively). The pooled correlation between muscle mass on dual-energy X-ray (DXA) or bioelectrical impedance analysis (BIA) and muscle thickness (MT) on ultrasound was moderate (r = 0.56, 95% CI: 0.49–0.62). There was a low-to-moderate correlation between muscle mass on DXA or BIA and cross-sectional area (CSA) on ultrasound (r = 0.267–0.584). The correlation was high to very high between muscle mass from DXA and the ultrasound-predicted formula (r = 0.85–0.963). The CSA from ultrasound had a high or very high correlation with that from computed tomography or magnetic resonance imaging (r = 0.826, intra (inter)-correlation coefficient [ICC] = 0.998–0.999). The respective meta-analyses showed good inter-rater and intra-rater reliabilities (ICC > 0.9). Conclusion: Ultrasonography is a reliable and valid diagnostic method for the quantitative assessment of appendicular muscle mass in sarcopenia in older people. The thickness and CSA of the RF or GM seem to be proper ultrasound parameters to predict muscle mass in sarcopenia. Multicenter studies with large samples and the application of new ultrasonic techniques will be the future research directions.

© 2022 The Author(s). Published by S. Karger AG, Basel

Introduction

The term sarcopenia was first proposed by Rosenberg in 1988 and was recognized as an independent muscle disease with an International Classification of Diseases-10 Code in 2016 [1, 2]. The definition is controversial and still evolving due to scientific development. The disputed issues include age-related, the variables (e.g., muscle mass, muscle strength, and physical performance) to be included, and cutoff points. It was defined as “age-related loss of skeletal muscle mass plus loss of muscle strength and/or reduced physical performance” according to the 2019 Consensus by the Asian Working Group for Sarcopenia (AWGS2019) [3]. The pathophysiologic mechanisms of sarcopenia remained unclear. It was reported that potential molecular pathophysiology may include protein synthesis and degradation, autophagy, impaired satellite cell activation, mitochondria dysfunction, and other factors associated with muscle weakness and muscle degeneration [4]. Age was the most important risk factor independently associated with sarcopenia, and studies showed that incidence of sarcopenia increased with age [3, 5, 6]. A study on Chinese participants showed that the incidence of sarcopenia was increased during follow-up [7]. Aging seemed to disturb the balance between muscle anabolic and catabolic pathways, resulting in the loss of skeletal muscle. The size and number of myofibers decreased at the cytology level, especially type II fibers. Because type II fibers transformed into type I fibers, intramuscular and intermuscular fat infiltration occurred and the number of type II fiber satellite cells decreased with age [8-11].

With the life expectancy generally extended, the world population older than 60 years has continuously increased, and the prevalence of sarcopenia is destined to increase [12]. The prevalence of sarcopenia differs according to different definitions, settings, populations, and other factors [5, 13]. A systematic review of 58,404 older adults aged 60 years and older in community found that the overall prevalence of sarcopenia was 10% in the world [14]. The prevalence of sarcopenia ranges from 5.5% to 25.7% in Asian countries, with 5.1–21.0% in males and 4.1–16.3% in females [3]. It is estimated that 50 million individuals worldwide currently suffer from sarcopenia, and the number is expected to quadruple over the next 40 years [15].

Sarcopenia, associated with functional decline, can result in multiple adverse consequences, such as an increased incidence of falls and fractures, impaired daily life capabilities, low quality of life, and death [6, 16, 17]. Furthermore, sarcopenia may be associated with postoperative complications and mortality, resulting in a worse prognosis in patients undergoing major surgery, prolonging the length of hospitalization, etc. [16, 17]. Thus, sarcopenia causes high personal, social, and economic burdens, and its early diagnosis and intervention are recommended [3].

Sarcopenia is diagnosed when low muscle mass coexists with low muscle strength or low physical performance [3, 5, 6]. Currently, methods to evaluate muscle mass include magnetic resonance imaging (MRI), computed tomography (CT), dual-energy X-ray (DXA), bioelectrical impedance analysis (BIA), anthropometry, and ultrasonography (US) [15, 18]. MRI and CT were considered as gold standards for evaluating muscle quantity and quality simultaneously; however, they cannot be applied to large-scale community screening and long-term follow-up work, due to high costs, ionizing radiation, time-consuming procedures, and not being portable [6, 16, 17, 19, 20]. Therefore, MRI and CT have been used in scientific research rather than clinical practice. DXA and BIA, with high accuracy and reliability, were recommended in clinical practice by the European Working Group on Sarcopenia in Older People in 2018 (EWGSOP2) and AWGS2019 [3, 15, 20]. However, DXA lacks portability, has device specificity, and may be influenced by the hydration status of patients [6, 16, 17, 19]. BIA is portable but device-specific and population-specific, and its results are greatly affected by the body hydration state, which results in underestimation of muscle mass, and applicability of this device in people with chronic diseases needs to be further verified [5, 6, 19, 21, 22]. Anthropometry is a noninvasive, inexpensive, and convenient method that can be applied in large surveys; however, it requires training, does not generate precise results in the obese population, and is difficult to use for measurements in the elderly [23]. Thus, it was not an ideal method to measure muscle mass and was used when no other methods were available.

US, characterized as real-time, portable, affordable, free of radiation exposure, is an excellent imaging technique for evaluating and long-term monitoring of musculoskeletal disorders; more importantly, it can be performed at the bedside, making it more suitable for older adults [15, 17, 19, 20, 24]. US can assess muscle mass via muscle thickness (MT), cross-sectional area (CSA), and muscle volume [15]. Moreover, the measurement of EI on US can evaluate muscle quality because the echo intensity (EI) of muscles in sarcopenia increase due to fat infiltration. Other US parameters associated with sarcopenia include pennation angle and muscle fascicle length [16]. Indeed, the applicability of ultrasound in evaluating muscle quantity and quality has been proven [16, 25-27]. Most of the studies supported that ultrasound may be a potential, accurate, and promising method for estimating muscle mass in the future [6, 16, 22, 24, 28].

A few systematic reviews have reported the assessment of muscle mass with ultrasound [26, 29]. One study published in 2018 evaluated the evaluation of US in sarcopenia but was restricted to standardized ultrasound measurements for muscle assessment, not involving the reliability and validity of US to evaluate muscles [29]. Another study published in 2017 reviewed the reliability and validity of US to evaluate muscles in older adults, but meta-analysis was not performed [26]. Nowadays, the clinical use of US to assess muscle mass in sarcopenia remains controversial for the lack of standardized protocols and validated cutoff points [3, 6, 16, 22, 24, 28]. We were concerned with the evaluation of appendicular muscles in sarcopenia because appendicular muscles are easily accessible, time-saving, and suitable for community screening and follow-up, and lower limbs are load-bearing muscles, which are affected earlier and more commonly by sarcopenia. New studies on the diagnosis of sarcopenia using US have emerged in recent years. To evaluate the diagnostic value of US for appendicular muscle mass in sarcopenia in older adults and find out the proper muscle and ultrasound parameters, we conducted a systematic review and meta-analysis.

Materials and Methods

This systematic review and meta-analysis were performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) and registered at the International Prospective Register of Systematic Reviews (PROSPERO) (number CRD42021181693).

Search Strategy

We searched Embase, Medline, Cochrane, and Web of Science databases from inception to December 11, 2020. The terms relevant to sarcopenia, US, and older adults were combined in the search strategy. Using the Medline database as an example, “Ultrasonography,” “sarcopenia,” and “aged” were searched using Medical Subject Headings (MeSH). Their synonyms were searched in the title/abstract. The restriction was “human”; the operator “OR” was used to combine every term with their synonym, respectively, then the operator “AND” was used to combine the three queries above. Manual retrieval and cross-referencing from original articles and reviews were also performed. An example of the search query in the Embase database is presented (shown in online suppl. Table S1; see www.karger.com/doi/10.1159/000525758 for all online suppl. material). Similar search strategies were performed using the other databases.

Study Search

Published studies concerning the validity and reliability of US to evaluate appendicular muscle mass in the older adults (average age or median age ≥50 years) were qualified for inclusion in this study. Only the sections on the limb muscles were selected in the studies involving the limb and trunk muscles. Additionally, only the sections on the elderly were selected in studies involving older and young adults.

Exclusion criteria: (1) studies of animals or cadavers, studies of non-appendicular muscles (e.g., abdominal muscles, masseter muscles, lingual muscles, digastric muscle, and temporal muscle), studies with a mean age or median age of less than 50 years, and studies on critically ill patients; (2) case reports, reviews or systematic reviews, editorials, letters to the editor, consensus, and conference paper; (3) studies not providing sufficient data to calculate; and (4) abstract/title only publications.

EndNote software was used for document management. After removing duplicates, titles and abstracts were screened independently by two reviewers (R Zhao, X Li) who were blinded to each other’s decisions. According to the inclusive and exclusive criteria, the studies were independently classified as relevant or irrelevant. The full text of the relevant study was assessed for eligibility independently by the two reviewers. Disagreements were resolved by negotiation. The study selection process is presented in Figure 1.

Fig. 1.

Flow diagram of the study selection process. * The searches were re-run just before the final analyses and a new study was included for the diagnostic tests.

/WebMaterial/ShowPic/1446892Data Extraction

Two reviewers extracted data from the included studies using an Excel spreadsheet independently. Any discrepancies were determined by consensus. The extracted data included the authors, year, country, sample size, age, sex, ethnicity, setting, health status, transducer type, scanning plane, muscles, imaging modality, ultrasound parameters (MT, CSA), the definition of sarcopenia, reference methods, validity and/or reliability estimates, and study results including statistical findings and overall conclusions.

Quality Assessment

Selected studies were assessed independently by two reviewers (R Zhao, X Li) using a methodological quality assessment scale by Pretorius and Keating [25]. Ten items were contained to assess the value of US to evaluate muscles (shown in online suppl. Table S2). Each item was scored as “Yes” (1 point), “No” (0 points), or “Not available” (0 points). A high total score suggested good methodological quality.

Statistical Analysis

A systematic review was conducted according to various muscles and reference methods. The validity included diagnostic tests and correlations. The diagnostic test was measured with sensitivity and specificity. The predictive power was measured with the area under the curve (AUC) or adjusted coefficient of determination and standard error of the estimate. Association was measured with odds ratio or correlation coefficient (r), according to the data reported. Reliability was measured with intra (inter)-correlation coefficient (ICC) or kappa values (κ). For AUC, the values of 0–0.5, 0.5–0.7, 0.7–0.9, 0.9–1, and 1 were regarded as none, low, moderate, high, and perfect predictive power, respectively [30]. For correlation coefficient, the values of 0–0.25, 0.26–0.49, 0.50–0.69, 0.70–0.89, and 0.90–1.00 were graded as little, low, moderate, high, and very high correlation, respectively [31]. For ICC, the values of 0–0.50, 0.51–0.75, 0.76–0.90, and 0.91–1.0 were graded as poor, moderate, good, and excellent, respectively [31].

MedCalc Statistical Software version 15.2.2 (MedCalc Software bvba, Ostend, Belgium; http://www.medcalc.org; 2015) was applied to compare the correlation coefficient of muscle mass between bilateral sides or ultrasonographic parameters among different contrast parameters. Data on validity and reliability were pooled with a meta-analysis using random-effects methods, when possible. We performed I2 testing to assess heterogeneity between studies. Statistical heterogeneity exists when I2 values are greater than 50%. Subgroup analysis was then performed according to muscle type when possible. A funnel plot of each study was conducted to assess publication bias. We conducted Begg and Egger tests simultaneously to assess funnel plot asymmetry. Sensitivity analyses with a one-by-one elimination method were used to evaluate the reliability of the meta-analysis. The effect of publication bias was estimated with trim-and-fill computation. Statistical significance was defined as p < 0.05. The Stata Software version 15.1 was used for statistical analyses.

ResultsSummary of Findings

A total of 40 articles were included [32-71]. The general characteristics of all included studies are presented in Table 1. There were 20 (50%) studies conducted in Asia [32, 33, 38, 40-42, 44, 46, 48, 51, 52, 59, 60, 62, 64-68, 71], 12 (30%) in Europe [35-37, 43, 45, 47, 49, 50, 54, 56, 61, 69], 4 (10%) in North America, 2 (5%) in South America, and 2 (5%) in Oceania. The settings were community or laboratory in 22 (55%) studies, outpatient or inpatient in 16 (40%) studies, and not available in 2 (5%) studies. The subjects were relatively healthy adults free of overt chronic disease (e.g., neuromuscular, diabetes, severe liver disease, kidney disease, chronic infections, cardiovascular disease, cancer, stroke, Parkinson’s disease, paresis, amputations, autoimmune disorders affecting the musculoskeletal system, etc.) in 28 (70%) studies, chronic kidney disease patients, chronic obstructive pulmonary disease patients, and heart failure patients in 2 studies, respectively, and coronary artery disease, diabetes, liver cirrhosis, stroke, frailty, and acute kidney injury in one study, respectively. The mean or median ages ranged from 57.5 to 88 years. The reliability and validity of US were evaluated in 12 and 32 studies, respectively. Of the 32 studies on validity, the comparison with US was BIA, DXA, and CT in 13 (40.6%), 12 (37.5%), and 4 (12.5%) studies, respectively, and MRI, CT or MRI, mid-arm muscle circumference in each study; the most commonly evaluated muscle was the rectus femoris (RF) (n = 13), followed by the quadriceps femoris (QF) (n = 10) and gastrocnemius (GM) (n = 9); the MT was the most commonly used ultrasonic parameter (n = 29) followed by CSA (n = 6).

Table 1.

The general characteristics of all the included studies

/WebMaterial/ShowPic/1446898Methodological Quality Assessment

A methodological quality assessment is shown in Figure 2. There were 34 (87.5%) studies that scored 6 points or higher, indicating the good quality of the methodology. Only one (2.5%) study [50] scored 4 points, and five (12.5%) studies [37, 53, 58, 59, 66] scored 5 points. The study that was scored as 4 and three of the five studies that scored as 5 only assessed reliability of the US [37, 50, 53, 58]. Furthermore, some of the items, such as “time frame” and “adequate detail reported,” did not apply to them, so the low scores did not demonstrate a low methodological quality of the studies.

Fig. 2.

Quality assessment of the included studies.

/WebMaterial/ShowPic/1446890Meta-Analysis of the Diagnostic Value of US for Sarcopenia

Twelve studies assessed the diagnostic value of US to diagnose low muscle mass or sarcopenia, with DXA, BIA, CT, MRI, or anthropometric measures as the gold standard (shown in Table 2). Five studies assessed the diagnosis of low muscle mass [32, 36, 41, 64, 66], and seven examined the diagnosis of sarcopenia [42, 43, 47, 48, 61, 63, 68]. The diagnostic criteria were AWGS 2014 [72] in four studies [41, 42, 66, 68], AWGS 2019 [3] in two studies [32, 64], EWGSOP 2010 [15] in one studies [43], EWGSOP 2019 [6] in two study [36, 47], and FNIH [73] in two studies [36, 61]. The reference method was BIA in seven studies [32, 36, 42, 48, 64, 66, 68], DXA in three studies [41, 43, 61], CT or MRI in one study [63], and anthropometric measures in one study [47].

Table 2.

The diagnostic value of ultrasonography to predict low muscle mass or sarcopenia by muscle mass measurement in older individuals

/WebMaterial/ShowPic/1446896

MT was measured in 10 studies [32, 41-43, 47, 48, 63, 64, 66, 68]. One evaluated the accuracy of the ultrasound-derived prediction equation with QF thickness and showed a moderate diagnostic performance for sarcopenia in men (AUC = 0.78) and women (AUC = 0.89). Nine studies provided data on the accuracy of MT obtained by the US compared to BIA, DXA, or anthropometric measures as the gold standard. We performed a meta-analysis according to muscle type. There were RF in four studies, GM in five, QF in four, and vastus intermedius (VI) in three.

For the MT of RF obtained by the US, the diagnostic value was moderate (the area under summary receiver operating characteristic curve [SROC]: 0.76, 95% CI: 0.72–0.79), with the pooled sensitivity of 0.78 (95% confidence interval [CI]: 0.57–0.90) and specificity 0.70 (95% CI: 0.62–0.76). The heterogeneity for SROC was substantial (I2: 81.95%, 95% CI: 59–100) (shown in Fig. 3, 4). Deeks’ funnel plot asymmetry test indicated no significant publication bias (p = 0.58) (shown in online suppl. Fig. S1).

Fig. 3.

Sensitivity and specificity of rectus femoris thickness on ultrasound for the diagnosis of sarcopenia.

/WebMaterial/ShowPic/1446888Fig. 4.

Summary receiver-operating characteristic curve: rectus femoris thickness by ultrasound ①: Fukumoto 2021 [32]/female, ②: Fukumoto 2021 [32]/male, ③: Yamada 2017 [66]/female, ④: Yamada 2017 [66]/male, ⑤: Sato 2019 [68]/total, and ⑥: Rustani 2019 [43]/total.

/WebMaterial/ShowPic/1446886

For the MT of GM obtained by the US, the diagnostic value was moderate (SROC: 0.80, 95% CI: 0.76–0.83), with the pooled sensitivity of 0.79 (95% CI: 0.69–0.87) and specificity 0.69 (95% CI: 0.63–0.75). The heterogeneity for SROC was moderate (I2: 51.95%, 95% CI: 0–100) (shown in Fig. 5, 6). Deeks’ funnel plot asymmetry test indicated no significant publication bias (p = 0.70) (shown in online suppl. Fig. S2).

Fig. 5.

Sensitivity and specificity of gastrocnemius thickness on ultrasound for the diagnosis of sarcopenia.

/WebMaterial/ShowPic/1446884Fig. 6.

Summary receiver-operating characteristic curve: gastrocnemius thickness by ultrasound ①: Fukumoto 2021 [32]/female; ②: Fukumoto 2021 [32]/male; ③: Barotsis 2020 [47]/total; ④: Yuguchi 2020 [64]/total; ⑤: Wang 2018 [41]/total; ⑥: Kuyumcu 2016 [48]/total.

/WebMaterial/ShowPic/1446882

For the MT of QF obtained by the US, the diagnostic value was low (SROC: 0.69, 95% CI: 0.65–0.73), with the pooled sensitivity of 0.68 (95% CI: 0.60–0.75) and specificity 0.68 (95% CI: 0.61–0.74). The heterogeneity for SROC was mild (I2: 29.95%, 95% CI: 0–100) (shown in Fig. 7, 8). Deeks’ funnel plot asymmetry test indicated a significant publication bias (p = 0.03) (shown in online suppl. Fig. S3).

Fig. 7.

Sensitivity and specificity of quadriceps femoris thickness on ultrasound for the diagnosis of sarcopenia.

/WebMaterial/ShowPic/1446880Fig. 8.

Summary receiver-operating characteristic curve: quadriceps femoris thickness by ultrasound ①: Fukumoto 2021 [32]/female; ②: Fukumoto 2021 [32]/male; ③: Yamada 2017 [66]/female; ④: Yamada 2017 [66]/male; ⑤: Hida 2018 [42]/female; ⑥: Barotsis 2020 [47]/total; ⑦: Hida 2018 [42]/male.

/WebMaterial/ShowPic/1446878

For the MT of VI obtained by the US, the diagnostic value was low (SROC: 0.64, 95% CI: 0.60–0.68), with the pooled sensitivity of 0.61 (95% CI: 0.53–0.69) and specificity 0.68 (95% CI: 0.58–0.76). The heterogeneity for SROC was moderate (I2: 61.95%, 95% CI: 11–100) (shown in Fig. 9, 10). Deeks’ funnel plot asymmetry test indicated a significant publication bias (p = 0.01) (shown in online suppl. Fig. S4).

Fig. 9.

Sensitivity and specificity of vastus intermedius thickness on ultrasound for the diagnosis of sarcopenia.

/WebMaterial/ShowPic/1446876Fig. 10.

Summary receiver-operating characteristic curve: vastus intermedius thickness by ultrasound ①: Barotsis 2020 [47]/total; ②: Fukumoto 2021 [32]/female; ③: Fukumoto 2021 [32]/male; ④: Yamada 2017 [66]/female; ⑤: Yamada 2017 [66]/male.

/WebMaterial/ShowPic/1446874

CSA was measured in two studies [36, 61]. One study evaluated the accuracy of an ultrasound-derived prediction equation and showed that the appendicular lean mass (ALM) derived from muscle ultrasound measurements had a very high correlation with that measured by DXA (adjusted R2 = 0.92). The other study evaluated the diagnostic value of RF-CSA for low muscle mass with BIA and showed moderate discriminative ability (AUC: 0.7 in males, and 0.7–0.9 in females).

The Association of Muscle Mass with Ultrasonographic ParametersThe Comparison of Correlation Coefficient of Muscle Mass with Ultrasonographic Parameters between Bilateral Sides

The results of Kuyumcu et al. [48] showed no significant difference in the correlation coefficient for fat-free mass index (FFMI) and GM thickness, fascicle length, and pinnation angle on US between the left and right sides (p = 0.19, p = 0.45, p = 0.23). Jung et al. [51] discovered no significant difference in the correlation coefficient of the appendicular skeletal muscle index and the thicknesses of the VI, RF, GM, tibialis anterior (TA), and biceps brachii (BB) between the paretic and unaffected sides in stroke patients (p = 0.38, p = 0.69, p = 0.27, p = 0.71, p = 0.61) [51]. Therefore, we chose the correlation coefficient of the right side, the dominant side, or the average of the bilateral sides in articles involving bilateral sides for quantitative analysis.

However, Barotsis et al. [47] found significant differences in the thickness values between the dominant side and nondominant side concerning TA (longitudinal and transverse sections), VI (longitudinal section), GM (longitudinal and transverse sections), and the anterior arm muscles (transverse section), and no significant differences concerning VI (transverse section), anterior arm muscles (long section), and RF (transverse and longitudinal sections).

Meta-Analysis of the Correlation Coefficient of Muscle Mass and Ultrasonographic Thickness

There was a significantly positive correlation between MT and muscle mass (r = 0.56, 95% CI: 0.49–0.62), with statistically significant heterogeneity (I2 = 85.5%, p = 0.000) (shown in Fig. 11). Subgroup analysis by muscle showed that there was a significantly positive correlation between MT of QF and muscle mass (r = 0.43, 95% CI: 0.38–0.48), with no significant heterogeneity (I2 = 18.1%, p = 0.299) (shown in Fig. 11). The Egger test showed a publication bias (p = 0.008) (shown in online suppl. Fig. S5). The trim-and-fill test showed that the estimates were not influenced by publication bias (shown in online suppl. Fig. S6). Sensitivity analyses with a one-by-one elimination method showed that the primary meta-analysis was reliable (shown in online suppl. Fig. S7).

Fig. 11.

Forest plot of the correlation between muscle thickness on ultrasound and muscle mass on reference techniques.

/WebMaterial/ShowPic/1446872The Correlation Coefficient of Muscle Mass with CSA of Muscles

The studies found a low-to-moderate correlation between muscle mass from DXA or BIA and CSA of RF or BB on ultrasound (r = 0.267–0.584, R2 = 0.426–0.438, ICC = 0.546–0.685) [36, 44, 45, 55]. The CSA of the RF or vastus lateralis (VL) from ultrasound had a high or very high correlation with that from CT or MRI (r = 0.826, ICC = 0.998–0.999) [49, 55].

Odds Ratio of Muscle Mass and Ultrasonographic Parameters

Five studies measured the association between ultrasound parameters and low muscle mass or sarcopenia via odds ratio [38, 41, 44, 47, 64]. Wang et al. [41] and Yuguchi et al. [64] found that GM thickness on US was associated with low skeletal muscle mass. Isaka et al. [38] found that the MT of TA was associated with sarcopenia. Barotsis et al. [47] found that the MT of VI, RF, QF, and GM was associated with sarcopenia. Li et al. [44] showed that the CSA of BB was a significant risk factor of sarcopenia (shown in Table 3).

Table 3.

The association of muscle ultrasound parameters with reference techniques of muscle mass measurement in older individuals

/WebMaterial/ShowPic/1446894The Correlation Coefficient of Muscle Mass from DXA with Ultrasound-Predicted Muscle Mass

Gulyaev et al. [61] showed a very high correlation between ALM from DXA and prediction equations derived from the CSA of BB, FA, and RF on US (r = 0.961–0.963). Abe et al. [65] found a high or very high correlation between ALM from DXA and prediction equations derived from MT of different muscles on US (r = 0.85–0.944, R2: 0.877–0.972, ICC = 0.808–0.963), between ALM-minus-appendicular-fat-free adipose tissue from DXA and prediction equations derived from MT of different muscles on US (r = 0.935, R2: 0.868–0.902) [33, 40, 59, 60, 65] (shown in Table 3).

The Reliability of US to Assess Muscle Mass in Older AdultsThe Intra-Class Reliability of US to Assess Muscle Mass

In terms of intra-class reliability, seven studies referred to the thickness of muscles involving the RF, VI, QF, VL, and GM. The meta-analysis of MT showed that the intra-class reliability was excellent (ICC = 0.96, 95% CI: 0.94–0.97), with statistically significant heterogeneity (I2 = 53.9%, p = 0.017). Subgroup analysis by muscle showed that the intra-class reliability of the thickness of the QF was excellent (ICC = 0.95, 95% CI: 0.90–0.97), with statistically significant heterogeneity (I2 = 67.2%, p = 0.009), and the intra-class reliability of the thickness of the RF was excellent (ICC = 0.97, 95% CI: 0.94–0.98), with no heterogeneity (I2 = 0.0%, p = 0.669) (shown in Fig. 12). The intra-class reliability of MT for VL, GM, VI was excellent (ICC = 0.96 for VL, 95% CI: 0.90–0.98, ICC = 0.97 for GM, 95% CI: 0.92–0.99, ICC = 0.98 for VI, 95% CI: 0.95–0.99, respectively).

Fig. 12.

Forest plot of intra-class reliability of muscle thickness on ultrasound by muscles. * The Wilson 2019-1 means that the subject were 39 healthy older patients, and the Wilson 2019-2 means that the subjects were 31 frail older patients in this article.

/WebMaterial/ShowPic/1446870

Four studies referred to CSA of muscles involving RF, BB, and VL. In the three studies involved RF, the intra-class reliability was excellent (ICC = 0.91, 95% CI: 0.76–0.97), with statistically significant heterogeneity (I2 = 87.5%, p = 0.000) (shown in Fig. 13). The intra-class reliability of CSA for VL and BB was excellent (ICC = 0.998 for VL, ICC = 0.99 for BB).

Fig. 13.

Forest plot of intra-class reliability of rectus femoris cross-section area on ultrasound. * The Bemben 2002-1 means that the subjects were 38 postmenopausal women, and the Bemben 2002-2 means that the subjects were 85 older men and women in this article.

/WebMaterial/ShowPic/1446868The Inter-Class Reliability of US to Evaluate Muscle Mass in Older Adults

Regarding inter-rater reliability, four studies referred to the thickness of the QF. In the three studies involving ICC of QF, the meta-analyses of MT showed that the inter-class reliability was excellent (ICC = 0.97, 95% CI: 0.92–0.99), with statistically significant heterogeneity (I2 = 79.1%, p = 0.008) (shown in Fig. 14). Another study evaluated the reliability with the κ value, and the result also indicated high reliability (κ = 0.936) [68].

Fig. 14.

Forest plot of inter-class reliability of quadriceps femoris thickness on ultrasound.

/WebMaterial/ShowPic/1446866

Two studies referred to CSA of RF, the meta-analyses showed that the inter-class reliability was excellent (ICC = 0.97, 95% CI: 0.52–1.0), with statistically significant heterogeneity (I2 = 97.2%, p = 0.000) (shown in Fig. 15). One study referred to MT of RF and VI, the results showed excellent inter-class reliability for MT (ICC = 0.93 for RF 95% CI: 0.85–0.97, ICC = 0.92 for VI, 95% CI: 0.84–0.97, respectively) [58]. A study involving inter-transducer reliability showed excellent reliability (ICC = 0.982) [34].

Fig. 15.

Forest plot of inter-class reliability of rectus femoris cross-section area on ultrasound. * The Hammond 2014-1 means the inter-class reliability between novice operator and experienced operator 1, and the Hammond 2014-2 means the inter-class reliability between experienced operator 1 and experienced operator 2.

/WebMaterial/ShowPic/1446864Discussion/Conclusion

As an affordable and feasible technique, US can be used in the community and different clinical settings, such as outpatient, inpatient, and bedside [74]. In our review, 55% of the studies were conducted in the community or laboratory and 40% in clinical practice. More research on the evaluation of muscle mass using ultrasound in sarcopenia has emerged. This review aimed to update the evidence regarding the validity and reliability of ultrasound to assess muscle mass in sarcopenia in older adults, to better understand the current state, and find out the future research directions.

Our results found that the diagnostic value of US for low muscle mass or sarcopenia was moderate with RF or GM thickness on US and low with QF or VI thickness. The ultrasound-derived prediction equation with QF thickness had a moderate diagnostic performance for sarcopenia. The CSA of the RF had a moderate diagnostic performance for low muscle mass. The ultrasound-derived prediction equation based on the combination of multiple limb muscle CSA adjusted for sex, body weight, and height had a very high predictive power for sarcopenia. When the comparison was DXA or BIA, there was a low-to-moderate correlation between muscle mass and MT or CSA on ultrasound. When the comparison was CT or MRI, or when the muscle mass from DXA or BIA was compared with ultrasound-predicted muscle mass, the correlation was high to very high. The meta-analyses showed that the inter-and intra-class reliability was good. Ultrasound is a reliable and valid imaging method that quantitatively assesses muscle mass in sarcopenia in older people. In the included studies, three studies compared the association of muscle mass and ultrasound parameters between bilateral sides, and the results were controversial. Therefore, further studies are required.

US is a potentially promising method to diagnose sarcopenia. The clinical use is limited by the lack of a standardized protocol and validated cutoff points. Therefore, multicenter studies with large samples are needed. Moreover, new ultrasonic technology, such as elastosonography and artificial intelligence, has been widely used in the diagnosis of diseases. Elastosonography can estimate muscle stiffness, predicting increased fibrosis or intramuscular adipose tissue in sarcopenia. Artificial intelligence can detect changes in muscle architecture and composition and might be helpful for the standardized measurement of muscle mass. Therefore, we believed new ultrasonic technology might be the future direction in the diagnosis of sarcopenia.

Sarcopenia is a relatively new term, and its interpretation is in progress. The diagnostic methods varied in different studies, and the most common gold standards were BIA and DXA. CT, MRI, or mid-arm muscle circumference was used in a few studies. In this review, the most commonly used parameter was MT. The most commonly used muscles were the lower appendicular skeletal muscles, especially the quadriceps femoris. The reasons may be that they are convenient to expose, accessible, and save time, which is conducive to some occasions such as community screenings. More importantly, as the main weight-bearing muscles, lower appendicular skeletal muscles are more accurate and reliable indicators of low muscle mass or sarcopenia. There were many diagnostic criteria for sarcopenia, and the most commonly used diagnostic criteria were AWGS and EWGSOP in the review.

As far as we know, one systematic review has focused on the reliability and validity of US to evaluate muscles in older adults [26]. However, that review did not focus on sarcopenia, and the validity of equations derived from ultrasound to predict muscle mass in older populations could not be conformed restricted to the number of studies. Our study included six relevant studies and supported the validity of the prediction equations derived from ultrasound. Most of the studies in the review had high methodological quality scores. We found that the issues in included studies were mainly related to the blind final assessor, inception cohort, and timeframe. The application of ultrasound should be standardized further.

A limitation of our review was the high heterogeneity in different studies, such as the definition of sarcopenia, reference methods, muscle evaluation, ultrasound modality, statistical analysis, etc. Although we performed the meta-analysis, subgroup analysis could not be conducted restricted to the number of studies on some occasions, even if statistical heterogeneity existed. Second, we focused on the muscle mass without muscle quality. The quantity and quality of the muscles are involved in sarcopenia; therefore, studies on muscle quality need to be conducted in the future. Third, the muscle mass showed a significant gender difference; experts from Asian and European countries had different opinions on the definition and evaluation of sarcopenia, and studies from different regions may be inconsistent; considering different conditions in the community and clinical practice, it remains a question whether there was a difference between the results from community and clinical practice. We did not perform subgroup analysis on these issues owing to insufficient number of studies in some subgroups. Further analyses based on different genders, regions, and settings are also needed.

In conclusion, ultrasonography is a reliable and valid diagnostic method for the quantitative assessment of appendicular muscle mass in sarcopenia in older people. The thickness and CSA of RF or GM seem to be proper ultrasound parameters to predict muscle mass in the sarcopenia. Multicenter studies with large samples and the application of new ultrasonic techniques will be the future research directions.

Acknowledgments

We thank Professor Tao Xu for his support in the statistical analysis. We would also like to thank Editage (www.editage.cn) for English language editing.

Statement of Ethics

An ethics statement is not applicable because this study is based exclusively on published literature.

Conflict of Interest Statement

The authors have no conflicts of interest to declare.

Funding Sources

The study was funded by the National Natural Science Foundation of China (61971447), the Beijing Natural Science Foundation (JQ18023), and CAMS Innovation Fund for Medical Sciences (2020-I2M-C&T-B-035).

Author Contributions

Research design: Meng Yang; data collection: Ruina Zhao, Xuelan Li, and Na Su; data analysis: Ruina Zhao and Xuelan Li; manuscript drafting: Ruina Zhao and Xuelan Li; consultation (including review of manuscript before submitting): Yuxin Jiang, Jianchu Li, Lin Kang, and Yewei Zhang.

Data Availability Statement

All data generated or analyzed during this study are included in this article and its supplementary material files. Further inquiries can be directed to the corresponding author.

This article is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC). Usage and distribution for commercial purposes requires written permission. Drug Dosage: The authors and the publisher have exerted every effort to ensure that drug selection and dosage set forth in this text are in accord with current recommendations and practice at the time of publication. However, in view of ongoing research, changes in government regulations, and the constant flow of information relating to drug therapy and drug reactions, the reader is urged to check the package insert for each drug for any changes in indications and dosage and for added warnings and precautions. This is particularly important when the recommended agent is a new and/or infrequently employed drug. Disclaimer: The statements, opinions and data contained in this publication are solely those of the individual authors and contributors and not of the publishers and the editor(s). The appearance of advertisements or/and product references in the publication is not a warranty, endorsement, or approval of the products or services advertised or of their effectiveness, quality or safety. The publisher and the editor(s) disclaim responsibility for any injury to persons or property resulting from any ideas, methods, instructions or products referred to in the content or advertisements.

留言 (0)

沒有登入
gif