The Applications of Artificial Intelligence for Assessing Fall Risk: Systematic Review


IntroductionBackground

According to alarming figures reported by the World Health Organization in 2021, falls cause 37.3 million injuries annually that require medical attention and result in 684,000 deaths []. These figures indicate a significant impact of falls on the health care system and on society, both directly and indirectly [,].

Life expectancy has progressively increased over the years, leading to an aging population []. By 2050, it is estimated that 16% of the population will be >65 years of age. In this group, the incidence of falls has steadily risen, becoming the leading cause of accidental injury and death (accounting for 55.8% of such deaths, according to some research) [,]. It is estimated that 30% of this population falls at least once a year, negatively impacting their physical and psychological well-being [,].

Physically, falls are often associated with severe complications that can lead to extended hospitalizations []. These hospitalizations are usually due to serious injuries, often cranioencephalic trauma, fractures, or soft tissue injuries [,]. Psychologically, falls among the older adult population tend to result in self-imposed limitations due to the fear of falling again [,]. These limitations lead to social isolation as individuals avoid participating in activities or even individual mobility []. Consequently, falls can lead to psychological conditions such as anxiety and depression [,]. Numerous research studies on the risk of falls are currently underway, with ongoing investigations into various innovations and intervention ideas [-]. These studies encompass the identification of fall risk factors [,], strategies for prevention [,], and the outcomes following rehabilitation [,].

In the health care field, artificial intelligence (AI) is characterized by data management and processing, offering new possibilities to the health care paradigm []. Some applications of AI in the health care domain include assessing tumor interaction processes [], serving as a tool for image-based diagnostics [,], participating in virus detection [], and, most importantly, as a statistical and predictive method [-].

Several publications have combined AI techniques to address health care issues [-]. Within the field of predictive models, it is important to understand certain differentiations. In AI, we have machine learning and deep learning [-]. Machine learning encompasses a set of techniques applied to data and can be done in a supervised or unsupervised manner [,]. On the other hand, deep learning is typically used to work with larger data sets compared to machine learning, and its computational cost is higher [,].

Some examples of AI techniques include the gradient boosting machine [], learning method, and the long short-term memory (LSTM) [] and the convolutional neural network (CNN) [], all of them are deep learning methods.

Objectives

For all the reasons mentioned in the preceding section, it was considered necessary to conduct a systematic review to analyze the scientific evidence of AI applications in the analysis of data related to postural control and the risk of falls.


MethodsData Sources and Searches

This systematic review and meta-analysis were prospectively registered on PROSPERO (ID CRD42023443277) and followed the Meta-Analyses of Observational Studies in Epidemiology checklist [] and the recommendations of the Cochrane Collaboration [].

The search was conducted in January 2024 on the following databases: PubMed, Scopus, ScienceDirect, Web of Science, CINAHL, and Cochrane Library. The Medical Subject Headings (MeSH) terms used for the search included machine learning, artificial intelligent, accidental falls, rehabilitation, and physical therapy specialty. The terms “predictive model” and “algorithms” were also used. These terms were combined using the Boolean operators AND and OR ().

Textbox 1. Search strategy according to the focused question.

PubMed

(“machine learning”[MeSH Terms] OR “artificial intelligent”[MeSH Terms]) AND “accidental falls”[MeSH Terms](“machine learning”[MeSH Terms] OR “artificial intelligent”) AND (“rehabilitation”[MeSH Terms] OR “physical therapy specialty”[MeSH Terms])“accidental falls” [Title/Abstract] AND “algorithms” [Title/Abstract]“accidental falls”[Title/Abstract] AND “predictive model” [Title/Abstract]

Scopus

TITLE-ABS-KEY (“machine learning” OR “artificial intelligent”) AND TITLE-ABS-KEY (“accidental falls”)TITLE-ABS-KEY (“machine learning” OR “artificial intelligent”) AND TITLE-ABS-KEY (“rehabilitation” OR “physical therapy specialty”)TITLE-ABS-KEY (“accidental falls” AND “algorithms”)TITLE-ABS-KEY (“accidental falls” AND “predictive model”)

ScienceDirect

Title, abstract, keywords: (“machine learning” OR “artificial intelligent”) AND “accidental falls”Title, abstract, keywords: (“machine learning” OR “artificial intelligent”) AND (“rehabilitation” OR “physical therapy specialty”)Title, abstract, keywords: (“accidental falls” AND “algorithms”)Title, abstract, keywords: (“accidental falls” AND “predictive model”)

Web of Science

TS=(“machine learning” OR “artificial intelligent”) AND TS=“accidental falls”TS=(“machine learning” OR “artificial intelligent”) AND TS= (“rehabilitation” OR “physical therapy specialty”)AB= (“accidental falls” AND “algorithms”)AB= (“accidental falls” AND “predictive model”)

CINAHL

(MH “machine learning” OR MH “artificial intelligent”) AND MH “accidental falls”(MH “machine learning” OR MH “artificial intelligent”) AND (MH “rehabilitation” OR MH “physical therapy specialty”)(AB “accidental falls”) AND (AB “algorithms”)(AB “accidental falls”) AND (AB “predictive model”)

Cochrane Library

(“machine learning” OR “artificial intelligent”) in Title Abstract Keyword AND “accidental falls” in Title Abstract Keyword(“machine learning” OR “artificial intelligent”) in Title Abstract Keyword AND (“rehabilitation” OR “physical therapy specialty”) in Title Abstract Keyword“accidental falls” in Title Abstract Keyword AND “algorithms” in Title Abstract Keyword“accidental falls” in Title Abstract Keyword AND “predictive model” in Title Abstract KeywordStudy Selection

After removing duplicates, 2 reviewers (AGC and RLR) independently screened articles for eligibility. In the case of disagreement, a third reviewer (JABA) finally decided whether the study should be included or not. We calculated the κ coefficient and percentage agreement scores to assess reliability before any consensus and estimated the interrater reliability using κ. Interrater reliability was estimated using κ>0.7 indicating a high level of agreement between the reviewers, κ of 0.5 to 0.7 indicating a moderate level of agreement, and κ<0.5 indicating a low level of agreement [].

For the selection of results, the inclusion criteria were established as follows: (1) articles should have been published in the last 5 years (from 2018 to the present); (2) they must apply some AI method; (3) AI analyses should be applied to data from samples of humans; and (4) the sample analyzed should consist of people with independent walking, with or without the use of external orthopedic devices.

After screening the data, extracting, obtaining, and screening the titles and abstracts for inclusion criteria, the selected abstracts were obtained in full texts. Titles and abstracts lacking sufficient information regarding inclusion criteria were also obtained as full texts. Full-text articles were selected in case of compliance with inclusion criteria by the 2 reviewers using a data extraction form.

Data Extraction and Quality Assessment

The 2 reviewers mentioned independently extracting data from the included studies using a customized data extraction table in Excel (Microsoft Corporation). In case of disagreement, both reviewers debated until an agreement was reached.

The data extracted from the included articles for further analysis were: demographic information (title, authors, journal, and year), characteristics of the sample (age, inclusion and exclusion criteria, and number of participants), study-specific parameters (study type, AI techniques applied, and data analyzed), and the results obtained. Tables were used to describe both the studies’ characteristics and the extracted data.

Assessment of Risk of Bias

The methodological quality of the selected articles was evaluated using the Critical Review Form for Quantitative Studies []. The ROBINS-E (Risk of Bias in Nonrandomized Studies of Exposures) tool was used to evaluate the risk of bias [].


ResultsCharacteristics of the Selected Studies

A total of 3858 articles were initially retrieved, with 1563 duplicates removed. From the remaining 2295 articles, 2271 were excluded based on the initial selection criteria, leaving 24 articles for the subsequent analysis. In this second analysis, 2 articles were removed as they were systematic reviews, and 22 articles were finally selected [-] (). After the first reading of all candidate full texts, the kappa score for inclusion of the results of reviewers 1 and 2 was 0.98, indicating a very high level of agreement.

The methodological quality of the 22 analyzed studies (Table S1 in [,,,,,,,,, ,,]) ranged from 11 points in 2 (9.1%) studies [,] to 16 points in 7 (32%) studies [,,,,-].

Figure 1. Identification of studies via databases and registers. Study Characteristics and Risk of Bias

All the selected articles were cross-sectional observational studies ().

In total, 34 characteristics affecting the risk of falls were extracted and classified into high fall-risk and low fall-risk groups with the largest sample sizes significantly differing from the rest. Studies based on data collected from various health care systems had larger sample sizes, ranging from 22,515 to 265,225 participants [,,]. In contrast, studies that applied some form of evaluation test had sample sizes ranging from 8 participants [] to 746 participants [].

It is worth noting the various studies conducted by Dubois et al [,], whose publications on fall risk and machine learning started in 2018 and progressed until 2021. A total of 9.1% (2/22) of the articles by this author were included in the final selection [,]. Both articles used samples with the same characteristics, even though the first one was composed of 43 participants [] and the last one had 30 participants []. All 86.4% (19/22) of the articles used samples of individuals aged ≥65 years [-,-,-]. In the remaining 13.6% (3/22) of the articles, the ages ranged between 16 and 62 years [,,].

Althobaiti et al [] used a sample of participants between the ages of 19 and 35 years for their research, where these participants had to reproduce examples of falls for subsequent analysis. In 2022, Ladios-Martin et al [] extracted medical data from participants aged >16 years for their research. Finally, in 2023, the study by Maray et al [] used 3 types of samples, with ages ranging from 21 to 62 years. Among the 22 selected articles, only 1 (4.5%) of them did not describe the characteristics of its sample [].

Finally, regarding the sex of the samples, 13.6% (3/22) of the articles specified in the characteristics of their samples that only female individuals were included among their participants [,,].

Table 1. Methodological characteristics of results analyzed.StudyAge range (mean; years)Assessment proceduresAIa techniquesResultsNait Aicha et al [], (2018)65-99 (75.3)Over the following 6 months, they had monthly consultations to monitor the incidence of falls.Classification of participants into fall and nonfall categories and creation of an MLb model that predicts falls from the accelerometer data.A total of 34.1% of the participants experienced at least one fall. ML applied to the accelerometric data showed a predictive model of comparable accuracy to conventional fall risk assessment models.Althobaiti et al [], (2020)19-34 (ndc)Participants performed 6 ADLsd and 9 falls on a foam landing mat. Accelerometer data were transmitted via Bluetooth and recorded continuously.After extracting the signals from the accelerometers, the data were used to train ML models.The ML models created can predict and detect falls with high accuracy. The supervised classification experiments achieved an F1-score of 98.4% for distinguishing between falls and nonfalls, and an F1-score of 88.1% for distinguishing between ADLs.Dubois et al [], (2018)61-93 (83)The TUGe test was administered 3 times, with 3-5 minutes of rest between attempts. The best result was used to categorize participants into low and high risk of falling.The images were analyzed with a processing algorithm, and ML techniques were applied to predict the risk of falls.All participants were classified using a combination of walking speed, stride length, and sitting speed. The algorithm used was used to correctly classify them according to their risk of falling and to assess their progress.Dubois et al [], (2021)61-93 (83.3)A first assessment, including the Tinetti and TUG tests, was conducted by a physiotherapist to identify patients who are “high risk.” Daily activity was monitored for 8 hours.Different parameters, such as walking speed, sitting speed, and total sitting time, were analyzed. The data were analyzed by combining statistics and ML algorithms.The results obtained from the algorithms were compared with those obtained by physiotherapists using the Tinetti and TUG tests. Step length and sit-to-stand time could complement the assessment tools and improve the risk of falls.Eichler et al [], (2022)65-nd (nd)BBSf (consisting of 14 tasks). A total of 2 physiotherapists independently scored each participant on the 14 tasks. The number of participants varied across tasks due to differences in performance abilities.ML to predict the risk of falling by categorizing it as “high,” “medium,” or “low.” Initially, the movements of the participants were tracked using deep learning applied to the camera while they performed the BBS. Then, an algorithm predicted the BBS score, leading to the final risk classification.The results obtained through deep learning were compared with the scores given by the physiotherapists, and the overall accuracy was 75.5% correct. The ML algorithm reduced the false-negative rate by adjusting the thresholds. The most predictive features were: 360° turn, alternating feet on the step, chair transfer, and reaching forward with the arm extended.Ferrete et al [], (2019)ndParticipants were instructed to act out falls.A total of 2 deep learning approaches were explored: the classifier based on ASMg and CNNh.The CNN demonstrated 92.7% accuracy in differentiating normal gait, prefall condition, and fall situation, considering the prefall step, and 100% when not considering the prefall step.Gillain et al [], (2019)65-89 (71.3)Participants underwent a physical and functional assessment, which included the evaluation of gait speed, stride length, frequency, symmetry, regularity, and minimum toe separation. These variables were recorded under conditions of comfortable walking, fast walking, and dual-task walking.A supervised ML algorithm (J48) was applied to the recorded data.A classification tree was obtained that correctly identifies 80% of future falls based on gait parameters, gender, leg length, and age. Accuracy was 84%, sensitivity was 80%, specificity was 87%, positive predictive value was 78%, and negative predictive value was 88%.Greene et al [], (2021)nd (74.7)The test lasted 30 seconds and was conducted with both eyes open and closed. A comprehensive geriatric assessment and the BBS were used to obtain risk factors for each participant. Data were collected using the Kinesis Balance FPi application, and a questionnaire on the socio-demographic data of the participants was administered through the app.The ML algorithm assessed balance and the risk of falling based on previously collected data.The use of ML models for balance and fall risk assessment could improve the approach to falls, including prescribing personalized prevention programs and monitoring progress. This would improve the remote care system with more targeted and digitized initiatives.Hauth et al [], (2021)69-82 (nd)IMUsj measured linear and angular accelerations in the mediolateral, anteroposterior, and longitudinal axes. Participants wore a wrist-mounted voice recorder to indicate when the event occurred and to provide context descriptions.Customized algorithms were used to integrate the data from the IMUs in 6 dimensions to obtain the position, orientation, and velocity of the foot. Model performance was evaluated in terms of AUROCk and AUPRl.Algorithms for data integration could be used retrospectively to track the occurrence of falls or loss of balance events. The model showed discriminatory ability to differentiate a fall from a normal walking event. The best model achieved an AUROC ≥0.87 for nonparticipants.Hsu et al [], (2020)ndAll participants were assessed for fall risk using the MFSm upon admission.The XGBn predictive model was applied, and its performance was compared with that of the MFS.The XGB model is more sensitive than the MFS, and therefore it is more appropriate for assessing the risk of falls.Hu et al [], (2020)ndParticipants underwent treadmill walking and MCTo with small, medium, and large perturbations. They were classified into a high-balance group and a low-balance group based on the MCT results.ML analysis: 5-fold cross-validation where the algorithms were repeatedly randomly trained. Using the GBMp algorithm, participants were classified into high and low equilibrium through analysis of the accelerometric data.The feasibility of ML was demonstrated by matching accelerometric data and predicting balance dysfunctions. This method was comparable to the data obtained from instrumented balance tests and was also less expensive.Kim et al [], (2019)ndAll participants simulated various types of falls, resulting in a total of 282 scenes constituting the database. Falls, including forward, backward, and sideways motions, were replicated. Additionally, images without falls depicted actions such as sitting on a chair or lying on a bed.The application of ML: The RFq system was used as a data management and predictive tool. A total of 200 decision trees were used, with 25 trees deep for each tree.Improved detection of false fall data. The model combining accelerometric data and vision data gave promising results. The application of ML achieved good generalization of the obtained data. Accuracy of the proposed method was 90%.Ladios-Martin et al [], (2022)16-106 (nd)In total, 91 variables related to the risk of falls were selected. A total of 2 models were created: model A included the variable FP, while model B did not.The performance of 10 algorithms was evaluated. Once the final models were created, they were compared with each other by evaluating their performances.The performance of the model that included the FP variable performed better than that of the one that did not include it. Using AI techniques and the FP variable, a high-performance model is obtained that can be exported to other contexts.Lo et al [], (2019)65-nd (nd)Researchers established which patients were at risk of falls based on existing medical record data.ML techniques (RF algorithm) were used to build a predictive model of fall risk.The risk of falls is affected by a large number of factors (age, clinical diagnoses, daily habits, environment, and hygiene). ML makes it possible to predict the risk of falls and rank the importance of each risk factor.Lockhart et al [], (2021)56-9 (74.3)Participants conducted 10-meter walks. Out of 127 participants, 79.5%) did not experience falls, while 20.5%) did.A total of 3 types of deep learning techniques were applied, and a predictive model was created with the data from the 127 participants (RF, RF with feature engineering, and RF with feature engineering and linear and nonlinear variables). The accuracy of the model was tested with the remaining 34.6%) participants with a 6-month follow-up of their fall history.The IMU was a good tool for the identification of falls risk. Data analysis through AI obtained very good predictive results.Maray et al [], (2023)21-65 (nd)All participants performed fall representations.Deep learning was used to match the data extracted from devices.A deep learning model was built using data obtained from different handheld devices that gave better results than the initial creation of a new model.Noh et al [], (2021)63-89 (73)In total, 3 walking tests were conducted on a 20-meter straight walkway, involving walking at slower, faster, or the participant\'s preferred speed. Participants were categorized into low fall-risk and high fall-risk groups based on their fall history and fear of falling.The ML XGB algorithm was applied to perform the predictive analysis. The predictive analysis sought to predict factors affecting the risk of falls. The parameters were spatiotemporal parameters extracted from the gait tests.In total, 34 characteristics affecting the risk of falls were extracted and classified into high fall-risk and low fall-risk groups. The study demonstrated >70% accuracy. Stride length, speed, and posture accurately classified the risk of falling.Qiu et al [], (2018)ndParticipants underwent 7 tests: Sensory Integration Test, Limits of Stability Test, Sit-to-Stand test (5 times), TUG test, motor function test, Choice Reaction test, and Fear of Falling test.A total of 38 algorithms were developed to extract meaningful measurements from the data extracted from the sensors. In total, 38 meaningful measures were obtained and incorporated into a supervised ML model. A prediction of “fallers” and “nonfallers” was made based on this data. The accuracy of the prediction was assessed through a 10-fold cross-validation.The data obtained distinguished between people who fall and those who do not, with overall accuracy of 89.4% (92% sensitivity and 84.9% specificity). The model created from the inertial sensors in the test battery and the subsequent analysis of the data through AI are promising.Roshdibenam et al [], (2021)65-95 (74.4)A physiotherapist conducted various assessments, including the TUG test, 30-Second Stand Test, and 4-Stage Balance tests, and classified participants into “high” and “low” fall risk categories.The CNN algorithm was used for the data for each individual sensor location. Algorithms were applied to evaluate the data obtained by the portable sensors.Deep learning had high similarities to those obtained by the physiotherapist. Deep learning techniques combined with nonintrusive wearable sensors could provide a new tool for measuring risk factors and predicting fall risk.Tang et al [], (2022)nd (66)Participants were divided into 2 groups: “Normal walking” and “Abnormal gait.” The “abnormal” group had a higher average age and a greater proportion of men compared to the “normal” group. The TUG test videos of all participants were observed.ML with the AlphaPose algorithm and the naïve Bayes classifier to identify normal or abnormal gait.The naïve Bayes classifier obtained a complete data accuracy of 90.1% and a LOOCVr accuracy of 89.1% for the detection of abnormal gait performance.Tunca et al [], (2020)nd (76.8)Participants were divided into 2 groups: “high fall risk” and “low fall risk.” A total of 2 intervention groups were created: the validation set (30 low risk+30 high risk) and the test set (9 low risk+7 high risk). Participants were instructed to walk 8 m in a straight line at least 3 times.The applicability of deep learning LSTMs networks were compared with traditional ML with respect to fall risk.The LSTM achieved classification accuracy of 89% compared to accuracy of 82.8% (the highest accuracy achieved by the traditional ML models used).Ye et al [], (2020)65-nd (nd)A record of falls was compiled from April 2017 to March 2018. A total of 4361 falls were recorded.The XGB algorithm was used to create the crash prediction model.The fall prediction model achieved improved discriminatory ability. It may be able to identify risk factors and facilitate prevention interventions.

aAI: artificial intelligence.

bML: machine learning.

cnd: none described.

dADL: activities of daily living.

eTUG: Timed Up and Go.

fBBS: Berg Balance Scale.

gASM: associative skill memories.

hCNN: convolutional neural network.

iFP: fall prevention.

jIMU: inertial measurement unit.

kAUROC: area under the receiver operating characteristic curve.

lAUPR: area under the precision-recall curve.

mMFS: Morse Fall Scale.

nXGB: extreme gradient boosting.

oMCT: motor control test.

pGBM: gradient boosting machine.

qRF: random forest.

rLOOCV: leave-one-out cross-validation.

sLSTM: long short-term memory.

Applied Assessment Procedures

All articles initially analyzed the characteristics of their samples to subsequently create a predictive model of the risk of falls. However, they did not all follow the same evaluation process.

Regarding the applied assessment procedures, 3 main options stood out: studies with tests or assessments accompanied by sensors or accelerometers [-,,-,,-], studies with tests or assessments accompanied by cameras [,], or studies based on medical records [,,,] (). Guillan et al [] performed a physical and functional evaluation of the participants. In their study, they evaluated parameters such as walking speed, stride frequency and length, and the minimum space between the toes. Afterward, they asked them to record the fall events they had during the past 2 years in a personal diary.

Figure 2. Infographic about the characteristics of the analyzed research.

In total, 22.7% (5/22) of the studies used the Timed Up and Go test [,,,,]. In 18.2% (4/22) of them, the participants performed the test while wearing a sensor to collect data [,,,]. In 1 (4.5%) study, the test was recorded with a camera for later analysis []. Another commonly used method in studies was to ask participants to perform everyday tasks or activities of daily living while a sensor collected data for subsequent analysis. Specifically, 18.2% (4/22) of the studies used this method to gather data [,,,].

A total of 22.7% (5/22) of the studies asked participants to simulate falls and nonfalls while a sensor collected data [,-,]. In this way, the data obtained were used to create the predictive model of falls. As for the tests used, Eichler et al [] asked participants to perform the Berg Balance Scale while a camera recorded their performance.

Finally, other authors created their own battery of tests for data extraction [,,,]. Gillain et al [] used gait records to analyze speed, stride length, frequency, symmetry, regularity, and foot separation. Hu et al [] asked their participants to perform normal walking, the postural reflexive response test, and the motor control test. In the study by Noh et al [], gait tests were conducted, involving walking 20 m at different speeds. Finally, Greene et al [] created a 12-question questionnaire and asked their participants to maintain balance while holding a mobile phone in their hand.

AI Techniques

The selected articles used various techniques within AI. They all had the same objective in applying these techniques, which was to achieve a predictive and classification model for the risk of falls [-].

In chronological order, in 2018, Nait Aicha et al [] compared single-task learning models with multitask learning, obtaining better evaluation results through multitask learning. In the same year, Dubois et al [] applied AI techniques that analyzed multiple parameters to classify the risk of falls in their sample. Qiu et al [], also in the same year, used 6 machine learning models (logistic regression, naïve Bayes, decision tree, RF, boosted tree, and support vector machine) in their research.

In contrast, in 2019, Ferrete et al [] compared the applicability of 2 different deep learning models: the classifier based on associative skill memories and a CNN classifier. In the same year, after confirming the applicability of AI as a predictive method for the risk of falls, various authors investigated through methods such as the RF to identify factors that can predict and quantify the risk of falls [,].

Among the selected articles, 5 (22.7%) were published in 2020 [-]. The research conducted by Tunca et al [] compared the applicability of deep learning LSTM networks with traditional machine learning applied to the risk of falls. Hu et al [] first used cross-validation, where algorithms were trained randomly, and then used the gradient boosting machine algorithm to classify participants as high or low risk. Ye et al [] and Hsu et al [] both used the extreme gradient boosting (XGBoost) algorithm based on machine learning to create their predictive model. In the same year, Althobaiti et al [] trained machine learning models for their research.

In 2021, Lockhart et al [] started using 3 deep learning techniques simultaneously with the same goal as before: to create a predictive model for the risk of falls. Specifically, they used the RF, RF with feature engineering, and RF with feature engineering and linear and nonlinear variables. Noh et al [], again in the same year, used the XGBoost algorithm, while Roshdibenam et al [], on the other hand, used the CNN algorithm for each location of the wearable sensors used in their research. Various machine learning techniques were used for classifying the risk of falls and for balance loss events in the research by Hauth et al []. Dubois et al [] used the following algorithms: decision tree, adaptive boosting, neural net, naïve Bayes, k-nearest neighbors, linear support vector machine, radial basis function support vector machine, RF, and quadratic discriminant analysis. Hauth et al [], on the other hand, used regularized logistic regression and bidirectional LSTM networks. In the research conducted by Greene et al [], AI was used, but the specific procedure that they followed is not described.

Tang et al [] published their research with innovation up to that point. In their study, they used a smart gait analyzer with the help of deep learning techniques to assess the diagnostic accuracy of fall risk through vision. Months later, in August 2022, Ladios-Martin et al [] published their research, in which they compared 2 deep learning models to achieve the best results in terms of specificity and sensitivity in detecting fall risk. The first model used the Bayesian Point Machine algorithm with a fall prevention variable, and the second one did not use the variable. They obtained better results when using that variable, a mitigating factor defined as a set of care interventions carried out by professionals to prevent the patient from experiencing a fall during hospitalization. Particularly controversial, as its exclusion could obscure the model’s performance. Eichler et al [], on the other hand, used machine learning–based classifier training and later tested the performance of RFs in score predictions.

Finally, in January 2023, Maray et al [] published their research, linking the previously mentioned terms (AI and fall risk) with 3 wearable devices that are commonly used today. They collected data through these devices and applied transfer learning to generalize the model across heterogeneous devices.

Findings

The results of the 22 articles provided promising data, and all of them agreed on the feasibility of applying various AI techniques as a method for predicting and classifying the risk of falls. Specifically, the accuracy values obtained in the studies exceed 70%. Noh et al [] achieved the “lowest” accuracy among the studies conducted, with a 70% accuracy rate. Ribeiro et al [] obtained an accuracy of 92.7% when using CNN to differentiate between normal gait and fall events. Hsu et al [] further demonstrated that the XGBoost model is more sensitive than the Morse Fall Scale. Similarly, in their comparative study, Nait Aicha et al [] also showed that a predictive model created from accelerometer data with AI is comparable to conventional models for assessing the risk of falls. More specifically, Dubois et al [] concluded that using 1 gait-related parameter (excluding velocity) in combination with another parameter related to seated position allowed for the correct classification of individuals according to their risk of falls.


DiscussionPrincipal Findings

The aim of this research was to analyze the scientific evidence regarding the applications of AI in the analysis of data related to postural control and the risk of falls. On the basis of the analysis of results, it can be asserted that the following risk factors were identified in the analyzed studies: age [], daily habits [], clinical diagnoses [], environmental and hygiene factors [], sex [], stride length [,], gait speed [], and posture []. This aligns with other research that also identifies sex [,], age [], and gait speed [].

On the other hand, the “fear of falling” has been identified in various studies as a risk factor and a predictor of falls [,], but it was not identified in any of the studies included in this review.

As for the characteristics of the analyzed samples, only 9.1% (2/22) of the articles used a sample composed exclusively of women [,], and no article used a sample composed exclusively of men. This fact is incongruent with reality, as women have a longer life expectancy than men, and therefore, the number of women aged >65 years is greater than the number of men of the same age []. Furthermore, women experience more falls than men []. The connection between menopause and its consequences, including osteopenia, suggests a higher risk of falls among older women than among men of the same age [,].

Within the realm of analysis tools, the most frequently used devices to analyze participants were accelerometers [-,,-,,-]. However, only 36.4% (8/22) of the studies provided all the information regarding the characteristics of these devices [,,,,,,,]. On the other hand, 18.2% (4/22) of the studies used the term “inertial measurement unit” as the sole description of the devices used [-,].

The fact that most of the analyzed procedures involved the use of inertial sensors reflects the current widespread use of these devices for postural control analysis. These sensors, in general (and triaxial accelerometers in particular), have demonstrated great diagnostic capacity for balance []. In addition, they exhibit good sensitivity and reliability, combined with their portability and low economic cost []. Another advantage of triaxial accelerometers is their versatility in both adult and pediatric populations [-], although the studies included in this review did not address the pediatric population.

The remaining studies extracted data from cameras [,], medical records [,,,], and other functional and clinical tests [,,]. Regarding the AI techniques used, out of the 18.2% (4/22) of articles that used deep learning techniques [,,,], only 4.5% (1/22) did not provide a description of the sample characteristics []. In this case, the authors focused on the AI landscape, while the rest of the articles strike a balance between AI and health sciences.

Regarding the validity of the generated models, only 40.9% (9/22) of the articles assessed this characteristic [,,,-,,]. The authors of these 9 (N=22, 40.9%) articles evaluated the validity of the models through accuracy. All the results obtained reflected accuracies exceeding 70%, with Ribeiro et al [] achieving a notable accuracy of 92.7% and 100%. Specifically, they obtained a 92.7% accuracy through the CNN model for distinguishing normal gait, the prefall condition, and the falling situation, considering the step before the fall, and 100% when not considering it [].

The positive results of sensitivity and specificity can only be compared between the studies of Qiu et al [] and Gillain et al [], as they were the only ones to take them into account, and in both investigations, they were very high. Similarly, in the case of the F1-score, only Althobaiti et al [] examined this validity measure. This measure is the result of combining precision and recall into a single figure, and the outcome obtained by these researchers was promising.

Despite these differences, the 22 studies obtained promising results in the health care field [-]. Specifically, their outcomes highlight the potential of AI integration into clinical settings. However, further research is necessary to explore how health care professionals can effectively use these predictive models. Consequently, future research should focus on studying the application and integration of the already-developed models. In this context, fall prevention plans could be implemented for the target populations identified by the predictive models. This approach would allow for a retrospective analysis to determine if the combination of predictive models with prevention programs effectively reduces the prevalence of falls in the population.

Limitations

Regarding limitations, the articles showed significant variation in the sample sizes selected. Moreover, even in the study with the largest sample size (with 265,225 participants []), the amount of data analyzed was relatively small. In addition, several of the databases used were not generated specifically for the published research but rather derived from existing medical records [,,,]. This could explain the significant variability in the variables analyzed across different studies.

Strengths

Despite the limitations, this research has strengths, such as being the first systematic review on the use of AI as a tool to analyze postural control and the risk of falls. Furthermore, a total of 6 databases were used for the literature search, and a comprehensive article selection process was carried out by 3 researchers. Finally, only cross-sectional observational studies were selected, and they shared the same objective.

Conclusions

The use of AI in the analysis of data related to postural control and the risk of falls proves to be a valuable tool for creating predictive models of fall risk. It has been identified that most AI studies analyze accelerometer data from sensors, with triaxial accelerometers being the most frequently used.

For future research, it would be beneficial to provide more detailed descriptions of the measurement procedures and the AI techniques used. In addition, exploring larger databases could lead to the development of more robust models.

None declared.

Edited by A Mavragani; submitted 28.11.23; peer-reviewed by E Andrade, M Behzadifar, A Suárez; comments to author 09.01.24; revised version received 30.01.24; accepted 13.02.24; published 29.04.24.

©Ana González-Castro, Raquel Leirós-Rodríguez, Camino Prada-García, José Alberto Benítez-Andrades. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 29.04.2024.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

留言 (0)

沒有登入
gif