Explainable AI Method for Tinnitus Diagnosis via Neighbor-Augmented Knowledge Graph and Traditional Chinese Medicine: Development and Validation Study


Introduction

Tinnitus is a common refractory disease in the field of otolaryngology, and its diagnosis has always been a cutting-edge research topic in audiology. With changes in the social environment and an accelerated pace of life, an increasing number of patients, particularly among the younger generation, have sought medical assistance for tinnitus as their primary complaint in the last decade. Globally, approximately 14% (95% CI 0.8%-1.6%) of adults are affected by tinnitus [,], which can cause stress, anxiety, and depression []. Distress and hearing impairment brought on by the disease can affect cognitive abilities and lead to suicidal tendencies in severe cases, greatly affecting the work and daily lives of patients [].

The pathogenesis of tinnitus is extremely complex and not fully understood. Currently, no effective objectification methods are available. Traditional Chinese medicine (TCM) classifies tinnitus into 5 different syndrome patterns: wind fire attacking internally (WFAI), liver fire bearing upward (LFBU), phlegm fire stagnation internally (PFSI), Qi deficiency of the spleen and stomach (QDSS), and kidney essence deficiency (KED). The diagnosis of tinnitus remains a challenge in medical science because it is influenced by several complex factors [,], including individual differences among patients and atypical symptom presentations. Clinical diagnosis relies heavily on the personal knowledge and clinical experience of doctors, thereby introducing subjectivity, uncertainty, and ambiguity. Consequently, achieving a high tinnitus diagnostic accuracy becomes difficult. Therefore, tinnitus diagnosis remains an urgent issue requiring further exploration and resolution by medical researchers.

Previous studies have focused on the use of artificial intelligence (AI) to assist doctors in diagnosing tinnitus and improving diagnostic accuracy. Liu et al [] proposed a meta-learning method based on lateral perception for cross–data set tinnitus diagnosis. Sun et al [] used a support vector machine classifier to distinguish between patients with tinnitus and healthy individuals. Shoushtarian et al [] used a naive Bayes algorithm to classify patients with tinnitus and control groups. Sanders et al [] used a spiking neural network model to classify patients with tinnitus into 2 groups based on different classification criteria. Manta et al [] used clinical data and patient features to build a machine learning (ML) model for classifying the degree of tinnitus-related distress in individuals and their ears. Allgaier et al [] used a gradient-boosting engine to classify transient tinnitus. Rodrigo et al [] used a decision tree model to identify variables related to the success of internet-based cognitive behavioral therapy for tinnitus. Liu et al [] used a support vector machine model to explore cortical or subcortical morphological neuroimaging biomarkers that effectively distinguished patients with tinnitus from healthy individuals. Niemann et al [] proposed a LASSO model to predict the severity of depression in patients with tinnitus. Although previous studies have achieved success using their respective data sets, the developed ML- or deep learning–based methods are entirely data-driven modeling approaches that do not make full use of existing medical knowledge. Models built using such methods are equivalent to “black boxes” for doctors, lack interpretability, and are not conducive to clinical promotion and application.

In this study, the aim is to incorporate clinical medical knowledge into a diagnostic model, enabling the integration of knowledge and data for interpretable results. Knowledge graph–based modeling methods offer solutions to such issues by using a novel knowledge representation format that connects entities and concepts in an objective world using semantic relationships. Such methods offer reasoning and interpretability that are highly sought after by both medical practitioners and academia. Li et al [] used a knowledge graph to predict diabetic macular edema, overcoming the limitations of traditional ML and data-mining techniques that deal with missing feature values. Zhou et al [] used 124 medical records to construct a knowledge graph for recommending hypertension medication. Lyu et al [] created a knowledge graph for diabetic nephropathy diagnosis using patient data. Lin et al [] extracted knowledge from medical texts and historical prescription data to construct a medical knowledge graph and accurately detect clinical prescription risks. Recently, knowledge graph applications have expanded to TCM; for instance, Yang et al [] built a knowledge graph to extract medical information from TCM case records. Xie et al [] constructed a knowledge graph using ancient Chinese medical books to infer symptoms and syndromes. Yang et al [] used electronic medical records (EMRs) to build a knowledge graph, transforming TCM diagnostic issues into multilabel classification problems. Lan et al [] integrated knowledge graphs with graph neural networks to introduce graph-based supervised contrastive learning, effectively enabling the classification of TCM texts. However, no previous studies have used knowledge graphs in the complex medical field of tinnitus diagnosis. Therefore, this study focuses on knowledge graph technology to assist doctors in tinnitus diagnosis and improve diagnostic accuracy.

This paper aims to establish a comprehensive knowledge graph in TCM specifically tailored for tinnitus. Leveraging this knowledge graph, we propose a novel method for calculating patient similarity. This method takes into account the weighting of symptom-syndrome type relationships, thereby facilitating the inference of syndrome types in patients with tinnitus according to TCM principles. By implementing this approach, clinicians can increase the accuracy of tinnitus diagnosis within the realm of TCM.

In general, we make several noteworthy contributions as follows:

We propose a method for tinnitus knowledge graph construction based on heterogeneous patient EMRs and TCM clinical knowledge.We introduce weights to measure patient similarity into the tinnitus knowledge graph using a method based on prior probabilities and mutual information values.A collaborative neighbor algorithm that uses patient similarity scores to obtain recommended diagnostic results is proposed to assist doctors in understanding the model-generated conclusions, thereby improving the accuracy of tinnitus diagnosis.
MethodsPatients

For this study, we collected the EMRs of 1267 patients with tinnitus who visited the ear, nose, and throat departments of 11 medical institutions in Shanghai, China, from November 2019 to July 2023. The inclusion criteria included (1) tinnitus as the primary complaint and (2) the ability to communicate normally. The exclusion criteria included (1) objective tinnitus, (2) nonotogenic tinnitus caused by factors such as endocrine and blood disorders, (3) tinnitus caused by head or ear trauma, and (4) difficulties in communication or severe psychiatric history that could hinder follow-up compliance. After screening the data for quality, 1265 cases were included for further analysis.

The clinical EMR data set recorded medical data of real patients including the relationship between patient symptoms and disease, which was crucial for disease diagnosis. The data set contained patient information such as age, sex, inducement, medical history, tinnitus sound, accompanying symptoms, tongue coating, pulse condition, TCM syndrome differentiation, and sleep status. Each patient had a clear diagnosis that could be classified into 1 of 5 categories: WFAI, LFBU, PFSI, QDSS, and KED. Statistical data are presented in - .

Figure 1. Age distribution of different syndromes by sex. KED: kidney essence deficiency; LFBU: liver fire bearing upward; PFSI: phlegm fire stagnation internally; QDSS: Qi deficiency of the spleen and stomach; WFAI: wind fire attacking internally. Figure 2. The tongue body distribution of different syndrome types. KED: kidney essence deficiency; LFBU: liver fire bearing upward; PFSI: phlegm fire stagnation internally; QDSS: Qi deficiency of the spleen and stomach; WFAI: wind fire attacking internally. Figure 3. The tongue fur distribution of different syndrome types. KED: kidney essence deficiency; LFBU: liver fire bearing upward; PFSI: phlegm fire stagnation internally; QDSS: Qi deficiency of the spleen and stomach; WFAI: wind fire attacking internally. Figure 4. The pulse condition distribution of different syndrome types. KED: kidney essence deficiency; LFBU: liver fire bearing upward; PFSI: phlegm fire stagnation internally; QDSS: Qi deficiency of the spleen and stomach; WFAI: wind fire attacking internally. Ethical Considerations

This study’s protocol was approved by the ethics committee of the Shanghai Municipal Hospital of Traditional Chinese Medicine, Shanghai, China (2021SHL-KY-70).

The data was anonymized in order to protect patient privacy. Patients could receive free examinations and treatments throughout the entire process, so no compensation was provided.

Clinical Decision Support for TinnitusOverview

To integrate patient EMRs with diagnostic knowledge from TCM textbooks, we constructed a knowledge graph using a combined “top-down” and “bottom-up” approach []. First, a patient-centered knowledge graph was developed using EMRs. Then, the knowledge graph was enriched with tinnitus diagnostic knowledge from TCM textbooks. Finally, we used a mutual information–based weight calculation method to enhance the knowledge graph by fusing patient case data with diagnostic knowledge. The resulting knowledge graph simulated the diagnostic reasoning processes of experienced physicians. The entire method consisted of three steps: (1) building a weighted tinnitus knowledge graph, (2) finding and scoring common neighbors, and (3) predicting syndrome patterns based on patient similarity. The overall framework is illustrated in .

Figure 5. Overall framework of the proposed method. Knowledge Graph of Tinnitus Based on Heterogeneous Sources

In response to the diagnostic needs of tinnitus in TCM, the ontology structure of a tinnitus medical knowledge graph should revolve around symptoms, syndrome patterns, diseases, drugs, and treatment methods. For this study, we extracted such common concepts from expert-reviewed EMRs and classic medical textbooks, constructed a conceptual knowledge system, and built a top-level ontology structure. Natural language processing techniques [] were used to extract entities and relationships from the patient EMRs based on a defined conceptual knowledge system for tinnitus. By applying certain rules and conducting string matching within the text, we extracted 15 and 10 categories of entities and relationships from the 1265 EMR records, respectively. Once the entity types and hierarchy were determined, we embedded the data into the conceptual knowledge system and established a patient-centric tinnitus knowledge graph in the form of a triple, which maximized the retention of both explicit and implicit diagnostic information.

Furthermore, we enhanced the constructed tinnitus knowledge graph using knowledge extracted from authoritative medical textbooks to supplement tinnitus knowledge information that was not fully expressed in EMRs. Together with the EMR knowledge graph, a complete tinnitus knowledge graph was developed. The knowledge we selected came from 2 classic Chinese medicine textbooks [,], from which we extracted basic concepts related to tinnitus including TCM syndromes, prescriptions, Chinese medicinal herbs, and treatment methods to construct the TCM knowledge graph.

Heterogeneous Knowledge Fusion

Redundancy in the entities and relationships extracted from heterogeneous sources was observed owing to the different sources of data and knowledge. Therefore, knowledge fusion was required. First, data normalization and entity alignment were performed to standardize the named entities extracted from multiple data sources. The entities were associated using string-matching and similarity-calculation methods. As entity and attribute texts were relatively short, a lower similarity threshold was more appropriate; therefore, the similarity judgment threshold was set as 0.6 to prevent errors and omissions. The entity similarity calculation results are listed in . As the knowledge graph was established in Chinese, we calculated the similarity of the Chinese strings.

Then, a matching path was built from the tinnitus ontology–based knowledge graph entity to the EMR-based knowledge graph entity. Patient data were linked to diagnostic knowledge through an ontology. The 2 knowledge graphs were linked by unifying entities with duplicate meanings in the 2 graphs. Manual verification was performed to ensure the accuracy of the knowledge graph. The specific method is illustrated in . Finally, the tinnitus knowledge graph consisted of 1247 entities and 9234 relationships.

Table 1. Entity similarity calculation results.Standardized and ambiguous entities (Chinese)SimilarityWFAIa
风热外侵证 (wind-heat invasion syndrome)0.8
风热外犯证 (wind-heat exterior syndrome)0.6
风热外侵证 (wind-heat exterior assault syndrome)0.8LFBUb
肝火上炎证 (liver fire flaming upward syndrome)0.8
肝热上扰证 (liver heat disturbing upward syndrome)0.8
肝火上扰清窍证 (liver fire disturbing upward and disturbing clearing orifices syndrome)0.83QDSSc
脾胃虚证 (spleen and stomach deficiency syndrome)0.89
脾胃虚弱证 (spleen and stomach weakness syndrome)0.8PFSId
痰火壅结证 (phlegm-fire concretions syndrome)0.8KEDe
肾精不足证 (kidney essence insufficiency syndrome)0.6
肾精亏虚证 (kidney essence deficiency syndrome)0.8
肾虚精亏证 (kidney deficiency and essence deficiency syndrome)0.99
肾精亏耗证 (kidney essence consumption syndrome)0.8

aWFAI: wind fire attacking internally.

bLFBU: liver fire bearing upward.

cQDSS: Qi deficiency of the spleen and stomach.

dPFSI: phlegm fire stagnation internally.

eKED: kidney essence deficiency.

Figure 6. Tinnitus knowledge graph fusion flowchart. EMR: electronic medical record. Calculation of Knowledge Graph Relationship Weights Based on Mutual Information

Considering the varying importance of different entities for different syndrome patterns, the imbalance in data categories, and the varying amount of information carried by symptoms, the calculation of weights required consideration of entities’ importance for diagnostic pattern identification and information content carried by the entities themselves. The data used for weight calculation were derived from real clinical case data used for constructing the knowledge graph. First, the mutual information value (wif) possessed by each entity was obtained using the mutual information method. The obtained value represented the extent to which a variable could acquire diagnostic pattern information.

For a given set of entities X = with corresponding probabilities P = , the target variable to be measured was the diagnostic pattern Y. By calculating the overall entropy H(), conditional entropy H(Y|X), and mutual information value Gain(S,x), the degree to which the diagnostic pattern was determined based on the entity values or the weight value wif of the entity was calculated. The calculations were performed using equations 1-3.

(1) (2) wif = Gain(Y,X) = H(Y) – H(Y|X) (3)

Further, the feature weights were calculated based on the syndrome patterns under the prior conditions. The probability of each symptom appearing under different syndrome patterns was obtained using statistical methods such as:

wsd = p(symi|sdj) (4)

where sym = represents the symptom set and sd = represents the diagnostic pattern set. Finally, the edge weight from node u to node v was defined using equation 5.

Weight(u,v) = wif + wsd (5)

The weights of various symptoms under different syndrome patterns are presented in .

Table 2. Partial weight value of symptom-syndrome type.SymptomWeightKEDa
Spermatorrhea1.435
Soreness of loins1.4213
Dreaminess1.4104
Wake up early in the morning1.3868
Deficiency and insomnia1.3856
Aggravation at night1.167
Cicadas chirp1.1559
Fine pulse1.1448
Scanty fur0.7142
Duration0.6991LFBUb
Irritable1.2376
Restlessness and insomnia1.1196
Wind sound1.0271
String-like pulse1.0056
Tide sound1.0030
Yellow fur0.9118
Reddish tongue0.8992
Duration0.7036
Dry mouth0.6855
Bitter taste in mouth0.6558PFSIc
Tastelessness1.1953
Dizziness and heaviness1.1488
Aural fullness1.1216
Ear distension1.0899
Slippery pulse0.9121
Slimy fur0.8342
Duration0.7113
Yellow fur0.6895
Hearing loss0.6495
Reddish tongue0.6440WFAId
Cold or rhinitis1.2089
Tinnitus onset within a month1.1398
Low voice1.1398
Thin fur1.0286
Floating pulse0.9563
Duration0.6903
Light color of tongue0.6664
Yellow fur0.5082
Hearing loss0.5032
Dreaminess0.4993QDSSe
Feeling emptiness in ear1.2615
Aggravation after work1.1813
Aggravation when standing up1.1562
Fine pulse1.0782
Duration0.7370
Thin fur0.7022
Light color of tongue0.6745
Anxiety0.6596
Hearing loss0.6444
Dreaminess0.4865

aKED: kidney essence deficiency.

bLFBU: liver fire bearing upward.

cPFSI: phlegm fire stagnation internally.

dWFAI: wind fire attacking internally.

eODSS: Qi deficiency of the spleen and stomach.

Patient Similarity Scoring Based on Weighted Common Neighbor Algorithm

By transforming the TCM syndrome diagnostic problem into a prediction problem of linked patient nodes to TCM syndrome nodes, the similarity between 2 patients was calculated to obtain TCM syndrome similarity. For 2 patients, the higher the similarity, the greater the likelihood of having the same diagnostic result. This study measured the similarity using common features. In the knowledge graph, the higher the number of common neighbors to 2 patient nodes, the greater the likelihood of them belonging to the same community (linked to the same TCM syndrome node). The common neighbor graph of patients with different TCM syndromes is shown in , where fewer common neighbors were observed. The common neighbor graph of patient 1 and patient 2 with the same TCM syndrome is shown in , where more common neighbors were observed; however, different nodes had different importance. In TCM, the importance of pulse condition is greater than that of tinnitus duration while diagnosing tinnitus. The edge weight values of continuous tinnitus and thin pulse-to-kidney deficiency syndrome were 0.6991 and 1.1448, respectively, as shown in ; however, even for the same pulse condition, the importance varied for different TCM syndromes. In , the edge weight values of thin pulse to QDSS and KED syndromes were 1.078 and 1.1447, respectively. Therefore, considering the edge weights of common neighbors to the patient nodes and calculating the score of common neighbors based on the edge weight values were essential when counting the number of common neighbors between patient nodes.

The similarity scoring function between patients x and y was defined by equation 6.

(6)

where X = and Y = represent the sets of neighboring nodes for patients x and y, respectively; Pathu,h,v = (u, h, v) denotes the 2-hop path from node u to node v, where h represents the common neighbor of nodes u and v; Pathu,h = (u, h) represents the path from node u to the common neighbor h; and weight(pathu,h) indicates the weight of the path.

When 2 paths with a hop count of 2 between the patient nodes existed, the weights of the paths were calculated to obtain a similarity score list for the patients. The list was then sorted in descending order, and the top 20 patient node syndromes with the highest scores were counted, which represented the most frequently occurring syndrome. Finally, the recommended syndrome was obtained.

Sn = G(f20(score(X,Y))) (7)

where G denotes a frequency-counting method in which X and Y represent sets of patient nodes. f20() was used to obtain the top 20 patient syndromes based on the scores.

Figure 7. Sketch map of common neighbors between different syndromes. Figure 8. Sketch map of common neighbors between same syndromes. Experimental Design

In total, 2 experiments were conducted to verify the effectiveness of the proposed method. The first experiment was performed to compare the proposed method with similar graph algorithms, while the second experiment was performed to compare the proposed method with other common explainable ML methods. The evaluation metrics of the algorithm are accuracy, precision, sensitivity, specificity, F1-score, area under receiver operating characteristic curve (AUC), etc. To demonstrate the interpretability of our method, we selected a tinnitus case for result interpretation to showcase the inference process and interpretability of our method.


ResultsPerformance Verification

For a given knowledge graph, we extracted the patient nodes and their neighboring nodes to form a knowledge network. The node and edge sets in the knowledge network were divided into training and testing sets. The testing set did not contain syndrome entities. To reasonably divide the training and testing sets, we used a stratified sampling cross-validation method of randomly dividing the network node and edge sets into 5 subsets: 1 subset as the testing set, and the other 4 subsets as the training set. The training set served as a known network, whereas the testing set was used to verify the syndrome prediction results and evaluate the accuracy of the syndrome prediction algorithm.

Evaluation OutcomesComparison With Similar Graph Algorithms

The proposed method was compared with similar graph algorithms such as CommonNeighbors and Adamic-Adar. CommonNeighbors is a common graph algorithm used to infer the potential relationships and proximity between 2 nodes []; however, the differences between common neighbors are not considered. Adamic-Adar is a typical algorithm for determining the closeness of 2 points by measuring the outdegree of common neighbors []. ResourceAllocation calculates the closeness between 2 nodes using a set of neighboring nodes near the target node []. We added common neighbor edge weights based on CommonNeighbors. Unlike Adamic-Adar and ResourceAllocation, our weight calculation method considered each syndrome, which had a higher adaptability to TCM diagnosis by the doctors. The experimental results are listed in ; our method outperformed similar graph algorithms in diagnosing each syndrome.

Table 3. Experimental results of graph algorithm comparison.Evaluation indicators and modelsKEDa (n=339)LFBUb (n=307)PFSIc (n=194)QDSSd (n=270)WFAIe (n=155)Value, mean (SD)Average accuracy
Common neighbors0.9780.9780.9820.9830.9880.982 (0.004)
Adamic-Adar0.9790.9790.9780.9830.9890.982 (0.004)
Resource allocation0.9180.9440.9610.9360.9740.947 (0.019)
WeightedCommonNeighbors0.9900.9940.9950.9920.9980.994 (0.003)Average precision
Common neighbors0.9390.9410.9520.9820.9710.957 (0.017)
Adamic-Adar0.9400.9490.9320.9810.9710.955 (0.019)
Resource allocation0.7940.8930.9300.8600.9480.885 (0.055)
WeightedCommonNeighbors0.9700.9870.9930.9861.0000.987 (0.010)Average sensitivity
Common neighbors0.9810.9710.9220.9430.9290.949 (0.023)
Adamic-Adar0.9840.9650.9170.9420.9350.949 (0.023)
Resource allocation0.9330.8770.8010.8400.8370.857 (0.045)
WeightedCommonNeighbors0.9900.9900.9760.9790.9870.985 (0.006)Average F1-score
Common neighbors0.9590.9560.9360.9610.9490.952 (0.009)
Adamic-Adar0.9610.9570.9240.9610.9520.951 (0.014)
Resource allocation0.8560.8840.8590.8490.8850.866 (0.015)
WeightedCommonNeighbors0.9800.9890.9840.9820.9940.986 (0.005)Average specificity
Common neighbors0.9780.9800.9930.9950.9960.988 (0.008)
Adamic-Adar0.9780.9830.9890.9950.9960.988 (0.007)
Resource allocation0.9140.9660.9900.9630.9940.965 (0.029)
WeightedCommonNeighbors0.9890.9960.9990.9961.0000.996 (0.004)Average AUCf
Common neighbors0.9790.9760.9580.9690.9630.969 (0.008)
Adamic-Adar0.9810.9740.9530.9690.9660.968 (0.009)
Resource allocation0.9230.9220.8950.9010.9150.911 (0.011)
WeightedCommonNeighbors0.9900.9930.9870.9880.9940.990 (0.003)

aKED: kidney essence deficiency.

bLFBU: liver fire bearing upward.

cPFSI: phlegm fire stagnation internally.

dQDSS: Qi deficiency of the spleen and stomach.

eWFAI: wind fire attacking internally.

fAUC: area under receiver operating characteristic curve.

Comparison With Other Interpretable ML Methods

The proposed method was compared with common ML classification algorithms including decision tree, random forest, naive Bayes, logistic regression, and k-nearest neighbors algorithms. The results are presented in . The graph algorithm based on WightedCommonNeighbor outperformed other models in the comprehensive diagnosis of each syndrome on the same data set but was lower than the random forest model in terms of the AUC metric. Although the random forest model had a certain degree of interpretability, the overall complexity of model interpretation increased when a large number of decision trees were included. The higher the number of decision trees in the random forest model, the greater the difficulty of interpreting the relationships and decision processes within the model. Compared to the random forest model, our proposed method had higher interpretability and was more readily accepted by doctors.

Table 4. Experimental results of machine learning classification algorithm comparison.Evaluation indicators and modelsKEDaLFBUbPFSIcQDSSdWFAIeValue, mean (SD)Average accuracy
WeightedCommonNeighbors0.9900.9940.9950.9920.9980.994 (0.003)
Decision tree0.9750.9750.9780.9700.9840.976 (0.005)
Random forest0.9870.9820.9850.9870.9940.987 (0.004)
Naive Bayes0.9790.9760.9790.9810.9910.981 (0.005)
Logistic regression0.9860.9830.9830.9840.9940.986 (0.004)
KNNf0.9860.9800.9820.9860.9940.985 (0.005)Average precision
WeightedCommonNeighbors0.9700.9870.9930.9861.0000.987 (0.010)
Decision tree0.9500.9510.9170.9430.9370.939 (0.012)
Random forest0.9740.9500.9700.9820.9630.968 (0.011)
Naive Bayes0.9710.9230.9530.9560.9800.957 (0.019)
Logistic regression0.9710.9610.9500.9640.9810.965 (0.010)
KNN0.9740.9380.9580.9780.9800.966 (0.016)Average sensitivity
WeightedCommonNeighbors0.9900.9900.9760.9790.9870.985 (0.006)
Decision tree0.9590.9450.9430.9150.9360.939 (0.014)
Random forest0.9760.9770.9330.9560.9870.966 (0.019)
Naive Bayes0.9530.9810.9120.9560.9480.950 (0.022)
Logistic regression0.9760.9670.9380.9630.9680.963 (0.013)
KNN0.9730.9840.9230.9560.9680.961 (0.021)Average F1-score
WeightedCommonNeighbors0.9800.9890.9840.9820.9940.986 (0.005)
Decision tree0.9530.9480.9290.9280.9360.939 (0.010)
Random forest0.9750.9630.9500.9680.9750.966 (0.009)
Naive Bayes0.9610.9510.9320.9550.9640.953 (0.011)
Logistic regression0.9740.9640.9430.9630.9740.964 (0.011)
KNN0.9730.9600.9400.9660.9740.963 (0.012)Average specificity
WeightedCommonNeighbors0.9890.9960.9990.9961.0000.996 (0.004)
Decision tree0.9810.9840.9840.9850.9910.985 (0.003)
Random forest0.9900.9830.9940.9950.9950.992 (0.005)
Naive Bayes0.9890.9740.9920.9880.9970.988 (0.008)
Logistic regression0.9890.9880.9910.9900.9970.991 (0.003)
KNN0.9900.9790.9930.9940.9970.991 (0.006)Average AUCg
WeightedCommonNeighbors0.9900.9930.9870.9880.9940.990 (0.003)
Decision tree0.9700.9640.9640.9500.9630.962 (0.007)
Random forest0.9950.9980.9960.9971.0000.997 (0.002)
Naive Bayes0.9960.9960.9930.9950.9970.995 (0.001)
Logistic regression0.9970.9970.9940.9950.9970.996 (0.001)
KNN0.9930.9930.9770.9880.9930.989 (0.006)

aKED: kidney essence deficiency.

bLFBU: liver fire bearing upward.

cPFSI: phlegm fire stagnation internally.

dQDSS: Qi deficiency of the spleen and stomach.

eWFAI: wind fire attacking internally.

fKNN: k-nearest neighbor.

gAUC: area under receiver operating characteristic curve.


DiscussionPrincipal Findings

The experimental results show that the accuracy, sensitivity, specificity, precision, F1-score, and AUC of our propose

留言 (0)

沒有登入
gif