Organisation for Economic Co-operation and Development (OECD) reported that mild-to-moderate mental disorders (e.g. anxiety, depression) affect around 20% of the working-age population [1]. Mental disorders bring significant concerns for economic development and social welfare [2]. For example, U.S. workers suffering from depression cost employers an additional 31 billion dollars each year due to lost productive time [3]. The Japanese Government estimated that the economic and social loss from suicides and mental disorders were at least 2.7 trillion yen (about 25 billion dollars), which is equivalent to 0.7 per cent of the GDP in 2009 [4]. Furthermore, depression was highly associated with the presenteeism [5] causing the health-related productivity loss during paid hours [6-8]. Companies need systems for maintaining their employees mental well-being to avoid both absenteeism and presenteeism. A low-cost and easy solution is demanded in workplaces. We here propose a new concept, ‘layered mental healthcare’ based on biometric measurements.
2 LAYERED MENTAL HEALTHCARE‘Layered mental healthcare’, is a concept for screening and managing the mental well-being in workplaces besides the medical diagnosis, treatments, and cares (Figure 1). The mental disorder originates from the brain, affects physiology, such as heart rate and blood pressure, and alternates behaviour. The changes in the three layers of behaviour, physiology, and brain should be measured to monitor the mental/distress conditions and estimate the risk. In this concept, measurement devices were designed to be wearable; non-clinical cares for each layer can also be expected. For example, leisure activities (e.g. travel, exercise, entertainment), supplement administrations, and neurofeedback were suggested for the disrupted behaviour, physiology, and brain layers, respectively. Condition monitoring is also helpful for evaluating the effectiveness of cares with low costs. Furthermore, the management of working hours, conditions, and environment can be recommended to employers, depending on the employees conditions.
Concept of ‘Layered mental healthcare’. OT is the optical topography, i.e., functional near-infrared spectroscopy device
A lot of biometric measurements have been used for monitoring distress and also both bio-and neuro-feedbacks for stress coping [9]. The concept of layer is introduced here to cover a variety of preceding symptoms. The measurements/cares are selected depending on the required accuracy/efficacy and acceptable cost. Physiological markers and behaviours were collected using wearable sensors and mobile phones for the classification of high or low stress [10].
A multimodal system using the electroencephalography (EEG), hemoencephalography (HEG), and heart rate variability (HRV) was reported [11]. A multimodal system over the three layers was a new point of this letter. However, it is not clear if the three-layer measurements are significantly needed for monitoring mental/stress conditions. The purpose of this letter is to investigate the contribution of the three-layer measurements in predicting mental/distress conditions. The concept of ‘Layered mental healthcare’ was validated using the data-driven technique.
3 DEVICES, TRIAL IN OFFICE AND ANALYSISThe devices were selected for the validation of the concept according to the previous studies as follows. For the behaviour layer, a home-made wrist-band type activity tracker (Life logger) and the PC logger recording event-logs of keyboard and mouse operations were used for monitoring the lifestyle [12] and the workstyle [13], respectively. The wrist-band type activity tracker had a triaxial accelerometer to provide the number of steps, the index of exercise intensity in metabolic equivalents, etc. By using the key/mouse data recorded over the daily course, a fractal dimension was obtained as a slope of fitted line for cumulative distribution vs time-interval of key/mouse events graph [13]. The fractal dimension represents a kind of rhythm of PC operations, which depends on the mood state. For the physiology layer, the HRV, a well-known biomarker for stress, was measured [10, 11]. Even though a camera can accommodate the HRV measurement (Figure 1), the wearable optical topography (OT, model HOT-1000, NeU) was currently used to measure the HRV derived from forehead pulses as well as the brain measurement. The ‘LF/HF’ ratio reflecting the sympatho-vagal balance was calculated. For the brain layer, the hemodynamic change in the dorsolateral prefrontal cortex during the spatial and verbal delayed matching tasks (working memory function) was observed using OT [14, 15]. The difference of oxygenated hemoglobin changes between the verbal and spatial delayed matching tasks which are well correlated with scores of the profile of mood state.
Figure 2 shows the data accumulation system used in the trial for validating the ‘Layered mental healthcare concept’. All data measured using the devices were accumulated in the database servers. Thirty-nine healthy volunteers (32 males, 7 females, 43.7 ± 8.9 years old) with no history of mental disorders participated in this study; the measurement was lasted for about four months [16]. The data from the volunteers were obtained according to the regulations set forth by the internal review board at the Central Research Laboratory, Hitachi, Ltd., following the receipt of their written informed consent. The PC logger was installed on each PC used by each participant in the office. The log data were automatically recorded during the working time. The fractal indexes of key and mouse operations in a day were calculated from the logs. The life logger was worn all day including holidays. Steps, exercise, intensity in metabolic equivalents, and sleeping time in a day were obtained. Once a week at the relatively same time, the OT and questionnaire (Kessler 6 (K6), twenty-nine questions of Brief Job Stress (BJS) Questionnaire [17-19]) measurements were performed. The participants wore the OT headset and performed the delayed matching tasks which appeared in the tablet PC. The cerebral blood changes associated with brain activity during the task and HRV data were recorded in the tablet PC. Three cardiac features and eleven features of brain activity during tasks were obtained by OT measurement. Each participant answered the questionnaires shown in a web browser. BJS provides scores of anxiety, depression, fatigue, irritation, physical stress, vigor, and the total of them. The worse condition was marked by lower scores. K6 was not used in this analysis.
Configuration diagram of data accumulation system for the trial in office. Recorded data of PC log, OT including HRV, answers of questionnaires, and life log were accumurated in the databases on the servers. Each record frequency is shown in parenthesis
Data preprocessing were performed for all of the accumulated data before the machine learning process. Each features obtained from the PC logger and the life logger was averaged for a week. The outliers of feature data (< –3 σ or > 3 σ) were removed. Independent features with cross-correlation less than 0.6 were selected to avoid the multicollinearity. The weekly data with missing records due to forgetting, absence, and business trip were removed. No standardisation of data was performed. Finally, 96 weekly records of feature data were obtained for each target variable (i.e. BJS scores). Furthermore, weekly feature records were obtained some weeks before and after the week; the targets measured were prepared by temporally shifting the feature data.
Light Gradient Boosting Machine (LGBM) [20, 21] was used for obtaining the feature importance based on gain for each target variable. The nested cross-validation (cv) was performed to obtain the best models using 10-fold outer cv and to tune hyper-parameters of LGBM with Optuna [22] using 5-fold inner cv. The objective function was L2 loss (the least square errors) in the case of regression. The best models were selected according to the adjusted R2 averaged across targets, because the valid sample size depended on the set of records. Then, each feature importance for each target was obtained using the set of records that provided the best models.
4 RESULTS AND DISCUSSIONThe selected features together with the results of Shapiro Wilk normality test are shown in Table 1. The simplest feature, ‘Ped’ and ‘Keylog’ were selected among behaviour features. The physiological features of ‘HRstdev’ and “LF/HF” were favoured rather than the heart rate itself. For the brain layer, ‘OT_sv_L’ and ‘OT_sv_rt’ which are correlated with the mood state and the ability of brain function were selected. The comparison of the selected features between layers was performed. The target variables were not normally distributed because those variables are discrete scores. Some features are also not normally distributed; the small sample size might cause these non-normal distributions. Therefore, the regression trees, which do not require the assumption of normal distribution for variables were used for making models.
TABLE 1. Features selected as explanatory variables. ‘Non’ in Normality means that the normal distribution of feature variable was rejected with a significance level of 5% by Shapiro Wilk test Feature Device Description Normality (p-value) Ped Life logger Steps in a day – (0.094) Keylog PC logger Fractal index of key operation in a day – (0.37) HRstdev OT Standard deviation of heart rate during the task Non (8.4E-6) LF/HF OT Power ratio of low (0.04–0.15 Hz) to high (0.15-0.4 Hz) frequency of pulse cycle Non (1.5E-7) OT_sv_L OT Difference of left prefrontal cortex activity between during verbal and spatial working memory tasks Non (0.0046) OT_sv_rt OT Difference of response time between to verbal and spatial working memory tasks Non (0.0030)Because the features objectively measured using the devices may change earlier/later than the mental/distress scores subjectively complained, the combination of features and target values obtained in the different week were also tested. Figure 3 shows the comparison of the best adjusted R2 averaged across target variables between the set of records shifted weeks. The sets of S1b, S2b, S1a, and S2a were combinations of the target variable and the feature variables obtained 1 and 2 weeks prior to and 1 and 2 weeks after the measurement of target variables, respectively. The feature variables in set S0 were obtained in the week of target variable measurement. Meanwhile, S0+1b had both feature variables of S0 and S1b. The adjusted R2s of S1b and S0 were placed in the first and second rank, respectively. These results suggested that the mental modulation on three-layer features was objectively observed prior to the target variables. These results are reasonable because the questionnaires asked mental conditions within the last 1 week. Therefore, through the three-layer measurement, the mental condition can be indicated early, and this early detection becomes one of the advantages of the current concept.
Adjusted R2 averaged across mental scores of BJS for each feature set
Figure 4 shows the ratios of feature importance (‘gain’) for each target variables using the best models with S1b. The feature importance is a measure of the degree of improvement in the predictive accuracy of the model by branching at the feature quantity. It is used to assess the relative contribution of the feature quantity to the model. For example, the 1-week shifted keylog feature highly contributed to the depression. Each layer importance was calculated by the summation of feature ratios in each layer for each target variable. The ratio of layer importance averaged across the seven target variables were calculated as,
Percentage of feature importance in the behaviour, physiology, and brain layers for predicting the mental scores of BJS questionnaire using Light Graded Boosting Machine with explanatory variables obtained 1 week prior to the measurement of mental scores
Behaviour:Physiology:Brain = 4:3:3.
The feature importances are comparable across layers. Therefore, the three-layer measurements were important for estimating the mental/distress conditions.
The relationship between the total BJS scores predicted by the best models and the corresponding test data is shown in Figure 5 as a reference example. The Pearson correlation coefficients for the mental/distress scores of BJS were under the approximate condition of normal distribution in Table 2. These values are relatively acceptable except for the irritation, considering the small variance data for healthy participants. The development of the best models for prediction is another topic. Because the regression tree was used for quantitatively investigating the contribution of features in three layers, it did not necessarily provide the best model for prediction. The prediction accuracy is potentially improved by optimising the explanatory variables, introducing the interaction effects, and using suitable regressors, such as the support vector machine and the neural network. The features in the physiology layer obtained during the task once a week using OT were different from the features obtained using the heart rate monitor during the resting state. A daily and continuous measurement using the mobile pulse meter may provide better prediction results. The models depending on the individual/state will be led through more trials. In order to show the efficiency of the ‘Layered mental healthcare’ concept, a protocol of measurements should consider the trade-off between accuracy and cost; the care recommendation methods are expectedly developed in the next step.
Relationship between the predicted and true values of the total BJS questionnaire scores. Fitted line is obtained by the linear regression using both predicted and true values
TABLE 2. Pearson correlation coefficients between the predicted and true values for each mental score of BJS Mental/distress score of BJS Pearson correlation coefficient Total 0.55 Irritation 0.32 Fatigue 0.48 Depression 0.57 Anxiety 0.55 Physical stress 0.55 5 CONCLUSIONBy using the data obtained during the 14-week trial in office, the best models for predicting mental scores of BJS were obtained using LGBM. The biometric features obtained 1 week prior to the measurement of the mental scores provided the best models. The ratio of feature importance for the layers of behaviour:physiology:brain was 4:3:3. These results suggested that the prediction contribution of each layer was comparable. Therefore, the three-layer measurement was necessary for monitoring the mental/distress conditions. Because the mental/distress conditions are risk factors of mood disorders, the three-layer measurement is potentially helpful for an early detection before the onset of clinical symptoms of mood disorders. The measurement purposes can be extended for managing mental conditions and preventing mood disorders by recommending suitable cares. The current results also confirmed and validated the benefits of the ‘Layered mental healthcare’ concept in realising the mental well-being in workplaces.
REFERENCES
1Hewlett, E., Moran, V.: Making mental health count: The socio economic costs of neglecting mental health care. OECD Publishing, Paris (2014) 2Kigozi, J., et al.: The estimation and inclusion of presenteeism costs in applied economic evaluation: A systematic review. Value Health 20(3), 496– 506 (2017). 3Stewart, W., et al.: Cost of lost productive work time among US workers with depression. J. Am. Med. Assoc. 289, 3135– 3145 (2003). 4Kaneko, Y., Sato, I.: A calculation of economic benefits from introducing measures to cope with suicide and depression. National Institute of Population and Social Security Research (2010) (in Japanese) 5Burton, W.N., et al.: The association of medical conditions and presenteeism. J. Occup. Environ. Med. 46(Suppl), S38– S45 (2004). 6Yoshimoto, T., et al.: The economic burden of lost productivity due to presenteeism caused by health conditions among workers in Japan. J. Occup. Environ. Med. 62, 883– 888 (2020). 7Loeppke, R., et al.: Health-related workplace productivity measurement: General and migraine-specific recommendations from the ACOEM Expert Panel. J. Occup. Environ. Med. 45, 349– 359 (2003). 8Schmidt, B., et al.: A comparison of job stress models: Associations with employee well-being, absenteeism, presenteeism, and resulting costs. J. Occup. Environ. Med. 61, 535– 544 (2019). 9Pardamean, B., et al.: Quantified self-using consumer wearable device: Predicting physical and mental health. Healthc. Inform. Res. 26, 83– 92 (2020). 10Sano, A., et al.: Identifying objective physiological markers and modifiable behaviors for self-reported stress and mental health status using wearable sensors and mobile phones: Observational study. J. Med. Internet Res. 20, e210 (2018). 11Ha, U., et al.: A wearable EEG-HEG-HRV multimodal system with simultaneous monitoring of tES for mental health management. IEEE Trans. Biomed. Circuits Syst. 9, 758– 766 (2015) 12Hayashi, T.: A study on estimation method of business performance using wristband type life monitor. The 70th National Industrial Safety and Health Convention, Tokyo (2011) 13Egi, M., Ishikawa, H.: Scale-free dynamics of human behavior in personal computer operations. The 10th International Conference on Management of Digital EcoSystems, Tokyo (2018) 14Sato, H., et al.: Correlation of within-individual fluctuation of depressed mood with prefrontal cortex activity during verbal working memory task: Optical topography study. J. Biomed. Optics 16, 126007 (2011). 15Atsumori, H., et al.: Prefrontal cortex activation of return-to-work trainees in remission of mental disorders with depressive symptoms compared to that of healthy controls. J. Biomed. Optics 24, 056008 (2019) 16Kiguchi, M., et al.: Mental condition monitoring based on multimodality biometry. Front. Public Health 8, 479431 (2020). 17Inoue, A., et al.: Development of the new brief job stress questionnaire. Springer, Dordrecht (2014) 18The Brief Job Stress Questionnaire, https://www.mhlw.go.jp/bunya/roudoukijun/anzeneisei12/dl/160621-1.pdf 19Furuichi, W., et al.: Effects of job stressors, stress response, and sleep disturbance on presenteeism in office workers. Neuropsychiatr. Dis. Treat. 16, 1827– 1833 (2020). 20Ke, G., et al.: LightGBM: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process Syst. 30, 3149– 3157 (2017) 21Light, G.B.M.: https://lightgbm.readthedocs.io/en/latest/, accessed Sep 2020 22Akiba, T., et al.: Optuna: A next-generation hyperparameter optimization framework. KDD ’19 (Anchorage, August, 2019). https://dl.acm.org/doi/10.1145/3292500.3330701, accessed Sep 2020
留言 (0)