Next Article in Journal
A Review of Natural Gas Hydrate Formation with Amino Acids
Previous Article in Journal
Numerical Study of Scale Effects on Open Water Propeller Performance
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Develop a Comprehensive Method to Evaluate the Mental Workload of Ship Operators

1
College of Mechanical and Electrical Engineering, Harbin Engineering University, Harbin 150001, China
2
Management Department, East University of Heilongjiang, Harbin 150086, China
3
Faculty of Electromechanical and Civil Engineering, Vietnam National University of Forestry, Hanoi 10000, Vietnam
*
Author to whom correspondence should be addressed.
J. Mar. Sci. Eng. 2022, 10(8), 1133; https://doi.org/10.3390/jmse10081133
Submission received: 6 July 2022 / Revised: 10 August 2022 / Accepted: 16 August 2022 / Published: 17 August 2022
(This article belongs to the Section Ocean Engineering)

Abstract

:
Mental workload has become an important factor affecting the human error of ship operators. Controlling the mental workload of operators within a reasonable range can reduce human errors. The purpose of this study is to develop an evaluation method to evaluate the ship operator’s mental workload. First, the evaluation indices system was constructed according to three types of mental workload measurements. Second, the criteria importance though intercrieria correlation (CRITIC) method and analytic hierarchy process (AHP) method were used to determine the relative weight of each index. Finally, the fuzzy theory was used to calculate ship operator’s mental workload. The experiment results indicated that subjective workload assessment technique, physiological measurement (eye response), and error rate can be integrated into the comprehensive evaluation method to assess the mental workload of ship operators. Thus, this method can be used to comprehensively evaluate the mental workload level and improve the reliability of the assessment results.

1. Introduction

The marine transportation industry plays an increasingly important role in global transportation. Although a lot of work has been done to enhance the safety of maritime transportation, human error remains one of the leading causes of marine accidents. Research reports on maritime accidents show that about 80–85% of accidents involve human error [1]. Ship operation is a heavy mental workload task, particularly in the complex operation procedures. The mental workload has become an important factor affecting human errors. Low or high mental workload will have a negative impact on human performance [2]. Keeping mental workload within a reasonable range can decrease human error and improve system security [3]. As the improvement of automation level of ship control room, the operation tasks are primarily cognitive tasks, which will cause the change of the operator’s mental workload level. Consequently, it is necessary to assess the operators’ mental workload to improve the operators’ performance.
Mental workload is defined as the amount of mental work required by person to complete a specific task [4]. When the operator is engaged in cognitive tasks, information processing needs to call on the operator’s cognitive resources. As the task load increases, more and more resources are needed. When the mental workload exceeds a reasonable range, performance suffers and errors begin to occur. At present, the evaluation methods of mental workload are generally divided into three categories: subjective measurement, physiological measurement, and performance measurement. Subjective measure, such as Positive Affect and Negative Affect Schedule [5], Subjective Workload Assessment Technique (SWAT) [6], Activation-Deactivation Adiective Check List [7], National Aeronautics and Space Administration’s Task Load Index (NASA-TLX) [8], and Modified Cooper-Harper Ratings [9], have been widely used to assess the operator’s mental workload. Most subjective measurement results generally refer to the measurement of mental workload without considering the impact of physical labor [10]. However, subjective measurement is easily influenced by personal factors, such as preferences, prejudices, and protect attitudes. Thus, it is not recommended to use subjective measurement when the evaluation results are easily affected by personal factors [11]. Physiological measurement is to measure mental workload by collecting physiological parameters. This method requires the instrument to be connected with the person during the measurement process, which may restrict the movement of the person. However, this method requires only a few samples to provide accurate results.
Eye response is a physiological measurement method which can measure the operators’ mental workload in real time [12]. Eye response measurement is a technique to analyze the eye response to visual stimuli. Generally, the indexes of pupil diameter, fixation rate, saccadic rate, and blink rate are combined to evaluate mental workload [13,14]. Pupil dilation has a correlation with the amount of cognitive control, attention, and cognitive requirement; it can reflect the mental workload level [15,16]. Nevertheless, the pupil diameter is easily affected by changes of light environment, so the experimental results cannot accurately reflect the level of mental workload. Therefore, pupil dilation is usually combined with blink indices, saccadic rate, and fixation to assess cognitive requirement in different tasks. Blink rate increased with increased load resulting from memory tasks, but it declined with increased workload resulting from processing visual stimuli [17]. Eye fixation refers to the stay of the human eyes in a specific position. This may be because the operator is interested in specific information, or it may be because the information is difficult to understand. It is found that this index has a correlation with task difficulty and can be used to measure mental workload [18]. In addition, the saccadic rate has a negative correlation with mental workload [19].
Through the above analysis, subjective measurement is easily affected by personal factors. Physiological measurement can be affected by external environmental factors and emotions. In addition, performance measurement requires many experiments to obtain reliable results. A single measurement method has certain limitations. The comprehensive measurement method of multiple indices can make up for the shortcomings of various methods. Therefore, the purpose of this research is to develop a method to measure the mental workload of ship operators based on subjective measurement, physiological measurement, and performance measurement. According to the measurement method, an evaluation index system is established, and the CRITIC method and AHP method are used to calculate the relative weight of each index. Finally, the mental workload of the ship operator is evaluated according to the fuzzy theory. The developed method has an important impact on improving human performance and reducing the number of accidents.

2. Comprehensive Evaluation Method

2.1. Comprehensive Evaluation Index System

The first step of the comprehensive evaluation method is to construct an evaluation index system. This study selects workload evaluation indices from three aspects: physiological measurement, subjective measurement, and performance measurement. Physiological measurement methods are easily affected by emotional and physical factors because the device is in contact with the user’s body, which limits the real state of the user. Nevertheless, physiological measurement requires only a few samples to obtain accurate evaluation results. Therefore, this paper selects the eye response measurement method to measure the operator’s mental workload level in real time.
During the experiment, the iView X head mounted eye-tracking device (SensoMotoric Instruments, Teltow, Germany) with pupil/corneal reflection <0.1 and a rate of 50 Hz was used to record the eye response. The experimental data were analyzed and processed using the BeGaze software (version 3.0). Raw eye tracking data were recorded in the form of XY coordinates according to sampling rate.
Subjective workload assessment is becoming more and more widely used. This method is easy to implement and low cost. However, subjective assessment results are easily influenced by personal characteristics, such as bias, errors, and preferences [20] Therefore, only relying on the subjective evaluation results cannot accurately reflect the operator’s workload level. This research used the SWAT method to assess the subjective workload. SWAT evaluates the workload from three dimensions: mental load, time load, and stress load. In this study, an improved version that is more sensitive to the medium-level workload was used to evaluate the workload [21]. Participants marked a visual analog scale with a horizontal length line of 16.5 cm, and made a mark at the beginning, middle, and end of the horizontal line to represent the workload from high to low. The level of each dimension of SWAT method is measured by the marked horizontal line (lowest score 0 cm, highest score 16.5 cm), and the total workload is the sum score of the three dimensions (the highest score is 49.5 cm).
Performance measurement methods can be classified into many categories, such as task time, accuracy, domain-specific measures, worst-case performance, etc. [22]. The error rate is the key index to measure human performance. This index has been applied in many fields to measure workload [23,24]. The operator’s mental workload is not within the reasonable range, which is the cause of the errors, and the errors affect the system operation safety. Thus, this study selected the error rate as the performance measurement index.

2.2. Normalization of Evaluation Matrix

With m evaluating indices and n participants count form an original index value matrix:
X = [ x 11 x 12 x 1 m x 21 x 22 x 2 m x n 1 x n 2 x n m ]
where, xij is the evaluation data of the ith participant on the jth index.
This matrix is normalized to get Equation (2):
H = [ h i j ] n × m
where, hij is the normalized value, and h i j [ 0 , 1 ] .

2.3. Determining the Indices Weight

As an important part of fuzzy assessment method, calculating the index weight has an important impact on the evaluation results. The CRITIC method is an objective weighting method based on data volatility. The objective weighting method is based on the influence of quantitative indices of data information on the evaluation results. Because the objective weighting method needs to depend on sample data and actual problem domain, its universality and participation are poor, and it cannot reflect the importance of the judges on different index. Sometimes the determined weight will greatly differ from the actual importance of attributes. AHP method is a weighting method based on subjective evaluation results. In the subjective weighting method, the index weight is easily affected by subjective preference of the evaluator, and it is difficult to measure the importance of the index scientifically. This study combined the CRITIC method and the AHP method to calculate the index weight. The calculation procedure of CRITIC method is as follows [25]:
Calculate the mean square deviation of the ith influencing factor:
h j ¯ = 1 n i = 1 n h i j j = 1 , 2 , , m
s j = 1 n 1 i = 1 n ( h i j h j ¯ ) 2 j = 1 , 2 , , m
where, h ¯ j stands for the average value of the jth influencing factor, and s j is the mean square deviation of the jth influencing factor.
Calculate the correlation coefficient of jth index and kth index:
r j k = i = 1 n ( x i j x j ¯ ) ( x i k x k ¯ ) i = 1 n ( x i j x j ¯ ) 2 i = 1 n ( x i k x k ¯ ) 2 i = 1 , 2 , , n ; j = 1 , 2 , , m
Calculate the conflict of the jth index:
R j = i = 1 n ( 1 r i j )
Calculate the information content of the jth index
T j = s j R j = s j i = 1 n ( 1 r i j ) i = 1 , 2 , , n ;   j = 1 , 2 , , m
The greater T j , the greater the amount of information contained in the jth evaluation index, and the greater the relative importance of the index.
The calculation result of index weight by CRITIC method is:
δ j = T j j = 1 m T j j = 1 , 2 , , m
The procedure of calculating the index weight by AHP method is described as follows [26]:
The first step of AHP is to construct a judgment matrix. n participants judged the relative importance of the pairwise comparison of m indices through collective discussion. The judgment matrix is constructed according to the results of the comparison of m indices:
A = [ a 11 a 1 m a m 1 a m m ]
where aij represents the importance of ith index to jth index. The importance of the pairwise comparison of two indices is expressed on an integer scale from 1 to 9, the meaning of numbers is shown in Table 1. The element aij satisfies aij > 0, aij = 1/aji, and aii = 1.
Column standardization:
a i j ¯ = a i j / k = 1 m a k j , ( i , j = 1 , 2 , , m )
Summation of the standardized column:
ω i ¯ = j = 1 m a i j ¯
Calculation of the eigenvector:
ω i = ω i ¯ / i = 1 m ω i ¯
Find the largest eigenvalue:
λ m a x = i = 1 m ( A ω ) i / ( m ω i )
λmax is the largest eigenvalue of A. ω = (ω1, ω2, …, ω8), the value of ωi is the weight of the ith factor.
The maximum eigenvalue and eigenvector of the judgment matrix was calculated, and then the consistency index, random consistency ratio, and mean random consistency index were used to test the consistency. The calculation process of consistency index is as follows:
C I = ( λ max m ) / ( m 1 )
The parameter of CR is used to measure the consistency satisfaction for the judgement matrix. The calculation process is as follows:
C R = C I / R I
If CR < 0.1, the judgment matrix consistency is acceptable. Otherwise, the judgment matrix needs to be adjusted. The value of the average random consistency index RI is shown in Table 2.
The result of the combination of the CRITIC method and the AHP as shown in Equation (16):
η j = λ δ j + ( 1 λ ) ω j
where, λ is the preference index, and λ ( 0 , 1 ) , η = [ η j ] = [ η 1 ,   η 2 η m ] .

2.4. Determine Evaluation Criteria

The m indices U = { u 1 u 2 u m } were used to evaluate the operator’s mental workload. For each index, evaluation criteria and ranks are carefully determined. V = {v1 v2 v3 v4 v5} = {Very low Low Normal High Very high} is used to describe the five evaluation ranks of each index, the score of each evaluation criteria is H = {1 2 3 4 5}.

2.5. Determining Fuzzy Relation Matrix

According to the evaluation index system and membership function, the fuzzy relation matrix is determined as:
F = [ f i j ] m × r = [ f 11 f 1 r f m 1 f m r ]
where, f i j represents the fuzzy membership of the ith index belonging to the jth rank.
When the indices, such as pupil dilation, have positive correlation with mental workload level, then the fuzzy function is as follows:
f i k ( x ) = { 0 x λ i ( k + 1 ) λ i ( k + 1 ) x λ i ( k + 1 ) λ i k λ i k < x < λ i ( k + 1 ) k = 1 1 x λ i k
f i k ( x ) = { 0 x λ i ( k 1 ) ,   x λ i ( k + 1 ) x λ i ( k 1 ) λ i k λ i ( k 1 ) λ i ( k 1 ) < x < λ i k k = 2 , 3 , 4 λ i ( k + 1 ) x λ i ( k + 1 ) λ i k λ i k x < λ i ( k + 1 )
f i k ( x ) = { 0 x λ i ( k 1 ) x λ i ( k 1 ) λ i k λ i ( k 1 ) λ i ( k 1 ) < x < λ i k k = 5 1 x λ i k
where, x is the assess value of the ith index; λ i k is the kth rank threshold of the ith index; and f i k is the fuzzy membership of the ith index belonging to the kth rank.
When the indices, such as saccadic rate, have negative correlation with mental workload level, then the membership f i k can be given as:
f i k ( x ) = { 1 x λ i k x λ i ( k + 1 ) λ i k λ i ( k + 1 ) λ i ( k + 1 ) < x < λ i k k = 1 0 x λ i ( k + 1 )
f i k ( x ) = { 0 x λ i ( k + 1 ) ,   x λ i ( k 1 ) λ i ( k 1 ) x λ i ( k 1 ) λ i k λ i k < x < λ i ( k 1 ) k = 2 , 3 , 4 x λ i ( k + 1 ) λ i k λ i ( k + 1 ) λ i ( k + 1 ) x λ i k
f i k ( x ) = { 1 x λ i k λ i ( k 1 ) x λ i ( k 1 ) λ i k λ i k < x < λ i ( k 1 ) k = 5 0 λ i ( k 1 ) x

2.6. Calculating Evaluation Result

The membership degree of the evaluation results at different ranks can be calculated through the fuzzy relationship matrix and the index weight vector:
M = η × F = [ η 1 ,   η 2 η m ] × [ f 11 f 1 r f m 1 f m r ]
Fuzzy comprehensive evaluation scores are:
C = H × M T

3. Case Study

3.1. Participants

In this study, 22 students with the mean age and standard deviation of 25 ± 1.9 years were invited to participate in the experiment. All participants have excellent engineering education background and are familiar with the computer operations. Moreover, all of them have good vision ability, are right-handed, and had no physical discomfort on the process of the experiment. All participants were recruited in an open manner. Before the experiment, all participants were told the specific content of the experiment in detail. During the experiment, operators’ eye responses were recorded by the eye-tracking device. After the experiment, the participants were invited to fill in the questionnaire. All participants volunteered to participate in the experimental study, and the collected data were also allowed by them. Participants could stop the experiment and withdraw the experiment data without any reason. All experimental methods were implemented in accordance with the relevant regulations, and the experimental scheme was approved by the institutional review committee of Harbin Engineering University.

3.2. Operation Task

The function of the stern tube system is to support the stern shaft or propeller shaft, and make it reliably pass out of the ship, so as not to make a large amount of outboard water leak into the ship, and to prevent the lubricating oil from leaking out. Therefore, the stern tube system has an important impact on the safe operation of ships. In this study, the stern tube system operation procedure was carried out in the simulator MC90-V developed by Kongsberg maritime. The specific operation procedures are shown in Table 3. Figure 1 is the operation interface of simulator MC90-V. The software was developed based on the real diesel engine data [27].

3.3. Acquisition of Experimental Data

The calculation process of SWAT score includes two steps. The first step is to score each dimension to determine the impact of each dimension on mental workload. In this step, the improved version of visual analysis scales is used to score each dimension. The second step is to sum the scores of the three dimensions as SWAT scores.
Error rate refers to the ratio of the number of incorrect operations to the number of actual operations. The eye response data of all participants were processed, and the average value of the whole task was taken as the final eye response parameter of participants. The area of interest is defined as the human–computer interface. During the experiment, only the data in the area of interest were collected, and all the data outside the area of interest were excluded. The results of the participants’ experimental data are summarized in Table 4.

3.4. Evaluation Index Weight

Using the data shown in Table 4 and based on Equations (1)–(4), the mean square deviation of the influencing factor is:
S j = [   0.3262   0.3050   0.3470   0.2799   0.2869   0.4289 ]
The conflict of index is:
R j = [   3.503   3.671   3.395   5.058   3.282   3.729 ]
The information content of the index is:
T j = [   1.143   1.120   1.178   1.416   0.942   1.599 ]
The calculation result of index weight by CRITIC method is:
δ j = [   0.1545   0.1514   0.1592   0.1914   0.1273   0.2162 ]
The judgment matrix of AHP is:
A = [ 1 1 2 1 1 1 5 1 4 2 1 2 2 1 4 1 3 1 1 2 1 1 1 5 1 4 1 1 2 2 2 1 5 1 4 5 4 5 5 1 2 4 3 4 4 1 2 1 ]
According to Equations (10)–(12), the calculation result of index weight by AHP method is as follows:
ω = [   0.0682   0.1189   0.0682   0.0682   0.4041   0.2724 ]
CI = 0.011, CR = 0.0088, the consistency of the judgment matrix is considered acceptable.
According to Equation (16), the calculation result of index weight using CRITIC method and AHP method is as follows:
η = [   0.11   0.14   0.11   0.13   0.27   0.24 ]
According to the calculated result of CRITIC and AHP methods, the relative weights of pupil dilation, blink rate, fixation rate, saccadic rate, SWAT, and error rate were 0.11, 0.14, 0.11, 0.13, 0.27, and 0.24, respectively.

3.5. Comprehensive Evaluation Result

Before constructing the fuzzy membership degree matrix, the threshold of each index under different evaluation ranks needs to be determined. In this study, the indices were divided into five evaluation ranks: very low, low, normal, high, and very high. The thresholds of each index under different evaluation ranks are shown in Table 5.
According to Equations (18)–(23), the fuzzy membership matrix F was obtained.
F = [ 0 0 0.94 0.06 0 1 0 0 0 0 0 0.6 0.4 0 0 0 0 0.94 0.06 0 0 0.24 0.76 0 0 0.33 0.67 0 0 0 ]
Using the Equation (24), the membership degree of the comprehensive evaluation results at different ranks is:
M = η × F = [ 0.2192   0.2916   0.4748   0.0144   0 ]
According to the maximum membership principle, the operator’s mental workload is “Normal” level. Using Equation (25), the fuzzy comprehensive evaluation score is 2.844.

3.6. Validation of Evaluation Results

The developed assessment method was verified by the correlation analysis between the comprehensive evaluation scores and the other indices. The fuzzy comprehensive evaluation score of all participants were shown in Figure 2. According to the maximum membership principle, the number of participants with workload levels in “very low”, “low”, “normal”, and “high” were 3, 9, 8, and 2, respectively.
Table 6 shows the correlation analysis results between the comprehensive evaluation scores and other indices. The result showed that the comprehensive evaluation result has a positive correlation with pupil dilation (r = 0.75, p < 0.01), fixation rate (r = 0.84, p < 0.01), and SWAT (r = 0.85, p < 0.01).

4. Discussion

The purpose of this study is to develop a method to assess the mental workload of ship operators. Eye response, subjective score, and error rate were used to assess the mental workload. Due to the improvement of automation technology and the application of computer technology, the operators realize the monitoring of the system through the digital human–computer interface. The main tasks of the operator are cognitive tasks. Eye response index can reflect the change of mental workload level. SWAT is a subjective assessment method. To obtain the accuracy of the evaluation results, it needs to be combined with the objective evaluation method. Increasing operators’ mental workload, or overload, may cause delayed processing of information since the amount of information exceeds their ability to process information. In contrast, when their mental workload decreases from a suitable level, they feel bored and tend to make errors. Therefore, the human performance is affected by too high or too low mental workload. Therefore, correctly evaluating the workload of operators can optimize mental workload allocation, reduce human errors, improve system safety, and improve operator satisfaction. In the long-term, accurately evaluating the mental workload to realize reasonable mental workload allocation can maintain the mental workload of operators within a reasonable range and reduce the impact of long-term unreasonable mental workload on the health of operators.
The weight of the overall eye response indices is up to 0.49, which indicated that the measurement method is particularly important for evaluating operators’ mental workload. One reason is that the four eye response indices are directly related to cognitive workload, which can usually reflect the level of cognitive workload. In the digital human–computer interface operation, the operator’s task is mainly cognitive task, which determines that the workload is mainly cognitive workload. Therefore, the weight value of the overall eye response indices is the highest among all indices. The weight of blink rate is the highest in all eye response indices, because the blink rate increases with the increase of workload caused by memory task. Pupil dilation is related to cognitive control and attention. It reflects the difficulty of getting information from digital human–computer interface. The pupil diameter in highly complex tasks is larger than that in low complexity tasks. The fixation rate can also be used to reflect the mental workload status, which is positively correlated with task difficulty. Task complexity is an important factor affecting workload. When the operator’s fixation is higher on a specific area, it may indicate greater interest in the target, or the target is more complex. However, eye response methods may be affected by environmental and emotional factors. Therefore, this method is usually combined with subjective workload method to measure the mental workload. The SWAT weight value is 0.27, indicating that this method has a significant impact on the comprehensive evaluation results.
The comprehensive evaluation result indicated that the mental workload is in “normal” level, and the fuzzy evaluation score is 2.844. The effectiveness of the developed method was tested by analyzing the correlation between the comprehensive scores of all participants and other indices. The analysis result indicated that the comprehensive evaluation score is correlated with pupil dilation, fixation rate, and SWAT, demonstrating that the developed method can be used to evaluate the mental workload level. However, there are still some indices that do not show correlation with the comprehensive evaluation results, which may be due to the fact that these indices are not sensitive to mental workload.
However, the developed method has some limitations. First, the experience of participants is different from that of operators with many years of operation experience. Second, this experience used a small number of samples, which reduces the statistical ability. Third, the experiment was implemented on the simulation platform. Therefore, the experiment result needs to be validated in a field study.

5. Conclusions

This research constructed a method to evaluate mental workload. The evaluation index system is established by integrating various workload evaluation methods to improve the rationality of the indices selection. The CRITIC and AHP methods were used to determine the index weight to avoid deviation caused by subjective factors. Taking the average measurement results as the indices level, the comprehensive evaluation results indicated that the mental workload of the operator is at the “normal” level. In addition, the correlation analysis indicated that the scores of the comprehensive evaluation results are related to some evaluation indices. This means that the comprehensive evaluation method can effectively evaluate the mental workload. Therefore, this method can be used as a tool to evaluate the mental workload. Further, future study will investigate other samples which are above 30 years old (not student) to investigate the evaluation results of different ages.

Author Contributions

Conceptualization, S.Y. and Y.W.; methodology, Y.W. and F.L.; software, C.C.T.; validation, Y.W., C.C.T. and F.L.; writing—original draft preparation, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of Harbin engineering university.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study. Written informed consent has been obtained from the patient(s) to publish this paper.

Data Availability Statement

Not Applicable.

Acknowledgments

Thanks to the reviewers for their comments and suggestions. The authors would also like to thank the participants who helped the research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Baker, C.C.; Seah, A.K. Maritime accidents and human performance: The statistical trail. In Proceedings of the MarTech Conference, Singapore, 22–24 September 2004. [Google Scholar]
  2. Nachreiner, F. Standards for ergonomics principles relating to the design of work systems and to mental workload. Appl. Ergon. 1995, 26, 259–263. [Google Scholar] [CrossRef]
  3. Moray, N. Mental workload since 1979. Int. Rev. Ergon. 1988, 2, 123–150. [Google Scholar]
  4. Gao, Q.; Wang, Y.; Song, F.; Li, Z.; Dong, X. Mental workload measurement for emergency operating procedures in digital nuclear power plants. Ergonomics 2013, 56, 1070–1085. [Google Scholar] [CrossRef] [PubMed]
  5. Watson, D.; Clark, L.A.; Tellegen, A. Development and validation of brief measures of positive and negative affect: The PANAS scales. J. Pers. Soc. Psychol. 1988, 54, 1063–1070. [Google Scholar] [CrossRef] [PubMed]
  6. Reid, G.B. The Subjective Workload Assessment Technique: A Scaling Procedure for Measuring Mental Workload; Elsevier Science Publishers: Amsterdam, The Netherlands, 1988. [Google Scholar]
  7. Thayer, R.E. Activation-deactivation adjective check list: Current overview and structural analysis. Psychol. Rep. 1986, 58, 607–614. [Google Scholar] [CrossRef]
  8. Hart, S.G.; Staveland, L.E. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. Adv. Psychol. 1988, 52, 139–183. [Google Scholar]
  9. Wierwille, W.W.; Casali, J.G. A validated rating scale for global mental workload measurement applications. Proc. Hum. Factors Soc. Ann. Meet. 1983, 27, 129–133. [Google Scholar] [CrossRef]
  10. Cain, B. A review of the mental workload literature. In Defence Research and Development Toronto (Canada); Report No. RTO-TR-HFM-121-Part-II; NATO: Brussels, Belgium, 2007; pp. 1–34. [Google Scholar]
  11. Dyer, R.F.; Matthews, J.J.; Wright, C.E.; Yudowitch, K.L. Questionnaire construction manual annex literature survey and bibliography. In P-77-1, Operations Research Associates, Fort Hood (TX); Report: DTIC Document; Army Research Institute for the Behavioral and Social Sciences: Palo Alto, CA, USA, 1976. [Google Scholar]
  12. Rosch, J.L.; Vogel-Walcutt, J.J. A review of eye-tracking applications as tools for training. Cognit. Technol. Work. 2013, 15, 313–327. [Google Scholar] [CrossRef]
  13. Brookings, J.B.; Wilson, G.F.; Swain, C.R. Psychophysiological responses to changes in workload during simulated air traffic control. Biol. Psychol. 1996, 42, 361–377. [Google Scholar] [CrossRef]
  14. Van Orden, K.F.; Jung, T.-P.; Makeig, S. Combined eye activity measures accurately estimate changes in sustained visual task performance. Biol. Psychol. 2000, 52, 221–240. [Google Scholar] [CrossRef]
  15. Batmaz, I.; Ozturk, M. Using pupil diameter changes for measuring mental workload under mental processing. J. Appl. Sci. 2008, 8, 68–76. [Google Scholar] [CrossRef]
  16. Wierda, S.M.; van Rijn, H.; Taatgen, N.A.; Martens, S. Pupil dilation deconvolution reveals the dynamics of attention at high temporal resolution. Proc. Natl. Acad. Sci. USA 2012, 109, 8456–8460. [Google Scholar] [CrossRef] [PubMed]
  17. Wilson, G.; Schlegel, R.; Veltman, J. Operator functional state assessment. In Report No. RTO-TR-HFM-104 AC/323 (HFM-104) TP/48 RTO, RTO-TR-HFM-104 AC/323 (HFM-104) TP/48 RTO Technical Report HFM; NATO Research and Technology Organization: Washington, DC, USA, 2004. [Google Scholar]
  18. Van Orden, K.F.; Limbert, W.; Makeig, S.; Jung, T.-P. Eye activity correlates of workload during a visuospatial memory task. Hum. Factors 2001, 43, 111–121. [Google Scholar] [CrossRef] [PubMed]
  19. Nakayama, M.; Takahashi, K.; Shimizu, Y. The act of task difficulty and eyemovement frequency for the ‘Oculo-motor indices’. In Proceedings of the 2002 Symposium on Eye Tracking Research & Applications, New Orleans, LA, USA, 25–27 March 2002; pp. 37–42. [Google Scholar]
  20. Chen, Y.; Yan, S.; Tran, C.C. Comprehensive evaluation method for user interface design in nuclear power plant based on mental workload. Nucl. Eng. Technol. 2019, 51, 453–462. [Google Scholar] [CrossRef]
  21. Yan, S.; Wei, Y.; Tran, C.C. Evaluation and prediction mental workload in user interface of maritime operations using eye response. Int. J. Ind. Ergon. 2019, 71, 117–127. [Google Scholar] [CrossRef]
  22. Gawron, V.J. Human Performance, Workload, and Situational Awareness Measures Handbook; CRC Press: Boca Raton, FL, USA, 2008. [Google Scholar]
  23. Hwang, S.-L.; Yau, Y.-J.; Lin, Y.-T.; Chen, J.-H.; Huang, T.-H.; Yenn, T.-C.; Hsu, C.-C. Predicting work performance in nuclear power plants. Saf. Sci. 2008, 46, 1115–1124. [Google Scholar] [CrossRef]
  24. Yan, S.; Tran, C.C.; Wei, Y.; Habiyaremye, L.J. Driver’s mental workload prediction model based on physiological indices. Int. J. Occup. Saf. Ergon. 2017, 25, 476–484. [Google Scholar] [CrossRef] [PubMed]
  25. Milic, M.; Zupac, G. An objective approach to determining criteria weights. Vojnoteh. Glas. 2012, 60, 39–56. [Google Scholar] [CrossRef]
  26. Yan, S.; Yao, K.; Wei, Y.; Li, F.; Tran, C.C. Develop a framework of human reliability analysis to evaluate the human error probability in DMCR of NPPs. J. Nucl. Sci. Technol. 2022, 1–12. [Google Scholar] [CrossRef]
  27. Kongsberg. Operator’s manual Part 3. Machinery & operation, Kongsberg maritime. In Engine Room Simulator L11 5L90MC–VLCC Machinery & Operation MC90-IV; Report: SO-1136-D/11-Oct-05; Kongsberg Maritime: Kongsberg, Norway, 2005. [Google Scholar]
Figure 1. Operation interface of stern tube system.
Figure 1. Operation interface of stern tube system.
Jmse 10 01133 g001
Figure 2. The evaluation score of all participants.
Figure 2. The evaluation score of all participants.
Jmse 10 01133 g002
Table 1. Importance scale values.
Table 1. Importance scale values.
Importance Scale ValueDescription
1The elements i and j are equally important.
3The element i is moderately more important than the element j.
5The element i is strongly more important than the element j.
7The element i is very strong important than the element j.
9The element i is extremely more important than the element j.
2,4,6,8Intermediate value between two adjacent judgments.
ReciprocalIf the importance ratio of indexes i to factor j was aij, then the importance ratio of indexes j to indexes i was aji = 1/aij.
Table 2. The value of mean random consistency index RI.
Table 2. The value of mean random consistency index RI.
m12345678910
RI000.520.891.121.261.361.411.461.49
Table 3. Stern tube system operation procedure.
Table 3. Stern tube system operation procedure.
StepsOperation Procedure
1Ensure cooling water to stern tube cooler.
2Refill lubricating oil sump tank if necessary.
3Select required gravity tank using 3-way valve in filling line.
4Select correct gravity feed to stern tube.
5Ensure stern seal isolating valve is open.
6Start the lubricating oil pump in manual.
7When one pump is started, set the other pump in Auto.
8If the running pump is unable to maintain the level in the gravity tank, the stand-by pump starts automatically.
9Check level of oil in sealing tank, fill from make-up valve. Drain water if required.
10Stop of pumps to be carried out manually
Table 4. The evaluation results of all indices.
Table 4. The evaluation results of all indices.
Pupil DilationBlink RateFixation RateSaccadic RateSWATError Rate
Mean50.30.431.061.2823.80.02
Standard deviation6.20.090.430.253.190.04
Table 5. Parameter values in membership function.
Table 5. Parameter values in membership function.
Pupil DilationBlink RateFixation RateSaccadic RateSWATError Rate
K = 1 λ i k 400.20.51.65150.01
λ i ( k + 1 ) 450.40.92200.025
K = 2 λ i ( k 1 ) 400.20.51.3150.01
λ i k 450.40.91.65200.025
λ i ( k + 1 ) 500.61.32250.04
K = 3 λ i ( k 1 ) 450.40.90.95200.025
λ i k 500.61.31.3250.04
λ i ( k + 1 ) 550.81.71.65300.055
K = 4 λ i ( k 1 ) 500.61.30.6250.04
λ i k 550.81.70.95300.055
λ i ( k + 1 ) 6012.11.3350.07
K = 5 λ i ( k 1 ) 550.81.70.6300.055
λ i k 6012.10.95350.07
Table 6. Correlation between the evaluation score and other indices data.
Table 6. Correlation between the evaluation score and other indices data.
Pupil DilationBlink RateFixation RateSaccadic RateSWATError Rate
Pupil dilationCorrelation1
Sig.(2-tailed)
Blink rateCorrelation0.211
Sig.(2-tailed)0.34
Fixation rateCorrelation0.59 **0.391
Sig.(2-tailed)0.000.07
Saccadic rateCorrelation−0.050.29−0.181
Sig.(2-tailed)0.820.180.43
SWATCorrelation0.45 *0.330.58 **−0.211
Sig.(2-tailed)0.040.130.000.35
Error rateCorrelation0.290.100.220.090.57 **1
Sig.(2-tailed)0.180.670.320.700.00
Evaluation scoreCorrelation0.75 **0.420.84 **−0.340.85 **0.40
Sig.(2-tailed)0.000.060.000.120.000.06
**: Correlation is significant at 0.01 level. *: Correlation is significant at 0.05 level.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Yan, S.; Wei, Y.; Li, F.; Tran, C.C. Develop a Comprehensive Method to Evaluate the Mental Workload of Ship Operators. J. Mar. Sci. Eng. 2022, 10, 1133. https://doi.org/10.3390/jmse10081133

AMA Style

Yan S, Wei Y, Li F, Tran CC. Develop a Comprehensive Method to Evaluate the Mental Workload of Ship Operators. Journal of Marine Science and Engineering. 2022; 10(8):1133. https://doi.org/10.3390/jmse10081133

Chicago/Turabian Style

Yan, Shengyuan, Yingying Wei, Fengjiao Li, and Cong Chi Tran. 2022. "Develop a Comprehensive Method to Evaluate the Mental Workload of Ship Operators" Journal of Marine Science and Engineering 10, no. 8: 1133. https://doi.org/10.3390/jmse10081133

APA Style

Yan, S., Wei, Y., Li, F., & Tran, C. C. (2022). Develop a Comprehensive Method to Evaluate the Mental Workload of Ship Operators. Journal of Marine Science and Engineering, 10(8), 1133. https://doi.org/10.3390/jmse10081133

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop