Towards Sustainable Use of Machine Translation: Usability and Perceived Quality from the End-User Perspective

Kasperė, Ramunė; Horbačauskienė, Jolita; Motiejūnienė, Jurgita; Liubinienė, Vilmantė; Patašienė, Irena; Patašius, Martynas

doi:10.3390/su132313430

Open AccessArticle

Towards Sustainable Use of Machine Translation: Usability and Perceived Quality from the End-User Perspective

¹

Faculty of Social Sciences, Arts and Humanities, Kaunas University of Technology, 44249 Kaunas, Lithuania

²

Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, Lithuania

³

Institute of Biomedical Engineering, Kaunas University of Technology, 44249 Kaunas, Lithuania

^*

Author to whom correspondence should be addressed.

Sustainability 2021, 13(23), 13430; https://doi.org/10.3390/su132313430

Submission received: 2 November 2021 / Revised: 26 November 2021 / Accepted: 29 November 2021 / Published: 4 December 2021

Download

Browse Figures

Versions Notes

Abstract

:

Artificial intelligence-grounded machine translation has fundamentally changed public awareness and attitudes towards multilingual communication. In some language pairs, the accuracy, quality and efficiency of machine-translated texts of certain types can be quite high. Hence, the end-user acceptability and reliance on machine-translated content could be justified. However, machine translation in small and/or low-resource languages might yield significantly lower quality, which in turn may lead to potentially negative consequences and risks if machine translation is used in high-risk contexts without awareness of the drawbacks, critical assessment and modifications to the raw output. The current study, which is part of a more extensive project focusing on the societal impact of machine translation, is aimed at revealing the attitudes towards usability and quality as perceived from the end-user perspective. The research questions addressed revolve around the machine translation types used, purposes of using machine translation, perceived quality of the generated output, and actions taken to improve the quality by users with various backgrounds. The research findings rely on a survey of the population (N = 402) conducted in 2021 in Lithuania. The study reveals the frequent use of machine translation for a diversity of purposes. The most common uses include work, research and studies, and household environments. A higher level of education correlates with user dissatisfaction with the generated quality and actions taken to improve it. The findings also reveal that age correlates with the use of machine translation. Sustainable measures to reduce machine translation related risks have to be established based on the perceptions of different social groups in different societies and cultures.

Keywords:

machine translation; usability; quality; end-user perspective; Lithuania

1. Introduction

Machine translation and language technologies based on the development of artificial intelligence have altered societal perceptions and attitudes about multilingual communication. Such technologies are now daily used in personal circumstances and in high-risk settings in order to immediately overcome linguistic gaps at basically no cost [1]. However, today, little is known about end users’ attitudes towards machine translation despite its enormous impact on cross-cultural communication, where members of society find themselves in various situations with machine translation as the main means of communication [2]. In general, machine translation technology is expected to closer approximate human capabilities as artificial intelligence and machine learning advance [1,3]. The accuracy, quality, and efficiency of machine-translated texts of specific sorts can already be fairly good in particular language combinations.

Recently, more attention is being devoted to fair machine translation, which is understood as ethical and sustainable machine translation [4]. The authors bring into focus the fact that for much of the history of machine translation, questions of ethics and sustainability were largely ignored [4]. Sustainability, as seen by the authors, is addressed in translation studies in two broad ways: the first approach considers current (unsustainable) practices in the translation industry and the second investigates the future of the translation profession and, thus, of translator education, especially in the light of technological change. The combination of different perspectives in machine translation shows how complex and dynamic the object under investigation is. Recent developments of machine translation technological solutions encompass social, economic, technological, organizational and operational factors, but none of these perspectives can alone provide an adequate description of the transformations that translation is currently facing (see [4]). Thus, our study, which is part of a more extensive project focusing on the societal impacts of machine translation, concentrates on the current status of machine translation uses among different society members in Lithuania and may provide implications as to the machine translation acceptability in light of technological development. The current study is aimed at revealing the attitudes towards the usability and quality as perceived from the end-user perspective. The research questions revolve around the types of machine translation, purposes of machine translation usage, perceived quality of the generated output, and actions taken by users of different backgrounds to improve the quality. The paper consists of the following sections: introduction, literature overview which concentrates on the latest research in the field of machine translation, its acceptability, assessment of quality and efficiency as well as the impact on society. Next, a section on methods and materials seeks to clarify the research instruments and describe the research participants and data analysis. The research results are presented followed by the discussion and conclusions.

2. Literature Overview

The use of machine translation in everyday communication and issues related thereof [5,6,7,8] constitute an emerging research area in the contemporary academic field of translation. Machine translation in everyday personal communication is usually applied while communicating with overseas relatives or traveling. However, the acceptability of such experiences can be mixed [8]. Success was reported when using machine translation apps to communicate with relatives overseas; however, participants struggled between the spoken and written forms of languages, and issues appeared relating to grammar and syntax, making communication difficult [8].

Machine translation is a widely used tool for multilingual groups because it allows all members to speak (write) and listen (read) in almost any language [5,9]. When machine translation is used for multilingual communication purposes, it is critical that the quality of machine translation is excellent. However, not every user is aware of the errors and risks related to this technology when it comes to everyday usage; according to research, the quality of today’s machine translation systems is insufficient for supporting intercultural conversation [10]. Many authors pay attention to the serious risks that machine translation errors pose in the field of health care, legal services, culture and media as well as many areas of social life [1,11,12]. The misuse of machine translation can have particularly serious consequences in high-stakes settings [1]. Rossetti et al. [13] also focused on risk involving contexts and performed a study to learn more about the role and impact of machine translation and post-editing knowledge on public understanding and confidence in communications messages broadcast for alerting the public about a certain emergency. They [13] asked 61 people to read crisis communication messages and rate comprehensibility and trust in the messages using ratings and open-ended questions. The most common explanation of why the participants used machine translation was assimilation, which demonstrated that the end users were potentially using machine translation to translate crisis messages provided in a language they were unfamiliar with [13]. Overall, machine translation is not only a means for personal or everyday communication but is also being relied on in scenarios of utmost significance.

Despite the fact that machine translation is improving in terms of speed, language compatibility and mobility, it also has some drawbacks in terms of accuracy, and cannot yet totally replace humans unless they develop their language abilities and knowledge and are able to grasp and appropriately use words without the aid of machine translation or without the assistance of a translator [3]. Therefore, it is necessary to research how people perceive and accept machine translation. According to [14], the acceptability of machine translation is measured via usability, satisfaction and quality, where usability may be measured via tasks recorded using an eye tracker, quality may be evaluated via a translation quality assessment questionnaire answered by professional translators, and satisfaction may be measured via surveys, post-task questionnaires and translators’ rankings. The study accomplished by [15] reveals that different languages have different thresholds of machine translation quality [15]. For example, machine translation will have problems in many languages that contain a lot of homonyms. Vieira et al. [1] claimed that the results of quality assessments of machine translation for health care tend to be more favorable for language pairs involving English and other Western European languages [16,17]. On the other hand, when it comes to minor language use, e.g., Lithuanian–English pair, it is assumed that machine translation systems still need more extensive training to provide high quality of translation [18].

It is also important to compare not only different language combinations, but also different machine translation systems, which was attempted by [19], who reviewed two machine translation systems and six types of machine-generated texts in order to see which system is more acceptable and useful from the perspective of the end-user, and found out that Google Translate is more acceptable. Vanjani and Aiken [20] compared eight translation systems and seven languages in all combinations in order to determine which machine translation system was the most useful to users and found out that Google Translate produced better results overall, although it was more accurate when the source and target languages were similar.

One of the wide areas of application of automated translation is user-generated content. Saadany et al. [21] tested the ability of a machine translation online system to translate user-generated content. The researchers uncovered a variety of linguistic obstacles in translation and concluded that using neural machine translation technologies to translate raw texts could be harmful, as it could send users a message that is distinct from or even contradicts the intended meaning [21]. Poncelas et al. [22] also investigated the use of a machine translation system to translate user-generated content in order to see if machine-translated sentences have the same meaning and are as fluent as the original ones. Despite the fact that the machine translation system produced comprehensible translations, the meaning, connotations and associations expressed by the source sentence were not always the same [22].

Though research into perception towards machine translation is very recent, there have been several attempts to conduct surveys of users’ views on machine translation and their strategies in assessing such translations. Pérez Macías et al. [23] with the help of a preliminary survey, selected 10 translators with experience in the migratory context and conducted a qualitative study about translators’ perceptions towards machine translation and post-editing as well as the usefulness of machine translation in the context of migration. The results of their research showed that the translators were aware of the increasing popularity of machine translation and post-editing in the translation market but they had generally negative attitudes towards both machine translation and post-editing [23]. A theoretical description of the translators’ resistance to post-editing work was attempted by Sakamoto [24] who relied on Bourdieu’s sociological framework of analysis in order to examine the positions of translators and post-editors in the field of translation and its mechanism of emotional impact. Sakamoto [24] distinguished the post-editor’s position as a new category of workers in the social system of translation and emphasized the fact that this position in the overall system of post-edited machine translation is yet to be determined.

Translation quality was also at the focus of the study carried out by Gaspari et al. [25] who surveyed a group of 438 translators, teachers and language service providers. According to the study, the majority of respondents (68%) used free online machine translation systems and acknowledged that these were not adapted to their needs. Garcia [26] suggested that rather than assessing the quality of translation using criteria for human translation assessment, a more pragmatic approach should be taken so that the translation is understood “by the educated bilingual rather than by the professional translator” [26] (p. 10). The assessment of quality, therefore, needs to not only reflect the context of the source document, but also the simplicity of the language [12]. Rossi and Chevrot [27] researched the current usage, perceptions and acceptance of machine translation and post-editing in 15 language departments at the European Commission by carrying out a survey analysis and conducting interviews in addition to other methods. The researchers have found out that technology adoption was high but the perceptions of machine translation had a significant impact on the perceived usefulness of machine translation [27]. Other researchers conducted studies on how end-users evaluate machine translation [28,29] and concluded that machine translation technology has made people’s daily work easier by offering quick access to information collection, processing and communication, but it is not free of biases affecting users and society as a whole. Therefore, bearing in mind that machine translation is designed for people and by people, it is critical to consider the backgrounds of those working in machine translation development at all levels and those using machine translation for various purposes and contexts.

There is also a growing body of research on how machine translation can be implemented for various purposes in multilingual settings, e.g., in healthcare systems [12,30], legal spheres [31], migratory contexts [23,32] and other areas, which have prompted investigations on how machine translation is used in a certain context or how it is perceived or assessed [1]. Thus, in a broad sense, the usability, quality and user satisfaction with end machine translation and the impact it has on society belong to a research field that comes into focus and deserves a deeper analysis. Therefore, it is essential to create sustainable measures in order to reduce risks associated with the misuse of machine translation [33,34] and ensure that machine translation is developed and used in a sustainable manner.

3. Materials and Methods

3.1. Research Instrument

For the purposes of this research, in July through October 2021, a survey was conducted in order to determine the perceptions of use, quality, acceptability overall, and impact on the society of machine translation in Lithuania. The survey was designed and conducted using LimeSurvey. An emphasis in the survey was put on the end-user experience while using machine translation tools and the perceived satisfaction with the generated quality of the machine-translated text. In total, 402 respondents filled in the questionnaire online. The survey was multifaceted, covering different questions, namely those related to demographic data of the respondents (age, gender, place of residence, education, occupational status, and proficiency in languages), reasons for not using machine translation, questions related to the particular machine translation systems (e.g., Google Translate, DeepL, Microsoft Bing, and Apple Translate), digital devices (computer, tablet PC and smart phone) and machine translation types (text-to-text, speech-to-text, text-to-speech, speech-to-speech, and image-to-text) used by the respondents, the purposes of machine translation usage (work, studies, household environments, entertainment, traveling, public and private sector services, medical and legal environments, personal communication, news, etc.), as well as a satisfaction evaluation with the generated quality, and additional actions taken to improve the text. The analysis of the results reported in this paper are based on the variables of age and education.

The questionnaire for the survey was designed based on the previous research on the components of acceptability of machine translation [14,19], societal impacts of machine translation [1,3], machine translation in small and low-resource languages [35], the attitudes towards machine translation [23], etc. A pilot survey was tested on 30 volunteering respondents. Based on their remarks and suggestions, the final version of the survey was designed. The survey questionnaire was freely available online and was open for 5 months. Social media channels were used to attract respondents representing the entire population.

3.2. Research Participants

The survey was targeted at all age groups, except for those aged 17 years old and under. The majority of the respondents represented the age group of 18–29 years (38%). The second largest group was that of respondents aged 40–49 years (24%), followed by those aged 30–39 years (16%) and 50–59 years (14%). The smallest age groups represented in the survey were 70 years and more (1%) and 60–69 years (4%). Although respondents who were 17 years old and under were not targeted in the survey, there were 2% of the total number of responses within the age group. The answers of such respondents were excluded from the questionnaire. The question about the occupation was optional in the survey. Of those who indicated their occupation in the group of the respondents aged 18–29 years, the majority was students, but other occupations indicated by the respondents were administrators, project managers, transport managers, librarians, etc. In the groups of the respondents aged 30–39, 40–49 and 50–59 years, the occupations mentioned were social workers, researchers, teachers, IT specialists, lawyers, communications specialists, engineers, doctors, among others. In many cases, these professions require a higher education in Lithuania. Among the respondents aged 60–69 and over 70 years, the majority were retired, but some also mentioned working as librarians, teachers, marketing managers, administrators, drivers. There were 81% of women and 17% of men, while 2% chose not to indicate their gender. The absolute majority of the respondents (96.8%) indicated Lithuanian as their mother tongue. Russian speakers represented 2.6%, while 1% of the respondents indicated other languages as their mother tongues, namely Polish, Latvian, Italian, etc. All respondents whose native language was something other than Lithuanian indicated knowing it to the level of a proficient user or a native speaker. A total of 60.5% of all respondents indicated knowing English and 36.9% indicated knowing Russian to the level of a proficient user or a native speaker.

In terms of education, there were 70% of the respondents with higher education, 11% had incomplete higher education, 10% had high school (secondary) education, 4% had vocational training, 3% indicated having not finished their secondary school, and 1% did not indicate their education level. The questionnaire also asked questions about the occupational (employment) status of the respondents. There were 51% of fully employed respondents, 23% of students, 6% of freelancers, 4% had their own business, and 4% were secondary school students. There were 3% of part-time employed and retired each. A total of 2% of the respondents were not working, but were looking for a job, and 1% of the respondents were unemployed and not looking for job. There were 3% of the respondents who did not indicate their occupational status and chose the option “other”.

The residence place of the majority of respondents was indicated as large cities and suburbs of the country (71%), followed by smaller cities and towns (18%), and villages (6%). A small number of the respondents (2%) indicated their place of residence to be abroad, i.e., a foreign country, and 2% did not indicate their place of residence. Approximately 11% (N = 46) of the respondents indicated that they did not use machine translation. Their answers are not included into data analysis. These respondents provided reasons for not using machine translation. The majority did not use machine translation, as they were unfamiliar with the technology (37%), were not satisfied with the output quality (20%), or had no needs to use it (15%). Other indicated reasons were reliance on professional translators, lack of trust in the service, inconvenient machine translation tools, etc. The total number of the responses included into the data analysis was 347.

3.3. Data Analysis

IBM SPSS Statistics 27 was used for descriptive and relationship analysis. Descriptive statistics were calculated for quantitative nominal and ordinal data. The relationships between ordinal data were evaluated by Spearman correlation coefficients. The relationship was considered statistically significant when p < 0.05. MS Excel was used for plotting and data analysis.

4. Results

The existing scientific literature measures acceptability from three different perspectives, namely usability, quality and satisfaction. In this study, the responses relevant for each component were analyzed employing frequency calculations and the Spearman correlation coefficient.

Among the users of machine translation, the age group of 18–29 years was dominant (see Figure 1).

In terms of different machine translation systems, the vast majority of the respondents (96.6%) indicated using Google Translate as the main machine translation tool, of whom 56.2% indicated using Google Translate very frequently and 19.4% frequently. There were 8% who used other machine translation systems (Apple Translate, Tilde, Microsoft Bing, Yandex, or eTranslation). The respondents indicated using them rarely, except for DeepL, which was indicated by 16.4% of the respondents, of whom 6.5% used it very often, and 4.5% used often.

The responses on the usage of machine translation in different digital devices were varied. However, the majority of the respondents (95%) indicated using machine translation on a computer and approximately three-fourths of them used it on a smart phone (78%). Meanwhile, only 15% indicated using tablet PCs for machine translation purposes.

Figure 2 presents the results of the usage of machine translation types, namely text-to-text, speech-to-text, text-to-speech, speech-to-speech, and image-to-text. As seen, the vast majority of the respondents (97%) noted using text-to-text translation. Other types were used by a small number of the respondents: image-to-text—14%; text-to-speech—12%; speech-to-text—8%; and speech-to-speech—7%.

Figure 3 presents the overview how the respondents of different age use different machine translation types. As seen, text-to-text machine translation type was prevalent in all age groups. The respondents aged 70 years and over used text-to-text machine translation type exclusively. The younger the age, the greater the variety of machine translation types used was reported.

The answers of the respondents statistically significantly differed in terms of usage of machine translation types. A statistically significant negative Spearman correlation coefficient (−0.263;

p < 0.001

) was found between the age group and the number of used machine translation types, which implies that the younger respondents were more willing to use multiple types of machine translation. A statistically significant positive correlation (0.183;

p < 0.001

) was found between the respondents’ age group and the text-to-text type, which implies that older age groups tend to use text-to-text more often. Although the number of the respondents using speech-to-speech type was low, a statistically significant positive correlation (0.200;

p < 0.001

) was determined between the age group and use of the speech-to-speech machine translation type.

Figure 4 presents the purposes of machine translation use. The majority of the respondents indicated using machine translation for work (N = 283) and for research or studies (N = 174). Other purposes were indicated as follows: household tasks (N = 145), entertainment (N = 119), personal communication (N = 116), news (N = 81), travel (N = 82), internet shopping (N = 77), legal services (N = 2), health care (N = 8), and public and private sector services (N = 17).

Differences between the age groups were determined in terms of purposes for using machine translation (see Table 1). As presented in Table 1, the respondents of the age group of 18–29 years used machine translation mostly for the purposes of research and studies (77.6%), and less often for work, entertainment, personal communication, news, traveling, household, internet shopping, public and private sector services, health care, legal situations, etc. The respondents of the age group of 30–39 years used machine translation mostly for the purposes of work (78.2%), and less often for research and studies, household, entertainment, personal communication, etc. The respondents of the age group of 40–49 years used machine translation mostly for the purposes of work (85.9%), and less often for household, research and studies, traveling, etc. The respondents of the age group of 50–59 years used machine translation mostly for the purposes of work (76.6%), and less often for household, personal communication, entertainment, etc. The respondents of the age group of 60–69 years used machine translation mostly for the purposes of personal communication (50%), and less often for work, household, entertainment, and other purposes. The respondents of the age group of 70 and more years indicated using machine translation exclusively for household and health care purposes.

There were statistically significant differences between different age groups in terms of the purposes of machine translation usage. A statistically significant negative Spearman correlation coefficient (−0.210;

p < 0.001

) was found between the age group and use of machine translation for research and study purposes. The older the age of the respondents, the less usage of machine translation for research and study purposes. There was a statistically significant correlation determined between the age group and use of machine translation for work (0.345;

p < 0.001

). Statistically significant correlations were found between the age group and the use of machine translation for traveling purposes (0.173;

p < 0.001

); use of machine translation for services in private and public sectors (0.173;

p = 0.001

); use of machine translation legal situations (0.171,

p < 0.001

); use of machine translation for health care purposes (0.206;

p < 0.001

); and use of machine translation for internet shopping (0.172;

p < 0.001

).

Figure 5 provides the results of the satisfaction of the respondents with machine translation. Almost half of the respondents (47%) were neither satisfied nor dissatisfied with the quality of machine translation. Only 4% of the respondents were very satisfied, 40% were satisfied, 7% were dissatisfied, and 1% of the respondents were very dissatisfied. A statistically significant negative correlation (−0.171;

p = 0.001

) was determined between the age group and satisfaction with the machine translation quality level.

Additionally, we also asked the respondents to indicate the source and the target languages when they used machine translation. Of all respondents, 40.6% indicated using machine translation when translating from a native language to a foreign language very often or often; 48.1% indicated using such systems when translating from a foreign language to a native language; and 28.8% indicated using it when translating from a foreign to another foreign language. Since the machine translation quality is somewhat lower with Lithuanian as a low-resource language, the findings on satisfaction of users with machine translated may be considered predictable.

Education is another factor that may affect the machine translation practices of end-users. According to the education level, the respondents were mostly representatives of the higher education (71.7%, N = 281), followed by those with incomplete higher education (11.5%, N = 45), secondary education (9.7%, N = 38), vocational training (4.1%, N = 16), and incomplete secondary education (1.8%, N = 7), NA (1.3%, N = 5). This finding is not at all surprising and relates to the statistical data that the greater part of the Lithuanian population (approximately 68% of women and approximately 47% of men) have a higher education of at least the bachelor’s level [36].

In the study, we analyzed the interrelation between the level of education and the purposes for which machine translation was used (see Table 2). The respondents with the higher education indicated using machine translation mostly for work (79.0%), and less often for household tasks, research and studies, entertainment, personal communication, traveling, news, internet shopping, etc. The respondents with the incomplete higher education indicated using machine translation mostly for research and studies (88.6%), and less often for household tasks, entertainment, personal communication, and less often for work, traveling, news, internet shopping, and public and private sector services. The respondents with the secondary education indicated using machine translation mostly for research and studies (97.1%), household tasks (63%), personal communication (51%), and less often for entertainment (49%), news, work, traveling, internet shopping, health care, and public and private sector services. The respondents with the vocational training indicated using machine translation mostly for work (55%), and less often for research and studies, personal communication, entertainment, household tasks, news, traveling, and health care. The respondents with the incomplete secondary indicated using machine translation mostly for research and studies (100%), entertainment (54%), and news (54%), and less often for household tasks, personal communication, traveling, work, public and private sector services, and internet shopping. These findings reveal that the respondents with a higher level of education tend to use machine translation more for work purposes. However, generally, the most common purposes of machine translation use indicated by all respondents are work, research and studies, and household tasks. Health care, legal situations, public and private sector services, and internet shopping were the less frequently mentioned purposes of machine translation use by all respondents.

Figure 6 presents the results of the usage of machine translation types, namely text-to-text, speech-to-text, text-to-speech, speech-to-speech, and image-to-text among the respondents with a different education level. As seen, the text-to-text machine translation type was used by the majority of the respondents in all groups (ranging from 91% in the group of the respondents with vocational training to 100% in the group of the respondents with incomplete secondary education). The speech-to-speech machine translation type was the least frequently chosen option in all groups (ranging from 6% in the group of respondents with higher education to 18% in the group of the respondents with vocational training).

Figure 7 presents the results of satisfaction with machine translation from the perspective of the education level. The mean satisfaction level is 3.4, the median is 3, the mode is 3, and the standard deviation is 0.7. It may be inferred from the data that there were only 5% of respondents with a higher education who were very satisfied with machine translation quality, followed by 40% of those who were satisfied. Half of the respondents (48%) were neither satisfied nor dissatisfied. Only a small number of the respondents with the higher education were dissatisfied (5%) and very dissatisfied (1%) with the quality of machine translation. The distribution of the respondents who had incomplete higher education was as follows: 5% very satisfied, 43% satisfied, 41% neither satisfied nor dissatisfied, and 11% dissatisfied. The distribution of the respondents who had secondary education was as follows: 46% satisfied, 46% neither satisfied nor dissatisfied, and 9% dissatisfied. The distribution of the respondents who had vocational training was as follows: 9% very satisfied, 27% satisfied, 55% neither satisfied nor dissatisfied, and 9% dissatisfied. The distribution of the respondents who had incomplete secondary education was as follows: 8% very satisfied, 54% satisfied, and 38% neither satisfied nor dissatisfied. All these findings seem to reveal that the respondents with a lower level of education were more satisfied with machine translation than dissatisfied in comparison with those who have a higher level of education. Yet the correlation proved to be statistically insignificant (Spearman correlation coefficient −0.014,

p = 0.789

).

The respondents also provided answers as to the additional actions taken to change and modify the raw machine translated text in order to improve it (see Figure 8). The respondents mostly indicated that they made modifications but did not change the text substantially (37%), followed by 12% of the respondents who made minimal changes, and 7% of the respondents who made substantial changes. Only 5% of the respondents indicated making no changes at all and using the raw text as generated by a machine translation system.

The data on additional actions taken to change and modify the raw machine translated text for its improvement were analyzed for groups of the respondents with a different level of education (see Figure 9).

It was determined that in all groups, the respondents tended to translate texts themselves using machine translation suggestions (38% in the group of the respondents with higher education; 43% in the group of the respondents with incomplete higher education; 49% in the group of the respondents with secondary education; and 31% in the group of the respondents with incomplete secondary education), except for the group of the respondents with vocational training (9%). The respondents who indicated making substantial changes to the text were those with vocational training—27%, higher education—8%, and incomplete higher education—7%. The option of making some changes without substantial modification to the text was chosen by 37% of the respondents with higher education and secondary education each, 38% of the respondents with incomplete secondary education, 32% of the respondents with incomplete higher education, and 18% of the respondents with vocational training. Minimal changes were made by the respondents as follows: incomplete secondary—31%, vocational training—27%, higher education and secondary education—11% each, and incomplete higher education—9%. At the same time, the respondents with vocational training mostly indicated making no changes and using the raw text as generated by a machine translation system (18%). There were only 4% of the respondents with higher education, 7% with incomplete higher education, and 3% of the respondents with secondary education who did not make any changes and used the raw machine translated text. The correlation between the education level and the reported level of text changes was found to be statistically insignificant (Spearman correlation coefficient 0.024,

p = 0.637

).

5. Discussion

There is scarce research in relation to machine translation usage among end-users as noted by other researchers conducting studies in this field [1,37]. The research on machine translation conducted by researchers in various countries mainly focuses on professional translation practices and impacts of machine translation in the translator’s profession.

Our study focuses around the purposes of using machine translation, perceived quality of the generated output, and actions taken to improve the quality by users with various backgrounds. The results of this study reveal the uses of machine translation by different groups of population in terms of age and education. Although previous studies have highlighted the risks in relation to machine translation used for legal and healthcare purposes [1], our study results indicate that ordinary end-users very rarely use machine translation for these purposes. This fact might be interpreted as the Lithuanian population and context being relatively monolingual and monocultural, requiring no multilingual communication in legal and healthcare or public and private sector services. Nevertheless, the legal and medical sectors have to deal with immigrants and foreign residents, who may encounter difficulties by not being able to speak the local language. In such settings, they are left with very few options, and in emergency situations, they might rely on machine translation. In further studies, it might be relevant to target such respondents in order to find out what difficulties they face in multilingual communication. On the other hand, the purposes indicated for the use of machine translation might also be the result of the sample with students representing approximately one-fourth of the respondents who might need to use machine translation for research and studies purposes more often than for other reasons.

The quality of machine translation was researched in a plethora of research studies, yet mostly from the professional users’ perspectives. Perceived end-user quality was studied in various experimental research designs. In our study, the results indicate that end-users acknowledge the drawbacks of machine translation and are not fully satisfied with the generated output. The higher the level of education, the less satisfied the respondents are with the machine translation quality. This finding may relate to other numerous research studies conducted with professional translators and language experts ([38,39,40], among others). From the perspective of the education level, it might be argued that our results are, to some extent, in line with those by other authors, showing dissatisfaction with the quality of machine translation increasing with the higher level of education. However, all respondents seem to be unsure when asked about their satisfaction with the quality of machine translation output, as approximately half of them do not have a clear opinion about it. They are neither satisfied nor dissatisfied, which may also be related to the fact that to a greater or lesser extent, they make additional actions to the raw machine translated text in order to improve it. Only a small part of the respondents did not take any action and used the raw machine-translated text. This finding relates to the observations that it is important to improve the quality of the translations, even in such non-formal situations as personal communication [10].

Machine translation is becoming more and more popular not only because of its technical advancement, but also due to its accessibility to end-users who may choose different modes of translation, i.e., translation not only that is text-to-text, but also translation from/to speech and images. The results of the current study show that the majority of the respondents use translation from text to text but they are also aware of and use translation for image to text, text to speech, speech to text and even speech to speech. What is more, the end-users who indicated that they relied on translation from speech to speech belonged to a younger age group and those who mainly used translation from text to text belonged to older age groups.

The findings of our study are hardly comparable with other research conducted on machine translation in other languages. As Lithuanian may be considered a low-resource language, the machine translation output is consequently of lower quality in comparison with that obtained for such big languages as Chinese, English, Russian, French, or Spanish, which receive far greater investment into data acquisition and parallel corpora development. Since the absolute majority of the respondents in the survey were Lithuanians who mostly indicated using machine translation when translating either from or to the Lithuanian language, their lower satisfaction with, perceived quality of machine translated text, and consequently acceptability might have been predicted to some degree. Users of machine translation output in Lithuanian might be dissatisfied with the machine translation, as, generally, the raw output demonstrates issues of fluency and accuracy, resulting in mistranslation, ambiguous translation, untranslated text, omission, and addition, as well as grammatical errors, such as agreement, incorrect part of speech, etc. [18,41]. This might imply that users of machine translation output in Lithuanian are generally more dissatisfied and, therefore, should be cautious about using raw machine translated text and taking them for granted. Therefore, sustainable measures, including different means to increase awareness of end-users as to the quality generated by machine translation system, should be established based on the perceptions of different social groups in different environments and settings.

6. Conclusions

Machine translation is becoming a common everyday practice for a variety of reasons and purposes. The study reveals that Google Translate is the best known and most commonly used system. Age and education play a role in how end-users use machine translation, perceive its usability and quality. In our study, the most common uses include work, research and studies, and household environments. Although younger end-users use machine translation more frequently compared with other age groups, the type of the most commonly employed machine translation remains text-to-text in all age groups. The study results also show a correlation between the level of education and the purposes of machine translation usage. The higher the education level, the more the respondents use machine translation for research and studies as well as work. A higher level of education also correlates with user dissatisfaction with the generated quality and actions taken to improve it. The findings demonstrate a tendency for the respondents with a lower level of education to be more satisfied with machine translation than dissatisfied in comparison with those with a higher level of education. In all groups of education levels, using raw machine translated text without any modifications is not an option, except for respondents with vocational training. All other groups of respondents tend to translate texts themselves while only relying on machine translation suggestions. Overall, the study is important since it provides an overview of how machine translation is perceived and employed by a wide variety of end-users. The limitation of this study lies in its sample, which includes a relatively high number of representatives of a younger age (18–29 years). Therefore, further research might be focused on a better representation of the entire population in order to confirm the findings.

Author Contributions

Conceptualization, R.K., J.M., J.H.; methodology, R.K., J.H., J.M., V.L., I.P., M.P.; investigation, R.K., J.H., J.M., V.L., I.P., M.P.; software, I.P., M.P.; data curation, I.P., M.P.; writing—original draft preparation, R.K., J.M., J.H., V.L.; writing—review and editing, R.K., J.H., J.M., V.L., I.P., M.P; visualization, I.P., M.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research has received funding from the Research Council of Lithuania (LMTLT, agreement No S-MOD-21-2).

Institutional Review Board Statement

Ethical review and approval were waived for this study, due to the absence of the collection and/or processing of personal or any sensitive data at any stage of the research.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

All data are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Vieira, L.N. Machine translation in the news: A framing analysis of the written press. Transl. Spaces 2020, 9, 98–122. [Google Scholar] [CrossRef]
Asscher, O.; Glikson, E. Human evaluations of machine translation in an ethically charged situation. New Media Soc. 2021, OnlineFirst, 14614448211018833. [Google Scholar] [CrossRef]
Hoi, H.T. Machine Translation And Its Impact In Our Modern Society. Int. J. Sci. Technol. Res. 2020, 9, 1918–1921. [Google Scholar]
Kenny, D.; Moorkens, J.; do Carmo, F. Fair MT: Towards ethical, sustainable machine translation. Transl. Spaces 2020, 9, 1–11. [Google Scholar] [CrossRef]
Yamashita, N.; Ishida, T. Effects of Machine Translation on Collaborative Work. In Proceedings of the 2006 20th Anniversary Conference on Computer Supported Cooperative Work (CSCW ’06), Banff, AB, Canada, 4–8 November 2006; Association for Computing Machinery: New York, NY, USA, 2006; pp. 515–524. [Google Scholar] [CrossRef]
Doherty, S. Translations| The Impact of Translation Technologies on the Process and Product of Translation. Int. J. Commun. 2016, 10, 23. [Google Scholar]
Pituxcoosuvarn, M.; Ishida, T. Multilingual communication via best-balanced machine translation. New Gener. Comput. 2018, 36, 349–364. [Google Scholar] [CrossRef]
Panayiotou, A.; Hwang, K.; Williams, S.; Chong, T.W.H.; LoGiudice, D.; Haralambous, B.; Lin, X.; Zucchi, E.; Mascitti-Meuter, M.; Goh, A.M.Y.; et al. The perceptions of translation apps for everyday health care in healthcare workers and older people: A multi-method study. J. Clin. Nurs. 2020, 29, 3516–3526. [Google Scholar] [CrossRef] [PubMed]
Yasuoka, M.; Bjorn, P. Machine Translation Effect on Communication: What Makes It Difficult to Communicate through Machine Translation? In Proceedings of the 2011 Second International Conference on Culture and Computing, Kyoto, Japan, 20–22 October 2011; pp. 110–115. [Google Scholar] [CrossRef]
Shigenobu, T. Evaluation and Usability of Back Translation for Intercultural Communication. In Usability and Internationalization. Global and Local User Interfaces; Aykin, N., Ed.; Springer: Berlin/Heidelberg, Germany, 2007; pp. 259–265. [Google Scholar]
Nurminen, M.; Koponen, M. Machine translation and fair access to information. Transl. Spaces 2020, 9, 150–169. [Google Scholar] [CrossRef]
Taylor, R.M.; Crichton, N.; Moult, B.; Gibson, F. A prospective observational study of machine translation software to overcome the challenge of including ethnic diversity in healthcare research. Nurs. Open 2015, 2, 14–23. [Google Scholar] [CrossRef] [Green Version]
Rossetti, A.; O’Brien, S.; Cadwell, P. Comprehension and Trust in Crises: Investigating the Impact of Machine Translation and Post-Editing. In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, Lisbon, Portugal, 3–5 November 2020; European Association for Machine Translation: Lisboa, Portugal, 2020; pp. 9–18. [Google Scholar]
Castilho, S. Measuring Acceptability of Machine Translated Enterprise Content. Ph.D. Thesis, Dublin City University, Dublin, Ireland, 2016. [Google Scholar]
Castilho, S.; O’Brien, S. Acceptability of machine-translated content: A multi-language evaluation by translators and end-users. Linguist. Antverp. New Ser. Themes Transl. Stud. 2018, 16, 120–136. [Google Scholar] [CrossRef]
Chen, X.; Acosta, S.; Barry, A.E. Machine or Human? Evaluating the Quality of a Language Translation Mobile App for Diabetes Education Material. JMIR Diabetes 2017, 2, e13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dew, K.N.; Turner, A.M.; Choi, Y.K.; Bosold, A.; Kirchhoff, K. Development of machine translation technology for assisting health communication: A systematic review. J. Biomed. Inform. 2018, 85, 56–67. [Google Scholar] [CrossRef] [PubMed]
Kasperavičienė, R.; Motiejūnienė, J.; Patašienė, I. Avaliação da qualidade da produção de tradução automática: Abordagem de avaliação cognitiva em um experimento com rastreamento ocular. Texto Livre Ling. E Tecnol. 2020, 13, 271–285. [Google Scholar] [CrossRef]
Azer, H.S.; Aghayi, M.B. An Evaluation of Output Quality of Machine Translation (Padideh Software vs. Google Translate). Adv. Lang. Lit. Stud. 2015, 6, 226–237. [Google Scholar]
Vanjani, M.; Aiken, M. A Comparison of Free Online Machine Language Translators. J. Manag. Sci. Bus. Intell. 2020, 5, 26–31. [Google Scholar] [CrossRef]
Saadany, H.; Orasan, C.; Quintana, R.C.; do Carmo, F.; Zilio, L. Challenges in Translation of Emotions in Multilingual User-Generated Content: Twitter as a Case Study. arXiv 2021, arXiv:2106.10719. [Google Scholar]
Poncelas, A.; Lohar, P.; Hadley, J.; Way, A. The Impact of Indirect Machine Translation on Sentiment Classification. In Proceedings of the 14th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), Virtual, 6–9 October 2020; pp. 78–88. [Google Scholar]
Macías, L.P.; del Mar Sánchez Ramos, M.; Rico, C. Study on the Usefulness of Machine Translation in the Migratory Context: Analysis of Translators’ Perceptions. Open Linguist. 2020, 6, 68–76. [Google Scholar] [CrossRef]
Sakamoto, A. Why do many translators resist post-editing? A sociological analysis using Bourdieu’s concepts. J. Spec. Transl. 2019, 31, 201–216. [Google Scholar]
Gaspari, F.; Almaghout, H.; Doherty, S. A survey of machine translation competences: Insights for translation technology educators and practitioners. Perspectives 2015, 23, 333–358. [Google Scholar] [CrossRef]
Garcia, I. Is machine translation ready yet? Target. Int. J. Transl. Stud. 2010, 22, 7–21. [Google Scholar] [CrossRef] [Green Version]
Rossi, C.; Chevrot, J.P. Uses and perceptions of Machine Translation at the European Commission. J. Spec. Transl. (JoSTrans) 2019, 31, 117–200. [Google Scholar]
Three Directions for the Design of Human-Centered Machine Translation. Available online: https://srobert4.github.io/publication/hcmt/hcmt.pdf (accessed on 29 November 2021).
Savoldi, B.; Gaido, M.; Bentivogli, L.; Negri, M.; Turchi, M. Gender Bias in Machine Translation. Trans. Assoc. Comput. Linguist. 2021, 9, 845–874. [Google Scholar] [CrossRef]
Das, P.; Kuznetsova, A.; Zhu, M.; Milanaik, R. Dangers of Machine Translation: The Need for Professionally Translated Anticipatory Guidance Resources for Limited English Proficiency Caregivers. Clin. Pediatr. 2019, 58, 247–249. [Google Scholar] [CrossRef] [PubMed]
Yates, S. Scaling the tower of babel fish: An analysis of the machine translation of legal information. Law Libr. J. 2006, 98, 481–500. [Google Scholar]
Rico, C.; Ramos, M.D.M.S.; Oliver, A. INMIGRA3: Building a case for NGOs and NMT. In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, Lisboa, Portugal, 3–5 November 2020; European Association for Machine Translation: Lisboa, Portugal, 2020; pp. 469–470. [Google Scholar]
Canfora, C.; Ottmann, A. Risks in neural machine translation. Transl. Spaces 2020, 9, 58–77. [Google Scholar] [CrossRef]
Sakamoto, A.; Yamada, M. Social groups in machine translationpost-editing: A SCOT analysis. Transl. Spaces 2020, 9, 78–97. [Google Scholar] [CrossRef]
Sefara, T.J.; Zwane, S.G.; Gama, N.; Sibisi, H.; Senoamadi, P.N.; Marivate, V. Transformer-based Machine Translation for Low-resourced Languages embedded with Language Identification. In Proceedings of the 2021 Conference on Information Communications Technology and Society (ICTAS), Durban, South Africa, 10–11 March 2021; pp. 127–132. [Google Scholar] [CrossRef]
Statistics Lithuania. Education and Educational Attainment; Official Statistics Portal: Vilnius, Lithuania, 2021; Gedimino pr. 29. [Google Scholar]
Nurminen, M. Machine Translation in Everyday Life: What Makes FAUT MT Workable? TAUS eLearning Blogs. 2018. Available online: https://blog.taus.net/elearning/machine-translation-in-everyday-life-what-makes-faut-mt-workable (accessed on 29 November 2021).
Castilho, S.; Doherty, S.; Gaspari, F.; Moorkens, J. Approaches to Human and Machine Translation Quality Assessment. In Translation Quality Assessment: From Principles to Practice; Moorkens, J., Castilho, S., Gaspari, F., Doherty, S., Eds.; Springer International Publishing: Cham, Switzerland, 2018; pp. 9–38. [Google Scholar] [CrossRef]
Qin, Y.; Zhang, J.; Lu, X. The Gap between NMT and Professional Translation from the Perspective of Discourse. In Proceedings of the 2019 3rd International Conference on Innovation in Artificial Intelligence (ICIAI 2019), Suzhou, China, 15–18 March 2019; Association for Computing Machinery: New York, NY, USA, 2019; pp. 50–54. [Google Scholar] [CrossRef]
Zouhar, V.; Tamchyna, A.; Popel, M.; Bojar, O. Neural Machine Translation Quality and Post-Editing Performance. arXiv 2021, arXiv:2109.05016. [Google Scholar]
Petkevičiūtė, I.; Tamulynas, B. Computer-based translation into Lithuanian: Alternatives and their linguistic evaluation. Stud. Lang. 2011, 18, 38–45. [Google Scholar] [CrossRef]

Figure 1. Usage of machine translation among different age groups.

Figure 2. Usage of machine translation types.

Figure 3. Distribution of machine translation types among different age groups.

Figure 4. Distribution of purposes of machine translation usage.

Figure 5. Satisfaction of all respondents with machine translation.

Figure 6. Interrelation between the education level and the use of different machine translation types.

Figure 7. Satisfaction with machine translation in groups of the respondents with a different level of education.

Figure 8. Additional actions taken to change and modify the raw machine translated text in order to improve it.

Figure 9. Additional actions taken to change and modify the raw machine translated text for its improvement in the groups of the respondents with a different level of education.

Table 1. Purposes for using machine translation in different age groups (in percentage).

Purpose for Machine Translation	Age Groups
Purpose for Machine Translation	18–29 y.	30–39 y.	40–49 y.	50–59 y.	60–69 y.	70+ y.	NA
Studies	77.6%	40.0%	25.9%	27.7%	10.0%	–	100%
Work	45.6%	78.2%	85.9%	76.6%	40.0%	–	–
Entertainment	40.8%	38.2%	23.5%	34.0%	20.0%	–	–
News	29.3%	18.2%	15.3%	27.7%	30.0%	–	–
Travel	20.4%	23.6%	24.7%	25.5%	10.0%	–	–
Household	49.0%	40.0%	31.8%	40.4%	40.0%	100%	–
Public services	6.1%	5.5%	4.7%	2.1%	–	–	–
Legal services	0.7%	1.8%	–	–	–	–	–
Medical services	2.7%	1.8%	2.4%	–	–	100%	–
Internet shopping	12.2%	7.3%	14.1%	14.9%	–	–	–
Personal communication	39.5%	29.1%	21.2%	40.4%	50.0%	–	–

Table 2. Purposes for using machine translation in different educational level groups (in percentage).

Purpose for Which Machine Translation Was Used	Higher Education	Incomplete Higher Education	Vocational Training	Secondary Education	Incomplete Secondary Education
Studies	36.7%	88.6%	36.4%	97.1%	100%
Work	79.0%	25.0%	54.5%	20.0%	7.7%
Entertainment	31.5%	40.9%	27.3%	49.0%	53.8%
News	21.4%	20.5%	18.2%	37.1%	53.8%
Travel	21.8%	22.7%	9.1%	25.7%	23.1%
Household	39.1%	47.7%	18.2%	62.9%	38.5%
Public services	5.2%	2.3%	–	5.7%	7.7%
Legal services	0.8%	–	–	–	–
Medical services	1.6%	–	9.1%	8.6%	–
Internet shopping	11.7%	9.1%	–	20.0%	7.7%
Personal communication	30.2%	36.4%	36.4%	51.4%	38.5%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kasperė, R.; Horbačauskienė, J.; Motiejūnienė, J.; Liubinienė, V.; Patašienė, I.; Patašius, M. Towards Sustainable Use of Machine Translation: Usability and Perceived Quality from the End-User Perspective. Sustainability 2021, 13, 13430. https://doi.org/10.3390/su132313430

AMA Style

Kasperė R, Horbačauskienė J, Motiejūnienė J, Liubinienė V, Patašienė I, Patašius M. Towards Sustainable Use of Machine Translation: Usability and Perceived Quality from the End-User Perspective. Sustainability. 2021; 13(23):13430. https://doi.org/10.3390/su132313430

Chicago/Turabian Style

Kasperė, Ramunė, Jolita Horbačauskienė, Jurgita Motiejūnienė, Vilmantė Liubinienė, Irena Patašienė, and Martynas Patašius. 2021. "Towards Sustainable Use of Machine Translation: Usability and Perceived Quality from the End-User Perspective" Sustainability 13, no. 23: 13430. https://doi.org/10.3390/su132313430

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Towards Sustainable Use of Machine Translation: Usability and Perceived Quality from the End-User Perspective

Abstract

1. Introduction

2. Literature Overview

3. Materials and Methods

3.1. Research Instrument

3.2. Research Participants

3.3. Data Analysis

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI