Generative Artificial Intelligence and Machine Translators in Spanish Translation of Early Vulnerability Cybersecurity Alerts
Abstract
:1. Introduction
2. Background
2.1. Early Warning of Vulnerabilities
2.2. Machine Translaters
2.3. Quality Assessment Indicators
- nkt: Kendall’s coefficient;
- p1: unigram precision;
- bp: length penalty;
- , : hyperparameters that control the weight of unigram precision and length penalty, respectively.
- Element-wise product with the source text ().
- Element-wise product with the reference ().
- Element-wise absolute difference with the source text ().
- Element-wise absolute difference with the reference ().
3. Materials and Methods
3.1. Research Questions
- RQ1: Which translator provides the highest quality translations?
- RQ2: Is the reference translation reliable?
- RQ3: Which translator produces vulnerability description translations that most closely align with those reported by INCIBE?
- RQ4: Are there differences between the performance of the translators depending on the size of the translated description?
- RQ5: Are the translators efficient in terms of time and cost?
- RQ6: Are GenAIs or specialized machine translators more efficient?
3.2. Datasets
3.3. Machine Translators and GenAI
3.4. Data Collection Methodology
- ChatGPT: The GPT-4o-mini model was used due to the limitations of the free version of GPT-4. The script incorporated strategies to handle temporary message blocks, conversation length limits, and auto-scroll issues. Additionally, automatic pauses and session restarts were implemented to ensure process continuity. To optimize performance, a second device was employed, and the processing order of the original file was modified.
- Bing Translate and Google Translate: Scripts were developed to simulate mouse movements, avoiding direct web scraping.
- DeepL: Translation automation was achieved through copy-pasting into the DeepL interface. To prevent service blocks, periodic tab resets were implemented every 10 translations.
- Copilot: A script was developed to automate the insertion of translations into Copilot. The script simulated mouse movements and used copy-paste commands to manage and transfer translations automatically into the system, incorporating strategies similar to those used for ChatGPT to handle temporary message blocks. However, for this tool, due to the inability to detect when a prompt had been answered, fixed waiting times were established for each interaction.
3.5. Response Variables
- The vulnerability description translated into Spanish by each machine translator and GenAI, and the reference translation provided by INCIBE.
- Evaluation of the translation using machine translation evaluation metrics.
- Time: measurement of the time taken to complete the translations. In this context, the time measurement includes both the system’s active periods and the fixed waiting times, which are an integral part of the interaction process with the tool.
3.6. Evaluation Metrics
3.7. Analysis
- Translations that exhibit statistically significant differences in any of the comparisons are awarded one point, signifying that these translations demonstrate distinct performance according to the evaluated metrics. This suggests that one translation is significantly superior or inferior to another.
- Translations that do not show significant differences are not awarded points, as there is insufficient evidence to suggest that their quality differs significantly based on the metrics.
- Translations with significant and high correlations in the range [0.5, 1] do not receive any additional points, as such correlations imply that the translations do not differ substantially in terms of quality. The interval [−1, −0.5] is excluded, as only TER operates inversely, and its final evaluation has been adjusted accordingly.
- Translations exhibiting correlations in the range (−0.5, 0.5) or non-significant correlations () are assigned one point. These points are not awarded more than once.
3.8. Technical Details
- Number of devices: 5 PCs;
- CPU: Intel Core i5-8400 (Intel, Santa Clara, CA, USA) (6 cores, 6 threads, base clock 2.8 GHz, turbo up to 4.0 GHz);
- Memory: 8 GB RAM per device.
- Operating Systems: Windows and Linux;
- Programming Language: Python 3.
4. Results and Discussion
4.1. RQ1 and RQ2: Quality of the Translations
4.2. RQ3: Accuracy of the Translation Style
4.3. RQ4: Effect of the Size of the Description in the Performance of the Translation
4.4. RQ5: Time Complexity
4.5. RQ6: GenAI or Machine Translators?
4.6. Discussion and Interpretation
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
GenAI | Generative Artificial Intelligence |
MT | Machine Translations |
LLM | Large Language Model |
References
- NIST. Generative Artificial Intelligence. Available online: https://csrc.nist.gov/glossary/term/generative_artificial_intelligence (accessed on 11 November 2024).
- Sengar, S.S.; Hasan, A.B.; Kumar, S.; Carroll, F. Generative artificial intelligence: A systematic review and applications. Multimed. Tools Appl. 2024. [Google Scholar] [CrossRef]
- Cohen, N.D.; Ho, M.; McIntire, D.; Smith, K.; Kho, K.A. A Comparative Analysis of Generative Artificial Intelligence Responses from Leading Chatbots to Questions About Endometriosis. AJOG Glob. Rep. 2025, 5, 100405. [Google Scholar] [CrossRef] [PubMed]
- NIST. National Institute of Standards and Technology. 2025. Available online: https://www.nist.gov/ (accessed on 15 January 2025).
- INCIBE. INCIBE Vulnerabilities. Available online: https://www.incibe.es/incibe-cert/alerta-temprana/vulnerabilidades (accessed on 28 February 2025).
- Lee, T.K. Artificial intelligence and posthumanist translation: ChatGPT versus the translator. Appl. Linguist. Rev. 2024, 15, 2351–2372. [Google Scholar] [CrossRef]
- Iorliam, A.; Ingio, J.A. A Comparative Analysis of Generative Artificial Intelligence Tools for Natural Language Processing. J. Comput. Theor. Appl. 2024, 1, 311–325. [Google Scholar] [CrossRef]
- INCIBE. Vulnerability Database. Available online: https://drive.google.com/file/d/1tJOP8R98DcEMCNh6whXkqTsXGFpoTYU2/view?usp=sharing (accessed on 28 February 2025).
- National Institute of Standards and Technology. Risk Management Guide for Information Technology Systems; Technical Report NIST SP 800-30 Rev. 1; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2012. [Google Scholar]
- IBM. Cost of a Data Breach Report 2024. 2024. Available online: https://www.ibm.com/reports/data-breach (accessed on 28 February 2025).
- MITRE Corporation. Common Vulnerabilities and Exposures (CVE)—Introduction. 2021. Available online: https://cve.mitre.org/docs/cve-intro-handout.pdf (accessed on 5 February 2025).
- Farrell, M. Survey on the Use of Generative Artificial Intelligence by Professional Translators. In Proceedings of the 46th Conference Translating and the Computer, Luxembourg, 18–20 November 2024. [Google Scholar] [CrossRef]
- Fu, L.; Liu, L. What are the differences? A comparative study of generative artificial intelligence translation and human translation of scientific texts. Humanit. Soc. Sci. Commun. 2024, 11, 1236. [Google Scholar] [CrossRef]
- EAMT. European Association for Machine Translation. 2025. Available online: https://eamt.org (accessed on 16 January 2025).
- Hutchins, W.J. Machine Translation: Past, Present, Future; Academic Press: Chichester, UK, 1986. [Google Scholar]
- Koehn, P. Statistical Machine Translation; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
- Bahdanau, D.; Cho, K.; Bengio, Y. Neural Machine Translation by Jointly Learning to Align and Translate. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is All You Need. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- DeepL SE. DeepL. Available online: https://www.deepl.com/es/translator (accessed on 28 February 2025).
- Google. Google Translate. Available online: https://translate.google.es/?hl=es&sl=fr&tl=es&op=translate (accessed on 28 February 2025).
- Microsoft. Bing Translator. Available online: https://www.bing.com/translator?to=es&setlang=es (accessed on 28 February 2025).
- Chapsvision. Systran. Available online: https://www.systransoft.com/es/ (accessed on 17 February 2025).
- Softissimo. Reverso. Available online: https://www.reverso.net/traducci%C3%B3n-texto (accessed on 17 February 2025).
- OpenAI. ChatGPT. Available online: https://openai.com/chatgpt (accessed on 28 February 2025).
- Google. Google Bard (Gemini). Available online: https://gemini.google.com/?hl=es-ES (accessed on 28 February 2025).
- Microsoft. Microsoft Copilot. 2023. Available online: https://copilot.microsoft.com/ (accessed on 5 January 2025).
- Google. Google Trends. 2006. Available online: https://trends.google.es/trends/ (accessed on 5 January 2025).
- Comisión Europea, ETranslation Info. Available online: https://language-tools.ec.europa.eu/documentation (accessed on 28 February 2025).
- Comisión Europea, ETranslation API. Available online: https://webgate.ec.europa.eu/etranslation/public/requestApiKey.html (accessed on 28 February 2025).
- Pangeanic. Pangeanic. Available online: https://pangeanic.com/es/ (accessed on 17 February 2025).
- Chapsvision. Systran API. Available online: https://pangeanic.com/translation-technology/translate-easy (accessed on 28 February 2025).
- Softissimo. Reverso API. Available online: https://www.npmjs.com/package/reverso-api (accessed on 28 February 2025).
- Ministerio para la Transformación digital y de la función pública, Gobierno de España, ALIA—Artificial Inteligence. Available online: https://www.alia.gob.es/ (accessed on 17 February 2025).
- Nidhaloff. DeepTranslator GitHub. Available online: https://github.com/nidhaloff/deep-translator (accessed on 28 February 2025).
- Klein, G.; Kim, Y.; Deng, Y.; Senellart, J.; Rush, A. OpenNMT. Available online: https://opennmt.net/ (accessed on 28 February 2025).
- vince62s. Eole-nlp. Available online: https://github.com/eole-nlp/eole?tab=readme-ov-file (accessed on 17 February 2025).
- Natural Language Processing Group T.U. EOLE. Available online: https://github.com/eole-nlp/eole (accessed on 3 April 2025).
- DeepSeek. DeepSeek. 2025. Available online: https://www.deepseek.com/ (accessed on 31 January 2025).
- X. Grok. 2023. Available online: https://x.ai/ (accessed on 8 January 2025).
- Amazon. Amazon Nova. 2025. Available online: https://aws.amazon.com/es/ai/generative-ai/nova/ (accessed on 8 January 2025).
- Gough, D.; Oliver, S.; Thomas, J. An Introduction to Systematic Reviews; SAGE: London, UK, 2012. [Google Scholar]
- Papineni, K.; Roukos, S.; Ward, T.; Zhu, W.J. BLEU: A Method for Automatic Evaluation of Machine Translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, Philadelphia, PA, USA, 7–12 July 2002. [Google Scholar] [CrossRef]
- Wu, Y.; Schuster, M.; Chen, Z.; Le, Q.V.; Norouzi, M.; Macherey, W.; Krikun, M.; Cao, Y.; Gao, Q.; Macherey, K.; et al. Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation. arXiv 2016, arXiv:1609.08144. [Google Scholar]
- Popovic, M. chrF: Character n-gram F-score for automatic MT evaluation. In Proceedings of the Tenth Workshop on Statistical Machine Translation, Lisbon, Portugal, 17–18 September 2015. [Google Scholar]
- Banerjee, S.; Lavie, A. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, Ann Arbor, MI, USA, 29 June 2005; pp. 65–72. [Google Scholar]
- Isozaki, H.; Hirao, T.; Duh, K.; Sudoh, K.; Tsukada, H. Automatic Evaluation of Translation Quality for Distant Language Pairs. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, MA, USA, 9–11 October 2010; pp. 944–952. [Google Scholar]
- Snover, M.; Dorr, B.; Schwartz, R.; Micciulla, L.; Makhoul, J. A Study of Translation Edit Rate with Targeted Human Annotation. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, Cambridge, MA, USA, 8–12 August 2006; pp. 223–231. [Google Scholar]
- Lo, C.-k. YiSi—A Unified Semantic MT Quality Evaluation and Estimation Metric for Languages with Different Levels of Available Resources. In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), Florence, Italy, 1–2 August 2019; pp. 507–513. [Google Scholar] [CrossRef]
- Rei, R.; Stewart, C.; Farinha, A.C.; Lavie, A. COMET: A Neural Framework for MT Evaluation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online, 16–20 November 2020; pp. 2685–2702. [Google Scholar] [CrossRef]
- Zhang, T.; Kishore, V.; Wu, F.; Weinberger, K.Q.; Artzi, Y. BERTScore: Evaluating Text Generation with BERT. arXiv 2019, arXiv:1904.09675. [Google Scholar]
- Gupta, P.; Shekhawat, S.; Kumar, K. Unsupervised Quality Estimation Without Reference Corpus for Subtitle Machine Translation Using Word Embeddings. In Proceedings of the 2019 IEEE 13th International Conference on Semantic Computing (ICSC), Newport Beach, CA, USA, 30 January–1 February 2019; pp. 32–38. [Google Scholar] [CrossRef]
- Mikolov, T.; Chen, K.; Corrado, G.; Dean, J. Efficient Estimation of Word Representations in Vector Space. In Proceedings of the Workshop at ICLR, Scottsdale, AZ, USA, 2–4 May 2013. [Google Scholar]
- Pennington, J.; Socher, R.; Manning, C.D. GloVe: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014; Association for Computational Linguistics: Stroudsburg, PA, USA, 2014; pp. 1532–1543. [Google Scholar] [CrossRef]
- Bojanowski, P.; Grave, E.; Joulin, A.; Mikolov, T. Enriching Word Vectors with Subword Information. Trans. Assoc. Comput. Linguist. 2017, 5, 135–146. [Google Scholar] [CrossRef]
- Schönemann, P. A generalized solution of the orthogonal procrustes problem. Psychometrika 1966, 31, 1–10. [Google Scholar] [CrossRef]
- INCIBE. Descriptions of Vulnerabilities in the NVD Database. 2019. Available online: https://elrc-share.eu/repository/browse/descripciones-de-vulnerabilidades-de-la-bbdd-nvd/9dd01c02cfcc11e9913100155d02670670070560807f458f8e50bd5273778d7a/ (accessed on 26 November 2024).
- Conneau, A.; Lample, G.; Ranzato, M.A.; Denoyer, L.; Jégou, H. Word Translation Without Parallel Data. arXiv 2017, arXiv:1710.04087. Available online: https://github.com/facebookresearch/MUSE (accessed on 28 February 2025).
- Developers, S. SciPy: Open Source Scientific Tools for Python. Available online: https://www.scipy.org/ (accessed on 26 November 2024).
- Bird, S.; Loper, E. Natural Language Toolkit (NLTK). 2024. Available online: https://github.com/nltk/nltk (accessed on 11 November 2024).
- chikiulo. YiSi. Available online: https://github.com/chikiulo/yisi (accessed on 26 September 2024).
- Lightning AI. Torchmetrics: Machine Learning Metrics for PyTorch. 2024. Available online: https://lightning.ai/docs/torchmetrics/stable/ (accessed on 11 November 2024).
- nttcslab nlp. RIBES. Available online: https://github.com/nttcslab-nlp/RIBES (accessed on 18 November 2024).
- Unbabel. COMET. Available online: https://huggingface.co/Unbabel/wmt20-comet-da (accessed on 26 September 2024).
No. | Characteristic | Description |
---|---|---|
1 | Multitasking | Performs multiple tasks (translation, summarization, etc.). |
2 | API | Does it have an API? What features does it offer? |
3 | Automation | Does it allow automation of input and output? |
4 | Input Formats | Formats accepted as input. |
5 | Output Formats | Formats generated as output. |
6 | Scoring | Does it calculate scores? How does it compute them? |
7 | Customization | Can the tool be modified or programmed? |
8 | Data Sources | What sources does it use for translation? |
9 | Local/Online | Does it function locally or only online? |
10 | Proprietor | Owning company or developer. |
11 | Technology | Translation technologies used. |
12 | Generalist/Specialized | Is it a generalist or specialized tool? |
13 | Update Frequency | How frequently is it updated? |
Machine Translation Systems (NMT/SMT) | |
---|---|
Google Translate | Neural network-based translator |
Bing Translator | Neural network-based translator |
DeepL | Proprietary neural translator |
Reverso | Translator based on hybrid models |
Systran | Translator based on hybrid models |
E-Translation | EU machine translation platform |
Pangeanic | AI-based specialized translator |
THUMT | Open-source neural translation system |
OpenNMT | Open-source neural translation framework |
Deep-translator | Library for multiple translators |
Public Machine Translation Platforms | |
ALIA | Public machine translation service by the Spanish government |
Generative AI Models | |
ChatGPT-3.5 | Generative language model |
Gemini | Google’s generative AI model |
Copilot | AI assistant based on generative models |
DeepSeek R1 | Large language model by DeepSeek |
Amazon Nova | Amazon’s generative AI model family |
Grok-3 | Generative AI model by xAI (Elon Musk) |
Specialized AI-Powered Tools | |
Copilot for Security | AI-based security assistant |
Metric | Type | Evaluation | Evaluated Aspects |
---|---|---|---|
BLEU | Statistical | Lexical | Based on n-gram precision, penalizes short translations. |
GLEU | Statistical | Lexical | Similar to BLEU, but balances precision and recall. |
CHRF | Statistical | Lexical | Uses characters instead of words, useful for morphologically rich languages. |
METEOR | Statistical | Lexical and Semantic | Uses flexible alignment with synonyms and stemming, balancing precision and recall |
RIBES | Statistical | Word Order | Based on word order, useful for languages with different syntactic structures. |
TER | Statistical | Edit Distance | Calculates the number of edits required to transform the translation into the reference. |
YiSi | Quantitative | Semantic | Uses semantic representations to measure similarity between translation and reference. |
COMET | AI-Based Model | Semantic and Contextual | Neural network-based model that evaluates quality using contextualized embeddings. |
BERT | AI-Based Model | Semantic | Measures similarity comparing translation and reference embeddings. |
Word Embeddings | AI-Based Model | Semantic | Measures semantic similarity between sentences using vector representations. |
Characteristics | Content |
---|---|
Id | Identifier |
CVEId | Common Vulnerabilities and Exposures |
Title | Alert title |
Description | Description in Spanish |
Description (English) | Description in English |
Dates | April 2006–6 June 2024 |
Instances | 205,534 |
Characters in English | 355.5990 per description (average) |
Characters in Spanish | 302.9201 per description (average) |
Dataset | D |
---|---|
Sample size (Instances) | 7514 |
Total characters in English descriptions | 1,958,403 |
Average characters per English description | 260.6339 |
Translator | Type of GenAI | Free | Number of Languages |
---|---|---|---|
ChatGPT (GPT-4o mini) | Conversational | Yes, with premium options | 85 |
Bing Translator | Specific for MT | Yes, with premium options | 179 |
DeepL | Specific for MT | Yes, with premium options | 33 |
Google Translate | Specific for MT | Yes, with premium options | 243 |
Microsoft Copilot | Conversational | Yes, with premium options | 42 |
Translator | Limitations |
---|---|
ChatGPT | Blocked due to high message volume |
Limit on conversation length | |
Issues with automatic scrolling (auto-scroll) | |
Random tool blocks | |
MAC address blocking | |
Power supply interruptions | |
Google Translate and Bing | Character limitation (1000 Bing|5000 Google) |
Performance degradation with consecutive requests | |
Incorrect text capture of the translated content | |
DeepL | Maximum daily translation limit |
Limit on translation length | |
Persistent service blocks | |
Appearance of advertising pop-ups | |
Disconnections due to power outages | |
Slow processing due to excessive waiting times | |
Copilot | Maximum characters per prompt: 2000 |
Maximum 10 prompts per conversation | |
Limit on prompts per hour |
GenAI | Prompt |
---|---|
Copilot | Translate the lines of this text into Spanish independently. Respond only with the translations. |
ChatGPT | Translate the lines of this text into Spanish independently. Respond only with the translations. |
Metric | Implementation |
---|---|
BLEU [43], GLEU [44], METEOR [46], CHRF [45] | NLTK library [60] available in Python |
YiSi [49] | GitHub repository [61] in C++ and Python subprocess library |
TER [48] | Torchmetrics library [62], in Python |
RIBES [47] | GitHub repository [63] and Python subprocess library |
COMET [50] | COMET model in Hugging Face [64] |
Model | Value | INCIBE | ChatGPT | Bing | DeepL | Goog.Trans. | Copilot |
---|---|---|---|---|---|---|---|
DB (INCIBE) | Maximum | 0.9748 | 0.9748 | 0.9752 | 0.9739 | 0.9753 | 0.9748 |
Minimum | 0.7315 | 0.7125 | 0.6992 | 0.7231 | 0.7010 | 0.6957 | |
Victories | 1058 | 1497 | 480 | 1528 | 1208 | 1743 | |
Corpus (INCIBE) | Maximum | 0.9729 | 0.9735 | 0.9737 | 0.9702 | 0.9727 | 0.9732 |
Minimum | 0.5891 | 0.5768 | 0.5869 | 0.5635 | 0.5768 | 0.5628 | |
Victories | 1821 | 1297 | 558 | 1031 | 916 | 1891 | |
META | Maximum | 0.9607 | 0.9600 | 0.9588 | 0.9597 | 0.9602 | 0.9626 |
Minimum | 0.7653 | 0.7797 | 0.4900 | 0.4862 | 0.7837 | 0.4111 | |
Victories | 1909 | 2064 | 834 | 525 | 913 | 1269 |
ChatGPT vs. Bing | ChatGPT vs. DeepL | ChatGPT vs. Google Translate | ChatGPT vs. Copilot | Bing vs. DeepL | Bing vs. Google Translate | Bing vs. Copilot | DeepL vs. Google Translate | DeepL vs. Copilot | Google Translate vs. Copilot | |
---|---|---|---|---|---|---|---|---|---|---|
BLEU | ||||||||||
COMET | ||||||||||
GLEU | ||||||||||
RIBES | ||||||||||
TER | ||||||||||
YiSi | ||||||||||
METEOR | ||||||||||
CHRF |
ChatGPT vs. Bing | ChatGPT vs. DeepL | ChatGPT vs. Google Translate | ChatGPT vs. Copilot | Bing vs. DeepL | Bing vs. Google Translate | Bing vs. Copilot | DeepL vs. Google Translate | DeepL vs. Copilot | Google Translate vs. Copilot | |
---|---|---|---|---|---|---|---|---|---|---|
BLEU | 0.730 | 0.657 | 0.743 | 0.584 | 0.681 | 0.762 | 0.503 | 0.714 | 0.437 | 0.525 |
COMET | 0.831 | 0.772 | 0.854 | 0.549 | 0.811 | 0.898 | 0.505 | 0.840 | 0.471 | 0.515 |
GLEU | 0.737 | 0.663 | 0.754 | 0.561 | 0.688 | 0.771 | 0.482 | 0.718 | 0.417 | 0.504 |
RIBES | 0.773 | 0.676 | 0.777 | 0.369 | 0.699 | 0.790 | 0.329 | 0.681 | 0.280 | 0.331 |
TER | 0.777 | 0.720 | 0.792 | 0.522 | 0.743 | 0.807 | 0.453 | 0.762 | 0.407 | 0.466 |
YiSi | 0.760 | 0.653 | 0.762 | 0.427 | 0.695 | 0.799 | 0.378 | 0.724 | 0.316 | 0.390 |
METEOR | 0.792 | 0.689 | 0.791 | 0.450 | 0.714 | 0.818 | 0.388 | 0.719 | 0.332 | 0.396 |
CHRF | 0.754 | 0.640 | 0.774 | 0.527 | 0.724 | 0.825 | 0.431 | 0.745 | 0.353 | 0.453 |
Translator | Significative Differences | Total |
---|---|---|
ChatGPT | 30 | 33 |
Bing | 31 | 37 |
DeepL | 31 | 39 |
Google Translate | 29 | 34 |
Copilot | 31 | 54 |
Translater | BLEU | COMET | GLEU | RIBES | TER | YiSi | METEOR | CHRF | Mean |
---|---|---|---|---|---|---|---|---|---|
ChatGPT | 0.500 | 0.681 | 0.532 | 0.921 | 0.695 | 0.716 | 0.751 | 0.775 | 0.696 |
Bing | 0.442 | 0.672 | 0.480 | 0.907 | 0.650 | 0.693 | 0.740 | 0.763 | 0.668 |
DeepL | 0.511 | 0.691 | 0.539 | 0.919 | 0.693 | 0.725 | 0.757 | 0.785 | 0.703 |
Google Translate | 0.503 | 0.700 | 0.532 | 0.919 | 0.689 | 0.725 | 0.756 | 0.782 | 0.701 |
Copilot | 0.459 | 0.608 | 0.489 | 0.894 | 0.638 | 0.670 | 0.697 | 0.739 | 0.649 |
BLEU | COMET | GLEU | RIBES | TER | YiSi | METEOR | CHRF | |
---|---|---|---|---|---|---|---|---|
ChatGPT | 3° | 3° | 3° | 1° | 1° | 3° | 3° | 3° |
Bing | 5° | 4° | 5° | 4° | 4° | 4° | 4° | 4° |
DeepL | 1° | 2° | 1° | 2° | 2° | 2° | 1° | 1° |
Google Translate | 2° | 1° | 2° | 3° | 3° | 1° | 2° | 2° |
Copilot | 4° | 5° | 4° | 5° | 5° | 5° | 5° | 5° |
Description | BLEU | COMET | GLEU | RIBES | TER | YiSi | METEOR | CHRF | |
---|---|---|---|---|---|---|---|---|---|
ChatGPT | Short Long | 0.511 0.489 | 0.716 0.645 | 0.544 0.520 | 0.926 0.916 | 0.705 0.684 | 0.722 0.710 | 0.762 0.740 | 0.780 0.770 |
Bing | Short Long | 0.450 0.435 | 0.713 0.630 | 0.489 0.471 | 0.912 0.902 | 0.659 0.640 | 0.702 0.684 | 0.758 0.722 | 0.771 0.756 |
DeepL | Short Long | 0.527 0.495 | 0.733 0.649 | 0.554 0.523 | 0.924 0.915 | 0.705 0.681 | 0.735 0.714 | 0.770 0.743 | 0.793 0.778 |
Google Translate | Short Long | 0.524 0.483 | 0.739 0.661 | 0.552 0.512 | 0.926 0.912 | 0.706 0.672 | 0.739 0.711 | 0.774 0.738 | 0.792 0.772 |
Copilot | Short Long | 0.470 0.448 | 0.644 0.573 | 0.501 0.478 | 0.898 0.889 | 0.647 0.629 | 0.676 0.663 | 0.707 0.686 | 0.742 0.735 |
Average | Short Long | 0.496 0.470 | 0.709 0.632 | 0.528 0.501 | 0.917 0.907 | 0.684 0.661 | 0.715 0.696 | 0.754 0.726 | 0.776 0.762 |
ChatGPT | Bing | DeepL | Google Translate | Copilot | |
---|---|---|---|---|---|
Time/Translation (s) | 5.4055 | 1.4 | 7.0328 | 3.2 | 4.9948 |
Model | Value | INCIBE | GenAI | Machine Translators |
---|---|---|---|---|
DB (INCIBE) | 0.8390 | 0.8484 | 0.8440 | |
S | 0.0346 | 0.0364 | 0.0340 | |
Maximum | 0.9748 | 0.9748 | 0.9753 | |
Minimum | 0.7315 | 0.6957 | 0.6991 | |
Victories | 1058 | 3240 | 3216 | |
Corpus (INCIBE) | 0.7746 | 0.7846 | 0.7757 | |
S | 0.0520 | 0.0556 | 0.0519 | |
Maximum | 0.9729 | 0.9735 | 0.9736 | |
Minimum | 0.5891 | 0.5628 | 0.5635 | |
Victories | 1821 | 3188 | 2505 | |
META | 0.9122 | 0.8952 | 0.9104 | |
S | 0.0203 | 0.0824 | 0.0255 | |
Maximum | 0.9607 | 0.9625 | 0.9601 | |
Minimum | 0.7653 | 0.4111 | 0.4861 | |
Victories | 1909 | 3333 | 2272 |
BLEU | COMET | GLEU | RIBES | TER | YiSi | METEOR | CHRF | Mean | |
---|---|---|---|---|---|---|---|---|---|
GenAI | 0.479 | 0.644 | 0.510 | 0.907 | 0.666 | 0.693 | 0.724 | 0.757 | 0.672 |
Machine translators | 0.485 | 0.687 | 0.517 | 0.915 | 0.677 | 0.714 | 0.751 | 0.776 | 0.690 |
BLEU | COMET | GLEU | RIBES | TER | YiSi | METEOR | CHRF | |
---|---|---|---|---|---|---|---|---|
GenAI | 2° | 2° | 2° | 2° | 2° | 2° | 2° | 2° |
Machine translators | 1° | 1° | 1° | 1° | 1° | 1° | 1° | 1° |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Román Martínez, J.; Triana Robles, D.; El Oualidi Charchmi, M.; Salamanca Estévez, I.; DeCastro-García, N. Generative Artificial Intelligence and Machine Translators in Spanish Translation of Early Vulnerability Cybersecurity Alerts. Appl. Sci. 2025, 15, 4090. https://doi.org/10.3390/app15084090
Román Martínez J, Triana Robles D, El Oualidi Charchmi M, Salamanca Estévez I, DeCastro-García N. Generative Artificial Intelligence and Machine Translators in Spanish Translation of Early Vulnerability Cybersecurity Alerts. Applied Sciences. 2025; 15(8):4090. https://doi.org/10.3390/app15084090
Chicago/Turabian StyleRomán Martínez, Javier, David Triana Robles, Mouhcine El Oualidi Charchmi, Ines Salamanca Estévez, and Noemí DeCastro-García. 2025. "Generative Artificial Intelligence and Machine Translators in Spanish Translation of Early Vulnerability Cybersecurity Alerts" Applied Sciences 15, no. 8: 4090. https://doi.org/10.3390/app15084090
APA StyleRomán Martínez, J., Triana Robles, D., El Oualidi Charchmi, M., Salamanca Estévez, I., & DeCastro-García, N. (2025). Generative Artificial Intelligence and Machine Translators in Spanish Translation of Early Vulnerability Cybersecurity Alerts. Applied Sciences, 15(8), 4090. https://doi.org/10.3390/app15084090