Instruction-tuned Large Language Models for Machine Translation in the Medical Domain

📅 2024-08-29

🏛️ arXiv.org

📈 Citations: 3

✨ Influential: 0

career value

190K/year

🤖 AI Summary

To address inconsistent and inaccurate terminology translation by large language models (LLMs) in medical machine translation, this work proposes explicitly injecting structured medical terminologies—sourced from authoritative ontologies such as UMLS and SNOMED CT—into instruction-tuning datasets, enabling the first instance of controllable, terminology-guided instruction fine-tuning. Methodologically, we employ efficient LoRA-based fine-tuning on open-source LLMs to construct high-quality medical instruction data. Experiments on multiple Chinese–English and German–English medical terminology translation benchmarks demonstrate substantial improvements: +9.2 BLEU points and +37.5% absolute gain in terminology accuracy (TERMScore), significantly outperforming both baseline LLMs and conventional neural machine translation models. This work overcomes the critical limitation of generic LLMs—terminological inaccuracy in low-resource, domain-specific settings—and establishes a novel paradigm for domain-adaptive instruction tuning grounded in structured knowledge integration.

Technology Category

Application Category

📝 Abstract

Large Language Models (LLMs) have shown promising results on machine translation for high resource language pairs and domains. However, in specialised domains (e.g. medical) LLMs have shown lower performance compared to standard neural machine translation models. The consistency in the machine translation of terminology is crucial for users, researchers, and translators in specialised domains. In this study, we compare the performance between baseline LLMs and instruction-tuned LLMs in the medical domain. In addition, we introduce terminology from specialised medical dictionaries into the instruction formatted datasets for fine-tuning LLMs. The instruction-tuned LLMs significantly outperform the baseline models with automatic metrics.

Problem

Research questions and friction points this paper is trying to address.

Improving medical domain machine translation with LLMs

Enhancing terminology consistency in medical translations

Instruction-tuning LLMs for specialized medical dictionaries

Innovation

Methods, ideas, or system contributions that make the work stand out.

Instruction-tuned LLMs enhance medical translation

Incorporates medical dictionaries for terminology accuracy

Outperforms baseline models in automatic metrics

🔎 Similar Papers

No similar papers found.