3/12/2026 | USA | technology | ✓ Verified - arxiv.org

Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck

#translationese bias #multilingual LLM #information bottleneck #machine translation evaluation #disentangled representation

📌 Key Takeaways

Researchers propose a method to reduce translationese bias in multilingual LLM-as-a-Judge evaluations.
The approach uses a disentangled information bottleneck to separate language-specific and content-specific features.
This improves fairness and accuracy in assessing machine translation outputs across languages.
The technique aims to prevent biased judgments due to unnatural translation artifacts.

📖 Full Retelling

arXiv:2603.10351v1 Announce Type: cross Abstract: Large language models (LLMs) have become a standard for multilingual evaluation, yet they exhibit a severe systematic translationese bias. In this paper, translationese bias is characterized as LLMs systematically favoring machine-translated text over human-authored references, particularly in low-resource languages. We attribute this bias to spurious correlations with (i) latent manifold alignment with English and (ii) cross-lingual predictabil

🏷️ Themes

Machine Translation, AI Fairness

Entity Intersection Graph

No entity connections available yet for this article.

Deep Analysis

Why It Matters

This research addresses a critical fairness issue in AI evaluation systems, particularly affecting non-English languages and their speakers. It matters because translationese bias can systematically disadvantage content originally written in languages other than English during AI assessment, potentially reinforcing linguistic hierarchies in global AI development. The findings affect AI researchers, multilingual content creators, and organizations deploying AI systems across different language communities, as they highlight how evaluation methods themselves can introduce bias. By improving evaluation fairness, this work supports more equitable development of multilingual AI technologies that better serve diverse global populations.

Context & Background

LLM-as-a-Judge refers to using large language models to evaluate text quality, translations, or other AI outputs, becoming increasingly common in AI research and development
Translationese describes the distinctive linguistic patterns that appear in translated text, often containing unnatural constructions or interference from the source language
Multilingual AI evaluation has historically struggled with bias, where content originally written in English often receives higher scores than equivalent content translated from other languages
The Information Bottleneck method is a machine learning technique that aims to extract minimal sufficient statistics from data, previously used for representation learning and fairness applications
Previous research has shown that translationese can affect various NLP tasks including sentiment analysis, text classification, and machine translation evaluation

What Happens Next

Researchers will likely implement and test the proposed method across more language pairs and evaluation tasks in the coming months. We can expect to see follow-up studies examining how this approach affects different types of bias beyond translationese, and whether similar techniques can address other forms of evaluation bias. Within the next year, we may see this methodology incorporated into standard evaluation pipelines for multilingual AI systems, potentially influencing how major AI companies and research institutions assess their models' performance across languages.

Frequently Asked Questions

What exactly is translationese bias in AI evaluation?

Translationese bias occurs when AI evaluation systems systematically give different scores to text based on whether it was originally written in a language or translated into it. Typically, content originally written in English receives higher scores than equivalent content translated from other languages, even when quality is comparable.

How does the Disentangled Information Bottleneck method work?

The method separates linguistic information into content-related and translationese-related components, allowing the evaluation system to focus on content quality while minimizing the influence of translation artifacts. This helps the AI judge make more fair comparisons between originally written and translated text across different languages.

Which languages and communities benefit most from this research?

All non-English language communities benefit, particularly those whose languages have significant translation activity with English. The research helps ensure that content originally written in languages like Spanish, Chinese, Arabic, or Hindi isn't systematically disadvantaged when evaluated by AI systems compared to English-original content.

Why is LLM-as-a-Judge evaluation becoming so important?

LLM-as-a-Judge provides scalable, automated evaluation of AI outputs when human evaluation is expensive or time-consuming. As AI systems generate more content across languages, having fair automated evaluation methods becomes crucial for development, benchmarking, and deployment of multilingual AI technologies.

What are the practical applications of this research?

Practical applications include fairer evaluation of machine translation systems, multilingual content generation models, and cross-lingual information retrieval systems. It also improves assessment of student writing in language learning applications and evaluation of international content in global business communications.

Could this method introduce new biases while solving translationese bias?

Like any bias mitigation technique, there's potential for unintended consequences, which is why the researchers emphasize careful validation across diverse language pairs. The method's effectiveness depends on properly disentangling translation artifacts from content quality, which may be more challenging for some language pairs than others.

}

Original Source

              arXiv:2603.10351v1 Announce Type: cross 
Abstract: Large language models (LLMs) have become a standard for multilingual evaluation, yet they exhibit a severe systematic translationese bias. In this paper, translationese bias is characterized as LLMs systematically favoring machine-translated text over human-authored references, particularly in low-resource languages. We attribute this bias to spurious correlations with (i) latent manifold alignment with English and (ii) cross-lingual predictabil
            

Read full article at source

Source

arxiv.org