Translation Bias and Accuracy in Multilingual LLMs for Cross-Language Claim Verification

December 1, 2024

Accepted to Attribution @ NeurIPS 2024

Authors: Aryan Singhal, Veronica Shao, Gary Sun, Ryan Ding

We investigate systematic biases in neural machine translation (NMT) systems when translating text between languages with different cultural contexts. Our analysis reveals that NMT systems often produce translations that reflect the dominant cultural perspectives present in their training data, leading to subtle but significant meaning shifts. We propose a framework for measuring and mitigating these translation biases, introducing metrics that capture semantic drift across cultural dimensions. Experiments on 15 language pairs demonstrate the prevalence of these biases and the effectiveness of our debiasing approaches.

Begin Your Journey

The application takes 10 minutes and is reviewed on a rolling basis. We look for strong technical signal—projects, coursework, or competition results—and a genuine curiosity to do real research.

If admitted, you will join a structured pipeline with direct mentorship to take your work from ideation to top conference submission at venues like NeurIPS, ACL, and EMNLP.

Begin Application Financial Aid