Flutterby™! : AI translation drives contributors away from Mozilla 2025-11-06 23:04:17.436374+01

AI translation drives contributors away from Mozilla

2025-11-06 23:04:17.436374+01 by Dan Lyke 0 comments

Mozilla's SUMO Japanese translation community ends their support over botched machine translation:

They are all happened on the product server, not on staging server. I understand that this is mass destruction of our work and explicit violation to the Mozilla mission, allowed officially.

Via nixCraft 🐧 @nixCraft@mastodon.social, in the replies David Chisnall (*Now with 50% more sarcasm!*) @david_chisnall@infosec.exchange has some notes on how, yes, this is the result of a bug, but...

That bit bothers me the least. Lots of systems have bugs. The issue here for me is that they have a load of experts who understand the problem, and someone who does not understand the problem has mandated a tool that does not solve the problem and entirely disregarded the value of the experts.

Machine-assisted translation tooling primarily focuses on building, maintaining, and using a term dictionary: a set of prior translations that ensure that you consistently translate terms of art in the same way. If you don't do this, you get something that is technically a valid translation, but which is completely useless because the same term is translated in different ways throughout the document (based on surrounding context and translator preferences) and so it's impossible for a reader to tell that they're the same term.

It sounds like the Japanese translators have put a lot of effort into solving this problem. LLM-based translation is infamous for not doing this. It will translate terms based on how, across the training corpus, that term was translated when adjacent to other words. This is completely fine for short, low-stakes translation. If I want to translate a menu while travelling, for example, an LLM will typically give a good output (maybe don't trust it if you have serious allergies, but for the rest of us it's fine). But for something where you want to communicate technical content (in any domain), they're (at best) a good first approximation. And translators have repeatedly reported that cleaning up LLM translations is more work than doing the translation well in the first place.

Also Via.

Flutterby™! : AI translation drives contributors away from Mozilla

AI translation drives contributors away from Mozilla

Comment policy