been leading Mercari's international expansion since 2023. Prior to joining Mercari, he worked in the automotive and defense industries. Engineering Manager
once, sell many times • C2C: Translate once, sell once → Larger impact on cost • From Japanese to multiple other languages • When to translate? When listed? When visited? On-demand? • And when the content is updated? • Some users try to game the search system • The marketing also has requirements • We chose a hybrid approach
based on input characters ◦ High rate limits ◦ Low latency ◦ Additional features such as glossaries ◦ Consistent results Classic translation models vs. LLMs • Large Language Models (LLMs) ◦ Pay-as-you-go or reserved capacity, based on input / output tokens ◦ Stricter rate limits ◦ Inconsistent latency ◦ No glossary ◦ Inconsistent results
• High quality Japanese translation • Similar price point as LLMs at the time • High rate limits, low latency, safer choice • Price point: 100 units • A/B test results: +5.3% Buyer Conversion Rate DeepL
units • Large Languages models got cheaper • Opportunity to learn about LLMs and run them in production ChatGPT • GPT-4o mini ◦ Price point: 10 units • A/B tests results: no impact
cost • Start simple, improve later * Original text will be delimited by ###\ * Original text is in Japanese\ * Your task is to translate it to Traditional Chinese ### <the product’s title or description>
GCP, not Microsoft Azure • Same price point • Gemini 1.5 Flash • Motivated by engineering maintenance effort ◦ Mercari mainly uses GCP, not Microsoft Azure • Same price point • Gemini 1.5 Flash • Price point: 1 unit 🎉 • A/B test result: no impact Gemini
You are a Japanese-to-English translation API. 1. **Task:** Translate the content of the user's <xb-text> tag. 2. **Output:** Your entire response MUST be the result, wrapped in <xb-text> tags. Add no other text. <content of title or description>
need it? カビゴン → Kabigon or Snorlax? • It's complicated • Tokenize • English: Replace in text then translate • Traditional Chinese: Provide keywords in the prompt
Start from the user experience ◦ Start simple, iterate, and A/B test ◦ Newer models don't impact business metrics ◦ Monitor new models and expiry dates