Research team • Previously worked as an ML engineer working in multiple different domains, including: LLM fine-tuning, AI-supported listing, embeddings for search, and on-device AI. • Engineering Manager for the AI/LLM team at Mercari. • Joined Mercari in 2024 as an individual contributor (MLOps and applied ML infrastructure). • Prior experience in mobile, web, and ML/MLOps engineering at companies including TRIAL Inc. (Fukuoka) and Cogent Labs (Tokyo). Engineering Manager, AI/LLM Sho Akiyama Andre Rusli, PhD Research Engineer, Applied AI
– from a forgotten piece of experimental code 2) Evolution – the rise of the embeddings revolution 3) Explosion – the long-awaited revival of Image Search 4) Expansion – from AI listing to contextual semantic search 5) Future – continuous research and knowledge sharing List of Chapters
sat unused in a personal repo, initially considered experimental and not ready for production. 02 Spark of Curiosity Picked up again when unclaimed ideas surfaced. SigLIP was bootstrapped internally without external mandate. 03 First Customer Recommendation team became the earliest adopter, validating real-world impact for existing MobileNet use case to try SigLIP. Story kicked off with a casual team building lunch that turned into a roadmap discussion.
get people onboarded and excited within the company, “Embeddings as a Meme” → #z-embeddings Slack channel becomes shorthand for new use cases even with non-technical teams.
Summit in Bangkok, Thailand • 画像の認識・理解シンポジウム MIRU2025 in Kyoto, Japan • ACM KDD 2025 in Toronto, ON, Canada • 19th ACM Conference on Recommender Systems (RecSys 2025) in Prague, Czech Events / CM / SNS
& “Item Title” pairs Base model cl-nagoya/ruri-small-v2 Loss Functions MultipleNegativeRankingLoss MatryoshkaLoss (2) 「Search」 Context- Aware Text Embeddings
driver of AI innovation: delivering research that ships, by working ahead of the roadmap – not outside of it. 01 Semantic Understanding 02 Contextual Intelligence 03 Continuous AI Learning
NLP/U Enabling our platform to reach more global audiences. User embeddings Personalized search and discovery, moderation (TnS), etc. and many more exciting projects! Going Forward