Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Computer Vision: Current Conditions and Possibi...

Computer Vision: Current Conditions and Possibilities for Service Handling

Yamato Okamoto (LINE / Computer Vision Lab Team / AI Researcher)
Kenji Doi (Yahoo! JAPAN / Science Group, Technology Group / Machine Learning Engineer)
Tomoya Kose (ZOZO / ML / Data Department, Data Science Section 2 / Machine Learning Engineer)

https://tech-verse.me/ja/sessions/305
https://tech-verse.me/en/sessions/305
https://tech-verse.me/ko/sessions/305

Tech-Verse2022

November 17, 2022
Tweet

More Decks by Tech-Verse2022

Other Decks in Technology

Transcript

  1. 岡本大和 Yamato Okamoto LINE Computer Vision Lab Team AI Researcher

    Yamato Okamoto joined LINE Corporation in 2021 as a founding member of the newly-established Computer Vision Lab. Yamato studied image recognition at Kyoto University, and worked in new business creation before taking on dual roles in both technology and business. Yamato is in charge of R&D for CLOVA OCR, which recognizes text from document images. Yamato’s , motto is "if you want a different result, it’s crazy not to change your ways”. Yamato works hard every day, striving to make the research center a place people will aspire to work at. Yamato also enjoys playing rugby. Moderator/Panelist
  2. CLOVA OCR converted over 200 million book images into text

    data. 国立国会図書館が保有するデジタル化資料 247万点・2億2300万枚超の全文テキストデータ化に「CLOVA OCR」が採用 https://linecorp.com/ja/pr/news/ja/2021/3825
  3. 土井賢治 Kenji Doi Yahoo! JPAN Science Group, Technology Group Machine

    Learning Engineer Kenji Doi is involved in the development of image recognition technology in collaboration with various in- house services such as similarity image search and OCR, as well as the application of new methods and technologies that are released on a daily basis. Kenji studies discriminative and generative modeling of Ramen Jiro as a personal project. Panelist
  4. Category Estimation of ad image Image Feature CNN e.g.) ResNet

    OCR text data [ “ワンクリックで秒速診断”, “僕でも借りられますか?”, “〇〇銀行カードローン”, … … ] Text Feature Language Model e.g.) BERT FC Consumer loan ?
  5. 光瀬智哉 Tomoya Kose ZOZO ML / Data Department, Data Science

    Section 2 Machine Learning Engineer Tomoya Kose completed a master’s course at the Nara Institute of Science and Technology in 2014, studying natural language processing. Tomoya joined ZOZO in 2018 as a result of a corporate merger, and has since worked in the computer vision field. Tomoya is involved in developing the models used in similar image searches on ZOZOTOWN, maintenance of development flows related to machine learning, and the management of the machine learning engineer team. Panelist
  6. Similar Item Retrieval (Image Retrieval) Available data from WEAR Positive

    Pair Extract item area with object detection. Get an item image from ZOZOTOWN corresponding to the item worn.
  7. Isn't it difficult to adapt to what the users need?

    ユーザーニーズに合わせるのって難しくないですか? Topic-2
  8. How do you collaborate with internal stakeholders in developing your

    services? サービス開発にあたり、社内の関係者とどのように連携していますか? Topic-3
  9. Q&A