Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Featured
Business
Design
Education
How-to & DIY
Marketing & SEO
Programming
Research
Science
Storyboards
Technology
Semantic Machine Intelligence Lab., Keio Univ.
PRO
keio_smilab
207 Decks
1 Collection
0 Following
70 Followers
0 Stars
Decks
Language
All Languages
한국인
Deutsch
English
Español
Français
Italiano
Português
Pусский
Svenska
Tiếng Việt
中文 (simplified)
中文 (traditional)
日本語
[RSJ25] Enhancing VLA Performance in Understanding and Executing Free-form Instructions via Visual Prompt-based Paraphrasing
keio_smilab
PRO
0
220
[MIRU25] NaiLIA: Multimodal Retrieval of Nail Designs Based on Dense Intent Descriptions
keio_smilab
PRO
1
330
[Journal club] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
keio_smilab
PRO
0
94
[Journal club] Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval
keio_smilab
PRO
0
78
[Journal club] Influence-Balanced Loss for Imbalanced Visual Classification
keio_smilab
PRO
0
72
[Journal club] Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
keio_smilab
PRO
0
95
[Journal club] Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking
keio_smilab
PRO
0
120
[MIRU2025]Preference Optimization for Multimodal Large Language Models for Image Captioning Tasks
keio_smilab
PRO
0
250
[MIRU25] An LLM-Hybrid-as-a-Judge Approach for Evaluating Long Image Captions
keio_smilab
PRO
1
310
Semantic Machine Intelligence for Vision, Language, and Actions
keio_smilab
PRO
3
650
Machine Intelligence for Vision, Language, and Actions
keio_smilab
PRO
0
800
[Journal club] V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization
keio_smilab
PRO
0
180
[Journal club] Model Alignment as Prospect Theoretic Optimization
keio_smilab
PRO
0
250
[Journal club] LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
keio_smilab
PRO
2
150
[Journal club] DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
keio_smilab
PRO
0
140
Will multimodal language processing change the world?
keio_smilab
PRO
4
740
[Journal club] RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation
keio_smilab
PRO
1
390
[Journal club] Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot
keio_smilab
PRO
0
340
‹ Prev
Next ›
Speaker Deck Pro:
Add privacy options and schedule the publishing of your decks
Upgrade