Featured

Business

Design

Education

How-to & DIY

Marketing & SEO

Programming

Research

Science

Storyboards

Technology

Semantic Machine Intelligence Lab., Keio Univ.

PRO keio_smilab

0 Stars

Decks

[Journal club] GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering

PRO

0

130

[RSJ25] LILAC: Language‑Conditioned Object‑Centric Optical Flow for Open‑Loop Trajectory Generation

PRO

0

190

[RSJ25] Feasible RAG: Hierarchical Multimodal Retrieval with Feasibility-Aware Embodied Memory for Mobile Manipulation

PRO

0

260

[RSJ25] Multilingual Scene Text-Aware Multimodal Retrieval for Everyday Objects Based on Deep State Space Models

PRO

0

140

[RSJ25] Everyday Object Manipulation Based on Scene Text-Aware Multimodal Retrieval

PRO

1

140

[RSJ25] Enhancing VLA Performance in Understanding and Executing Free-form Instructions via Visual Prompt-based Paraphrasing

PRO

0

240

[MIRU25] NaiLIA: Multimodal Retrieval of Nail Designs Based on Dense Intent Descriptions

PRO

1

380

[Journal club] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

PRO

0

130

[Journal club] Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval

PRO

0

91

[Journal club] Influence-Balanced Loss for Imbalanced Visual Classification

PRO

0

97

[Journal club] Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance

PRO

0

130

[Journal club] Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking

PRO

0

400

[MIRU2025]Preference Optimization for Multimodal Large Language Models for Image Captioning Tasks

PRO

0

300

[MIRU25] An LLM-Hybrid-as-a-Judge Approach for Evaluating Long Image Captions

PRO

1

340

Semantic Machine Intelligence for Vision, Language, and Actions

PRO

3

710

Machine Intelligence for Vision, Language, and Actions

PRO

0

830

[Journal club] V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization

PRO

0

210

[Journal club] Model Alignment as Prospect Theoretic Optimization

PRO

0

280

Speaker Deck Pro: Add privacy options and schedule the publishing of your decks Upgrade