Slide 3
Slide 3 text
論⽂ 4⽉分
Agent Capabilities
推論
• Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
• ChatShop: Interactive Information Seeking with Language Agents
• Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models
• Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought
• Graph of Thoughts: Solving Elaborate Problems with Large Language Models
メモリ
• Memory Sharing for Large Language Model based Agents
• A Survey on the Memory Mechanism of Large Language Model based Agents
エージェントの評価
• Foundational Challenges in Assuring Alignment and Safety of Large Language Models
• GPT in Sheep's Clothing: The Risk of Customized GPTs
計画
• Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Agent Framework
• The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
• Aligning LLM Agents by Learning Latent Preference from User Edits
• AgentKit: Flow Engineering with Graphs, not Coding
• The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey
• GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
• AI2Apps: A Visual IDE for Building LLM-based AI Agent Applications