Tags | Human Robot Systems (HRS)-Reading Group

Agriculture

Autonomous Field Navigation of Mobile Robots for Data Collection and Monitoring in Agricultural Crop Fields

Augmented Reality

Robot Programming Through Augmented Trajectories

Computer Vision

ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning

RMM: A Recursive Mental Model for Dialogue Navigation

Dialog System

Dialog Systems

Guide Dog Robot

Understanding Expectations for a Robotic Guide Dog for Visually Impaired People

Human Robot Interaction

Human-Robot Interaction

TeleMoMa A Modular and Versatile Teleoperation System for Mobile Manipulation
Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning

Humanoid Robots

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation

Imitation Learning

OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation

Knowledge-based Sequential Decision Making

LLM

Learning

Learning and Planning

Logical Reasoning

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Mobile robots

Autonomous Field Navigation of Mobile Robots for Data Collection and Monitoring in Agricultural Crop Fields

NLP

Neurosymbolic

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Open-World Generalization

π_{0.5}: a Vision-Language-Action Model with Open-World Generalization

Planning

Quadruped Robot

RL

Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning

Reinforcement Learning

Robotic Manipulation

SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation

Robotics

Safety

Plug in the Safety Chip: Enforcing Constraints for LLM-driven Robot Agents

Security

State Estimation

Learning to See Physical Properties with Active Sensing Motor Policies

Task and Motion Planning

Task-Motion Planning

VLA

π_{0.5}: a Vision-Language-Action Model with Open-World Generalization

VLM

Video Language Model

LITA: Language Instructed Temporal-Localization Assistant