Hanyang University KAUST

Simulation-to-Reality Transfer for
Advanced Robotic Control

Hanyang University & KAUST Robotics Lab Collaboration

August 23 – September 6, 2024  ·  King Abdullah University of Science and Technology

Abstract

This project explores the challenge of transferring learned robotic control policies from simulated environments to real-world applications. Through a 15-day intensive research collaboration at KAUST, we developed and validated novel techniques for domain adaptation using reinforcement learning algorithms. Our approach demonstrates successful deployment across six different trajectory-following tasks with a 98% success rate, bridging the sim-to-real gap through systematic domain randomization and robust policy training.

15 Research Days
6 Task Scenarios
98% Success Rate
10k+ Training Episodes

Methods

Training Environment

  • Physics Engine: IsaacSim 4.5
  • Algorithm: PPO with domain randomization
  • Observation Space: 24-dimensional state vector
  • Action Space: Continuous control signals
  • Training Episodes: 10,000+

Real-World Deployment

  • Computing: On-board Computing
  • Framework: ROS2 Humble
  • Control Frequency: 100 Hz
  • Sensors: MoCap (Motion Capture System)
  • Communication: WiFi

Robot Platforms

Simulation Platform

Environment IsaacSim 4.5
Control Rate 100 Hz
Sensors Absolute Position, etc.

Physical Platform

Hardware CrazyFlie
Control Rate 100 Hz
Sensors MoCap (Motion Capture System)

Experimental Results

Performance comparison between simulation training and real-world deployment across all tasks.

Task Performance Summary

Task Success Rate Convergence Time Tracking Error
AR Action Stabilize 98% 2.3s
AR Action Stabilize 2 96% 2.5s
AR Observation Stabilize 97% 2.1s
Circle Following 0.03 m
Spiral Following 0.04 m
Figure-8 Following 0.05 m

Video Results

Simulation training, real-world deployment, and trajectory visualizations across six tasks.

Disturbance rejection demonstration
Disturbance Rejection

The deployed policy recovers from external perturbations in real time, demonstrating robustness beyond the training distribution.