Learning to Assemble the Soma Cube with Legal-Action Masked DQN and Safe ZYZ Regrasp on a Doosan M0609

Jaehong Oh; Seungjun Jung; Sawoong Kim

Overview

A full production-grade collaborative-robotics system built around the Soma-cube assembly task, demonstrating that disciplined action masking, singularity-safe regrasp planning, and multimodal HRI can be composed into a deployable platform. First published on arXiv as 2508.21272.

What the paper shows

Legal-action masking reduces the action space from 4,536 → 2,484 feasible actions — a 26% sample-efficiency improvement with no loss of solution completeness.
ZYZ regrasp with proximity-based singularity detection prevents gimbal lock, raising motion success 54% → 96%.
Sim-to-real bridge — 75% assembly success rate with ±1.8 mm positioning accuracy in manufacturing-relevant conditions.
Curriculum learning achieves 100% / 92.9% / 39.9% success across 2-piece, 3-piece, and 7-piece levels.
Korean-language HRI — Whisper-based speech recognition at 94% accuracy.

Authors & acknowledgements

Jaehong Oh, Seungjun Jung, Sawoong Kim — Doosan Robotics Rokey Bootcamp, Seoul. Work supported by K-Digital Training Program, mentored by Chunghyeon Lee.

Where it sits

The most hands-on paper in the collection. It exercises the cognitive-robotics stack end-to-end on real hardware and links tightly to the SEGO architecture.

Overview#

What the paper shows#

Authors & acknowledgements#

Where it sits#

Overview

What the paper shows

Authors & acknowledgements

Where it sits