Dexterous manipulation: Eureka, DexMimicGen, π0.7 dexterous

This is a valid v1.0 placeholder page for the later curriculum arc. Full interactive lab treatment ships after Week 1 dogfooding.

LECTURE & READING

Glossary primer (12 min)

— Multi-finger / multi-contact , , . The hardest robotics problem. 16-25 DoFs in the hand alone.
Eureka — NVIDIA 2023. Uses LLMs (GPT-4) to write functions for dexterous tasks; iterates with . Solved pen spinning in sim (a long-standing open problem).
DexMimicGen — NVIDIA 2024. Generates synthetic dexterous demonstrations by "mimicking" base skills with augmentation. Avoids the 1000-human-hours-of-teleop bottleneck.
Allegro Hand / Shadow Hand / LEAP Hand — Common dexterous platforms. Allegro: 16 DoFs / 4 fingers; Shadow: 24 DoFs / 5 fingers; LEAP: low-cost 16 DoFs / 4 fingers.
In-hand — Hold an object, rotate it without dropping. Classic .
Object-centric vs hand-centric obs — Object pose in world frame vs object pose relative to hand. Choice affects .
— Force-sensing fingertips (e.g. DIGIT, GelSight). Crucial for fine ; many policies don't use it.
π0.7-dexterous — Apr 2026 variant of π0.7 fine-tuned for bimanual dexterous tasks (towel folding, container opening). Within the same blog post / release.

Hour 1 — Reading

Eureka paper, sections 1–4 (~30 min): https://eureka-research.github.io/
DexMimicGen paper, abstract + Section 3 (~25 min): https://dexmimicgen.github.io/
π0.7 dexterous results section (~10 min, from Day 30 reading): https://www.physicalintelligence.company/blog/pi07

Hour 2 — Run Eureka on a toy task (45 min)

Eureka requires GPT-4 API access. Skip if you don't have it; otherwise:

git clone https://github.com/eureka-research/Eureka
cd Eureka
export OPENAI_API_KEY=<key>
python eureka.py --env=shadow_hand_pen_spin --iterations=3

After 3 iterations (~30 min, mostly waiting on GPT-4), the LLM should have proposed a that yields a working pen-spinning in Isaac Gym. Expected: pen orientation tracks target with average angular error < 0.5 rad after 100 sim hours.

If no API access, read the paper's appendix carefully — the evolved is the key result.

LAB

Hour 3 — Lab: minimal in-hand reorientation in MuJoCo (75 min)

What you're building. A simplified in-hand cube on the Allegro Hand in MuJoCo. Train PPO for 30 min. Don't expect full — just confirm the cube doesn't drop.

Step 1 — Get the Allegro Hand model (10 min)

ls ~/robo47/mujoco_menagerie/wonik_allegro

Expected: Allegro hand assets including right_hand.xml.

Full source continues in the committed curriculum files. The v1.0 page exposes the day flow and lab surface without inventing content.

Completion controls unlock when this day graduates from placeholder to full lab.

Papers you will re-read after this

Dex-Net 2.0 — robust grasp planning