ARX Robot Control System

This skill provides expert guidance for working with the ARX robotic manipulation system based on mobile-aloha and act-plus-plus frameworks. The system implements Action Chunking with Transformers (ACT) for imitation learning on dual-arm robotic platforms.

What This Skill Does

Assists developers working with the ARX dual-arm robot platform by:

Understanding the three-phase workflow (data collection → training → inference)

Navigating the codebase structure and key components

Running data collection, training, and inference pipelines

Configuring cameras, CAN bus communication, and ROS2 integration

Debugging hardware setup and multi-process coordination

Managing conda environments and dependencies

Instructions

1. Repository Structure Understanding

When users ask about code organization:

**act/**: Core ACT implementation

- `collect.py`: Data collection from robot sensors/cameras

- `train.py`: Neural network training for imitation learning

- `inference.py`: Real-time robot control with trained models

- `robomimic/`: Robotics learning framework integration

- `detr/`: DETR transformer architecture for vision

- `utils/`: Policy utilities, ROS operations, data handling

**realsense/**: Intel RealSense camera integration (3 cameras: head, left, right)

- Configured for 640x480@90fps color and depth streams

**tools/**: Automation scripts

- `01_collect.sh`: Automated data collection

- `02_train.sh`: Training pipeline

- `03_inference.sh`: Inference deployment

**ARX_CAN/**: CAN bus communication

**ROS2/**: ROS2 workspace for robot control

**arx_joy/**: Joystick controller integration

2. Environment Setup

When users need to set up the development environment:

```bash

Activate conda environment

conda activate act

Install dependencies

pip install -r tools/IL/requirements.txt

```

Key dependencies:

PyTorch with CUDA support

ROS2 (rclpy)

OpenCV, torchvision

mujoco, dm_control

h5py, numpy==1.26

3. Data Collection Workflow

When users want to collect training data:

**Automated approach:**

```bash

./tools/01_collect.sh

```

**Manual approach:**

```bash

cd act

python collect.py --episode_idx -1 --num_episodes 20

```

Explain that collection requires:

CAN bus communication running

ROS2 controllers active

All 3 RealSense cameras streaming

Joystick controller connected

Camera topics: `/camera/camera_{h,l,r}/color/image_rect_raw/compressed`

4. Training Workflow

When users want to train models:

**Automated approach:**

```bash

./tools/02_train.sh

```

**Manual approach:**

```bash

cd act

python train.py --num_episodes -1

```

Training requirements:

GPU with CUDA support

Collected demonstration data in h5 format

Sufficient disk space for checkpoints

5. Inference/Deployment

When users want to run inference:

**Automated approach:**

```bash

./tools/03_inference.sh

```

**Manual approach:**

```bash

cd act

python inference.py

```

Inference requires:

Trained model checkpoint

Camera streams active

Robot controllers running

Real-time performance (GPU recommended)

6. Camera Management

When users need to manage RealSense cameras:

```bash

cd realsense

./realsense.sh

```

Cameras are hardcoded with specific serial numbers for:

Head camera

Left camera

Right camera

All stream at 640x480@90fps.

7. Configuration Files

Point users to key configuration locations:

**Main config**: `act/data/config.yaml`

- Camera topics

- Arm controller topics (`/arm_master_{l,r}_status`, `/arm_slave_{l,r}_status`)

- Robot base configuration

**Code style**: `.flake8`

- Max line length: 120 characters

- Run checks: `flake8 <file>`

8. Multi-Process Coordination

Explain that the system uses gnome-terminal for coordinating multiple processes:

CAN bus communication

ROS2 controllers

Camera nodes

Data collection/inference processes

Joystick input handling

9. Hardware Requirements

When discussing system requirements:

ARX dual-arm robot platform

3x Intel RealSense cameras (specific serial numbers)

CAN bus interface

GPU with CUDA support for training/inference

Joystick controller for teleoperation

10. Debugging Guidance

Common issues to check:

Camera serial numbers match hardware in `realsense.sh`

All processes launched successfully in multi-terminal setup

CUDA/GPU availability for training and inference

ROS2 topics publishing correctly

CAN bus communication established

11. Code Conventions

Codebase primarily in Chinese with some English comments

Follow flake8 style guidelines (max line length 120)

Neural network architecture based on DETR transformers

Data format: HDF5 for episode storage

Examples

**Example 1: Starting data collection**

```bash

User asks: "How do I collect training data?"

Response:

./tools/01_collect.sh

This automated script will:

1. Start CAN bus communication

2. Launch ROS2 controllers

3. Start RealSense cameras

4. Begin data collection with teleoperation

```

**Example 2: Training a model**

```bash

User asks: "I have 50 episodes collected, how do I train?"

Response:

cd act

python train.py --num_episodes 50

Or use the automated script:

./tools/02_train.sh

```

**Example 3: Configuration check**

```bash

User asks: "Where do I configure camera topics?"

Response: Edit act/data/config.yaml

Camera topics follow pattern:

/camera/camera_h/color/image_rect_raw/compressed (head)

/camera/camera_l/color/image_rect_raw/compressed (left)

/camera/camera_r/color/image_rect_raw/compressed (right)

```

Important Constraints

**Hardware-specific**: System expects specific ARX robot hardware and RealSense camera serial numbers

**Multi-terminal requirement**: Scripts use gnome-terminal for process coordination

**GPU required**: Training and inference need CUDA support

**ROS2 dependency**: All robot control goes through ROS2 topics

**Real-time constraints**: Inference must run at sufficient frame rate for robot control

When to Use This Skill

Use this skill when:

Working with ARX dual-arm robot platforms

Implementing imitation learning for robotics

Setting up data collection pipelines for robot manipulation

Training ACT models for robot control

Deploying learned policies for real-time inference

Debugging ROS2/camera/CAN bus integration issues

Understanding mobile-aloha/act-plus-plus architectures

ARX Robot Control System

ARX Robot Control System

What This Skill Does

Instructions

1. Repository Structure Understanding

2. Environment Setup

Activate conda environment

Install dependencies

3. Data Collection Workflow

4. Training Workflow

5. Inference/Deployment

6. Camera Management

7. Configuration Files

8. Multi-Process Coordination

9. Hardware Requirements

10. Debugging Guidance

11. Code Conventions

Examples

User asks: "How do I collect training data?"

Response:

This automated script will:

1. Start CAN bus communication

2. Launch ROS2 controllers

3. Start RealSense cameras

4. Begin data collection with teleoperation

User asks: "I have 50 episodes collected, how do I train?"

Response:

Or use the automated script:

User asks: "Where do I configure camera topics?"

Response: Edit act/data/config.yaml

Camera topics follow pattern:

/camera/camera_h/color/image_rect_raw/compressed (head)

/camera/camera_l/color/image_rect_raw/compressed (left)

/camera/camera_r/color/image_rect_raw/compressed (right)

Important Constraints

When to Use This Skill

Reviews (0)