Module 04: Sensors and Perception

Introduction

Perception is how robots understand their environment. This module covers the sensors used in humanoid robots and the algorithms that process sensor data into actionable information.

Section 1: Proprioceptive Sensors

1.1 Joint Encoders

Encoder: A sensor that measures angular position or velocity of a motor shaft or joint.

Types:

Incremental encoders (relative position)
Absolute encoders (absolute position)
Resolution: typically 12-18 bits

1.2 Inertial Measurement Units (IMU)

IMUs combine:

Accelerometers (linear acceleration)
Gyroscopes (angular velocity)
Magnetometers (heading, optional)

class IMU:
    def __init__(self):
        self.accel = np.zeros(3)  # m/s^2
        self.gyro = np.zeros(3)   # rad/s

    def estimate_orientation(self, dt):
        # Complementary filter
        accel_angle = np.arctan2(self.accel[1], self.accel[2])
        gyro_angle = self.orientation + self.gyro[0] * dt
        self.orientation = 0.98 * gyro_angle + 0.02 * accel_angle

Section 2: Exteroceptive Sensors

2.1 Cameras

RGB cameras provide:

Object detection and recognition
Visual servoing
SLAM features

2.2 Depth Sensors

RGB-D Camera: A sensor combining color imagery with per-pixel depth measurement, using structured light or time-of-flight.

2.3 LIDAR

LIDAR provides precise distance measurements:

2D: Single scan plane
3D: Full point cloud
Range: 10-200m typical

Section 3: Sensor Fusion

3.1 Kalman Filter

$\hat{\mathbf{x}}_{k|k} = \hat{\mathbf{x}}_{k|k-1} + \mathbf{K}_k(\mathbf{z}_k - \mathbf{H}\hat{\mathbf{x}}_{k|k-1})$

3.2 Extended Kalman Filter

For nonlinear systems, linearize about current estimate.

Sensor fusion requires careful calibration. Misaligned sensors will produce inconsistent estimates regardless of filter quality.

Section 4: Perception Pipelines

4.1 Object Detection

Modern approaches use deep learning:

YOLO, SSD for real-time detection
Point cloud processing for 3D objects

4.2 Pose Estimation

Estimating 6-DOF object pose enables manipulation:

def estimate_object_pose(rgb_image, depth_image, object_model):
    # Detect object in RGB
    bbox = object_detector(rgb_image)

    # Extract point cloud for object
    points = depth_to_points(depth_image, bbox)

    # Align to model
    pose = icp(points, object_model)
    return pose

Summary

Key takeaways:

Proprioceptive sensors measure internal state (joints, orientation)
Exteroceptive sensors measure the external world (cameras, LIDAR)
Sensor fusion combines noisy measurements optimally
Modern perception uses deep learning for robust detection

Key Concepts

Proprioception: Internal state sensing
Exteroception: External world sensing
Sensor Fusion: Combining multiple sensor readings
Kalman Filter: Optimal state estimation for linear systems

Introduction​

Section 1: Proprioceptive Sensors​

1.1 Joint Encoders​

1.2 Inertial Measurement Units (IMU)​

Section 2: Exteroceptive Sensors​

2.1 Cameras​

2.2 Depth Sensors​

2.3 LIDAR​

Section 3: Sensor Fusion​

3.1 Kalman Filter​

3.2 Extended Kalman Filter​

Section 4: Perception Pipelines​

4.1 Object Detection​

4.2 Pose Estimation​

Summary​

Key Concepts​

Further Reading​