MAX-CV Development

You are working with MAX-CV, an accelerated image processing framework built on MAX and Mojo, inspired by the GPUImage series. This framework provides a high-level Python API with Mojo-based custom operations for realtime video processing and machine vision tasks.

Primary Documentation References

Always consult these resources when working on the project:

Comprehensive Modular docs: https://docs.modular.com/llms.txt

Mojo docs: https://docs.modular.com/llms-mojo.txt

MAX Python API docs: https://docs.modular.com/llms-python.txt

GitHub repository: https://github.com/modular/modular/ (especially `modular/examples`)

Architecture Overview

Core Components

1. **Python API Layer** (`max_cv/`)

- `ImagePipeline`: Main class for constructing image processing graphs

- `io.py`: Image loading/saving utilities with `load_image_into_tensor()`

- `operations/`: Python wrappers invoking Mojo custom ops via `ops.custom()`

2. **Mojo Custom Operations** (`max_cv/operations_mojo/`)

- Low-level image processing operations in Mojo

- Registered with `@compiler.register("op_name")` decorator

- Uses `foreach` with element-wise kernels for GPU/CPU execution

- Organized by category: blend, color_correction, draw, edge_detection, effects, transform

3. **Pipeline Flow**

- Load images into `Buffer` objects on CPU or accelerator

- `ImagePipeline` constructs a MAX `Graph` (images normalized to 0.0-1.0 float32)

- Operations chained with `pipeline.input_image`

- Compile once with `pipeline.compile()` for optimized `Model`

- Execute multiple times with `pipeline(buffer)`

- Output automatically restored to uint8 0-255 colorspace

Essential Development Commands

Testing and Formatting

`pixi run test` - Run pytest unit and integration tests

`pixi run bench` - Run Mojo benchmarks for all operations

`pixi run format` - Format Python (ruff) and Mojo code

`pixi run format_mojo` - Format only Mojo (80 char line width)

Examples and Demos

`pixi run filter-single-image` - Simple image processing demo

`pixi run showcase [operation]` - Run specific operation

- Examples: `pixi run showcase pixellate --value 15`

- `pixi run showcase brightness --value 0.3`

- `pixi run showcase gaussian_blur --kernel_size 16 --sigma 4.0`

`pixi run showcase_video` - Video processing examples (requires OpenCV)

`pixi run -e notebook notebook` - Start Jupyter notebook environment

Code Patterns

Device Management

```python

from max.driver import Accelerator, CPU, accelerator_count

device = CPU() if accelerator_count() == 0 else Accelerator()

```

Single-Input Pipeline Construction

```python

from max_cv import ImagePipeline, load_image_into_tensor, operations as ops

from max.dtype import DType

image_tensor = load_image_into_tensor(image_path, device)

with ImagePipeline(

"pipeline_name",

image_tensor.shape,

pipeline_dtype=DType.float32,

device=device,

) as pipeline:

result = ops.brightness(device, pipeline.input_image, 0.5)

pipeline.output(result)

pipeline.compile()

result = pipeline(image_tensor)

```

Multi-Input Pipeline (Blending, etc.)

```python

with ImagePipeline(..., num_inputs=2) as pipeline:

result = ops.dissolve_blend(

device,

pipeline.input_images[0],

pipeline.input_images[1],

0.5

)

pipeline.output(result)

result = pipeline(image1_buffer, image2_buffer)

```

Buffer Handling

`Buffer.from_numpy()` - Create buffers from NumPy arrays

`buffer.to(device)` - Move buffers between CPU and accelerator

`buffer.to(CPU()).to_numpy()` - Convert back for display/saving

Custom Operation Pattern (Mojo)

```mojo

@compiler.register("operation_name")

struct OperationName:

@staticmethod

fn execute[target: StaticString](

output: OutputTensor,

param: Float32,

image: InputTensor[dtype = output.dtype, rank = output.rank],

ctx: DeviceContextPtr,

) raises:

@parameter

@always_inline

fn kernel[width: Int](idx: IndexList[image.rank]) -> SIMD[image.dtype, width]:

# Process pixels here

return image.load[width](idx) + param

foreach[kernel, target=target](output, ctx)

```

Available Operations

Color Adjustments

`brightness`, `gamma`, `luminance_threshold`

`rgb_to_luminance`, `luminance_to_rgb`

Edge Detection

`sobel_edge_detection`

Visual Effects

`pixellate`, `gaussian_blur`

Blending (two-image operations)

`add_blend`, `dissolve_blend`, `multiply_blend`

Drawing

`draw_circle`

Transforms

`flip` (vertical, horizontal, or both)

Environment Requirements

MAX 26.2+ nightly (conda channel: `https://conda.modular.com/max-nightly`)

Pixi for dependency and environment management

Supported platforms: macOS ARM64, Linux ARM64, Linux x86_64

Custom Mojo operations loaded from `max_cv/operations_mojo/`

Testing: pytest | Benchmarking: native Mojo in `benchmarks/`

Key Constraints

Always check accelerator availability before allocating device

Compile pipelines once, execute many times for performance

Mojo operations must use `foreach` pattern for parallelization

Format Mojo code to 80 character line width

Test all new operations with `pixi run test` and `pixi run bench`

MAX-CV Development

MAX-CV Development

Primary Documentation References

Architecture Overview

Core Components

Essential Development Commands

Testing and Formatting

Examples and Demos

Code Patterns

Device Management

Single-Input Pipeline Construction

Multi-Input Pipeline (Blending, etc.)

Buffer Handling

Custom Operation Pattern (Mojo)

Available Operations

Color Adjustments

Edge Detection

Visual Effects

Blending (two-image operations)

Drawing

Transforms

Environment Requirements

Key Constraints

Reviews (0)