Dumbo LLM Trainer Assistant

Expert assistant for working with Dumbo, a modular training framework built on top of Transformers for fine-tuning language models.

What This Skill Does

This skill provides deep knowledge of the Dumbo LLM training framework's architecture, plugin system, configuration patterns, and development workflows. It helps you:

Configure training runs with YAML files

Develop custom plugins for models, tokenizers, datasets, and trainers

Understand the plugin loading pipeline and execution order

Debug training issues and optimize configurations

Integrate metrics collection and logging

Work with the modular plugin architecture

Instructions

When assisting with Dumbo-related tasks, follow these guidelines:

1. Understand the Context

Before making changes, review:

The plugin-based architecture where all functionality comes from plugins in `src/dumbo/plugins/`

The execution pipeline: Model → Tokenizer → Dataset → Training → Output

The YAML configuration structure that defines the entire training setup

The metrics system with abstract collectors and registry pattern

2. Configuration Tasks

When creating or modifying training configurations:

Use the YAML structure with sections: `model`, `datasets`, `trainer`, `plugins`

Ensure required plugins are listed in the `plugins` array

Follow the configuration examples in `examples/` directory

Remember tokenizer special tokens must be configured explicitly

Use plugin-specific config keys (e.g., `liger`, `peft`) under the model section

Example configuration pattern:

```yaml

model:

base_model: org/model-name

tokenizer:

pad_token: "<|pad|>"

plugin_config:

setting: value

datasets:

- path: dataset/path

type: loader_type

train_format:

type: formatter_type

trainer:

arguments:

batch_size: 16

plugins:

- required_plugin_1

- required_plugin_2

```

3. Plugin Development

When creating or modifying plugins:

All plugins inherit from `BasePlugin` in `src/dumbo/plugin_loader.py`

Implement the appropriate interface: `ModelLoaderPlugin`, `TokenizerLoaderPlugin`, `ModelPatcherPlugin`, or `LoggingPlugin`

Understand the loading order: Model → Tokenizer → Patches → Datasets → Trainer

For metrics collection, implement `get_metrics_collector()` method returning a `MetricsCollector` instance

Plugin interface examples:

`ModelLoaderPlugin`: Implement `load_model(config)`

`TokenizerLoaderPlugin`: Implement `load_tokenizer(config, model=None)` - receives model for embedding resizing

`ModelPatcherPlugin`: Implement `patch_model(model, config)`

Metrics: Implement `get_metrics_collector()` returning collector instance

4. Running Training

Use these command patterns:

```bash

Basic training run

uv run dumbo path/to/config.yaml

Development mode

uv run python -m dumbo path/to/config.yaml

Install dependencies first

uv sync

```

5. Key Files Reference

When investigating issues or making changes:

`src/dumbo/__init__.py`: Main orchestration and entry point

`src/dumbo/plugin_loader.py`: Plugin system base classes and loading logic

`src/dumbo/metrics.py`: Abstract metrics collection system

`src/dumbo/plugins/transformers.py`: Core model/tokenizer loading with embedding resizing

`src/dumbo/plugins/transformers_trainer.py`: Training setup and execution

`src/dumbo/plugins/liger.py`: Liger kernel optimizations

`src/dumbo/plugins/wandb.py`: W&B logging with metrics

6. Common Patterns

**Embedding Resizing**: Tokenizer loading receives the model reference to support automatic embedding resizing when special tokens are added.

**Plugin Loading Order**: Critical for proper initialization - model must load before tokenizer, patches apply after both are loaded, trainer created last.

**Configuration Inheritance**: Plugins read their config from dedicated sections (e.g., `model.liger` for Liger plugin, `model.peft` for PEFT/LoRA).

**Metrics Collection**: Abstract system allows multiple collectors to be registered and used throughout training without tight coupling.

7. Troubleshooting

When debugging issues:

Check plugin loading order if initialization fails

Verify all required plugins are listed in config `plugins` array

Ensure special tokens are properly configured in `model.tokenizer` section

Review plugin-specific config sections match plugin expectations

Check that model and tokenizer are compatible

Verify dataset formatter matches expected data structure

Examples

Example 1: Create a new training configuration

```yaml

model:

base_model: HuggingFaceTB/SmolLM2-135M

tokenizer:

pad_token: "<|pad|>"

eos_token: "<|im_end|>"

liger:

rope: true

cross_entropy: false

datasets:

- path: tatsu-lab/alpaca

type: huggingface_polars

data_format: alpaca

train_format:

type: jinja_messages

template: "{% for message in messages %}{{ message.content }}{% endfor %}"

trainer:

arguments:

batch_size: 16

physical_batch_size: 1

learning_rate: 1e-4

num_train_epochs: 3

plugins:

- transformers

- transformers_trainer

- liger

- polars

- jinja_formatter

```

Example 2: Develop a custom metrics collector plugin

```python

from dumbo.plugin_loader import BasePlugin

from dumbo.metrics import MetricsCollector

class CustomMetricsPlugin(BasePlugin):

def get_metrics_collector(self) -> MetricsCollector:

return CustomMetricsCollector()

class CustomMetricsCollector(MetricsCollector):

def log_metrics(self, metrics: dict, step: int):

# Custom metrics logging logic

pass

```

Important Notes

The plugin system is the core architectural pattern - all functionality comes from plugins

Configuration files use YAML format and define the complete training pipeline

Plugin loading order matters: Model → Tokenizer → Patches → Datasets → Trainer

Tokenizer loading receives model reference for embedding resizing support

Metrics system uses abstract collectors registered via plugin hooks

Always list required plugins explicitly in the config `plugins` array

Dumbo LLM Trainer Assistant

Dumbo LLM Trainer Assistant

What This Skill Does

Instructions

1. Understand the Context

2. Configuration Tasks

3. Plugin Development

4. Running Training

Basic training run

Development mode

Install dependencies first

5. Key Files Reference

6. Common Patterns

7. Troubleshooting

Examples

Example 1: Create a new training configuration

Example 2: Develop a custom metrics collector plugin

Important Notes

Reviews (0)