AI watermark detection system development guide. Implements Echo Rule watermark detection through a five-component pipeline (preprocessor, clause identifier, echo engine, scoring, validator) with strict tier-based progression.
AI watermark detection system for identifying "Echo Rule" patterns in AI-generated text. This skill guides development through a strict three-tier progression (MVP → Production → Research) with 32 tasks across five pipeline components.
**Project**: SpecHO - Echo Rule watermark detector
**Architecture**: Five-component sequential pipeline
**Language**: Python 3.11+
**Total Tasks**: 32 (12-week Tier 1 timeline)
**Approach**: Tier-based implementation with mandatory validation gates
1. **Verify working directory**: Must be in project root
2. **Read current state**: Check `docs/STATUS.md` first
3. **Create session file**: `working/session-YYYY-MM-DD.md`
4. **Review active task**: Reference `docs/TASKS.md` and `docs/SPECS.md`
| Document | Purpose | Update Trigger |
|----------|---------|----------------|
| `docs/TASKS.md` | Task specifications (32 tasks) | Tasks added/changed |
| `docs/SPECS.md` | Tier 1/2/3 specifications | Specs refined |
| `docs/IMPLEMENTATION.md` | Learnings and gotchas | After each session |
| `docs/DEPLOYMENT.md` | Operations and deployment | Infrastructure changes |
| `docs/STATUS.md` | Current state and next steps | After each session |
| `architecture.md` | Original Echo Rule design | Reference only |
1. **Extract insights** from session file → append to `docs/IMPLEMENTATION.md`
2. **Update current state** → `docs/STATUS.md`
3. **Archive session file** → move to `docs/archive/sessions/`
4. **DO NOT** leave files in `working/` directory
**Constraints**:
**Deliverable**: Working CLI-based detector
**Trigger Requirements**:
**Deliverable**: Production-ready system with "robust" config profile
**Trigger Requirements**:
**Deliverable**: Optimized research-grade system
```
Input: str (raw text)
↓
Preprocessor → List[Token] + spacy.Doc
↓
Clause Identifier → List[ClausePair]
↓
Echo Engine → List[EchoScore]
↓
Scoring → float (document_score)
↓
Validator → (z_score, confidence)
↓
Output: DocumentAnalysis dataclass
```
1. **Read specifications**: Check `docs/TASKS.md` for task API
2. **Check tier details**: Reference `docs/SPECS.md` for Tier 1 implementation
3. **Implement Tier 1 only**: No features from Tier 2/3
4. **Create tests**: Write corresponding test file in `tests/`
5. **Validate**: Run tests before marking complete
1. Check `docs/SPECS.md` for detailed specification
2. Check `architecture.md` for algorithm context
3. Provide clear explanation with document reference
4. Offer code example if helpful
**DO**:
**DO NOT**:
```
SpecHO/
├── SpecHO/ # Implementation
│ ├── models.py # START: Task 1.1
│ ├── config.py
│ ├── utils.py
│ ├── detector.py
│ ├── preprocessor/
│ ├── clause_identifier/
│ ├── echo_engine/
│ ├── scoring/
│ └── validator/
├── tests/ # Mirror structure
├── scripts/
│ ├── cli.py
│ └── build_baseline.py
├── data/
│ ├── baseline/
│ ├── models/
│ └── corpus/
├── docs/
│ ├── TASKS.md # READ FIRST for task specs
│ ├── SPECS.md # READ SECOND for tier details
│ ├── IMPLEMENTATION.md
│ ├── DEPLOYMENT.md
│ ├── STATUS.md
│ └── archive/
└── working/ # Temporary session files
```
1. Acknowledge task number (e.g., "Task 2.1: Tokenizer")
2. Reference relevant documentation section
3. State tier level being implemented
4. Implement complete file with Tier 1 specs
5. Suggest test creation or next task
1. Reference specific documentation section
2. Provide concrete code example
3. Explain in context of overall pipeline
1. Acknowledge user request
2. Note if it conflicts with tier system
3. Suggest tier-appropriate alternative
4. Implement only if reasonable for current tier
```
User: "Read STATUS.md and help me continue"
Assistant:
1. Reads docs/STATUS.md
2. Identifies current task
3. References docs/TASKS.md for specification
4. Proposes next action based on current tier
```
```
User: "Implement Task 2.1"
Assistant:
"Task 2.1: Tokenizer (Tier 1)
Per TASKS.md, this creates SpecHO/preprocessor/tokenizer.py
Tier 1 spec (SPECS.md): Use spaCy's default tokenizer, no customization.
[implements tokenizer.py with Tier 1 specifications]
Next: Create tests/test_preprocessor.py or proceed to Task 2.2?"
```
```
User: "Let's add caching to the phonetic analyzer"
Assistant:
"Caching is a Tier 2 feature (per SPECS.md). Current tier: 1.
We're implementing simple algorithms first. I'll note this
in IMPLEMENTATION.md as a potential Tier 2 enhancement.
Continue with current Tier 1 task?"
```
Leave a review
No reviews yet. Be the first to review this skill!
# Download SKILL.md from killerskills.ai/api/skills/specho-development-guide/raw