Cookbook: Research Agent
This example combines:
- loop reasoning
- built-in tools
- semantic memory
- eval-based quality checks
1. Goal
Build a research assistant that can:
- use tools for retrieval and processing
- store distilled findings as semantic memory
- evaluate output quality before release
2. Skeleton
ts
import { HarnessBuilder } from 'colony-harness'
import { createBuiltinTools } from '@colony-harness/tools-builtin'
import { runEvalSuite, containsScorer } from '@colony-harness/evals'
// Build harness, run task, then evaluate output quality.3. Design notes
- restrict tool permissions (
run_commandallowlist) - store reusable conclusions, not noisy raw logs
- run evals in CI before release
4. Minimum pre-release checks
pnpm buildpnpm typecheckpnpm testpnpm docs:build- pass-rate threshold for key eval cases