Cookbook: Research Agent

This example combines:

loop reasoning
built-in tools
semantic memory
eval-based quality checks

1. Goal

Build a research assistant that can:

use tools for retrieval and processing
store distilled findings as semantic memory
evaluate output quality before release

2. Skeleton

import { HarnessBuilder } from 'colony-harness'
import { createBuiltinTools } from '@colony-harness/tools-builtin'
import { runEvalSuite, containsScorer } from '@colony-harness/evals'

// Build harness, run task, then evaluate output quality.

3. Design notes

restrict tool permissions (run_command allowlist)
store reusable conclusions, not noisy raw logs
run evals in CI before release

4. Minimum pre-release checks

pnpm build
pnpm typecheck
pnpm test
pnpm docs:build
pass-rate threshold for key eval cases

Cookbook: Research Agent ​

1. Goal ​

2. Skeleton ​

3. Design notes ​

4. Minimum pre-release checks ​

Cookbook: Research Agent

1. Goal

2. Skeleton

3. Design notes

4. Minimum pre-release checks