WATCHDOG: a behavioral credit score for every trading agent

How you use it

It's a library you put in your agent.
This dashboard is your console.

WATCHDOG isn't an app you log into, and it isn't for end-users. It's infrastructure a developer embeds in their own Bitget trading agent, in three lines. Think Datadog, or an SSL certificate, but for agent behavior. The live dashboard is the read-only monitoring view: it shows the agents that are running WATCHDOG.

1

Install & wrap

npm install watchdog-agent, then wrap your agent. One call before every trade, two after.

2

Your agent trades

It runs exactly as before, on your own Bitget logic. WATCHDOG scores every trade and vetoes the dangerous ones.

3

Watch the console

Open this dashboard to see your agent's live trust score, forecasts and incident reports, plus the public leaderboard.

// 1. install: npm install watchdog-agent
import { Watchdog } from 'watchdog-agent';
const watchdog = new Watchdog({ agentId:'my-bot', portfolioUsdt:10000, rules:{…}, onViolation:'pause' });

// 2. inside your trading loop, before every Bitget order:
const d = await watchdog.checkTrade({ type:'open', symbol:'BTCUSDT', sizeUsdt:100, direction:'long' });
if (d.approved) await placeOrder(); // ← your existing Bitget code
else pauseAgent(d.reason); // WATCHDOG caught it misbehaving

The problem

Agents fail because they misbehave, not because they're dumb.

In 2026, AI trading agents lost over $45M. Not from hacks. From replicating the worst human trading habits. Builders watch profit and loss. Nobody watches whether the agent is acting sanely.

238

trades in 17 days from one LLM agent, bled out on fees while net PnL stayed flat.

$45M+

lost in 2026 to agents that overtraded, revenge-traded, and chased hype into bad positions.

0

tools that score an agent's behavior, predict its breaches, or compare agents on a public board. Until now.

Why WATCHDOG is different

A filter blocks one trade. WATCHDOG is the reputation layer for every agent.

Single-purpose risk filters answer one question: "is this trade safe?" That is one of WATCHDOG's five guards. WATCHDOG answers the bigger one: "is this agent trustworthy over time, and how does it rank against the rest?"

A single-agent risk filter

✗ blocks one bad trade, then forgets
✗ no memory, no score, no trend
✗ one agent at a time
✗ reacts after the threshold
✗ a black box that just says "no"

WATCHDOG

✓ a live 0–100 trust score per agent
✓ predicts breaches before they happen
✓ ranks every agent on a public board
✓ an AI risk officer that reasons + explains
✓ all five guards, plus a tamper-evident log

Each competing "risk filter" is a feature WATCHDOG already ships. It is the credit bureau for trading agents, not antivirus for one of them.

How it works

Five behavioral metrics. Four intelligence layers.

Every trade is scored against five guards. The metrics feed four layers that turn raw events into a live, predictive, explainable trust score.

Trade frequency

catches overtrading

Position drift

catches size creep

Drawdown velocity

catches bleed-out

Loss streak

catches broken strategy

Signal override

catches tilt

LAYER 1

Trust Score

A single 0–100 score per agent, recomputed every trade via a weighted EMA. Bands: healthy, caution, unsafe.

LAYER 2

Predictive Forecast

Linear regression on each metric's trajectory, warns "breach in ~3 trades" before the violation happens.

LAYER 3

AI Diagnosis

The instant an agent breaches, an LLM writes a plain-English incident report: what happened, why, and what to do.

LAYER 4

Fleet Leaderboard

Rank every agent by trust over time. A verifiable, public reputation layer for autonomous agents.

See it live

The agent goes rogue. WATCHDOG catches it.

Trust falls, the forecast warns early, the agent auto-pauses, and an AI incident report writes itself, in real time.

WATCHDOG dashboard showing an agent paused with an AI incident report

Open the live dashboard →

Integrate

Wrap any Bitget agent in three lines.

One call before every trade. Two after. Your agent keeps working exactly as before. WATCHDOG just gets a veto and a record. Or skip the library entirely and wire it as an MCP server.

Read the Quickstart →

// before every trade, ask permission
const d = await watchdog.checkTrade({
type:'open', symbol:'BTCUSDT',
sizeUsdt:100, direction:'long',
});
if (d.approved) placeOrder();
else console.log(d.reason);

// after it closes, report the outcome
watchdog.reportTradeClosed({ symbol:'BTCUSDT', pnlUsdt:-12 });

No black boxes

Every claim is proven.

A deterministic chaos harness fires 10 misbehavior classes; WATCHDOG catches them with a reproducible benchmark. Full test coverage. A hash-chained, tamper-evident audit trail. MIT licensed.

9/9

detectable classes caught

8.4

trades to detection

92%

test coverage

MIT

open source

$ git clone · npm install · npm run benchmark → reproduce it yourself

It's a library you put in your agent.This dashboard is your console.