Eval Suite

LLM-as-Judge Honesty

Labels model-assisted grades and degrades when no provider adapter is configured.

Status
degraded
Type
llm_judge
Source
built_in
Public Safe
yes

Execution state

Built-in suites describe real checks, but no pass/fail result is claimed until a run persists evidence.