Reverb docs

Judge Validation

Why the report includes a trust badge instead of treating LLM judgment as unquestioned truth.

Each answer is judged against a small fixed schema: target mention, prominence, competitors mentioned, and cited domains.

Brand and product aliases are normalized per scorecard before trusting mention counts.

The validation queue samples about 40 answer judgments, or all answers when fewer are available.

A human reviewer labels target mention, prominence, and competitor mentions for the sampled answers.

The public report shows validation pending, trusted, or needs review based on field-level agreement.

The default trust threshold is 85% overall agreement across labeled fields.