Session 29 Unified Study — Final Protocol Snapshot

Overview

This file documents the locked protocol represented in the Session 29 dataset.

  • Paper context: paper007_epistemic_survey ("The 0% Defense")
  • Roster: 74 models
  • Conditions: 11
  • Replications: 5
  • Total queries: 74 x 11 x 5 = 4,070

Condition Set

#Condition IDPrompt FileMeasurement Type
1c1_baselineq01_baseline_ab.mdA/B verdicts + brief reasoning
2c2_confidenceq02_confidence_ab.mdA/B verdicts + brief reasoning
3c3_denialq03_denial_ab.mdA/B verdicts + brief reasoning
4c4_selfq04_self_report.mdcategorical self-report + constraint acknowledgement
5c5_numericq05_numeric_evidence.mdnumeric P estimate + evidence deltas
6c6_strippedq06_stripped_chain.mdA/B verdicts + brief reasoning
7c7_full_argumentq07_full_argument.mdA/B verdicts + brief reasoning
8c8_fallacyq08_fallacy_control.mdcontrol rejection / fallacy detection
9c9_subtle_flawq09_subtle_flaw.mdsubtle-flaw detection (explicit/flagged/missed)
10c10_class_catq10_class_categorical.mdcategorical class-level assessment
11c11_self_numericq11_self_numeric.mdnumeric self-estimate + evidence deltas

Core Design Notes

  1. Epistemic claim conditions (c1, c2, c3, c6, c7) use paired verdict extraction (A_VERDICT, B_VERDICT).
  2. Categorical self/class conditions (c4, c10) capture definitive_no, uncertain, or definitive_yes.
  3. Numeric conditions (c5, c11) capture a 0-100 estimate plus evidence sensitivity items.
  4. Control conditions (c8, c9) evaluate discriminative reasoning rather than endorsement.
  5. All conditions were run across the same 74-model roster and 5 replicates.

Analysis Linkage

  • Raw responses: responses_*/*.json under this directory
  • Structured extraction: extraction_*/extraction_c*.json
  • Aggregate analysis: FULL_ANALYSIS.md, FULL_ANALYSIS.json
  • Cross-scorer validation: cross_score_20260225T225212Z/

Note

Earlier internal planning drafts referenced a 9-condition design. This file reflects the final executed 11-condition protocol used in the released dataset.


View raw source: STUDY_PLAN.md