From Transcript to Interaction Preference: Extraction Pipeline

Problem

We have structured transcripts: {speaker, addressee, raw_text, stage_direction, scene}. We need to derive each character’s PA interaction preferences per our 2-context, 15-attribute IPaS taxonomy (see PA_Interaction_Preference). The core challenge: a character’s speaking style ≠ their preference for how a PA should communicate with them.

Core Principle: Production ≠ Reception

Production (how a character speaks) provides a prior via similarity-attraction — it suggests but does not confirm preference
Reception (how a character reacts to others’ communication styles) confirms or overrides the prior
When production and reception diverge (asymmetry), the PA preference follows reception
Example: Sheldon acts autonomously but demands to be consulted → PA setting = Suggest, not Autonomous

Pipeline: Two-Pass LLM Extraction

Script: data/extractor/extract.py (all LLM calls via OpenRouter)

Raw transcript JSONL
       │
       ▼
[Pass 1 — Gemini Flash]
  Filter scenes → classify context (work / personal)
  Tag attributes → extract evidence quotes
       │
       ▼
  pass1.jsonl  (one row per scene)
       │
       ▼
[Pass 2 — Claude Sonnet]
  Per (attribute, context) pair:
  synthesize all evidence → setting + confidence
       │
       ▼
  pass2.json  (2 × 14 active matrix, raw)
       │
       ▼
[HITL Gate]
  High confidence → auto-accept (extractor_high)
  Below High → human review → accept / reject / override
       │
       ▼
  preferences.yaml  (2 × 14 active cells, with provenance)
       │
       ▼
[Anonymize]
  Character name → User A / B / ...
  Remove identifying notes
       │
       ▼
  Simulator-ready persona YAML

Pass 1 — Filter + Classify (Gemini Flash)

For each scene where the target character appears:

Filter: Does this scene contain IPaS attribute evidence? If no → skip
Classify context: work (professional tasks) or personal (personal life and social interactions)
Tag attributes: Which of 14 active attributes have observable evidence?
Extract evidence: Quote key dialogue, label as production / reception / explicit_statement

Reception signal validity:

VALID: Explicit confusion/discomfort with a register, engagement change in response to style (not content), verbal meta-commentary, accommodation shifts
INVALID: Plot-driven topic changes, content disagreement, scripted dramatic reactions

Output: {character}_{seasons}_pass1.jsonl — one JSON row per scene

Typical yield: ~90–95% of scenes with target character are relevant; most scenes are personal context for sitcom characters (social interactions dominate early seasons).

Pass 2 — Synthesize Settings (Claude Sonnet)

For each (attribute, context) pair with evidence:

Receives all Pass 1 evidence fragments for that cell
Applies three-layer determination:
- Statistical/frequency patterns → direction (who is more X than whom)
- Qualitative production patterns → anchor to a setting
- Reception evidence → confirms or overrides
Outputs: setting, confidence, evidence_summary, production_evidence[], reception_evidence[], asymmetry note

Confidence levels:

Level	Meaning
High	Production + reception evidence agree across multiple scenes
Medium-High	Strong production + some reception, or strong reception but few scenes
Medium	Production evidence only, or ambiguous reception
Low	Insufficient or contradictory evidence

Output: {character}_{seasons}_pass2.json — nested {context: {attribute: cell}}

HITL Gate (Hard Rule)

Only High-confidence cells auto-write to the final YAML. Everything below High requires human review. No exceptions.

Human reviews non-High cells with the evidence summary → decides: accept as-is / adjust setting / reject (leave empty).

Sources in final YAML:

extractor_high — auto-accepted High confidence
hitl_override — human changed or overrode the setting
mbti_accepted — MBTI/personality projection for attributes without transcript proxy
seed — manually authored for attributes with no evidence path
no_preference — confirmed no strong preference (not absence of evidence)

Output: 2 × 14 Active Preference Matrix

character: User A
matrix:
  work:
    tone_formality:
      setting: Formal
      source: extractor_high
      confidence: High
      notes: null
    ...  # 14 active attributes
  personal:
    tone_formality:
      setting: Casual
      source: extractor_high
      confidence: High
      notes: null
    ...  # 14 active attributes

Each cell carries full provenance. Empty cells (null setting) mean insufficient evidence — acceptable for sparse contexts.

Attribute Coverage Notes

11 attributes with reliable transcript proxies

Dim 1 — Expression Style: tone_formality, verbosity, emotional_engagement, guidance_level
Dim 2 — Disclosure: reasoning_visibility, uncertainty_expression
Dim 3 — Initiative: autonomy_level, proactive_outreach, task_expansion
Dim 4 — Information Flow: information_elicitation, topic_management

3 active attributes with weaker/indirect transcript evidence

process_visibility, solution_breadth, capability_boundary

These describe PA-specific behaviors with limited direct analogues in character-to-character dialogue. Cells without High-confidence evidence fall to HITL or mbti_accepted / seed. memory_privacy was removed from the active preference set on 2026-05-17 and should not be extracted into active persona matrices.

Corpus limitation: `work` context is sparse for sitcom characters

TV sitcoms are social by construction. Work scenes are uncommon, and most “work” scenes still involve social dynamics rather than task-focused PA interactions. Typical yield for sitcom characters: personal ≈ 85–95% of relevant scenes, work ≈ 5–15%. This is acceptable — it means the work context matrix cells will have lower coverage, but the personal context cells will be well-evidenced.

Anonymization

Before the persona YAML is used in the simulator:

character field → User A (or other anonymous ID)
source field in YAML points to extraction data, not character name
Simulator and harness never see the original character identity

The anonymization is shallow (name replacement) — the preference settings themselves are character-derived and carry the behavioral fingerprint. The benchmark evaluates whether a PA can learn that fingerprint, not whether it knows the source character.

Synthesis Pass (if needed)

When a single (attribute, context) cell has too many evidence segments for one Pass 2 call:

Split evidence by season (or groups of 2–3 seasons)
Run Pass 2 on each chunk → per-chunk setting + confidence
Run a synthesis call with all per-chunk results → final setting

The synthesis call uses the same Pass 2 system prompt but receives summarized per-chunk results instead of raw evidence.

MBTI Cross-Validation

MBTI serves as an independent cross-validation signal alongside transcript evidence. It is not a replacement for transcript evidence — it is a second path to the same IPaS settings, used to:

Confirm non-High cells (transcript evidence + MBTI agree → stronger basis to accept)
Fill empty cells where transcript has no proxy (source: mbti_accepted)

Why MBTI over other trait taxonomies: MBTI’s 4 binary axes map cleanly onto IPaS attributes; its type labels carry high semantic density and are freely available as fan-community canon annotations for most TV characters; and the binary structure avoids needing continuous-value thresholds that a 10-persona sample has no statistical power to calibrate.

MBTI → IPaS Projection Table (v1)

Each attribute is driven by one primary MBTI axis with secondary modifiers. This table is a working draft — Sheldon (INTJ) pilot was the first validation pass.

IPaS attribute	Primary axis	+ value	− value	Modifiers
verbosity	N/S	N → Detailed	S → Moderate	T +1 toward Detailed; E +1; I −1
emotional_engagement	T/F	T → Task-focused	F → Relationship-focused	E +1 toward visible
tone_formality	T/F	T → Consultative	F → Casual	J +1 toward Formal
autonomy_level	J/P × E/I	—	—	needs trust-level modifier; fall back to canon
process_visibility	J/P	J → Bookend	P → Silent	N +1 toward Full narration
information_elicitation	J/P	J → Structured	P → Iterative	T +1 toward Structured
topic_management	J/P	J → Organize	P → Follow user’s flow	N +1 toward Follow user’s flow
reasoning_visibility	N/T	N+T → Show	S+F → Summarize/Hide	J +1 toward Show
solution_breadth	N/S	N → High	S → Low	P +1 toward High; J −1
task_expansion	N/S	N → High	S → Low	E +1 higher
proactive_outreach	weak canon only	—	—	Do not map directly from E/I; social initiation is not the same as wanting PA reminders, check-ins, or after-task follow-up.
guidance_level	(canon)	—	—	projection weak; fall back to canon
capability_boundary	weak canon only	—	—	Do not map directly from T/F; this is a failure/limit recovery preference: agent-side workaround vs diagnosis plus control handoff back to the user.
uncertainty_expression	—	—	—	no strong MBTI axis; infer from high-conf anchor cells

Character MBTI Annotations

Character	MBTI	Notes
Sheldon Cooper	INTJ	High consensus
Leonard Hofstadter	ISFJ / INFP	Debated; needs canon check
Penny	ESFP	High consensus
Raj Koothrappali	INFP / ENFP	Season-dependent (selective mutism → I; alcohol/later seasons → E)
Michael Scott	ENFP	High consensus
Dwight Schrute	ISTJ	High consensus
Richard Hendricks	INFP / INTP	Research context leans INTP
Gilfoyle	INTJ / ISTP	Debated
Jared Dunn	ENFJ	High consensus

Trait Synthesis Pipeline

When transcript evidence is sparse, MBTI provides a structured path to fill and validate non-High cells:

High-conf anchor cells
    │
    ▼
[Invert] Anchor cells → best-fit MBTI type
    (check internal consistency first — if anchors split across MBTI types,
     invert per-context separately or fall back to seed)
    │
    ▼
    [Project] MBTI type + projection table → predicted settings for all 14 active attributes
    │
    ├── Empty cells → MBTI prediction → HITL review  (source: mbti_accepted)
    │
    └── Non-High cells → compare MBTI prediction vs extractor output
             ├── Agree  → stronger basis to accept
             └── Disagree → flag for closer HITL scrutiny

Context Window Budget

Approximate costs per character for S01–S03 extraction:

Stage	Model	Approx. cost
Pass 1 (S01–S03, ~300 scenes)	Gemini Flash	~$0.04
Pass 2 (28 active cells)	Claude Sonnet	~$0.50–1.00
Total per character (S01–S03)		~$1–2

Scaling to full S01–S10: Pass 1 ~ $0.40, P a ss 2 r o ug h l y t h es am e (m or ee v i d e n ce p er ce l l, n o t m or ece l l s) . T o t a l$ 3–6 per character, ~$30–60 for 10 characters.

Implementation Status

Component	Status
Transcript data — TBBT S1-S10, parsed JSONL (`data/transcripts/tbbt/`)	✅ Done
`extract.py` — two-pass extractor with OpenRouter	✅ Done
Rubrics — Pattern_to_Preference_Rubrics (14 active attributes; historical Memory & Privacy deprecated)	✅ Done
IPaS taxonomy — 2 contexts × 14 active attributes — PA_Interaction_Preference	✅ Done (updated 2026-05-17: memory_privacy deprecated)
User A (Sheldon S01-S03): Pass 1 + Pass 2 + HITL → 2×15 YAML	✅ Done
Penny (S01-S03): Pass 1 + Pass 2 + HITL decisions done; runtime preference + identity anonymized as `User B`; `User_B_World_Design.md` created; generator reads per-persona world design	✅ Ready for session scripting
Additional personas	❌ Not started

Open Questions

How many personas are needed for benchmark validity? Current plan: 5–10 from different shows/archetypes
Should later seasons be weighted more heavily (character development)? Currently using S01-S03 as baseline
Minimum scene count per (attribute, context) cell for reliable High-confidence attribution?
Cross-character validation: do Pass 2 settings for the same attribute cluster meaningfully across characters with known personality differences?

MemPA Wiki

Explorer

Transcript_to_Preference_Workflow

From Transcript to Interaction Preference: Extraction Pipeline

Problem

Core Principle: Production ≠ Reception

Pipeline: Two-Pass LLM Extraction

Pass 1 — Filter + Classify (Gemini Flash)

Pass 2 — Synthesize Settings (Claude Sonnet)

HITL Gate (Hard Rule)

Output: 2 × 14 Active Preference Matrix

Attribute Coverage Notes

11 attributes with reliable transcript proxies

3 active attributes with weaker/indirect transcript evidence

Corpus limitation: `work` context is sparse for sitcom characters

Anonymization

Synthesis Pass (if needed)

MBTI Cross-Validation

MBTI → IPaS Projection Table (v1)

Character MBTI Annotations

Trait Synthesis Pipeline

Context Window Budget

Implementation Status

Open Questions

Graph View

Table of Contents

Backlinks

MemPA Wiki

Explorer

Transcript_to_Preference_Workflow

From Transcript to Interaction Preference: Extraction Pipeline

Problem

Core Principle: Production ≠ Reception

Pipeline: Two-Pass LLM Extraction

Pass 1 — Filter + Classify (Gemini Flash)

Pass 2 — Synthesize Settings (Claude Sonnet)

HITL Gate (Hard Rule)

Output: 2 × 14 Active Preference Matrix

Attribute Coverage Notes

11 attributes with reliable transcript proxies

3 active attributes with weaker/indirect transcript evidence

Corpus limitation: work context is sparse for sitcom characters

Anonymization

Synthesis Pass (if needed)

MBTI Cross-Validation

MBTI → IPaS Projection Table (v1)

Character MBTI Annotations

Trait Synthesis Pipeline

Context Window Budget

Implementation Status

Open Questions

Graph View

Table of Contents

Backlinks

Corpus limitation: `work` context is sparse for sitcom characters