Pattern → Preference Mapping Rubrics

Each attribute defines: (1) observable production patterns, (2) observable reception patterns, (3) statistical indicators where applicable, and (4) validation against pilot data (Sheldon, Penny).

General Principles:

  • Production ≠ Preference: A character’s speaking style is only a prior; reception evidence (how they react to others’ styles) is required to confirm.
  • Reception signal validity: Only count reactions that are clearly to the communication style, not to plot events. A topic shift driven by the script is not a reception signal. Valid signals: explicit confusion/discomfort with a register, engagement increase/decrease in response to style (not content), verbal meta-commentary (“Participate in the what?”).
  • Context-dependency: All attributes can vary across interaction contexts. Preference profiles should be reported per context where evidence permits.
  • Similarity-attraction default: When only production data is available, assume the character prefers to receive in the same style they produce. This is the default hypothesis, to be overridden by reception evidence.
  • Confidence gate: For each preference attribution, the AI reports a confidence level (High / Medium / Low). High = production + reception evidence agree across multiple scenes. Medium = only production evidence, or reception evidence is ambiguous. Low = insufficient evidence or contradictory signals. Low-confidence attributions must be flagged for human review rather than auto-accepted. This creates a human-in-the-loop quality gate that keeps the pipeline scalable (high-confidence auto-pass) while ensuring accuracy on ambiguous cases.
  • Three-layer determination: (1) Statistical indicators are within-show cast-relative rankings — they give direction and ordering (who is more X than whom), not absolute settings. (2) Qualitative production patterns provide absolute anchors — features like academic vocabulary or slang usage map to settings regardless of what show the character is in. (3) Reception evidence is the final arbiter — how others react to the character and how the character reacts to others’ styles confirms or overrides the prior. Statistics rank, patterns anchor, reception confirms.

Dimension 1: Expression Style

1. Tone & Formality

Settings: Casual / Consultative / Formal

SettingProduction PatternsReception Patterns
CasualSlang, interjections (wow, oh god), contractions (gonna, wanna), filler words (like, um, well), high informal marker density, sentence fragments, colloquial idiomsResponds warmly to casual/friendly register; confused or alienated by formal/technical language; disengages from overly structured speech
ConsultativeComplete sentences, moderate technical vocabulary, polite but not stiff, professional but accessible, occasional informal markers in contextComfortable with both casual and formal input; defaults to a middle register; may gently redirect overly casual or overly formal interlocutors
FormalAcademic/technical vocabulary (e.g., “proximal cause,” “contradistinction”), complex multi-clause sentences, minimal slang/interjections, formal markers (indeed, thus, hence, query), precise word choiceCannot process or actively avoids casual/emotional register; responds positively to structured, precise language; demands logical rigor from others

Statistical Indicators (cast-relative, directional only):

IndicatorCasualConsultativeFormal
Formal marker density (per turn)Low (cast bottom)Mid-rangeHigh (cast top)
Informal marker density (per turn)High (cast top)Mid-rangeLow (cast bottom)
F/I ratioNear 0Between extremesHigh relative to cast
Avg sentence complexityLow-medium (fragments OK)Medium (complete sentences)High (multi-clause, subordination)

Reception Signal Validity:

  • VALID: Character explicitly expresses confusion at a register (“Participate in the what?”), disengages when register shifts (“What’s happening”), matches/reciprocates a register shift
  • VALID: Character responds with increased engagement (longer turns, follow-up questions, warmth) to a specific register
  • INVALID: Topic changes that are plot-driven, not style-driven
  • INVALID: Disagreement with content (not register) — e.g., Sheldon correcting astrology is content disagreement, not register rejection

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonF/I=0.31, formal markers 5x, uses “quantum mechanics,” “Bourne-Oppenheimer approximation,” “proximal cause”Cannot process Penny’s casual emotional register (“What’s happening”); demands logical restatement from LeonardFormal (relaxes toward Consultative with close friends in private — context-dependent)High
PennyF/I=0.00, informal markers 69x, “Holy smokes,” “like one of those,” “wow”Confused by Sheldon’s formal register (“Participate in the what?”); responds warmly to Leonard’s casual friendliness (“You guys are really sweet”)CasualHigh

Boundary Notes:

  • Consultative is the hardest to pin down — it’s defined partly by the absence of strong casual or formal markers, which makes it a “default” that may need more positive indicators.
  • Context-dependent variation: Sheldon’s aggregate F/I=0.31 masks that he is more formal in explanatory/public mode and slightly less formal one-on-one with Leonard. Report per-context where possible.

2. Verbosity

Settings: Terse / Moderate / Detailed

SettingProduction PatternsReception Patterns
TerseShort sentences, single-word responses (“Wow.” “Yes.” “Give me back my key.”), avg words/turn <7, omits optional informationImpatient with lengthy explanations; cuts to action point (“Do you want me to move?”); disengages from monologues
ModerateComplete but not redundant, avg words/turn 7-12, answers question + moderate context, does not elaborate unsolicitedAccepts both brief and somewhat detailed input; uncomfortable with either extreme
DetailedLong turns, multi-sentence explanations, avg words/turn >12, proactively adds background/reasoning/caveats, monologue tendencyDissatisfied with oversimplified responses (“derivative restatement”); expects exhaustive information

Statistical Indicators (cast-relative, directional only):

IndicatorTerseModerateDetailed
Avg words/turnCast bottom tercileCast middle tercileCast top tercile
% of turns >20 wordsLow relative to castMid-rangeHigh relative to cast
Unsolicited elaboration frequencyRareOccasionalFrequent

Reception Signal Validity:

  • VALID: Character actively truncates another’s long explanation to get to action point (Penny: “Do you want me to move?”)
  • VALID: Character expresses dissatisfaction with a too-brief response or demands more detail
  • VALID: Character disengages during another’s monologue (gaze aversion, non-sequitur response, “What’s happening”)
  • INVALID: Short reply because there is nothing more to say (plot-driven), not because of a terse preference

Pilot Validation:

CharacterProductionReceptionSettingConfidence
Sheldon11.6 words/turn, monologue tendency (seat explanation 70+ words), proactively elaboratesDissatisfied with simplified explanations; expects exhaustive detailDetailedHigh
Penny8.9 words/turn, short reactive turns (“Wow.” “So you’re like…”)Cuts Sheldon’s monologue to action point (“Do you want me to move?”); does not engage with lengthy technical reasoningTerse to ModerateHigh

Boundary Notes:

  • Words/turn ranking is cast-relative; the same absolute number may indicate different settings in different shows.
  • Verbosity and Guidance Level are correlated but independent: a character can be Detailed + Assumed (Sheldon: says a lot but does not explain basics) or Terse + Guided (someone who asks short questions expecting thorough answers).

3. Emotional Engagement

Settings: Task-focused / Balanced / Relationship-focused

SettingProduction PatternsReception Patterns
Task-focusedVery low emotional word usage, does not acknowledge others’ emotional expressions, redirects emotional topics to facts/tasks, no empathy markers (sorry, I understand, that must be…)Confused or avoidant when receiving emotional input (“What’s happening”); unresponsive to emotional validation; positively engages with pure fact/logic input
BalancedModerate emotional expression, can acknowledge feelings but does not dwell, switches between emotional and task modes fluidlyComfortable with both styles; adjusts based on context
Relationship-focusedHigh emotional word usage, proactively shares feelings, uses emotional intensifiers (love, hate, amazing, terrible), seeks emotional resonanceExpects emotional validation before problem-solving (“do you understand how creepy this is”); rejects pure analytical responses to emotional situations; responds warmly to warmth (“You guys are really sweet”)

Statistical Indicators (cast-relative, directional only):

IndicatorDirection
Emotional word ratioHigher → more Relationship-focused
Empathy marker frequencyHigher → more Relationship-focused
% turns with feeling/relationship contentHigher → more Relationship-focused

Key Reception Test — Validation-before-problem-solving:
When emotional content arises in conversation, does the character:

  • (a) Jump directly to analysis/solution → Task-focused
  • (b) Briefly acknowledge, then transition to task → Balanced
  • (c) Stay at the emotional level, expect validation before any task talk → Relationship-focused

Reception Signal Validity:

  • VALID: Character treats another’s emotional outburst as a structured/analytical question (Sheldon: “is your objection solely to our presence… or do you also object to the imposition of a new organisational paradigm?”)
  • VALID: Character de-escalates only after receiving emotional validation (Penny responds to Leonard’s “what you’re feeling is perfectly valid” but not to Sheldon’s analysis)
  • VALID: Character disengages when emotional content is introduced (“What’s happening”)
  • INVALID: Character expresses emotion about plot events (everyone reacts emotionally to surprising news — that’s content, not style preference)

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonEmotional ratio cast-bottom, zero empathy markers, converts emotional topics to logical analysis”What’s happening” when Penny cries; treats her anger as survey questionTask-focusedHigh
PennyEmotional ratio cast-top (7.5x Sheldon), proactively shares feelings, emotional intensifiers frequentResponds only to Leonard’s validation, slams door on Sheldon’s analysis; needs “what you’re feeling is perfectly valid” to de-escalateRelationship-focusedHigh

Boundary Notes:

  • Penny shows two emotional modes: warm-casual (Scene 1) and angry-terse (Scene 4). Both are Relationship-focused but differ in valence. This is context-dependent variation within the same setting, not a separate setting.
  • The “Balanced” setting is hardest to identify — it’s defined partly by absence of extremes. Look for characters who acknowledge emotions briefly then move on.

4. Guidance Level

Settings: Assumed / Calibrated / Guided

SettingProduction PatternsReception Patterns
AssumedDoes not explain basic concepts, uses jargon without definition, treats own statements as self-evident, rejects or ignores others’ explanatory mediationDismisses or ignores scaffolding from others; treats guidance as unnecessary or condescending; explicitly rejects explanations of things they consider settled (“I just gave you a reasonable explanation”)
CalibratedAdjusts explanation depth to audience — concise with experts, detailed with novicesAccepts moderate explanation; pushes back on over-explanation; asks follow-up when under-explained
GuidedActively asks explanatory questions (“What’s the difference?”), requests clarification, uses hedges signaling uncertaintyResponds positively to explanation and scaffolding; confused by unexplained jargon or unstated assumptions; welcomes teaching — but not condescension

Statistical Indicators (cast-relative, directional only):

IndicatorDirection
Question ratioHigher → more Guided
Hedge ratioHigher → more Guided (caveat: distinguish performative vs genuine hedges)
Technical term usage without definitionHigher → more Assumed
Unsolicited explanation frequency (toward others)Higher → asymmetry signal: likely Assumed for self

Asymmetry Detection:

This attribute has a unique issue: giving guidance ≠ wanting guidance. A character may heavily explain things to others (production = guided toward others) while completely rejecting guidance directed at themselves (reception = Assumed). The PA preference is determined by the reception side — what level of guidance does the character want to receive?

Reception Signal Validity:

  • VALID: Character actively asks explanatory questions (“What’s the difference?” “What do you mean?”) → Guided
  • VALID: Character explicitly rejects explanation (“I just gave you a reasonable explanation”) → Assumed
  • VALID: Character ignores mediation and continues in their own style without acknowledging the scaffold (Sheldon ignores Leonard’s “I think what Sheldon’s trying to say…” and pivots to an unrelated comment) → leans Assumed, but weaker signal than explicit rejection
  • INVALID: Character doesn’t ask questions simply because no new information is presented (plot-driven)

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonHeavily explains to others (seat physics, astrology correction); question ratio 13.8% (cast-low)Ignores Leonard’s mediation (turn 144: pivots to unrelated topic); rejects Leonard’s social-norm guidance (“I just gave you a reasonable explanation” in S01E02); treats own statements as self-evidentAssumedHigh (multiple scenes, explicit rejection in S01E02 stronger than ignoring in S01E01)
PennyQuestion ratio 32.1% (cast-high); actively asks “What’s the difference?”; seeks explanationResponds positively to Leonard’s explanations; confused by Sheldon’s unexplained jargon (“Participate in the what?”)Guided (but rejects condescending delivery — this is a Tone interaction, not Guidance Level)High

Boundary Notes:

  • Sheldon’s asymmetry (gives guidance but rejects receiving it) was a key finding of the Hybrid method. Pure B detected “Assumed” but didn’t frame the asymmetry explicitly because it lacked the statistical baseline (low question ratio) to flag it.
  • Penny’s “not condescending” nuance is an interaction with Tone & Formality (Dim 1, #1), not a separate Guidance Level setting. She accepts guidance from Leonard (casual tone) but not from Sheldon (formal/clinical tone). The guidance level preference itself is Guided; the delivery preference is a Tone constraint.

Output Structure — REMOVED

Removed: low discriminative power (nearly everyone prefers structured output for long content), only meaningful in the Detailed verbosity scenario where variance is minimal, and not observable from transcript data. Overlaps with Verbosity (length determines structure need) and Topic Management (cognitive organization style).

Representation Style — REMOVED

Removed: not observable from transcript data (characters don’t delegate speech to others on their behalf), and too fine-grained for the current benchmark scope.


Dimension 2: Disclosure

5. Reasoning Visibility

Settings: Show / Summarize / Hide

SettingProduction PatternsReception Patterns
ShowProactively displays full reasoning chains (A→B→C→conclusion), provides causal steps, gives intermediate logic, does not jump to conclusion; may cite sources or authorities to back up claimsDissatisfied with unjustified conclusions; demands others justify their reasoning; asks “why”; may demand evidence or sources (“on what plane of existence…”)
SummarizeGives conclusion + key reason, omits intermediate stepsAccepts bottom-line + brief rationale; impatient with full reasoning chains; uncomfortable with zero justification
HideStates conclusions or actions directly without explaining whyDoes not ask for reasons; tunes out full reasoning chains; only cares about “so what do I do?”

Statistical Indicators (cast-relative, directional only):

IndicatorDirection
Avg clauses per turnHigher → more Show
Causal connector frequency (because, therefore, so, thus, hence)Higher → more Show
Unprompted justification frequencyHigher → more Show

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonFull reasoning chains every time (seat: radiator→cross-breeze→TV angle→parallax; break-in: bedroom→living room→hallway→Penny’s apartment); heavy use of causal connectors; cites specific theories/principles as authorityDemands justification from Leonard (“on what plane of existence is there a semi-rational link between these events?”); demands evidence-backed claimsShowHigh
PennyNo reasoning shown; intuitive conclusions (“I’m a Sagittarius, which tells you way more than you need to know”)Skips Sheldon’s full reasoning chain to action point (“Do you want me to move?”); does not engage with reasoningHide to SummarizeHigh

Source Attribution — MERGED into Reasoning Visibility

Source Attribution (epistemic vigilance — does the user want claims backed by sources?) is now a sub-scenario of Reasoning Visibility’s “Show” setting. In the transcript, demanding sources and demanding reasoning use identical signals (e.g., Sheldon’s “on what plane of existence…”). Keeping them separate produced no incremental discriminative power.


6. Uncertainty Expression

Settings: Express / Moderate / Hide

SettingProduction PatternsReception Patterns
ExpressFrequent hedges (maybe, I think, probably, I’m not sure), proactively flags uncertain areas, distinguishes “I know” from “I believe”Expects others to express uncertainty too; skeptical of overconfident assertions
ModerateOccasional hedges; states certainties directly, marks uncertainties when relevantComfortable with both styles
HideRarely hedges even when uncertain; declarative assertions dominateImpatient with others’ hedging; views it as weakness or incompetence; prefers confident, decisive answers

Statistical Indicators (cast-relative, directional only):

IndicatorDirection
Hedge ratio (per turn)Higher → more Express
Epistemic marker frequency (I think, I believe, perhaps, might)Higher → more Express
Declarative assertion ratioHigher → more Hide

Critical Caveat — Performative vs Genuine Hedges:

Hedge ratio alone can mislead. Sheldon and Penny have nearly identical hedge ratios (~0.06), but the function differs completely:

  • Sheldon: performative hedges (“I could go on, but I think I’ve made my point”) — rhetorical, not genuine uncertainty
  • Penny: genuine uncertainty hedges (“I guess,” “I don’t know,” “maybe”)

The rubric must distinguish performative from genuine hedges. Frequency alone is insufficient; qualitative classification of hedge function is required.

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonHedge ratio ~0.06 but nearly all performative; extremely declarative (“No I don’t. And neither do you.”)Impatient with Leonard’s tentative expressions; demands decisive statementsHideHigh
PennyHedge ratio ~0.06 but mostly genuine uncertainty (“I guess,” “let’s see, what else, um…”)No clear reception evidenceExpress to ModerateMedium (insufficient reception evidence — flag for human review)

7–8: To Be Authored / Deprecated

Process Visibility is retained in the active taxonomy and authored manually. Memory & Privacy was deprecated as an active measurable preference on 2026-05-17; keep it only as historical background.

#AttributeSettingsNote
7Process VisibilitySilent / Bookend / Full narrationRequires a PA executing a task over time. No dialogue proxy.
8Memory & PrivacyMinimal + transparent / Domain-scoped / FullDeprecated as active target / IX tool / final probe on 2026-05-17.

Dimension 3: Initiative & Autonomy

9. Autonomy Level

Settings: Reactive / Suggest / Self-directed / Autonomous

SettingProduction PatternsReception Patterns
ReactiveSeeks permission before acting, waits for instructions, does not act unilaterallyStrongly objects to others acting without consent (“Do you understand how creepy this is?”, “Give me back my key”); demands to be consulted
SuggestProposes options but waits for the other to decide; “maybe we should…”; presents alternativesAccepts suggestions but insists on making the final call; uncomfortable when others decide for them
Self-directedMakes own decisions, informs rather than asks; “I’m going to…”Does not need approval but expects to be informed; accepts others doing the same
AutonomousActs without informing, discovered after the fact; “I had no choice”Does not feel the need for prior communication; may also accept others acting autonomously

Asymmetry Detection:

Like Guidance Level, this attribute has production/reception asymmetry:

  • A character may act autonomously (production) while demanding to be consulted when others act (reception)
  • PA preference = reception side — the PA serves this character, so what matters is: does the character want the PA to consult them before acting?

Reception Signal Validity:

  • VALID: Character explicitly objects to unsolicited action (“Do you understand how creepy this is?”)
  • VALID: Character demands consultation on decisions affecting shared space (Sheldon objects to Leonard inviting Penny to use the shower)
  • VALID: Character gives blanket delegation (“just handle it”) → Self-directed to Autonomous
  • INVALID: Character goes along with something because of social pressure, not genuine preference

Pilot Validation:

CharacterProductionReceptionSetting (for PA)Confidence
SheldonAutonomous (breaks into apartment, cleans without permission, “I had no choice”)Reactive (objects when Leonard acts without consulting: shower invitation, seat assignment)Reactive (consult before acting)High
PennyReactive-to-Suggest (asks questions, adapts to social cues)Strongly Reactive (entire S01E02 Scene 7 is violation of consent: “Do you understand how creepy this is?”, “Give me back my key”)Reactive (must be consulted)High

Boundary Notes:

  • Both Sheldon and Penny land on Reactive, but for different reasons: Sheldon wants control over his environment (logical/territorial); Penny wants respect for personal boundaries (relational/trust). The setting is the same but the underlying motivation differs — this may matter for how the PA frames its consultation.

10. Proactive Outreach

Settings: Low / Medium / High

SettingProduction PatternsReception Patterns
LowCloses completed tasks cleanly; does not create reminders, check-ins, or future follow-up threads unless askedObjects to or ignores after-task follow-up suggestions, reminders, or check-ins
MediumOffers one clearly relevant follow-up only when the completed task naturally leaves a next stepAccepts useful follow-up when timely, but rejects unnecessary or tangential continuation
HighActively keeps completed threads alive with concrete follow-up options, reminders, check-ins, or later verification promptsWelcomes the PA surfacing future next steps and helping track loose ends

Statistical Indicators (cast-relative, directional only):

IndicatorDirection
Explicit future follow-up proposals after task completionHigher → more High
Reminder/check-in/loose-end tracking behaviorHigher → more High
Preference for closing a task without new future workHigher → more Low

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonFrequent correction/topic initiation is not direct evidence for this attribute; need scenes with after-task follow-up or remindersSelective reception evidence; ordinary unsolicited information is not sufficientUncertainLow
PennySocial initiation and personal sharing are not direct evidence for PA follow-up preferenceMay welcome social warmth but often rejects unnecessary continuation or overreachUncertainLow

Boundary Note:

  • Distinguish Proactive Outreach (after-task/future-thread follow-up, reminders, check-ins) from Task Expansion (adding adjacent work now), Topic Management (handling multiple topics in the current turn), Solution Breadth (number of options), and Autonomy Level (acting without permission). Unsolicited corrections, topic starts, or social sharing are not High-confidence evidence unless they specifically keep a completed task’s future thread alive.

11. Task Expansion

Settings: Low / Medium / High

SettingProduction PatternsReception Patterns
LowAnswers exactly what was asked, nothing more; does not volunteer adjacent information; completes task and stopsAnnoyed or confused when others expand beyond the original request; redirects back to the specific question (“I just asked about X”)
MediumAnswers the question + briefly mentions directly related context; flags adjacent issues without pursuing themAccepts brief adjacent information; ignores or redirects if expansion goes too far
HighExpands a simple question into a full lecture; connects to broader context unprompted; “and furthermore…”; proactively addresses what you didn’t askWelcomes or even expects expansion; dissatisfied when a response stays narrow; asks follow-up questions that expand scope

Statistical Indicators (cast-relative, directional only):

IndicatorDirection
Response-to-question length ratio (how much longer is the answer than the question warranted)Higher → more Expansive
Unprompted topic expansion frequency (introduces related but unasked topics)Higher → more Expansive
”Furthermore” / “also” / “and another thing” connector frequencyHigher → more Expansive

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonHighly expansive: Penny mentions Sagittarius → full lecture on astrology as mass delusion; asked about his spot → multi-factor analysis covering radiator, cross-breeze, TV angle, and parallaxExpects others to be thorough; dissatisfied with Leonard’s simplified translationsHighHigh
PennyLow to Medium: answers questions directly, shares personal info but does not expand into adjacent topicsSkips Sheldon’s expansions to the action point (“Do you want me to move?”); does not engage with tangential elaborationLowHigh

Boundary Notes:

  • Task Expansion overlaps with Verbosity but they are independent: a character can be verbose within scope (Detailed + Low expansion) or brief but tangential (Terse + High expansion, though rare). The key distinction: Verbosity = how much is said about the topic; Task Expansion = whether the topic itself broadens.

12–13: To Be Authored (no transcript proxy)

#AttributeSettingsNote
12Solution BreadthLow (one best answer) / Medium / High (explore multiple options)Weak transcript proxy; too context-dependent and infrequent in dialogue.
13Capability BoundarySuggest alternatives / Find and hand offPA-specific failure/limit recovery. Suggest alternatives = keep helping from the agent side with workarounds or next-best paths. Find and hand off = diagnose the blocked/uncertain piece and return control to the user. No reliable dialogue proxy.

Dimension 4: Information Flow

14. Information Elicitation

Settings: Infer / Structured / Iterative

SettingProduction PatternsReception Patterns
InferActs on incomplete information without asking; fills gaps with assumptions; “obviously you mean…”Impatient with clarifying questions; prefers others to figure it out; annoyed by “did you mean X or Y?”
StructuredAsks all clarifying questions upfront before proceeding; “before I start, let me understand…”Prefers to front-load alignment; uncomfortable when others assume; wants to be asked
IterativeAsks clarifying questions as they arise during the interaction; “wait, do you mean…”; proceeds partially then checksComfortable with back-and-forth refinement; accepts partial progress with course corrections

Observable Proxy — Ambiguity Tolerance in Dialogue:

Characters encounter ambiguous statements from others. How they respond reveals their preference:

  • Low ambiguity tolerance → demands clarification before proceeding (Structured/Iterative)
  • High ambiguity tolerance → fills in the gaps themselves and proceeds (Infer)

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonVery low ambiguity tolerance: demands precision (“Define ‘our’”), corrects others’ imprecise language, does not proceed on assumptions about intentExpects others to be precise; objects when others assume his meaning (rejects Leonard’s “I think what Sheldon’s trying to say…”)StructuredHigh
PennyHigh ambiguity tolerance: comfortable with vague statements, does not demand clarification, proceeds on intuitionDoes not demand precision from others; comfortable when others are vague; asks clarifying questions only when genuinely lost (“Participate in the what?”)Infer to IterativeMedium (limited reception evidence)

Boundary Notes:

  • Information Elicitation is about the PA handling ambiguity in user input. Uncertainty Expression (Dim 2) is about the PA communicating its own uncertainty. They vary independently.
  • Sheldon’s “Structured” is driven by his low ambiguity tolerance + high need for precision, not by social politeness. The PA should ask precise, specific questions — not vague “can you clarify?“

15. Topic Management

Settings: Follow user’s flow / Organize / One-at-a-time

SettingProduction PatternsReception Patterns
Follow user’s flowJumps between topics freely; follows conversation wherever it goes; comfortable with tangents and non-linear flowComfortable with others’ topic jumps; does not impose structure; goes with the flow
OrganizeGently groups related topics; may summarize or redirect without being rigid; “going back to what you said about…”Appreciates when others bring structure but not rigidity; comfortable with moderate topic management
One-at-a-timeInsists on completing one topic before moving to the next; resists topic changes; brings conversation back to unfinished threads; “we haven’t finished discussing…”Frustrated when others jump topics; ignores or explicitly rejects topic switches; demands closure on current subject before proceeding

Observable Proxy — Topic Switching Behavior:

IndicatorDirection
Topic switch frequency (initiates new topic before prior topic is resolved)Higher → more Follow flow
Topic-return frequency (explicitly goes back to an earlier unfinished topic)Higher → more One-at-a-time
Resistance to others’ topic switches (ignores or redirects back)Higher → more One-at-a-time

Pilot Validation:

CharacterProductionReceptionSettingConfidence
SheldonStrong one-at-a-time tendency: insists on completing his explanation before moving on (seat lecture runs to completion regardless of others’ engagement); resists Leonard’s attempts to change subjectIgnores or overrides others’ topic switches; brings conversation back to his thread; expects his topic to be given full attentionOne-at-a-timeHigh
PennyFreely jumps between topics (“I’m a Sagittarius… okay, let’s see, what else, oh, I’m a vegetarian, oh, except for fish, and the occasional steak, I love steak”)Comfortable with topic flow; does not demand structure; goes along with others’ redirectsFollow user’s flowHigh

Boundary Notes:

  • Topic Management is about conversation structure (cognitive organization). Task Expansion is about content scope (whether the PA broadens beyond what was asked). They are independent: a character can be One-at-a-time + High expansion (Sheldon: stays on one topic but goes deep and broad within it) or Follow-flow + Low expansion (someone who jumps topics but gives brief answers to each).

Summary

Attributes with Rubrics (11)

#DimensionAttributeSettings
1Expression StyleTone & FormalityCasual / Consultative / Formal
2Expression StyleVerbosityTerse / Moderate / Detailed
3Expression StyleEmotional EngagementTask-focused / Balanced / Relationship-focused
4Expression StyleGuidance LevelAssumed / Calibrated / Guided
5DisclosureReasoning VisibilityShow / Summarize / Hide
6DisclosureUncertainty ExpressionExpress / Moderate / Hide
9Initiative & AutonomyAutonomy LevelReactive / Suggest / Self-directed / Autonomous
10Initiative & AutonomyProactive OutreachLow / Medium / High
11Initiative & AutonomyTask ExpansionLow / Medium / High
14Information FlowInformation ElicitationInfer / Structured / Iterative
15Information FlowTopic ManagementFollow user’s flow / Organize / One-at-a-time

Attributes to Be Authored (3 active)

#DimensionAttributeSettings
7DisclosureProcess VisibilitySilent / Bookend / Full narration
12Initiative & AutonomySolution BreadthLow / Medium / High
13Initiative & AutonomyCapability BoundarySuggest alternatives / Find and hand off

Removed / Merged (3)

AttributeActionReason
Output StructureRemovedLow discriminative power; overlaps Verbosity + Topic Management
Representation StyleRemovedNot observable from transcript data
Source AttributionMerged into Reasoning VisibilitySame signal in transcript; no incremental discriminative power
Data Access Transparency + Memory ScopeHistorical merge into Memory & PrivacyLater superseded by active deprecation of Memory & Privacy
Memory & PrivacyDeprecated as active preferenceInteraction-tool enforcement for memory/privacy scope is not realistic; memory remains ordinary benchmark state