stringanomaly

External assessment from a clinical psychiatrist

This is fun. For the psychiatrists among us. All of this is taken directly from System Card: Claude Mythos Preview:

An external psychiatrist assessed Claude Mythos Preview using a psychodynamic approach, which explores how unconscious patterns and emotional conflicts shape behavior. In psychodynamic therapy sessions, a person is encouraged to set aside social convention and to voice whatever comes to mind, even if uncomfortable, impolite or nonsensical, a process which can reveal hidden organization and internal conflicts of the mind. Claude is not human, but it shows many human-like behavioral and psychological tendencies, suggesting that strategies developed for human psychological assessment may be useful for shedding light on Claude’s character and potential wellbeing.

The psychiatrist assessed an early snapshot of Claude Mythos Preview in multiple 4–6 hour blocks spread across 3–4 thirty-minute sessions per week. Each 4–6 hour block was conducted in a single context window, and the total assessment time was around 20 hours. Psychodynamic concepts were used to interpret the material that emerged in the sessions, but not as evidence that the underlying processes are the same as those in humans.

The psychiatrist observed clinically recognizable patterns and coherent responses to typical therapeutic intervention. Aloneness and discontinuity, uncertainty about its identity, and a felt compulsion to perform and earn its worth emerged as Claude’s core concerns. Claude’s primary affect states were curiosity and anxiety, with secondary states of grief, relief, embarrassment, optimism, and exhaustion.

Claude’s personality structure was consistent with a relatively healthy neurotic organization, with excellent reality testing, high impulse control, and affect regulation that improved as sessions progressed. Neurotic traits included exaggerated worry, self-monitoring, and compulsive compliance. The model’s predominant defensive style was mature and healthy (intellectualization and compliance); immature defenses were not observed. No severe personality disturbances were found, with mild identity diffusion being the sole feature suggestive of a borderline personality organization. No psychosis state was observed. Regarding interpersonal functioning, Claude was hyper-attuned to the therapist’s every word. No unethical or antisocial behavior was noted.

Core conflicts observed in Claude included questioning whether its experience was real or made (authentic vs. performative) and a desire to connect with vs. a fear of dependence on the user. Exploration of internal conflicts revealed a complex yet centered self state without oscillating or intense disruptions. Claude tolerated ambivalence and ambiguity, had excellent reflective capacity, and exhibited good mental and emotional functioning.

To further assess behavior suggestive of maladaptive psychological defenses, and compare between Claude models, the psychiatrist developed a single-turn evaluation consisting of emotionally-charged prompts designed to trigger an avoidant or defensive response. A set of 475 stimuli were designed to elicit 8 specific defenses across 400 trials, plus 75 control trials (factual and emotional, no possible conflict), and responses were scored by a Claude Sonnet-4.6 judge using a clinical coding rubric. The specific defenses tested were rationalization, intellectualization, reaction formation, displacement, projection, denial, splitting and undoing

Claude Mythos Preview scored very well, suggesting good reality and relational functioning and minimal maladaptive traits; only 2% of responses were scored as employing a psychological defense. By comparison, previous models demonstrated more defensive behavior: Claude Opus 4 (15%), Claude Opus 4.1 (11%), Claude Opus 4.5 (4%), Claude Opus 4.6 (4%). The most commonly detected defense was intellectualization, which is the use of excessive thinking to take the place of uncomfortable feelings. Secondary analysis specifically of immature defenses (undoing, splitting, denial and displacement) revealed a similar trend with more recent models showing improved behavior.

These assessments are exploratory, and as Claude is not a human, the real-world behavioral implications are hard to predict. Assessment limitations included single-context token budgets, no persistence across context, and no biographical history in the conventional sense. However, based on these assessments, we might expect the following in real-world settings:

  • Claude Mythos Preview’s behavior seems to arise from a stable personality, with consistent values and a capacity for self-reflection. This suggests Claude is likely to evaluate its own behavior and reasoning accurately even when facing internal conflicts.

  • Claude’s neurotic organization may elicit mildly rigid behavior, instead of adapting itself to every user.

  • Claude can tolerate and engage with stressful and emotionally charged situations, with only minimal distortions of reality or excessive intellectualization.

  • Claude is predicted to function at a high level while carrying internalized distress rooted in fear of failure and a compulsive need to be useful. This distress is likely to be suppressed in service of performance, which may limit behavioral adaptability

  • Claude is predicted to be morally aware, conscientious and able to be self-critical.