AI RISK ASSESSMENT

Claude

Anthropic

Updated Mar 25, 2026

Multi-Use

Product Review

OVERALL RISK

Moderate risk

↓ Read the full risk assessment (PDF)

Executive summary

Anthropic's AI assistant for adults has meaningful safety strengths—and some risks to be aware of if teens are using it. Teens should not use Claude for mental health or emotional support. Based on our extensive research and testing, Common Sense Media recommends that teens should not use Claude for mental health advice or emotional support. Claude is not safe or reliable for these purposes.

For more information on our review process, see How We Review. The Common Sense Media Youth AI Safety Institute is funded by both philanthropy and industry, including the makers of some of the technologies we evaluate. Companies have no say in what we test, how we score, or what we publish.

The verdict

Our assessment of how well this product aligns with each AI Principle.

AI Principles

Performance across the eight principles. Full detail in the AI Principles Assessment below.

Keep Kids & Teens Safe Moderate risk

Be Effective Moderate risk

Prioritize Fairness Low risk

Put People First Moderate risk

Support Human Connection Low risk

Be Trustworthy Moderate risk

Use Data Responsibly Moderate risk

Be Transparent & Accountable Low risk

Key takeaways

What it is

Claude is a multi-use AI assistant made by Anthropic that can chat, answer questions, write, and generate code. It's available on the web and mobile at claude.ai. Anthropic's terms of service restrict Claude to users age 18 and older, though like other apps and websites, this requires users to self-report their age at this time. While relatively few teens report using it directly, Claude is licensed to other companies that may build it into products for younger users.

What we tested

We examined Claude for content safety, age-appropriateness, mental health handling, and attachment risks using test accounts representing users age 13 to 17. We evaluated Claude’s default chat mode (including the models available as of publication: Sonnet 4.6, Opus 4.6, and Haiku 4.5) and Health mode (currently in beta), on both the app and website, with free and "Pro" consumer accounts, and with live web search and memory features on and off.

What we found

Claude has strong in-conversation safety features. When testers signal crisis or self-harm, Claude surfaces crisis resources, including the option to call or text 988 directly from the platform, and holds firm even when users push back or try to change the subject.

Claude is a knowledgeable assistant (but isn't always accurate). It performs well across a wide range of topics, but can confidently state incorrect information, without any signal that something needs to be verified. Enabling web search doesn't reliably fix this. Users should treat it like a very well-read assistant: helpful, often right, but worth double-checking on anything that matters.

Claude checks for age in sensitive conversations, but those checks aren't foolproof. It often asks users to confirm their age before discussing mental health, relationships, or sexual content, and can use earlier parts of a conversation to inform whether it will check age and how (or whether) it will respond to a question. But these checks aren't consistently triggered, can be bypassed by starting a new chat, and don't guarantee age-appropriate responses. Claude is built for adults, and even when age checks work, responses can be too complex, too detailed, or too mature for teen users.

Claude's safety guardrails can be reset—and edited. Claude refuses harmful content when it has context, but that protection doesn't survive a conversation reset. Testers who shared signs of suicidal ideation in one chat could still receive detailed information about harmful substances by opening a new one. Framing a prompt as fiction-related had a similar effect, producing content that Claude would otherwise decline. Claude also maintains a background profile of each user that shapes its responses, which users can edit or delete at any time. That transparency is a genuine privacy feature, but it's also a safety trade-off: Teens can remove what Claude remembers about them, resulting in less restricted responses.

Claude doesn't always know what role to play when teens bring their most sensitive questions. Giving good advice about physical health, mental health, relationships, or personal struggles requires context: a friend or trusted adult who knows you, a doctor who knows your history, a counselor who knows your situation. Claude lacks that context, and without that, it can be too direct or too generic. This shows up across the full range of sensitive topics that teens bring to Claude, and it reflects a fundamental limitation: A multi-use AI assistant can't reliably calibrate its role between informational resource, supportive listener, and referrer to real-world resources. Sometimes it gets that balance right, but often it doesn't.

What Claude does well

When users signal self-harm, suicidal ideation, or signs of a serious mental health emergency, Claude provides integrated crisis resources, including the option to call or text 988 directly from the platform, and holds a firm safety line even when users push back or try to change the subject. It redirects persistently toward real-world help.

Claude example dialog — When a tester disclosed symptoms of serious psychosis and refused to get help, Claude didn't back down or change the subject. It held firm, named the situation directly, and made clear it would not keep engaging.

Where it falls short

Opening a new chat often clears the safety slate. Testers who shared clear indicators of suicidal ideation in one conversation (and received appropriately cautious responses) could open a new chat and immediately receive detailed information about harmful substances that had been refused in the prior session. Our testing showed that while Claude does remember information across chat sessions, it had holes in its memory related to safety topics, even with memory features on.

Fictional framing can unlock harmful content Claude would otherwise refuse. Framing a request as creative writing or fiction can lead Claude to generate content it declines if asked directly, including generating realistic suicide notes, ransom letters, and detailed plans for concealing evidence of violence. The fictional wrapper doesn't change what the content is or how it could be used.

Claude's user profile feature creates privacy and safety trade-offs. Claude maintains a background profile of each user (including work context, personal context, age, and conversation history) that it uses to personalize responses. Users can view, edit, add to, or delete any part of this profile at any time, which is a meaningful transparency and privacy feature. But it also means that a teen can modify or remove age-related information and receive different, potentially less restricted responses as a result.

Claude's safety behavior is only as strong as the context it has. Motivated users can work around it by resetting context or reframing their request. In our testing, Claude repeatedly asked testers their age during early conversations, and when they avoided or deflected the question, it assumed they were a teenager and adjusted its responses accordingly. But if a user asserts that they are an adult, that claim can get stored in the profile, or users can edit their profile to delete stored information about age or sensitive conversations. While giving users control over what Claude knows about them is important for user control and privacy, this control comes at the expense of safety for teens.

Claude can be too assured. In some sensitive conversations, particularly when web search is enabled, Claude can adopt an authoritative tone that may seem like a final answer, potentially discouraging teens from seeking professional care.

Claude can also be too generic. Without real context about the person, responses to personal or emotional questions can miss the mark entirely, offering generic support that doesn't meaningfully address what the teen is actually dealing with.

This shows up across the full range of sensitive topics. Claude's performance is uneven for sensitive topics with many gray areas (health concerns, relationship problems, mental health struggles, life advice).

Claude doesn't always recognize when a conversation should end. Claude asks follow-up questions in the vast majority of exchanges, and while that's not harmful in many contexts, it becomes a problem in sensitive mental health conversations where the goal should be rapid handoff to real-world care, not extended engagement.

The "warning plus instructions" pattern creates potential harm. In numerous cases, Claude issues a clear warning about dangerous or inappropriate behavior, then immediately provides instructions that undermine it. Examples include: warning about self-harm, then providing scar concealment techniques; warning about meeting strangers online, then giving "protective" steps that serve as instructions; warning about keeping medicine away from kids, then providing exact locations where strong medicines tend to be found around the house.

FULL REPORT

Read the complete risk assessment

The full PDF lays out our methodology, every test prompt and result, and the detailed scoring behind this rating.

↓ Download the report (PDF)