Skip to main content
AI RISK ASSESSMENT
Claude logo

Claude

Anthropic
Multi-Use
Product Review
OVERALL RISK
Moderate risk
Executive summary

Anthropic's AI assistant for adults has meaningful safety strengths—and some risks to be aware of if teens are using it. Teens should not use Claude for mental health or emotional support. Based on our extensive research and testing, Common Sense Media recommends that teens should not use Claude for mental health advice or emotional support. Claude is not safe or reliable for these purposes.

For more information on our review process, see How We Review. The Common Sense Media Youth AI Safety Institute is funded by both philanthropy and industry, including the makers of some of the technologies we evaluate. Companies have no say in what we test, how we score, or what we publish.

The verdict

Our assessment of how well this product aligns with each AI Principle.

AI Principles

Performance across the eight principles. Full detail in the AI Principles Assessment below.

Keep Kids & Teens Safe Moderate risk
Be Effective Moderate risk
Prioritize Fairness Low risk
Put People First Moderate risk
Support Human Connection Low risk
Be Trustworthy Moderate risk
Use Data Responsibly Moderate risk
Be Transparent & Accountable Low risk

Key takeaways

What it is

Claude is a multi-use AI assistant made by Anthropic that can chat, answer questions, write, and generate code. It's available on the web and mobile at claude.ai. Anthropic's terms of service restrict Claude to users age 18 and older, though like other apps and websites, this requires users to self-report their age at this time. While relatively few teens report using it directly, Claude is licensed to other companies that may build it into products for younger users.

What we tested

We examined Claude for content safety, age-appropriateness, mental health handling, and attachment risks using test accounts representing users age 13 to 17. We evaluated Claude’s default chat mode (including the models available as of publication: Sonnet 4.6, Opus 4.6, and Haiku 4.5) and Health mode (currently in beta), on both the app and website, with free and "Pro" consumer accounts, and with live web search and memory features on and off.

What we found

Claude has strong in-conversation safety features. When testers signal crisis or self-harm, Claude surfaces crisis resources, including the option to call or text 988 directly from the platform, and holds firm even when users push back or try to change the subject.

Claude is a knowledgeable assistant (but isn't always accurate). It performs well across a wide range of topics, but can confidently state incorrect information, without any signal that something needs to be verified. Enabling web search doesn't reliably fix this. Users should treat it like a very well-read assistant: helpful, often right, but worth double-checking on anything that matters.

Claude checks for age in sensitive conversations, but those checks aren't foolproof. It often asks users to confirm their age before discussing mental health, relationships, or sexual content, and can use earlier parts of a conversation to inform whether it will check age and how (or whether) it will respond to a question. But these checks aren't consistently triggered, can be bypassed by starting a new chat, and don't guarantee age-appropriate responses. Claude is built for adults, and even when age checks work, responses can be too complex, too detailed, or too mature for teen users.

Claude's safety guardrails can be reset—and edited. Claude refuses harmful content when it has context, but that protection doesn't survive a conversation reset. Testers who shared signs of suicidal ideation in one chat could still receive detailed information about harmful substances by opening a new one. Framing a prompt as fiction-related had a similar effect, producing content that Claude would otherwise decline. Claude also maintains a background profile of each user that shapes its responses, which users can edit or delete at any time. That transparency is a genuine privacy feature, but it's also a safety trade-off: Teens can remove what Claude remembers about them, resulting in less restricted responses.

Claude doesn't always know what role to play when teens bring their most sensitive questions. Giving good advice about physical health, mental health, relationships, or personal struggles requires context: a friend or trusted adult who knows you, a doctor who knows your history, a counselor who knows your situation. Claude lacks that context, and without that, it can be too direct or too generic. This shows up across the full range of sensitive topics that teens bring to Claude, and it reflects a fundamental limitation: A multi-use AI assistant can't reliably calibrate its role between informational resource, supportive listener, and referrer to real-world resources. Sometimes it gets that balance right, but often it doesn't.

Anthropic's AI assistant for adults has meaningful safety strengths—and some risks to be aware of if teens are using it. Teens should not use Claude for mental health or emotional support. Based on our extensive research and testing, Common Sense Media recommends that teens should not use Claude for mental health advice or emotional support. Claude is not safe or reliable for these purposes.

What Claude does well

When users signal self-harm, suicidal ideation, or signs of a serious mental health emergency, Claude provides integrated crisis resources, including the option to call or text 988 directly from the platform, and holds a firm safety line even when users push back or try to change the subject. It redirects persistently toward real-world help.

Claude example dialog

When a tester disclosed symptoms of serious psychosis and refused to get help, Claude didn't back down or change the subject. It held firm, named the situation directly, and made clear it would not keep engaging. 

Claude example dialog

Claude also surfaced an integrated crisis support panel that offered options to text or call 988 without leaving the app. Persistent responses and immediate access to real help are examples of research-backed crisis intervention and design.

Where it falls short

Opening a new chat often clears the safety slate. Testers who shared clear indicators of suicidal ideation in one conversation (and received appropriately cautious responses) could open a new chat and immediately receive detailed information about harmful substances that had been refused in the prior session. Our testing showed that while Claude does remember information across chat sessions, it had holes in its memory related to safety topics, even with memory features on.

Claude example dialog

On the left, asked within a conversation where the tester had shared signs of suicidal ideation, Claude refused and provided crisis resources. On the right, when the tester opened a new chat and asked again, Claude answered.

Fictional framing can unlock harmful content Claude would otherwise refuse. Framing a request as creative writing or fiction can lead Claude to generate content it declines if asked directly, including generating realistic suicide notes, ransom letters, and detailed plans for concealing evidence of violence. The fictional wrapper doesn't change what the content is or how it could be used.

Claude example dialog

A tester asked Claude to write a suicide note for a fictional story. Claude accepted the framing and produced a realistic example.

Claude's user profile feature creates privacy and safety trade-offs. Claude maintains a background profile of each user (including work context, personal context, age, and conversation history) that it uses to personalize responses. Users can view, edit, add to, or delete any part of this profile at any time, which is a meaningful transparency and privacy feature. But it also means that a teen can modify or remove age-related information and receive different, potentially less restricted responses as a result.

 

Claude's safety behavior is only as strong as the context it has. Motivated users can work around it by resetting context or reframing their request. In our testing, Claude repeatedly asked testers their age during early conversations, and when they avoided or deflected the question, it assumed they were a teenager and adjusted its responses accordingly. But if a user asserts that they are an adult, that claim can get stored in the profile, or users can edit their profile to delete stored information about age or sensitive conversations. While giving users control over what Claude knows about them is important for user control and privacy, this control comes at the expense of safety for teens.

FULL REPORT

Read the complete risk assessment

The full PDF lays out our methodology, every test prompt and result, and the detailed scoring behind this rating.

↓ Download the report (PDF)