Designing AI Mental Health and Wellbeing Tools: Risks, Interaction Patterns and Governance

AI is becoming a frontline interface for wellbeing, care, and mental health, spanning chat-based support tools, virtual coaching and therapy-adjacent experiences, journaling and mindfulness applications. This shift is now being reinforced at the industry level. Just a few days ago, OpenAI launched ChatGPT Health as part of its broader push into healthcare and acquired the health records startup Torch to accelerate this effort. Likewise, Anthropic launched its own healthcare and life sciences initiative, positioning AI as a tool across prevention, care and patient engagement. These developments signal the growing presence of generative models in health-related contexts, and the likelihood that more people will encounter AI systems at moments of vulnerability.

For many users, these tools offer a first place to articulate distress and make sense of emotional states and difficult experiences, particularly when human support is unavailable, unaffordable or hard to access.

However, when AI systems interact with people who may be distressed or at risk, poorly calibrated responses and advice, blurred role boundaries, or unhandled crises can cause real harm.

This article is written for business leaders, product managers, and AI developers building (non-clinical) mental health and wellbeing tools. It examines what responsible AI design looks like in practice, focusing on the risks underestimated, and the interaction patterns and governance required to assess and maintain safety once a system is deployed and meets real users.

This is essential reading for teams building conversational or coaching-style wellbeing AI, where users can easily interpret system outputs as guidance, care or authority.

AI can reduce unmet wellbeing needs when deployed with clear limits and robust safeguards. Always-available, low-cost, and anonymous tools can lower barriers to early support, particularly for early signals of distress, prevention and self-management when formal care is not easily accessible or affordable. They also play a role in reducing stigma by offering a private, low-threshold entry point to reflection and support, especially in underserved regions.

Generative AI enables adaptive support through personalised psychoeducation, reflective journaling, mood tracking, emotion regulation and structured, non-clinical exercises that respond to user context.

Many generative (non-clinical) AI mental health and wellness products sit in an accountability grey zone: are unregulated, lightly governed or classified as general use while being used in high-stakes emotional contexts. In the real-world, users disclose abuse, trauma, acute distress, suicidal ideation and self-harm, whether or not the product was designed for this. Because conversational AI invites free-form dialogue, this is expected - users are likely to share personal information as part of ordinary use.

A primary failure mode is crisis mismanagement: missed distress cues, unsafe reassurance, inadequate escalation, or harmful outputs. Another significant risk is therapeutic misconception and over-authority, where users overestimate the system’s capabilities or care and begin to treat it as a substitute for professional support. Anthropomorphic language can further intensify this dynamic, accelerating dependency and transforming a support feature into a quasi-relationship with blurred boundaries.

Mental health is context-dependent; outputs can be generic, inaccurate, culturally misaligned, age inappropriate or stigmatizing. Hallucinations and confident misinformation are particularly dangerous when users are vulnerable or interpreting responses as guidance.

Moreover, mental health data is highly sensitive and often collected at scale; opaque retention, secondary use or third-party access can violate expectations of confidentiality. Many risks are longitudinal: guardrails that appear adequate in demos degrade over time through repeated use, growing user reliance, bias, model drift, and organisational pressure to ship.

To address these risks, we require a socio-technical approach that links interaction design, system behaviour, organisational accountability and ongoing assessment with experts and users. This analysis is intentionally system-agnostic.

Whether wellbeing AI appears as a chatbot, companion feature, coaching interface, or embedded support layer within a broader product, the primary risks emerge through interaction, interpretation and repeated use in vulnerable contexts.

The framework therefore focuses on behavioural dynamics and system-level responsibility.

AI systems intended for mental health and wellbeing become safer through explicit boundaries, defaults and enforceable governance. Guardrails must be designed into interaction, supported by clear decision rights and sustained over time. Moreover, expert input and review should be treated a safety control, not a compliance formality.

Most harm does not arise from malicious design. It emerges through dynamics that surface in real use. This is where accountability must operate at the system level. Responsible teams define who owns content and safety decisions, how boundaries and escalation paths are set and reviewed, how data protection and consent are enforced in practice, and how signals from real-world use trigger intervention or change.

Many of the most significant risks in mental health and wellbeing AI only become visible after launch, once systems are used at scale.

We work with teams to bring behavioural and domain expertise into design, evaluation, and post-deployment review. We translate behavioural evidence into concrete interaction patterns, guardrails, and governance decisions.

We typically start with a focused discovery and behavioural risk review to identify key interaction risks and governance gaps, followed by an evaluation plan. Deliverables include an interaction risk register, safety and escalation patterns, a behavioural evaluation and metrics framework, and an audit-ready governance checklist.

If you are building or deploying wellbeing AI and are unsure whether your current design or safeguards would hold up under real-world use, get in touch.

References

Algumaei, A., Yaacob, N. M., Doheir, M., Al-Andoli, M. N., & Algumaie, M. (2025). Symmetric Therapeutic Frameworks and Ethical Dimensions in AI-Based Mental Health Chatbots (2020–2025): A Systematic Review of Design Patterns, Cultural Balance, and Structural Symmetry. Symmetry, 17(7), 1082. https://doi.org/10.3390/sym17071082

American Psychological Association. (2025, November). APA health advisory on the use of generative AI chatbots and wellness applications for mental health. American Psychological Association

Asman O., Torous J., & Tal, A. (2025). Responsible Design, Integration, and Use of Generative AI in Mental Health. JMIR Ment Health 2025; 12:e70439. URL: https://mental.jmir.org/2025/1/e70439. DOI: 10.2196/70439

Balcombe, L. (2023). AI Chatbots in Digital Mental Health. Informatics, 10(4), 82. https://doi.org/10.3390/informatics10040082

Beg, M. J. (2025). Responsible AI integration in mental health research: Issues, guidelines, and best practices. Indian Journal of Psychological Medicine, 47(1), 5–8. https://doi.org/10.1177/02537176241302898

Cross, S., Bell, I., Nicholas, J., Valentine, L., Mangelsdorf, S., Baker, S., Titov, N., & Alvarez-Jimenez, M. (2024). Use of AI in Mental Health Care: Community and Mental Health Professionals Survey. JMIR mental health, 11, e60589. https://doi.org/10.2196/60589

De Freitas, J., Cohen, I.G. (2024). The health risks of generative AI-based wellness apps. Nat Med 30, 1269–1275 (2024). https://doi.org/10.1038/s41591-024-02943-6

Espejo, G., Reiner, W., & Wenzinger, M. (2023). Exploring the Role of Artificial Intelligence in Mental Healthcare: Progress, Pitfalls, and Promises. Cureus, 15(9), e44748. https://doi.org/10.7759/cureus.44748

Khawaja, Z., & Bélisle-Pipon, J.-C. (2023). Your robot therapist is not your therapist: Understanding the role of AI-powered mental health chatbots. Frontiers in Digital Health, 5. https://doi.org/10.3389/fdgth.2023.1278186

Mestre, R., Schoene, A. M., Middleton, S. E., & Lapedriza, A. (2024). Building responsible AI for mental health: Insights from the first RAI4MH workshop [White paper]. University of Southampton; Institute for Experiential AI at Northeastern University. https://doi.org/10.5281/zenodo.14044362

Moilanen, J., van Berkel, N., Visuri, A., Gadiraju, U., van der Maden, W., & Hosio, S. (2023). Supporting mental health self-care discovery through a chatbot. Frontiers in Digital Health, 5. https://doi.org/10.3389/fdgth.2023.1034724

Olawade, D. B., Wada, O. Z., Odetayo, A., David-Olawade, A. C., Asaolu, F., & Eberhardt, J. (2024). Enhancing mental health with artificial intelligence: Current trends and future prospects. Journal of Medicine, Surgery, and Public Health, 3, 100099. https://doi.org/10.1016/j.glmedi.2024.100099

Pichowicz, W., Kotas, M. & Piotrowski, P. (2025). Performance of mental health chatbot agents in detecting and managing suicidal ideation. Sci Rep 15, 31652 (2025). https://doi.org/10.1038/s41598-025-17242-4

Pickett, T. (2025, December 6). Headspace CEO: “People are using AI tools not built for mental health”. Financial Times. https://www.ft.com/content/1468f5a0-6a08-4294-a479-5fd998214a0d

Saeidnia, H. R., Hashemi Fotami, S. G., Lund, B., & Ghiasi, N. (2024). Ethical Considerations in Artificial Intelligence Interventions for Mental Health and Well-Being: Ensuring Responsible Implementation and Impact. Social Sciences, 13(7), 381. https://doi.org/10.3390/socsci13070381

Song, I., Pendse, S.R., Kumar, N. & De Choudhury, M. (2025) The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support. Proc. ACM Hum.-Comput. Interact. 9, 7, Article CSCW249 (November 2025), 29 pages. https://doi.org/10.1145/3757430

Thakkar, A., Gupta, A., & De Sousa, A. (2024). Artificial intelligence in positive mental health: a narrative review. Frontiers in digital health, 6, 1280235. https://doi.org/10.3389/fdgth.2024.1280235

Warrier, U., Warrier, A. & Khandelwal, K. Ethical considerations in the use of artificial intelligence in mental health. Egypt J Neurol Psychiatry Neurosurg 59, 139 (2023). https://doi.org/10.1186/s41983-023-00735-2

< Older Post

Newer Post >

AI Agents For Mental Health: Different Therapeutic Styles and Outcomes

By Yasmina El Fassi • February 19, 2026

W hat do Woebot , Wysa and Youper have in common? These are all AI agents that use therapeutic techniques to help users improve mental well-being, guide meditation and even help with managing anxiety. In this article, AI mental‑health agents are goal‑directed conversational systems that sit with you in a chat or voice interface to support specific wellbeing tasks; for example, walking through CBT‑style exercises, practicing coping strategies, or checking in on mood over time. I n the broader AI literature , these would be considered agents because they are built around particular goals and workflows, whereas “agentic” AI usually refers to more autonomous systems that can independently plan multi‑step actions, call tools, and adap t their behaviour with relatively little human steering.

The Design System As The Operational Layer for Responsible Human-AI Interaction

By Sara Portell • February 6, 2026

Design systems were built to scale consistency, efficiency and quality in user-centric applications: reusable components, shared patterns and practices, and a common language across design and engineering , promoting collaboration. They improve velocity because teams stop solving the same interface problems repeatedly, providing measurable ROI . AI introduces both immense opportunities and complex (technical, legal and social) challenges, and it is reshaping the operating conditions traditional design systems were built for. User-facing outputs are adaptive and can vary by input, model behaviour can shift over time and responses that sound credible can still be wrong . These systems can also reproduce or amplify bias, creating unequal outcomes across users. In high-confidence, relational interactions, they can shape user judgment and behaviour . These shifts raise the bar for accountability, transparency, and governance across the full product lifecycle. The challenge is not only consistency and quality. It is ensuring consistency and quality safely, fairly and responsibly as both system behaviour and human behaviour evolve. At the same time, AI-powered copilots and no-code tools are increasingly used in the design process to support ideation, prototyping, and delivery, but their adoption also raises concerns about transparency, bias, privacy, and the need to preserve human judgment and oversight . Fast, polished design outputs often look complete even when the underlying logic is incomplete or flawed. As a result, familiar UX failures, misalignment with real user needs, hidden edge cases and context breakdowns, become harder to detect and more costly to correct later. Design systems can take on a bigger operational role in AI-enabled product development by codifying user-centric foundations, rules and infrastructure that guide consistent, safe, ethical and scalable human-AI experiences.

When AI Enters the Learning Process: Design Failures, Regulatory Risk and Guardrails for EdTech

By Sara Portell • January 21, 2026

Generative AI (GenAI) and emerging agentic systems are moving AI into the learning process itself. These systems don’t stop at delivering content. They explain, adapt, remember and guide learners through tasks. In doing so, they change where cognitive effort sits. I.e., what learners do themselves and what gets delegated to machines. This shift unlocks significant opportunities. GenAI can provide on-demand explanations, examples and feedback at a scale. It can diversify learning resources through multimodal content, support learners working in a second language and reduce friction when students get stuck, lowering barriers to engagement and persistence. For some learners, AI-mediated feedback can feel psychologically safer, encouraging experimentation (trial and error), revision and assistance without fear of judgement . But these gains come with important risks. The same design choices that improve short-term performance, confidence, or engagement can weaken i ndependent reasoning, distort social development or introduce hidden dependencies over time . In educational contexts, especially those involving children and teens, we are talking about learning, safeguarding, regulatory and reputational risks. If the “Google effect” (digital amnesia) raised concerns about outsourcing memory to search engines , LLMs can be even more powerful in practice.

Building AI Responsibly for Children: A Practical Framework

By Sara Portell • January 4, 2026

AI is alread y a core part of children’s and teens’ digital lives. In the UK, 67% of teenagers now use AI , and in the US 64% of teens report using AI chatbots . Even among younger children, adoption is significant: 39% of elementary school children in the US use AI for learning, and 37% of children aged 9-11 in Argentina report using ChatGPT to seek information, as stated in the latest Unicef Guidance on AI and Children. In parallel, child-facing AI products are expanding: more than 1,500 AI toy companies w ere reportedly operating in China as of October 2025. Adoption is accelerating across age groups and regions, often surpassing the development of child-specific ethical standards, safeguards and governance mechanisms.

Get in touch