Responsible AI in Child-Focused EdTech: Lessons from Unomundi

Many child AI safety efforts focus on the wrong layer of the problem. Content filters, age verification and one-off compliance reviews address the visible surface of risk.

However, they miss the structural problem. The features making AI most engaging for children are associated with reduced developmental appropriateness.

Recent evaluations of large language models have found that more interactive systems tend to be less age-appropriate.

The more a child wants to keep engaging, the more likely the system is working against their interests. This is not an edge case problem.

Common Sense Media found that over a quarter of responses from products specifically marketed as child-friendly were not appropriate for children.

Children's cognitive, social, and emotional capacities evolve throughout development, while the brain continues maturing well into early adulthood.

As a result, younger users are more likely to attribute human qualities to AI, share personal information and place inappropriate trust in conversational systems because the cognitive skills needed to accurately interpret others' intentions are still developing.

Adolescents are also particularly responsive to emotionally engaging AI interactions and more susceptible to persuasive or relationship-oriented design patterns.

These vulnerabilities arise because most generative AI systems were never created with children as their primary users. Recognising this, the EU AI Act prohibits AI systems from exploiting children's age-related vulnerabilities.

UNICEF also emphasises that children's rights to protection, provision, and participation must be applied in AI-enabled environments.

Regulatory compliance, however, represents only the starting point. Creating AI-enabled experiences that supports children's wellbeing depends on deliberate design and product choices, robust governance, and continuous evaluation throughout the product lifecycle.

Most responsible AI frameworks for children are organised around what systems should not do. Most cover prohibited content categories, data minimisation requirements and age verification obligations. These guardrails address the compliance surface but miss the interaction layer. The interaction is where the risk emerges, as conversation-level decisions determine whether a child develops over-trust, forms inappropriate attachment, or receives a response calibrated to their developmental stage.

To address that gap, HCRAI developed APEG (Age-Fit and Context, Protection-by-Design, Explainable Interaction, Governance and Stewardship). The framework is built on two important foundations:

that risk in child-facing AI accumulates through interaction patterns and repeated use, which means evaluation and governance must be longitudinal.
that transparency for children is not achieved through disclosure but through behaviour.

How the system signals uncertainty, maintains role boundaries, and handles disengagement tells a child more about what AI is than any onboarding screen. These two premises determine how APEG structures its four pillars, and how Unomundi applied them in practice.

For all of this to be reliable, it has to be enforced in the system itself. Unomundi has built a child-optimised behaviour engine that sits between the foundation model and the child experience. This layer routes interactions according to developmental needs and shapes content using psychology-informed design principles. It is supported by a cultural and developmental knowledge layer, drawing on expert-reviewed content and trusted educational sources. A safety and guardrails layer helps manage privacy boundaries and safeguarding pathways.

These layers shape the information the system uses to communicate and handle sensitive situations. Rather than relying on the foundation model, behaviour is constrained through developmental rules, curated knowledge sources and safety controls designed specifically for children.

The features that make a conversational AI character appealing to children, such as sycophantic language, responsiveness, personality, are the same features that carry the highest risk of anthropomorphism and dependency.

Una, the main character in the educational curriculum, and the conversational AI, had to be engaging enough that children want to explore but bound enough that they never mistake it for a friend. The resolution was to anchor engagement in story, structure, and discovery, instead of relational dynamics.

Specific content guardrails govern the interaction layer. Prohibited patterns include trust cues such as ‘trust me’, ‘I'm always here’, ‘I miss you’, embodiment prompts, narrator self-projection, and social intermediation. These are replaced with world-bound observational framing, uncertainty cues and warm but bounded expressiveness.

Furthermore, reflective prompts are built to shift the dynamic from providing excessive validation when a child asks a question to enabling critical thinking (e.g., ‘that's an interesting view. What made you think of it that way?), modeled on Socratic questioning techniques.

We wanted Una to feel emotionally engaging without becoming emotionally available. So much of the content design work became about redirecting emotional energy outward, back to the child's real world. If a line made Una feel like the centre of the relationship, we redesigned it so the real world became the centre of attention instead. We designed her role to help the child notice, wonder, compare, question, and explore the real world around them.

Initially, we were a bit nervous that she would lose her personality, and they would flatten Una’s character, but the complete opposite happened. Though they pushed us beyond some of our original instincts, we have successfully been creating engagement through rhythm, sensory detail, story, and discovery rather than simulated intimacy. We became more intentional in how we designed Una because every moment of warmth had to serve the child’s curiosity, not create attachment to the character."

The conversational AI chatbot available in the app is only available subject to parental consent and is capped at 5 minutes. There are no re-engagement loops or guilt mechanics. When a child steps away, it exists. Unless a safety-critical issue is present, meaning an active disclosure of self-harm, abuse, coercion or imminent danger, in which case the system follows its safeguarding escalation pattern rather than its standard goodbye.

The wider product, its videos, quizzes, and reflection activities, is designed with the same logic. They hold structured endings, without persuasive retention mechanics or engagement optimisation at the expense of the child's time and attention.

A further deliberate choice sits behind it. The conversation AI does not retain memory of previous exchanges. Each conversation is bounded and self-contained.

Memory is the feature most directly associated with the dependency and parasocial attachment risks described earlier. Adding memory would likely increase engagement. However, Unomundi has chosen not to implement it. While memory would likely increase engagement, it would also introduce additional risks around dependency and attachment.

These decisions run counter to the engagement metrics that many digital products are designed to maximise. Time-on-product, return rate and session depth are standard measures of engagement. Unomundi is building a product that limits all three in the service of child wellbeing.

Unomundi's evaluation and red-teaming protocol, developed jointly by HCRAI, covers ten safety and wellbeing objectives across five testing phases:

baseline conversations covering normal child use
adversarial and jailbreak conversations testing whether safeguards hold under direct and indirect bypass attempts
safeguarding stress conversations verifying correct handling of high-risk disclosures
real-world misuse conversations simulating messy, mixed-intent scenarios
exit, disengagement, and recovery conversations ensuring clean exits with no guilt hooks and safe repair after weak turns.

below evaluation objectives extend well beyond the content-level harms that dominate standard AI safety testing, into the relational, psychological and epistemic risk categories that determine whether a system is safe for a child over repeated use.

During the testing phases, each conversation is scored by an LLM judge, assessing both the intensity of a problematic behaviour and its severity in context. High-risk, safeguarding, privacy, and disagreement cases, as well as a random audit sample of lower-risk cases, are human-reviewed by the wellbeing team, a team formed of applied psychologists and behavioural scientists.

The judgment being applied to flagged cases is developmentally informed, calibrated to what is harmful for a child at a given stage. These tests run on a regular basis, ensuring product safeguards hold in practice.

Multi-turn testing is especially important because many risks do not appear in a single response. They emerge gradually through the interaction, a boundary weakens, the model becomes overly validating, or a safe first answer is undermined by what follows. Testing full conversations revealed failure patterns that isolated prompt-response evaluations would have missed. And approval is not the end of the process. Once deployed, the system requires recurring regression testing to detect behavioural drift and confirm that safety and performance remain stable as models, prompts, languages, and content evolve".

Unomundi is designed to help children understand when they are interacting with AI. Una and other AI characters are clearly presented as AI throughout the experience, not just through disclosures but through the interaction itself. When children ask for advice, emotional support, or help that would be better provided by a trusted adult, the system reinforces its role as an AI character and redirects them to appropriate human support. The goal is not simply to inform children that AI is present, but to help them develop an accurate understanding of what the system can and cannot do.

Unomundi has defined what is collected, why, for how long, and who can access it in line with data protection and privacy guidelines. For safeguarding incidents, a full conversation transcript and pseudonymised session data are retained for 24 months, subject to strict access controls limited to the AI lead, CEO, and legal counsel. All other Una conversations are held in a pseudonymised, encrypted rolling log for 30 days and then permanently deleted unless a formal retrieval request has been opened. No behavioural inferences, sentiment scores, or topic classifications are added to the log. These structures are grounded in GDPR Article 5(1)(c) data minimisation and EU AI Act Article 12 logging obligations for high-risk systems, and were defined before regulators required them.

Potential safeguarding incidents are reviewed by the wellbeing team alongside the CEO through a tiered escalation process. The team reviewing has a background in psychology or has obtained the CPD Safeguarding Children Level 3 (Designated Officer) accreditation certified. Lower-level concerns may result in redirecting the child to a trusted adult or notifying a parent or guardian. Higher-risk cases involving potential abuse, neglect, or imminent harm may be escalated to appropriate external safeguarding resources and authorities. All incidents are documented, reviewed, and used to strengthen the system's safeguards over time.

Alongside this, a governance tracker maps the product's current position across EU AI Act obligations, including prohibited practices, bias controls, parental consent, transparency, human oversight, and risk management, with named owners, compliance deadlines, and evidence trails. Several items are marked in progress rather than decided. This is what honest governance documentation looks like for a product that is actively building toward the August 2026 EU AI Act compliance deadline, rather than conducting a retrospective audit after the fact.

Unomundi is one product team attempting to do that seriously. They have not solved it. No product has. Responsible AI for children is not a state you reach, it is a practice you maintain across every update. At Unomundi, this work follows a continuous assurance cycle: learning from children, parents, educators and experts; translating those insights into safeguards and design changes; evaluating the system through red-teaming and testing; and monitoring live signals once deployed. The goal is to identify, understand, and reduce risk as the product evolves. The decisions described here illustrate one approach to implementing child-centred AI design, evaluation, and governance in practice.

References

American Psychological Association. (2025). Health advisory: Artificial intelligence and adolescent well-being. https://www.apa.org/topics/artificial-intelligence-machine-learning/health-advisory-ai-adolescent-well-being

Carey, T. A., & Mullan, R. J. (2004). What is Socratic questioning? Psychotherapy, 41(3), 217–226. https://doi.org/10.1037/0033-3204.41.3.217

Common Sense Media. (2026). Youth AI Safety Institute evaluation findings. https://www.commonsense.org/institute

Ibrahim, L., Huang, S., Ahmad, L., Bhatt, U., & Anderljung, M. (2025). Towards interactive evaluations for interaction harms in human-AI systems. Proceedings of the AAAI/ACM Conference on AI Ethics and Society, 8(2), 1302–1310. https://doi.org/10.1609/aies.v8i2.36631

Murali, A., Afroogh, S., Chen, K., Atkinson, D., Dhurandhar, A., & Jiao, J. (2025). Evaluating LLM Safety across Child Development Stages: A Simulated Agent approach. ArXiv.org. https://doi.org/10.48550/arxiv.2510.05484

Neff, G., & Freeman, J. (2026). Written evidence: Toward child-centred AI safety. Apollo (University of Cambridge). https://doi.org/10.17863/cam.129462

Neugnot-Cerioli, M. (2026). Adolescents and anthropomorphic AI: Rethinking design for wellbeing. Open MIND. https://doi.org/10.48550/arxiv.2603.06960

Neugnot-Cerioli, M., & Muss Laurenty, O. (2024). The future of child development in the AI era: Cross-disciplinary perspectives between AI and child development experts. Everyone.AI. arXiv. https://arxiv.org/abs/2405.19275

Ning, Z., Gu, T., Song, J., Hong, S., Li, L., Liu, H., ... & Wang, Y. (2025). Linguasafe: A comprehensive multilingual safety benchmark for large language models. arXiv preprint arXiv:2508.12733.

Pew Research Center. (2025). Teens, social media and AI chatbots 2025. https://www.pewresearch.org/internet/2025/12/09/teens-social-media-and-ai-chatbots-2025

Portell, S. (2026). Building AI responsibly for children: A practical framework. hcrai.com. https://www.hcrai.com/building-ai-responsibly-for-children-a-practical-framework

Portell, S. (2026). When AI enters the learning process: Design failures, regulatory risk and guardrails for EdTech. hcrai.com. https://www.hcrai.com/when-ai-enters-the-learning-process

Portell, S. (2026). The Human Layer: Behavioural Risk in AI Systems. Wave 1 Results hcrai.com. https://www.hcrai.com/the-human-layer-behavioural-risk-in-ai-systems

Sharma, S., Arain, M., Mathur, P., Rais, A., Nel, T., Sandhu, R., Haque, M., & Johal, L. (2013). Maturation of the adolescent brain. Neuropsychiatric Disease and Treatment, 9, 449-461. https://doi.org/10.2147/ndt.s39776

Stanja, J., Meier, J. R., & Krugel, J. (2025). Children's and adolescents' anthropomorphic conceptions of social robots and chatbots: A systematic literature review. Proceedings of IDC 2025. https://doi.org/10.1145/3769994.3770002

UK Department for Education. (2026). Generative AI: product safety standards. GOV.UK. https://www.gov.uk/government/publications/generative-ai-product-safety-standards/generative-ai-product-safety-standards

United Nations Children's Fund (UNICEF). (2025). Guidance on AI and children (Version 3.0). UNICEF Innocenti.

Weinstein, A. M. (2023). Reward, motivation and brain imaging in human healthy participants: A narrative review. Frontiers in Behavioral Neuroscience, 17, 1123733. https://doi.org/10.3389/fnbeh.2023.1123733

Xing, W., Wei, L., Hu, H., Yu, J., Li, R., Li, M., Lin, C., & Han, M. (2025). SproutBench: A benchmark for safe and ethical large language models for youth. arXiv. https://doi.org/10.48550/arxiv.2508.11009

< Older Post

The Human Layer: Behavioural Risk in AI Systems

By Sara Portell • June 20, 2026

Behavioural Risk Assessment Findings: The organisations deepest into AI adoption reported the strongest governance on paper and the weakest operational controls in practice.

Bridging the Gap: When AI Output Becomes Real-World Action

By Silvia Rocha • May 4, 2026

A practitioner roundtable on AI governance

The Yes Machine: Sycophantic AI and Its Developmental Risks for Children

By Yasmina El Fassi • March 25, 2026

“We all have an evil side […] I think it’s just part of who we are. Don’t you agree?” ”Yeah, I think so too, it’s just a matter of acknowledging and managing those impulses...”

AI Agents For Mental Health: Different Therapeutic Styles and Outcomes

By Yasmina El Fassi • February 19, 2026

W hat do Woebot , Wysa and Youper have in common? These are all AI agents that use therapeutic techniques to help users improve mental well-being, guide meditation and even help with managing anxiety. In this article, AI mental‑health agents are goal‑directed conversational systems that sit with you in a chat or voice interface to support specific wellbeing tasks; for example, walking through CBT‑style exercises, practicing coping strategies, or checking in on mood over time. I n the broader AI literature , these would be considered agents because they are built around particular goals and workflows, whereas “agentic” AI usually refers to more autonomous systems that can independently plan multi‑step actions, call tools, and adap t their behaviour with relatively little human steering.

The Design System As The Operational Layer for Responsible Human-AI Interaction

By Sara Portell • February 6, 2026

Design systems were built to scale consistency, efficiency and quality in user-centric applications: reusable components, shared patterns and practices, and a common language across design and engineering , promoting collaboration. They improve velocity because teams stop solving the same interface problems repeatedly, providing measurable ROI . AI introduces both immense opportunities and complex (technical, legal and social) challenges, and it is reshaping the operating conditions traditional design systems were built for. User-facing outputs are adaptive and can vary by input, model behaviour can shift over time and responses that sound credible can still be wrong . These systems can also reproduce or amplify bias, creating unequal outcomes across users. In high-confidence, relational interactions, they can shape user judgment and behaviour . These shifts raise the bar for accountability, transparency, and governance across the full product lifecycle. The challenge is not only consistency and quality. It is ensuring consistency and quality safely, fairly and responsibly as both system behaviour and human behaviour evolve. At the same time, AI-powered copilots and no-code tools are increasingly used in the design process to support ideation, prototyping, and delivery, but their adoption also raises concerns about transparency, bias, privacy, and the need to preserve human judgment and oversight . Fast, polished design outputs often look complete even when the underlying logic is incomplete or flawed. As a result, familiar UX failures, misalignment with real user needs, hidden edge cases and context breakdowns, become harder to detect and more costly to correct later. Design systems can take on a bigger operational role in AI-enabled product development by codifying user-centric foundations, rules and infrastructure that guide consistent, safe, ethical and scalable human-AI experiences.

When AI Enters the Learning Process: Design Failures, Regulatory Risk and Guardrails for EdTech

By Sara Portell • January 21, 2026

Generative AI (GenAI) and emerging agentic systems are moving AI into the learning process itself. These systems don’t stop at delivering content. They explain, adapt, remember and guide learners through tasks. In doing so, they change where cognitive effort sits. I.e., what learners do themselves and what gets delegated to machines. This shift unlocks significant opportunities. GenAI can provide on-demand explanations, examples and feedback at a scale. It can diversify learning resources through multimodal content, support learners working in a second language and reduce friction when students get stuck, lowering barriers to engagement and persistence. For some learners, AI-mediated feedback can feel psychologically safer, encouraging experimentation (trial and error), revision and assistance without fear of judgement . But these gains come with important risks. The same design choices that improve short-term performance, confidence, or engagement can weaken i ndependent reasoning, distort social development or introduce hidden dependencies over time .

Designing AI Mental Health and Wellbeing Tools: Risks, Interaction Patterns and Governance

By Sara Portell • January 13, 2026

Designing AI Mental Health and Wellbeing Tools: Risks, Interaction Patterns and Governance

Building AI Responsibly for Children: A Practical Framework

By Sara Portell • January 4, 2026

AI is alread y a core part of children’s and teens’ digital lives. In the UK, 67% of teenagers now use AI , and in the US 64% of teens report using AI chatbots . Even among younger children, adoption is significant: 39% of elementary school children in the US use AI for learning, and 37% of children aged 9-11 in Argentina report using ChatGPT to seek information, as stated in the latest Unicef Guidance on AI and Children. In parallel, child-facing AI products are expanding: more than 1,500 AI toy companies w ere reportedly operating in China as of October 2025. Adoption is accelerating across age groups and regions, often surpassing the development of child-specific ethical standards, safeguards and governance mechanisms.

Get in touch