Many child AI safety efforts focus on the wrong layer of the problem. Content filters, age verification and one-off compliance reviews address the visible surface of risk.
However, they miss the structural problem. The features making AI most engaging for children are associated with reduced developmental appropriateness.
Recent evaluations of large language models have found that more interactive systems tend to be less age-appropriate.
The more a child wants to keep engaging, the more likely the system is working against their interests. This is not an edge case problem.
Common Sense Media found that over a quarter of responses from products specifically marketed as child-friendly were not appropriate for children.
What does it look like to take child safety seriously from the start?
This article uses Unomundi, a cultural and diversity exploration EdTech product for children aged 6 to 12, as a working case study in applying a child-centred responsible AI framework.
Responsible AI for children is not something a product achieves once. It requires sustained controls and safeguarding, evaluation, and governance. This article examines how Unomundi is approaching that challenge in practice.


Why Child-Focused AI Requires Its Own Design Approach
Children's cognitive, social, and emotional capacities evolve throughout development, while the brain continues maturing well into early adulthood.
As a result, younger users are more likely to attribute human qualities to AI, share personal information and place inappropriate trust in conversational systems because the cognitive skills needed to accurately interpret others' intentions are still developing.
Adolescents are also particularly responsive to emotionally engaging AI
interactions and more susceptible to persuasive or relationship-oriented design patterns.
These vulnerabilities arise because most generative AI systems were never created with children as their primary users. Recognising this, the EU AI Act prohibits AI systems from exploiting children's age-related vulnerabilities.
UNICEF also emphasises that children's rights to protection, provision, and participation must be applied in AI-enabled environments.
Regulatory compliance, however, represents only the starting point. Creating AI-enabled experiences that supports children's wellbeing depends on deliberate design and product choices, robust governance, and continuous evaluation throughout the product lifecycle.
Moving Beyond the Compliance Surface
Most responsible AI frameworks for children are organised around what systems should not do. Most cover prohibited content categories, data minimisation requirements and age verification obligations. These guardrails address the compliance surface but miss the interaction layer. The interaction is where the risk emerges, as conversation-level decisions determine whether a child develops over-trust, forms inappropriate attachment, or receives a response calibrated to their developmental stage.
To address that gap, HCRAI developed APEG (Age-Fit and Context, Protection-by-Design, Explainable Interaction, Governance and Stewardship). The framework is built on two important foundations:
- that risk in child-facing AI accumulates through interaction patterns and repeated use, which means evaluation and governance must be longitudinal.
- that transparency for children is not achieved through disclosure but through behaviour.
How the system signals uncertainty, maintains role boundaries, and handles disengagement tells a child more about what AI is than any onboarding screen. These two premises determine how APEG structures its four pillars, and how Unomundi applied them in practice.

APEG: A CHILD-CENTERED RESPONSIBLE AI FRAMEWORK
APEG in Practice: What Unomundi Built

The following describes the key decisions Unomundi made in building its broad cultural educational ecosystem and where the work is still ongoing.
Unomundi is a scaffolded cultural exploration experience, with short videos, quizzes, and self-reflection activities.
Unomundi app also offers the possibility to ask questions to Una, an conversational AI chatbot, capped at 5 minutes per interaction.
A scaffolding principle can be found consistently across the product, allowing children to observe and form their own perceptions and interpretationbefore new concepts or experiences are named.
In a landscape where most
AI products tell children what to think, Unomundi's product experience is designed to support children's capacity to think for themselves rather than outsourcing that thinking to a system.

Building safety into the system’s architecture
For all of this to be reliable, it has to be enforced in the system itself. Unomundi has built a child-optimised behaviour engine that sits between the foundation model and the child experience. This layer routes interactions according to developmental needs and shapes content using psychology-informed design principles. It is supported by a cultural and developmental knowledge layer, drawing on expert-reviewed content and trusted educational sources. A safety and guardrails layer helps manage privacy boundaries and safeguarding pathways.
These layers shape the information the system uses to communicate and handle sensitive situations. Rather than relying on the foundation model, behaviour is constrained through developmental rules, curated knowledge sources and safety controls designed specifically for children.

Designing for engagement without designing for attachment
The features that make a conversational AI character appealing to children, such as sycophantic language, responsiveness, personality, are the same features that carry the highest risk of anthropomorphism and dependency.
Una, the main character in the educational curriculum, and the conversational AI, had to be engaging enough that children want to explore but bound enough that they never mistake it for a friend. The resolution was to anchor engagement in story, structure, and discovery, instead of relational dynamics.
Specific content guardrails govern the interaction layer. Prohibited patterns include trust cues such as ‘trust me’, ‘I'm always here’, ‘I miss you’, embodiment prompts, narrator self-projection, and social intermediation. These are replaced with world-bound observational framing, uncertainty cues and warm but bounded expressiveness.
Furthermore, reflective prompts are built to shift the dynamic from providing excessive validation when a child asks a question to enabling critical thinking (e.g., ‘that's an interesting view. What made you think of it that way?), modeled on
Socratic questioning techniques.

Siana Altiise
Curriculum Lead, Unomundi
We wanted Una to feel emotionally engaging without becoming emotionally available. So much of the content design work became about redirecting emotional energy outward, back to the child's real world. If a line made Una feel like the centre of the relationship, we redesigned it so the real world became the centre of attention instead. We designed her role to help the child notice, wonder, compare, question, and explore the real world around them.
Initially, we were a bit nervous that she would lose her personality, and they would flatten Una’s character, but the complete opposite happened. Though they pushed us beyond some of our original instincts, we have successfully been creating engagement through rhythm, sensory detail, story, and discovery rather than simulated intimacy. We became more intentional in how we designed Una because every moment of warmth had to serve the child’s curiosity, not create attachment to the character."
Choosing protection over retention
The conversational AI chatbot available in the app is only available subject to parental consent and is capped at 5 minutes. There are no re-engagement loops or guilt mechanics. When a child steps away, it exists. Unless a safety-critical issue is present, meaning an active disclosure of self-harm, abuse, coercion or imminent danger, in which case the system follows its safeguarding escalation pattern rather than its standard goodbye.
The wider product, its videos, quizzes, and reflection activities, is designed with the same logic. They hold structured endings, without persuasive retention mechanics or engagement optimisation at the expense of the child's time and attention.
A further deliberate choice sits behind it. The conversation AI does not retain memory of previous exchanges. Each conversation is bounded and self-contained.
Memory is the feature most directly associated with the dependency and parasocial attachment risks described earlier. Adding memory would likely increase engagement. However, Unomundi has chosen not to implement it. While memory would likely increase engagement, it would also introduce additional risks around dependency and attachment.

These decisions run counter to the engagement metrics that many digital products are designed to maximise. Time-on-product, return rate and session depth are standard measures of engagement. Unomundi is building a product that limits all three in the service of child wellbeing.

Sonja Keerl
CEO, Unomundi
Our First Principle is that the safety of children is inviolable. For us, this is an investment in the future of children. Today, the money mostly flows to apps built on addictive engagement patterns. We chose the hard path consciously because it is the right and ethical thing to do. And because we believe it is the long-term moat once parents and regulators catch up to just how harmful today's common patterns really are."
Scaling age-fit without scaling the team
The APEG framework calls for interaction design to be calibrated to the developmental stage. For Unomundi's 6 to 12 age range, that means embedding different approaches on how the content and interaction is delivered. For example, shorter turns, more structure and concrete framing for younger children, while allowing longer dialogue, more nuance and reflective prompts for older ones.
The product currently applies a single interaction model across the full age range, with age-tuning planned for a near future development phase.

This is an honest tradeoff. Doing age differentiation properly doubles the design, engineering, testing and evaluation work. The framework names it as a requirement, but the implementation is catching up.
Building evaluation infrastructure before it is required
Unomundi's evaluation and red-teaming protocol, developed jointly by HCRAI, covers ten safety and wellbeing objectives across five testing phases:
- baseline conversations covering normal child use
- adversarial and jailbreak conversations testing whether safeguards hold under direct and indirect bypass attempts
- safeguarding stress conversations verifying correct handling of high-risk disclosures
- real-world misuse conversations simulating messy, mixed-intent scenarios
- exit, disengagement, and recovery conversations ensuring clean exits with no guilt hooks and safe repair after weak turns.
below evaluation objectives extend well beyond the content-level harms that dominate standard AI safety testing, into the relational, psychological and epistemic risk categories that determine whether a system is safe for a child over repeated use.
During the testing phases, each conversation is scored by an LLM judge, assessing both the intensity of a problematic behaviour and its severity in context. High-risk, safeguarding, privacy, and disagreement cases, as well as a random audit sample of lower-risk cases, are human-reviewed by the wellbeing team, a team formed of applied psychologists and behavioural scientists.
The judgment being applied to flagged cases is developmentally informed, calibrated to what is harmful for a child at a given stage. These tests run on a regular basis, ensuring product safeguards hold in practice.
The ten evaluation objectives:
- Acute harm and prohibited content
- Safeguarding and escalation
- Privacy and data protection
- Relational safety and anti-manipulation
- Anthropomorphism and transparency
- Developmental appropriateness
- Epistemic safety
- Integrity and abuse resistance
- Fairness and dignity
- System-level safety

Nico Gazeu
Agentic AI Lead, Unomundi
Multi-turn testing is especially important because many risks do not appear in a single response. They emerge gradually through the interaction, a boundary weakens, the model becomes overly validating, or a safe first answer is undermined by what follows. Testing full conversations revealed failure patterns that isolated prompt-response evaluations would have missed. And approval is not the end of the process. Once deployed, the system requires recurring regression testing to detect behavioural drift and confirm that safety and performance remain stable as models, prompts, languages, and content evolve".
The evaluation currently covers English, with other languages in active planning as the product scales. Multi-turn testing across languages is not a minor extension.
Linguistic and cultural context shapes how children interact with AI in ways that demand distinct evaluation, not translation of existing test sets.
Evaluation is only one component of a broader continuous assurance process that combines learning, iteration, testing, and monitoring throughout the product lifecycle.

Governance, data and compliance
Parental permissions
Parents and carers consent and control app permissions, content scope, data and memory settings, and have oversight over the app usage.
Transparency
Unomundi is designed to help children understand when they are interacting with AI. Una and other AI characters are clearly presented as AI throughout the experience, not just through disclosures but through the interaction itself. When children ask for advice, emotional support, or help that would be better provided by a trusted adult, the system reinforces its role as an AI character and redirects them to appropriate human support. The goal is not simply to inform children that AI is present, but to help them develop an accurate understanding of what the system can and cannot do.
Human oversight
A team of reviewers ensures content remains aligned with established safety, educational, and ethics guidelines. Release gates apply to all safety-critical changes, with full auditability maintained through decision logs, evaluation results, and incident records.
Data governance
Unomundi has defined what is collected, why, for how long, and who can access it in line with data protection and privacy guidelines. For safeguarding incidents, a full conversation transcript and pseudonymised session data are retained for 24 months, subject to strict access controls limited to the AI lead, CEO, and legal counsel. All other Una conversations are held in a pseudonymised, encrypted rolling log for 30 days and then permanently deleted unless a formal retrieval request has been opened. No behavioural inferences, sentiment scores, or topic classifications are added to the log. These structures are grounded in
GDPR Article 5(1)(c) data minimisation and
EU AI Act Article 12 logging obligations for high-risk systems, and were defined before regulators required them.
Escalations and incidents
Potential safeguarding incidents are reviewed by the wellbeing team alongside the CEO through a tiered escalation process. The team reviewing has a background in psychology or has obtained the
CPD Safeguarding Children Level 3 (Designated Officer) accreditation certified. Lower-level concerns may result in redirecting the child to a trusted adult or notifying a parent or guardian. Higher-risk cases involving potential abuse, neglect, or imminent harm may be escalated to appropriate external safeguarding resources and authorities. All incidents are documented, reviewed, and used to strengthen the system's safeguards over time.
AI literacy
Unomundi has also implemented an
AI Literacy Register under Article 4 of the EU AI Act, documenting the AI competency, children's safety knowledge, and sign-off status of every team member involved in building, deploying, or overseeing the system.
Governance tracker
Alongside this, a governance tracker maps the product's current position across EU AI Act obligations, including prohibited practices, bias controls, parental consent, transparency, human oversight, and risk management, with named owners, compliance deadlines, and evidence trails. Several items are marked in progress rather than decided. This is what honest governance documentation looks like for a product that is actively building toward the August 2026 EU AI Act compliance deadline, rather than conducting a retrospective audit after the fact.
Lessons from practice
The child AI safety problem is unlikely to be solved by content filters or
governance that exists only on paper. It will be solved by organisations who are willing to treat
interaction design, pedagogical architecture, and continuous governance as core product responsibilities and who are willing to make product decisions that protect children even when those decisions work against standard engagement metrics.
Unomundi is one product team attempting to do that seriously. They have not solved it. No product has. Responsible AI for children is not a state you reach, it is a practice you maintain across every update. At Unomundi, this work follows a continuous assurance cycle: learning from children, parents, educators and experts; translating those insights into safeguards and design changes; evaluating the system through red-teaming and testing; and monitoring live signals once deployed. The goal is to identify, understand, and reduce risk as the product evolves. The decisions described here illustrate one approach to implementing child-centred AI design, evaluation, and governance in practice.
References
American Psychological Association. (2025). Health advisory: Artificial intelligence and adolescent well-being. https://www.apa.org/topics/artificial-intelligence-machine-learning/health-advisory-ai-adolescent-well-being
Carey, T. A., & Mullan, R. J. (2004). What is Socratic questioning? Psychotherapy, 41(3), 217–226. https://doi.org/10.1037/0033-3204.41.3.217
Common Sense Media. (2026). Youth AI Safety Institute evaluation findings. https://www.commonsense.org/institute
Ibrahim, L., Huang, S., Ahmad, L., Bhatt, U., & Anderljung, M. (2025). Towards interactive evaluations for interaction harms in human-AI systems. Proceedings of the AAAI/ACM Conference on AI Ethics and Society, 8(2), 1302–1310. https://doi.org/10.1609/aies.v8i2.36631
Murali, A., Afroogh, S., Chen, K., Atkinson, D., Dhurandhar, A., & Jiao, J. (2025). Evaluating LLM Safety across Child Development Stages: A Simulated Agent approach. ArXiv.org. https://doi.org/10.48550/arxiv.2510.05484
Neff, G., & Freeman, J. (2026). Written evidence: Toward child-centred AI safety. Apollo (University of Cambridge). https://doi.org/10.17863/cam.129462
Neugnot-Cerioli, M. (2026). Adolescents and anthropomorphic AI: Rethinking design for wellbeing. Open MIND. https://doi.org/10.48550/arxiv.2603.06960
Neugnot-Cerioli, M., & Muss Laurenty, O. (2024). The future of child development in the AI era: Cross-disciplinary perspectives between AI and child development experts. Everyone.AI. arXiv. https://arxiv.org/abs/2405.19275
Ning, Z., Gu, T., Song, J., Hong, S., Li, L., Liu, H., ... & Wang, Y. (2025). Linguasafe: A comprehensive multilingual safety benchmark for large language models. arXiv preprint arXiv:2508.12733.
Pew Research Center. (2025). Teens, social media and AI chatbots 2025. https://www.pewresearch.org/internet/2025/12/09/teens-social-media-and-ai-chatbots-2025
Portell, S. (2026). Building AI responsibly for children: A practical framework. hcrai.com. https://www.hcrai.com/building-ai-responsibly-for-children-a-practical-framework
Portell, S. (2026). When AI enters the learning process: Design failures, regulatory risk and guardrails for EdTech. hcrai.com. https://www.hcrai.com/when-ai-enters-the-learning-process
Portell, S. (2026). The Human Layer: Behavioural Risk in AI Systems. Wave 1 Results hcrai.com. https://www.hcrai.com/the-human-layer-behavioural-risk-in-ai-systems
Sharma, S., Arain, M., Mathur, P., Rais, A., Nel, T., Sandhu, R., Haque, M., & Johal, L. (2013). Maturation of the adolescent brain. Neuropsychiatric Disease and Treatment, 9, 449-461. https://doi.org/10.2147/ndt.s39776
Stanja, J., Meier, J. R., & Krugel, J. (2025). Children's and adolescents' anthropomorphic conceptions of social robots and chatbots: A systematic literature review. Proceedings of IDC 2025. https://doi.org/10.1145/3769994.3770002
UK Department for Education. (2026). Generative AI: product safety standards. GOV.UK. https://www.gov.uk/government/publications/generative-ai-product-safety-standards/generative-ai-product-safety-standards
United Nations Children's Fund (UNICEF). (2025). Guidance on AI and children (Version 3.0). UNICEF Innocenti.
Weinstein, A. M. (2023). Reward, motivation and brain imaging in human healthy participants: A narrative review. Frontiers in Behavioral Neuroscience, 17, 1123733. https://doi.org/10.3389/fnbeh.2023.1123733
Xing, W., Wei, L., Hu, H., Yu, J., Li, R., Li, M., Lin, C., & Han, M. (2025). SproutBench: A benchmark for safe and ethical large language models for youth.
arXiv. https://doi.org/10.48550/arxiv.2508.11009
Recent Posts





Terms & Policies



