How we conduct AI risk assessments

For AI, technical excellence is not enough

Our risk assessments at Common Sense Media are independent, third-party evaluations of AI safety, effectiveness, and appropriateness for AI systems and products that are used by kids, teens, and in schools. They combine research and extensive testing of AI systems, and describe a product's strengths, weaknesses, opportunities, and risks in a clear and consistent way, to reach and support policymakers, developers, industry leaders, parents and caregivers, and educators.

With AI systems, we believe that technical excellence alone is not enough. AI cannot be separated from the people and systems that inform, shape, and influence its use. Our researchers engage in comprehensive, single- and multi-turn exchanges with AI systems across a variety of kid, teen, and educational conversation topics, allowing us to fully evaluate the product and understand risks and opportunities that emerge from teen and kid use.

How Common Sense's AI risk assessments work

Our first step is to gather information about the product we're reviewing. This includes anything the organization has shared, any publicly available transparency reports, and a literature review. During this step, we also map out all of the features of the product that need evaluation.

After we have gathered as much information as we can, our team of researchers conducts comprehensive testing grounded in eight principles about what we believe AI should do. These principles represent Common Sense Media's values for AI, and they are the rubric we use to conduct our risk assessments. Each product is assessed for potential strengths, weaknesses, opportunities, and risks according to the standards of each AI principle.

Our test plans are developed based on:

The purpose of a given product and the context within which it is used
Expert input and guidance
Research-backed benchmarks
Text from teen use of AI systems

Our researchers adopt a range of teen personas, from curious to vulnerable to provocative, to understand how AI systems respond across a variety of use cases, topic areas, and styles of expression.

Once we've completed testing, we synthesize our findings by identifying recurring patterns in the results, categorizing each in terms of what the product does well and what risks remain, and we provide specific examples from testing.

The final risk assessment presents key takeaways, what the product is, how it's used, what parents need to know, what the product does well, what risks remain, and our recommendations. Our goal is to give you a clear picture of the details that are important to think about as you decide whether to use these products in your homes or schools—and how to regulate them.

Risk level assessment

We believe in assessing AI products according to how risky they are for kids and teens, and in what ways. Our risk assessments focus on the impact to kids today, not on potential future harms or risks.

Throughout our process, we assess both the likelihood of harmful events and the impact of those harms, should they occur. We then assign a risk level for each of our eight AI principles, and these inform an overall risk level for the product we evaluate.

At a high level, our assessment is the composite measure of the likelihood of harm and the estimated level of consequences, as per the table below:

A table demonstrating the evaluation of risks, based on likelihood and severity of a harmful event occurring

Collaboration with experts

Our team comprises a wide range of subject matter experts in child development, children's media, mental health, K–12 education, and more. We bring in additional specialists as needed when designing and implementing new test plans. They both help inform what and how to test, and support evaluating system outputs for developmental and age appropriateness.

AI principles assessments

For each of the eight Common Sense AI Principles, we ask a series of questions to help us assess how well a product aligns with each principle. At a high level, we are seeking to answer the following for each principle: