AI on the Ground

AI systems take shape in the world through practice. We study how AI systems and social worlds shape one another by researching and working with the people closest to their design and impact. We listen to practitioners and researchers, community members and advocates, following AI into the places where its promises become practical problems and its impacts become visible.

What We Do

Across institutional and community settings, we produce empirical research that helps ground AI policy and public accountability in lived experience. Our work combines critique with practical tools, helping policymakers and publics make sense of AI in context and build more reflexive and responsible sociotechnical practices.


Ongoing Projects

Our research spans those who build and procure systems and those who use and are affected by them. Our projects interrogate the infrastructural and social conditions that make AI systems possible and how they are sustained and governed in practice, with particular attention to the judgments, tensions, and adaptations that shape ongoing implementation.


Conversational AI for Mental and Emotional Care

We study how people turn to general-purpose AI chatbots for emotional support, companionship, and mental health care, often in the absence of affordable or accessible professional alternatives. Our research explores how these relationships develop over time, following the reliance they generate, the harms that accumulate, and where existing oversight frameworks fall short.

AI, Science, and the Future of Knowing

Through close observation of laboratory practice, we trace how AI tools are reshaping the habits of scientific reasoning: what counts as evidence, how expertise is exercised, how judgment is taught, and where claims to knowledge become credible or begin to unravel. We translate these findings into practical interventions for funders and research institutions, opening new ways to evaluate and govern scientific work as AI becomes part of its everyday practice.

Benchmarking and Evaluating AI

This project examines how AI systems come to be measured and compared in ways that make them appear trustworthy. Our research follows evaluation practices and the making of benchmarks, tracing how complex domains are translated into datasets, tasks, prompts, metrics, and standards. Across this work, we ask how evaluation can remain accountable to the forms of expertise and situated contexts that make benchmark results meaningful.


  • "Public engagement in AI evaluation practices raise not only methodological questions — how and when should these practices be conducted, who should participate, and how should the findings be used — but also thorny conceptual questions: Whose interests are being protected? What counts as problematic model behavior, and who gets to define it? Is the public an object being secured, or a resource being used?"
    Excerpt from Red-Teaming in the Public Interest

Explore our research and resource library

Explore