In the first half of this talk I will introduce a few of my recent research projects spanning benchmark development for agentic AI systems, novel policy impact analysis frameworks, and technical literacy development. In the second half of the talk I will introduce our exciting new Human Rights Benchmark project which aims to assess language model performance through the lens of internationally recognized human rights. I will outline our methodology, including prompt development and model evaluation strategies, and discuss how this work contributes to more accountable and socially aligned AI systems.
Dr. Savannah Thais is an Associate Research Scientist in the Columbia Data Science Institute with a focus on machine learning (ML). She is interested in complex system modeling and in understanding what types of information is measurable or modelable and what impacts designing and performing measurements have on systems and societies. This work is informed by her background in high energy particle physics and incorporates traditional scientific experiment design components such as uncertainty quantification, experimental blinding, and decorrelation/de-biasing methods. Her recent work has focused on geometric deep learning, methods to incorporate physics-based inductive biases into ML models, regulation of emerging technology, social determinants of health, and community education.