AF - Evaluating AI Systems for Moral Status Using Self-Reports by Ethan Perez
The Nonlinear Library: Alignment Forum

AF - Evaluating AI Systems for Moral Status Using Self-Reports by Ethan Perez

2023-11-16
Link to original articleWelcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Evaluating AI Systems for Moral Status Using Self-Reports, published by Ethan Perez on November 16, 2023 on The AI Alignment Forum. TLDR: In a new paper, we explore whether we could train future LLMs to accurately answer questions about themselves. If this works, LLM self-reports may help us test them for...
View more
Comments (3)

More Episodes

All Episodes>>

Get this podcast on your phone, Free

Creat Yourt Podcast In Minutes

  • Full-featured podcast site
  • Unlimited storage and bandwidth
  • Comprehensive podcast stats
  • Distribute to Apple Podcasts, Spotify, and more
  • Make money with your podcast
Get Started
It is Free