Cognitive Biases and AI Value Alignment: An Interview with Owain Evans
Click here to see this page in other languages: Russian At the core of AI safety, lies the value alignment problem: how can we teach artificial intelligence systems to act in accordance with human goals and values? Many researchers interact with AI systems to teach them human values, using techniques like inverse reinforcement learning (IRL). In […]