MIRI’s February 2018 Newsletter

Published

27 February, 2018

Author

Rob Bensinger

Updates

News and links

In “Adversarial Spheres,” Gilmer et al. investigate the tradeoff between test error and vulnerability to adversarial perturbations in many-dimensional spaces.
Recent posts on Less Wrong: Critch on “Taking AI Risk Seriously” and Ben Pace’s background model for assessing AI x-risk plans.
“Solving the AI Race“: GoodAI is offering prizes for proposed responses to the problem that “key stakeholders, including developers, may ignore or underestimate safety procedures, or agreements, in favor of faster utilization”.
The Open Philanthropy Project is hiring research analysts in AI alignment, forecasting, and strategy, along with generalist researchers and operations staff.

This newsletter was originally posted on MIRI’s website.

Our newsletter

Subscribe to our newsletter and join over 20,000+ people who believe in our mission to preserve the future of life.

Including: Anthropic's new Claude Mythos model; Trump endorses an AI kill switch; Florida opens the first criminal probe of an AI company; and more.

Maggie Munro

1 May, 2026

Including: AI vs. Cancer; proposed data center moratorium; military AI news; and more.

Maggie Munro

1 April, 2026

Including: Anthropic drama; our new Protect What's Human campaign; war game simulations show AI defaults to terrifying outcomes; and more.

Maggie Munro

1 March, 2026