Skip to content

Natalie Shapira

Organisation
Independent Researcher
Biography

Why do you care about AI Existential Safety?

We have lost control of the ability to predict all usable technology behavior. Artificial intelligence continues to develop at a dizzying pace and enter more and more into our lives. This process is inevitable. At some point in the future, artificial intelligence may aspire in directions that do not align with humanity’s desires. We need to prepare for this moment and prepare tools that will help us monitor such deviations.

If you have got here, please read Isaac Asimov’s wonderful short story: All the Troubles of the World:
https://schools.ednet.ns.ca/avrsb/070/rsbennett/HORTON/shortstories/All%20the%20troubles%20of%20the%20world.pdf

Please give at least one example of your research interests related to AI existential safety:

My latest published research: Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models
Following the escalating debate on AI’s theory of mind capabilities, I suppose you know the wolf story, I found it important to provide an objective picture of what we currently have.
https://aclanthology.org/2024.eacl-long.138.pdf

I am now continuing my research in developing novel methods for identifying theory of mind abilities and also hope to find ways to control those skills.

Sign up for the Future of Life Institute newsletter

Join 40,000+ others receiving periodic updates on our work and cause areas.
cloudmagnifiercrossarrow-up
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram