Google AI Achieves Medal-Level Performance at International Mathematical Olympiad, Transforming Future of Math Competitions
July 26, 2024Google DeepMind trained specialized AI systems, AlphaProof and AlphaGeometry 2, to solve complex math problems and achieved silver-medal standards in the International Mathematical Olympiad (IMO).
Google's AI programs translated IMO problems into formal mathematical language, differing from human contestants who work directly with problem statements during the competition.
AlphaProof solved two algebra and one number theory problem, including the hardest question that only five contestants managed to solve.
AlphaGeometry 2 successfully handled a complex geometry problem involving triangle ABC and tangents to the incircle in just 19 seconds.
The AI systems excelled in solving algebra and number theory problems but struggled with combinatorics.
This achievement marks the first time any AI system has reached a medal-level performance on IMO questions.
The success of these AI systems could lead to collaborations between humans and AI in solving and creating new mathematical problems.
The development of these systems has implications for AI advancement and the path to AGI (Artificial General Intelligence).
OpenAI is working on a new reasoning technology project called 'Strawberry' to enhance AI reasoning capabilities, raising concerns among researchers about its impact on humanity.
Meta CEO Mark Zuckerberg announced steps towards open-source AI becoming the industry standard and the release of Llama 3.1 as their latest model.
XTX Markets offered a $5 million prize for an AI capable of winning a gold medal at the IMO, encouraging more teams to participate, including Google DeepMind with future public entries.
Summary based on 19 sources
Get a daily email with more World News stories
Sources
The New York Times • Jul 25, 2024
AlphaProof, a New A.I. from Google DeepMind, Scores Big at the International Math OlympiadThe Guardian • Jul 25, 2024
Google DeepMind takes step closer to cracking top-level mathsWIRED • Jul 25, 2024
Google DeepMind's Game-Playing AI Tackles a Chatbot BlindspotNature • Jul 25, 2024
DeepMind hits milestone in solving maths problems — AI’s next grand challenge