Edge 422: How NuminaMath Won the AI Math Olympiad?
The model combines a novel neurosymbolic architecture with a unique training mechanism.
The AI Mathematical Olympiad(AIMO) has been one of the most interesting initiatives to evaluate sophisticated math reasoning in AI models. Launched a few months ago, AIMO setup a $10 million prize for models that can reason at the level of a gold medalist in the International Math Olymmpiad(IMO) competitions for high school students. By performing at those levels, AI models need to exhibit sophisticated capabilities in areas such as multi-step reasoning, math as well as deep level language understanding. I was fascinated the AIMO challenge and was tracking the progress of the different models quite closely over the last few months trying to understand the techniques they were using to solve such complex chal.
After months of intervention, NuminaMath 7B TIR emerged as the winner. The model was a collaboration between HuggingFace and Numina, a lab focused on advancing math capabilities in foundation models. You probably know a lot about HuggingFace but very little about Numina so le’s fix that.