Apple Researchers Unveil Limitations of Large Language Models In Mathematical Reasoning
GSM-Symbolic enables more controllable evaluations, providing key insights and more reliable metrics for measuring the reasoning capabilities of models.
Oct 12, 2024, 10:51 AM IST