Moreover, they show a counter-intuitive scaling Restrict: their reasoning hard work will increase with problem complexity as many as a point, then declines Even with having an enough token spending plan. By comparing LRMs with their normal LLM counterparts below equal inference compute, we establish three general performance regimes: https://www.youtube.com/watch?v=snr3is5MTiU