Modern LLMs, like OpenAI’s o1 or DeepSeek’s R1, improve their reasoning by generating longer chains of thought. However, this ...