2 articles

New methods internalize outcome feedback into step-level guidance, expanding reasoning beyond chain-of-thought limitations.

New arXiv batch addresses loss landscape myths, multi-LLM coordination, sparse caching, and adaptive computation depth.