Solving the confusion o1, are inference models like DeepSeek-R1 thinking or not?
Found a fun paper "Thoughts Are All Over the Place: on the Underthinking of o1-Like LLMs" on the topic of analyzing o1-like inference models Thinking about Path Frequency...





























































































![[转]用 2000 美元 EPYC 服务器本地跑起 Deepseek R1 671b 大模型](https://aisharenet.com/wp-content/uploads/2025/02/78984d5c0694467.png)


