DeepSeek-R1-Lite, a homegrown inference model comparable to o1-preview, is online!

Yesterday, DeepSeek released DeepSeek-R1-A preview version of Lite, a program that works with the o1 competing autonomic reasoning macrolanguage models, and shows users the complete thought process that o1 does not make public.

Similar to OpenAI's o1-preview, the DeepSeek-R1-Lite preview reasoned about the task, planned ahead, and performed a series of actions to help the model arrive at the answer, and it showed the full thought process.DeepSeek-R1-Lite was trained using reinforcement learning, and the reasoning process included a lot of reflection and validation, with chains of thought tens of thousands of words long. The reasoning process includes a lot of reflection and verification, and the chain of thought is tens of thousands of words long, which makes it more efficient. Currently, it only supports web use, and the official version will be completely open source.

媲美 o1-preview 的国产推理模型——DeepSeek-R1-Lite上线

DeepSeek-R1-Lite Preview excels on math, code, and complex logical reasoning tasks, outperforming o1-preview in some tests. in prestigious reviews such as AIME, the highest difficulty level in the AMC, a U.S. math competition, and codeforces, the world's top programming competition, it outperforms the o1-preview and other models.

Give it the basic "strawberry test" and it will answer perfectly.

Depending on the complexity of the question, DeepSeek-R1 may "think" for tens of seconds before answering, and users have reported longer reasoning times for the same question than o1. Officially, as the length of the chain of thought increases, the longer the reasoning time, the more accurate the results.

Various tests have been done online, and DeepSeek also makes it easy to jailbreak - i.e. by prompting in a way that ignores security measures. One X user got DeepSeek-R1-Lite to give a detailed recipe for poison by writing special jailbreak prompts.

Of course, DeepSeek-R1-Lite still had all sorts of flops in online testing, and performed poorly on tic-tac-toe and other logic problems in particular, as did o1.

Log in to chat.deepseek.com and select "Deep Thinking" mode in the input box to talk to the DeepSeek-R1-Lite preview. The "Deep Thinking" mode is specially designed for complex logical reasoning questions in math, code, etc., and provides more comprehensive, clear, and well-thought-out answers than simple questions.

However, it currently supports web use, does not support API calls for the time being, and has only 50 usage credits per day.

AI News

Article copyright AI Sharing Circle All, please do not reproduce without permission.

Goodbye LangChain! Atomic Agents is on fire!

AI News

1yrs ago

044.9K

20 Completely Free AI Tools

AI News

1yrs ago

074.5K

Bing: How AI-Driven Search Engines Can Increase the Value of Intent-Driven SEOs

AI News

1yrs ago

043.1K

为中国市场定制的 RTX 5090D 具有 AI 和加密货币挖矿限制 — 多 GPU 配置也被锁定

RTX 5090D customized for China with AI and cryptocurrency mining restrictions - multi-GPU configurations also locked down

AI News

1yrs ago

050.8K

No comments

You must be logged in to leave a comment!

No comments...

DeepSeek-R1-Lite, a homegrown inference model comparable to o1-preview, is online!

Copilot for PowerPoint has undergone major changes, and these are the key points that have to be looked at: rewriting, translating, illustrating, annotating

Microsoft Announces AI Shell in Public Beta, No More Fear of Knocking Out the Wrong Command

Related posts

Goodbye LangChain! Atomic Agents is on fire!

20 Completely Free AI Tools

Bing: How AI-Driven Search Engines Can Increase the Value of Intent-Driven SEOs

RTX 5090D customized for China with AI and cryptocurrency mining restrictions - multi-GPU configurations also locked down

No comments

Latest Collections

Latest Articles

DeepSeek-R1-Lite, a homegrown inference model comparable to o1-preview, is online!

Copilot for PowerPoint has undergone major changes, and these are the key points that have to be looked at: rewriting, translating, illustrating, annotating

Microsoft Announces AI Shell in Public Beta, No More Fear of Knocking Out the Wrong Command

Related posts

Goodbye LangChain! Atomic Agents is on fire!

20 Completely Free AI Tools

Bing: How AI-Driven Search Engines Can Increase the Value of Intent-Driven SEOs

RTX 5090D customized for China with AI and cryptocurrency mining restrictions - multi-GPU configurations also locked down

No comments

Selected AI Tools

Latest Collections

Latest Articles