🚀 Invitation to Experience: China's First AI IDE Intelligent Programming Software Trae Chinese version downloadThe DeepSeek-R1 and Doubao-pro are available for unlimited use!

AI knowledge Page 3

Solving the confusion o1, are inference models like DeepSeek-R1 thinking or not?

I found an interesting paper "Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs", which analyzes the frequent switching of thinking paths and the lack of focus in o1-like reasoning models, or "underthinking" for short. The topic is to analyze the o1-like reasoning model's frequent switching of thinking paths and lack of focused thinking, which is referred to as "underthinking", and at the same time to give a solution to alleviate ...

2025-02-13

模型量化是什么：FP32, FP16, INT8, INT4 数据类型详解-首席AI分享圈

What is Model Quantization: FP32, FP16, INT8, INT4 Data Types Explained

Introduction In the vast starry sky of AI technology, deep learning models drive the innovative development of many fields with their excellent performance. However, the continuous expansion of model scale, like a double-edged sword, brings about a dramatic increase in arithmetic demand and storage pressure while improving performance. Especially in resource-constrained applications ...

2025-02-13

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.

2025-03-26

Think&Cite: Improving Text Citation Accuracy Using Tree Search Techniques

Abstract Although Large Language Models (LLMs) perform well, they are prone to hallucinating and generating factually inaccurate information. This challenge has motivated efforts in attribute text generation, prompting LLMs to generate content that contains supporting evidence. In this paper, we present a new approach called Think&Cite ...

2025-02-11

Systematic mastery of cue word engineering - from basic to advanced (reading time from 2 hours)

Introduction The purpose of this document is to help readers quickly understand and grasp the core concepts and applications of Prompt Engineering through a series of prompt examples (in part). These examples are all derived from an academic paper on a systematic review of prompt engineering techniques ("The Prompt Report: A Systematic Survey of Pr...

2025-02-10

An in-depth look at Titans: the path to convergence of long-time memory and efficient sequence modeling

Titans: Learning to Memorize at Test Time Original text: https://arxiv.org/pdf/2501.00663v1 Titans architecture Unofficial implementation: https://github.com/lucidrains/titans- pytorch I. Research Background and Motivation: Transformer of ...

2025-02-09

Limitations of LLM OCR: The Document Parsing Challenge Behind the Glossy Surface

For any application that requires Retrieval Augmented Generation (RAG) system, the massive PDF documents into machine-readable blocks of text (also known as "PDF chunking") are a big headache. There are both open-source programs and commercialized products on the market, but honestly, there is no program that can really...

2025-02-09

DeepSeek R1 越狱：尝试突破 DeepSeek 的审查机制-首席AI分享圈

DeepSeek R1 Jailbreak: Trying to Break DeepSeek's Censorship

DeepSeek R1 official jailbreaks are great experimental environments for triggering basically all types of censorship mechanisms, and you can learn a lot of defense techniques, so this is a big model censorship mechanism learning article that will take you through examples of big model jailbreaks over the years. Large model censorship mechanisms are usually used...

2025-02-03

OpenAI o3-mini System Manual (Chinese)

Original: https://cdn.openai.com/o3-mini-system-card.pdf 1 Introduction The OpenAI o model family is trained using large-scale reinforcement learning to reason using chains of thought. These advanced reasoning capabilities provide new ways to improve the security and robustness of our models. In particular, ...

2025-02-02

Chinchilla 时刻与 o3 时代：大语言模型“规模定律”的演进之路-首席AI分享圈

The Chinchilla Moment and the o3 Moment: The Evolution of the Law of Scale for Large Language Models

Quick Reads A comprehensive and in-depth look at the past and present of the Scaling Law of Large Language Models (LLMs) and the future direction of AI research. With clear logic and rich examples, author Cameron R. Wolfe takes the reader from the basic concepts to the...

2025-02-01

Intelligent Agentic Retrieval Enhanced Generation: An Overview of Agentic RAG Technology

Abstract Large-scale language models (LLMs), such as OpenAI's GPT-4, Google's PaLM, and Meta's LLaMA, have dramatically transformed Artificial Intelligence (AI) by enabling human-like text generation and natural language understanding. However, their reliance on static training data limits their ability to respond to dynamic, real-time queries...

2025-01-31

LangGraph：基于有向无环图拓扑的AI Agent构建与执行框架-首席AI分享圈

LangGraph: a framework for AI Agent construction and execution based on directed acyclic graph topology

Artificial Intelligence (AI) is a rapidly growing field. Language models have evolved to enable AI Agents to perform complex tasks and make complex decisions. However, as the skills of these Agents continue to grow, the infrastructure to support them struggles to keep up. LangGraph, a revolutionary library designed to revolutionize...

2025-01-30

Uncovering security holes in AI filters: a deep dive into using character code to bypass restrictions

Introduction Like many others, over the past few days my news tweets have been filled with news, praise, complaints, and speculation about the Chinese-made DeepSeek-R1 large language model, which was released last week. The model itself is being brought up against some of the best inference models from OpenAI, Meta, and other...

2025-01-29Prompt Jailbreak

CoRAG: Dynamic chained RAG modeling using MCTS (Monte Carlo Trees)

Summary of Key Contributions of CORAG CORAG (Cost-Constrained Retrieval Optimization for Retrieval-Augmented Generation) is an innovative retrieval-augmented generation (RAG) system designed to address key challenges in existing RAG approaches. The following CORAG ...

2025-01-28

一文说清楚知识蒸馏（Distillation）：让“小模型”也能拥有“大智慧”-首席AI分享圈

A clear article Knowledge Distillation (Distillation): let the "small model" can also have "big wisdom".

Knowledge distillation is a machine learning technique that aims to transfer learning from a large pre-trained model (i.e., a "teacher model") to a smaller "student model". Distillation techniques can help us develop lighter weight generative models for intelligent conversations, content creation, and other areas. Recently Distil...

2025-01-28

How to calculate the number of parameters for a large model, and what do 7B, 13B and 65B stand for?

Recently, many people engaged in large model training and inference have been discussing the relationship between the number of model parameters and model size. For example, the famous alpaca series LLaMA large model contains four versions with different parameter sizes, LLaMA-7B, LLaMA-13B, LLaMA-33B and LLaMA-65B. Here "...

2025-01-28

CLOB: Continuous Learning of a Series of Tasks by a Large Language Model Using Only Cued Words

Original article: https://arxiv.org/pdf/2412.15479 INTERPRETATION: This article itself is not very innovative and has little application. However, it reminds me of three highly informative articles I read a long, long time ago. Reading this article in conjunction with the three previous articles will hopefully bring you more inspiration. Recommended reading: the...

2025-01-25

向量数据库深度对比：Weaviate、Milvus 与 Qdrant-首席AI分享圈

Vector Database Depth Comparison: Weaviate, Milvus & Qdrant

In the field of artificial intelligence and machine learning, especially when building applications such as RAG (Retrieval Augmented Generation) systems and semantic search, efficiently processing and retrieving massive unstructured data becomes crucial. Vector databases have emerged as a core technology to address this challenge. They are not only for storing high-dimensional ...

2025-01-25

Unlocking the Little Red Book Marketing Code: A Guide to Growing Overseas User Operations (with PDF download)

Xiaohongshu, a hot social e-commerce platform in China and even in Asia, has long gone beyond a simple shopping app to become a weathervane for young people's lifestyles and a new position for brand marketing. For overseas brands and individuals wishing to enter the Chinese market or reach young consumers, mastering Xiaohongshu...

2025-01-24AI Side Hustle Money Making Program

Learn how AI Coding works, starting with Cline!

Unexpectedly, AI has set off a half-changing sky in the programming field. From v0, bolt.new to various Agant programming tools Cursor, Windsurf, AI Coding has a huge potential of idea MVP. From the traditional AI-assisted coding, to today's direct project generation behind, in the end is...

2025-01-24

preceding page
1
2
3
4
5
6
...
next page
Total 11 pages

AI knowledge Page 3

Solving the confusion o1, are inference models like DeepSeek-R1 thinking or not?

What is Model Quantization: FP32, FP16, INT8, INT4 Data Types Explained

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Think&Cite: Improving Text Citation Accuracy Using Tree Search Techniques

Systematic mastery of cue word engineering - from basic to advanced (reading time from 2 hours)

An in-depth look at Titans: the path to convergence of long-time memory and efficient sequence modeling

Limitations of LLM OCR: The Document Parsing Challenge Behind the Glossy Surface

DeepSeek R1 Jailbreak: Trying to Break DeepSeek's Censorship

OpenAI o3-mini System Manual (Chinese)

The Chinchilla Moment and the o3 Moment: The Evolution of the Law of Scale for Large Language Models

Intelligent Agentic Retrieval Enhanced Generation: An Overview of Agentic RAG Technology

LangGraph: a framework for AI Agent construction and execution based on directed acyclic graph topology

Uncovering security holes in AI filters: a deep dive into using character code to bypass restrictions

CoRAG: Dynamic chained RAG modeling using MCTS (Monte Carlo Trees)

A clear article Knowledge Distillation (Distillation): let the "small model" can also have "big wisdom".

How to calculate the number of parameters for a large model, and what do 7B, 13B and 65B stand for?

CLOB: Continuous Learning of a Series of Tasks by a Large Language Model Using Only Cued Words

Vector Database Depth Comparison: Weaviate, Milvus & Qdrant

Unlocking the Little Red Book Marketing Code: A Guide to Growing Overseas User Operations (with PDF download)

Learn how AI Coding works, starting with Cline!

Can't find AI tools? Try here!

FLUX.1 image generator (supports Chinese input)

Recent AI Hotspots

AI Tools Recommendations

AI Tools Classification