🚀 Invitation to Experience: China's First AI IDE Intelligent Programming Software Trae Chinese version downloadThe DeepSeek-R1 and Doubao-pro are available for unlimited use!

AI knowledge Page 5

CAG: A cache-enhanced generation method that is 40 times faster than RAG

CAG (Cache Augmented Generation) that is 40 times faster than RAG (Retrieval Augmented Generation).CAG revolutionizes knowledge acquisition: instead of retrieving external data in real time, all knowledge is pre-loaded into the model context. It's like condensing a huge library into an on-the-go toolkit that can be used when needed...

2025-01-07

Google Agents and Basic Applications White Paper (Chinese version)

By Julia Wiesinger, Patrick Marlow and Vladimir Vuskovic Originally published at https://www.kaggle.com/whitepaper-agents Table of Contents Introduction What is an Intelligent Body? Models Tools Orchestration Layers Intelligent Bodies and Models Cognitive Architecture: How Intelligent Bodies Work Tools ...

2025-01-04

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.

2025-05-08

2023 Old article review: a guide to the RAG system build process and evaluation

Retrieval Augmented Generation (RAG) is becoming one of the most popular applications for Large Language Models (LLMs) and vector databases.RAG is the process of augmenting the input to a LLM with context retrieved from vector databases (e.g., Weaviate).The RAG application passes...

2025-01-04

Approaching Multi-Agent Systems (MAS): a Collaborative AI World

A Multi-Agent System (MAS) is a computing system consisting of multiple interacting Intelligent Agents. Multi-Agent Systems can be used to solve problems that are difficult or impossible to solve by a single Intelligent Agent or a single system. Intelligent agents can be robots, humans, or soft...

2025-01-03

An article to take you to understand RAG (Retrieval Augmented Generation), the concept of theoretical introduction + code practice

First, LLMs already have strong capabilities, why do we still need RAG (Retrieval Augmented Generation)? Although LLMs have demonstrated significant capabilities, the following challenges still warrant attention: Illusion problem: LLMs use a statistically based probabilistic approach to generate text word by word, a mechanism that inherently leads to the possibility of...

2025-01-02

OpenAI-o3 and Monte-Carlo Ideas

o3 is here to share some personal insights. Progress on Test-time Scaling Law has been much faster than we thought. But I'd like to say that the path is actually a bit convoluted - it's OpenAI's way of saving the country from the curve in its pursuit of AGI. Reinforcement Learning and Shortcut Thinking For ...

2025-01-01

How to choose the best Embedding model for a RAG application

Vector Embedding is the core of current Retrieval Augmented Generation (RAG) applications. They capture semantic information of data objects (e.g., text, images, etc.) and represent them as arrays of numbers. In current generative AI applications, these vector Embedding are usually generated by Embedding models. How to apply for RAG ...

2025-01-01

A 10,000-word article on RAG optimization in DB-GPT real-world scenarios.

PREFACE Over the past two years, Retrieval-Augmented Generation (RAG, Retrieval-Augmented Generation) technology has gradually become a core component for enhancing intelligences. By combining the dual capabilities of retrieval and generation, RAG is able to introduce external knowledge, thus providing more applications of large models in complex scenarios...

2024-12-31

Top 5 AI Agent Frameworks Worth Getting Into in 2025

Agent The most common translation I've seen so far is "intelligent body", but the direct translation is "agent". What does Agentic translate to? I feel that a word like "agentic" is more appropriate. So in order not to confuse the readers, I use English directly in this article. With the development of LLM, the ability of AI...

2024-12-28

朴素、有效的RAG检索策略：稀疏+密集混合检索并重排，并利用“提示缓存”为文本块生成整体文档相关的上下文-首席AI分享圈

Simple, effective RAG retrieval strategy: sparse + dense hybrid search and rearrangement, and use "cue caching" to generate overall document-relevant context for text chunks.

In order for an AI model to be useful in a particular scenario, it usually needs access to background knowledge. For example, a customer support chatbot needs to understand the specific business it serves, while a legal analysis bot needs to have access to a large number of past cases. Developers often use Retrieval-Augmente...

2024-12-27Knowledge Retrieval and the RAG Framework

Large model fine-tuning knowledge points that even a novice can understand

Full Process of Fine-tuning Large Models It is recommended to strictly follow the above process during fine-tuning and avoid skipping steps, which may lead to ineffective labor. For example, if the dataset is not fully constructed, and it is eventually found that the poor effect of the fine-tuned model is a problem of the quality of the dataset, then the preliminary efforts will be wasted, and the matter...

2024-12-24

A 10,000-word article comprehending the development process of LLM-based Text-to-SQL

OlaChat AI Digital Intelligence Assistant 10,000-word in-depth analysis to bring you to the past and present of Text-to-SQL technology. Thesis: Next-Generation Database Interfaces: a Survey of LLM-based Text-to-SQL Generating accurate SQL from natural language problems (text-to-SQL) is a long...

2024-12-24

Late Chunking x Milvus: How to Improve RAG Accuracy

01.Background In RAG application development, the first step is to chunk the document, efficient document chunking can effectively improve the accuracy of the subsequent recall content. Efficient document chunking can effectively improve the accuracy of the subsequent recalled content. How to efficiently chunk is a hot topic of discussion, there are such as fixed-size chunking, random-size chunking, sliding window...

2024-12-23

Anthropic summarizes simple and effective ways to build efficient intelligences

Over the past year, we've worked with teams building Large Language Model (LLM) agents across multiple industries. Consistently, we have found that the most successful implementations did not use complex frameworks or specialized libraries, but rather were built with simple, composable patterns. In this post, we'll share our experience working with customers and since...

2024-12-21

多为来自Anthropic的专家关于Prompt Engineering的讨论-首席AI分享圈

Mostly experts from Anthropic discuss Prompt Engineering

AI Summary Overview An in-depth look at AI cue engineering, with a roundtable format in which several experts from Anthropic share their understanding and practical experience of cue engineering from a variety of perspectives, including research, consumer, and enterprise. The article details the definition of cue engineering, its importance, and how...

2024-12-19

Scaling Test-Time Compute：向量模型上的思维链-首席AI分享圈

Scaling Test-Time Compute: Chain of Thought on Vector Models

Scaling Test-Time Compute has become one of the hottest topics in AI circles since OpenAI released the o1 model. Simply put, instead of piling up computational power in the pre-training or post-training phases, it is better to do it in the inference phase (i.e., when the large language model generates the output...

2024-12-17

2024 RAG Inventory, RAG Application Strategy 100+

Looking back to 2024, the big models are changing day by day, and hundreds of intelligent bodies are competing. As an important part of AI applications, RAG is also a "swarm of heroes and lords". At the beginning of the year ModularRAG continued to heat up, GraphRAG shine, open source tools in full swing in the middle of the year, the knowledge graph re-innovation opportunity, the end of the year graphical reasoning ...

2024-12-14

Best-of-N 越狱法：对输入内容进行简单的随机变形并反复尝试，就能让主流 AI 系统突破安全限制产生有害回应-首席AI分享圈

Best-of-N Jailbreak: a simple random morphing of input and repeated attempts to get mainstream AI systems to break through security restrictions to produce harmful responses

In recent years, with the rapid development of Generative AI (GAI) and Large Language Model (LLM), their security and reliability issues have attracted much attention. A recent study has discovered a simple but efficient attack method called Best-of-N jailbreak (BoN for short). By inputting ...