🚀 Invitation to Experience: China's First AI IDE Intelligent Programming Software Trae Chinese version downloadThe DeepSeek-R1 and Doubao-pro are available for unlimited use!

AI knowledge Page 4

Workflow (Workflow): an article to read the operating principles of workflow

Before we start, let's understand a few "key words": Workflow (Workflow): Simply put, it is "the complete steps to accomplish something". It's like an "instruction manual" that tells you what needs to be done, in what order, and by whom, in order to achieve your goal. Input: Before the workflow begins, you need to...

2025-01-23

Turn Cursor into Devin in an hour and learn the difference!

This article is part of the series "Understanding and Deploying Intelligent Body AI": Intelligent Body AI Series 1: Comparison between Devin and Agent Cursor Intelligent Body AI Series 2: From Thinker to Doer - The Paradigm Revolution in Intelligent Body AI and Technology Architecture and Technical Architecture Intelligent Body AI Series 3: Turning $20 into $50...

2025-01-22

Trae Chinese Version First Invitation to Download: Unlimited use of DeepSeek-R1 after registration!

Enable Builder Smart Programming Mode, unlimited use of DeepSeek-R1 and DeepSeek-V3, smoother experience than the overseas version. Just enter the Chinese commands, even a novice programmer can write his own apps with zero threshold.

2025-03-26

Five ways to realize the LLM memory system

When building large language modeling (LLM) applications, memory systems are one of the key technologies to enhance conversation context management, long-term information storage, and semantic understanding. An efficient memory system can help the model maintain consistency over long conversations, extract key information, and even have the ability to retrieve historical conversations...

2025-01-20

OpenAI Function calling

OpenAI Function calling V2 Features The core goal of Function calling V2 is to give OpenAI models the ability to interact with the outside world, which is reflected in the following two core functions: Fetching Data - A function calling implementation of RAG: Essentially RAG (Retrieve Augmented...

2025-01-16

Retrieval: What is Retrieval? Explain the common "retrieval" techniques used in RAG.

Basic Concepts In the field of information technology, retrieval refers to the process of efficiently locating and extracting relevant information from a large dataset (usually documents, Web pages, images, audio, video, or other forms of information) in response to a user's query or need. Its core goal is to find information that is relevant to the use...

2025-01-11

Agent AI: 探索多模态交互的前沿世界[李飞飞-经典必读]-首席AI分享圈

Agent AI: Exploring the Frontier World of Multimodal Interaction [Fei-Fei Li - Classic Must Reads]

Agent AI: Surveying the Horizons of Multimodal Interaction Originally published at https://ar5iv.labs.arxiv.org/html/2401.03568 Abstract Multimodal AI systems are likely to be ubiquitous in our daily lives. Making these systems more interactive a...

2025-01-11

GraphReader:基于图的智能体，增强大型语言模型的长文本处理能力-首席AI分享圈

GraphReader: Graph-based Intelligents to Enhance Long Text Processing for Large Language Models

GraphReader: a graph-based intelligence that enhances long text processing for large language models Graphic Expert: like a tutor who is good at making mind maps, it transforms lengthy text into a clear knowledge network, so that the AI can easily find each key point needed for an answer as if it were exploring along a map, and effectively gr...

2025-01-08

CAG: A cache-enhanced generation method that is 40 times faster than RAG

CAG (Cache Augmented Generation) that is 40 times faster than RAG (Retrieval Augmented Generation).CAG revolutionizes knowledge acquisition: instead of retrieving external data in real time, all knowledge is pre-loaded into the model context. It's like condensing a huge library into an on-the-go toolkit that can be used when needed...

2025-01-07

Google Agents and Basic Applications White Paper (Chinese version)

By Julia Wiesinger, Patrick Marlow and Vladimir Vuskovic Originally published at https://www.kaggle.com/whitepaper-agents Table of Contents Introduction What is an Intelligent Body? Models Tools Orchestration Layers Intelligent Bodies and Models Cognitive Architecture: How Intelligent Bodies Work Tools ...

2025-01-04

2023 Old article review: a guide to the RAG system build process and evaluation

Retrieval Augmented Generation (RAG) is becoming one of the most popular applications for Large Language Models (LLMs) and vector databases.RAG is the process of augmenting the input to a LLM with context retrieved from vector databases (e.g., Weaviate).The RAG application passes...

2025-01-04

Approaching Multi-Agent Systems (MAS): a Collaborative AI World

A Multi-Agent System (MAS) is a computing system consisting of multiple interacting Intelligent Agents. Multi-Agent Systems can be used to solve problems that are difficult or impossible to solve by a single Intelligent Agent or a single system. Intelligent agents can be robots, humans, or soft...

2025-01-03

An article to take you to understand RAG (Retrieval Augmented Generation), the concept of theoretical introduction + code practice

First, LLMs already have strong capabilities, why do we still need RAG (Retrieval Augmented Generation)? Although LLMs have demonstrated significant capabilities, the following challenges still warrant attention: Illusion problem: LLMs use a statistically based probabilistic approach to generate text word by word, a mechanism that inherently leads to the possibility of...

2025-01-02

OpenAI-o3 and Monte-Carlo Ideas

o3 is here to share some personal insights. Progress on Test-time Scaling Law has been much faster than we thought. But I'd like to say that the path is actually a bit convoluted - it's OpenAI's way of saving the country from the curve in its pursuit of AGI. Reinforcement Learning and Shortcut Thinking For ...

2025-01-01

How to choose the best Embedding model for a RAG application

Vector Embedding is the core of current Retrieval Augmented Generation (RAG) applications. They capture semantic information of data objects (e.g., text, images, etc.) and represent them as arrays of numbers. In current generative AI applications, these vector Embedding are usually generated by Embedding models. How to apply for RAG ...

2025-01-01

A 10,000-word article on RAG optimization in DB-GPT real-world scenarios.

PREFACE Over the past two years, Retrieval-Augmented Generation (RAG, Retrieval-Augmented Generation) technology has gradually become a core component for enhancing intelligences. By combining the dual capabilities of retrieval and generation, RAG is able to introduce external knowledge, thus providing more applications of large models in complex scenarios...

2024-12-31

Top 5 AI Agent Frameworks Worth Getting Into in 2025

Agent The most common translation I've seen so far is "intelligent body", but the direct translation is "agent". What does Agentic translate to? I feel that a word like "agentic" is more appropriate. So in order not to confuse the readers, I use English directly in this article. With the development of LLM, the ability of AI...

2024-12-28

朴素、有效的RAG检索策略：稀疏+密集混合检索并重排，并利用“提示缓存”为文本块生成整体文档相关的上下文-首席AI分享圈

Simple, effective RAG retrieval strategy: sparse + dense hybrid search and rearrangement, and use "cue caching" to generate overall document-relevant context for text chunks.

In order for an AI model to be useful in a particular scenario, it usually needs access to background knowledge. For example, a customer support chatbot needs to understand the specific business it serves, while a legal analysis bot needs to have access to a large number of past cases. Developers often use Retrieval-Augmente...

2024-12-27Knowledge Retrieval and the RAG Framework

Large model fine-tuning knowledge points that even a novice can understand

Full Process of Fine-tuning Large Models It is recommended to strictly follow the above process during fine-tuning and avoid skipping steps, which may lead to ineffective labor. For example, if the dataset is not fully constructed, and it is eventually found that the poor effect of the fine-tuned model is a problem of the quality of the dataset, then the preliminary efforts will be wasted, and the matter...

2024-12-24

A 10,000-word article comprehending the development process of LLM-based Text-to-SQL

OlaChat AI Digital Intelligence Assistant 10,000-word in-depth analysis to bring you to the past and present of Text-to-SQL technology. Thesis: Next-Generation Database Interfaces: a Survey of LLM-based Text-to-SQL Generating accurate SQL from natural language problems (text-to-SQL) is a long...