Deep Searcher: Efficient Retrieval of Enterprise Private Documents and Intelligent Q&A

Latest AI Resources9mos agorelease AI Sharing Circle

24.1K 00

General Introduction

Deep Searcher is a combination of powerful large language models (such as the DeepSeek and OpenAI) and vector databases (e.g., Milvus) are tools designed to search, evaluate, and reason based on private data, providing highly accurate answers and comprehensive reports. The project is applicable to enterprise knowledge management, intelligent Q&A systems, and information retrieval scenarios.Deep Searcher supports a wide range of embedding models and large language models, and is able to manage vector databases to ensure efficient retrieval and secure utilization of data.

Function List

Private Data Search: Maximize the use of internal enterprise data and ensure data security.
Vector Database Management: Supports vector databases such as Milvus, which allows data partitioning for more efficient retrieval.
Flexible embedding options: Compatible with multiple embedding models for easy selection of the best solution.
Multiple large language model support: Support for big models like DeepSeek, OpenAI, etc. for smart Q&A and content generation.
Document Loader: Local file loading is supported and web crawling will be added in the future.

Using Help

Installation process

Cloning Warehouse:

   git clone https://github.com/zilliztech/deep-searcher.git

Create a Python virtual environment (recommended):

   python3 -m venv .venv
source .venv/bin/activate

Install the dependencies:

   cd deep-searcher
pip install -e .

Configuring LLM or Milvus: Edit examples/example1.py file to configure LLM or Milvus as needed.
Prepare the data and run the example:

   python examples/example1.py

Instructions for use

Configuring LLM: In deepsearcher.configuration module, use the set_provider_config method to configure the LLM. for example. configure the OpenAI model:

   config.set_provider_config("llm", "OpenAI", {"model": "gpt-4o"})

Load Local Data: Use deepsearcher.offline_loading in the module load_from_local_files method to load local data:

   load_from_local_files(paths_or_directory="your_local_path")

Query Data: Use deepsearcher.online_query in the module query method for querying:

   result = query("Write a report about xxx.")

Detailed function operation flow

Private Data Search::
- Maximize the use of data within the enterprise while ensuring data security.
- Online content can be integrated when more accurate answers are needed.
Vector Database Management::
- Supports vector databases such as Milvus, which allows data partitioning for more efficient retrieval.
- Support for more vector databases (e.g. FAISS) is planned for the future.
Flexible embedding options::
- Compatible with multiple embedded models for easy selection of the best solution.
Multiple large language model support::
- Supports big models like DeepSeek, OpenAI, etc. for smart Q&A and content generation.
Document Loader::
- Local file loading is supported and web crawling will be added in the future.

Latest AI Resources # AI Java Open Source Projecct # Knowledge Retrieval with RAG Framework

Article copyright AI Sharing Circle All, please do not reproduce without permission.

iLoveIMG: online image batch processing tool | free online use of image enlargement, remove the background

Latest AI Resources # AI image editing

1yrs ago

036.7K

InfiniteTalk - Open Source Audio-Driven Video Generation Tool for Mission Vision AI

Latest AI Resources

2mos ago

021.3K

Oulu Translator Plugin: A Webpage Scratch Translation Tool with a Focus on Learning English

Latest AI Resources # AI Translation

9mos ago

023.8K

LlamaParse：Llamaindex推出的高品质解析文档，提取数据服务（每日免费提取1000页）

LlamaParse: High-quality document parsing and data extraction service by Llamaindex (1000 free pages per day).

Latest AI Resources # AI Open Services # Document Extraction and Cleaning

10mos ago

028.8K

No comments

You must be logged in to leave a comment!

No comments...

Deep Searcher: Efficient Retrieval of Enterprise Private Documents and Intelligent Q&A

General Introduction

Function List

Using Help

Installation process

Instructions for use

Detailed function operation flow

Flashcard: a word flashcard foreign language learning tool built on Dify, replacing Duolingo.

LocalPdfChatRAG: Intelligent Chat Tool to Support Local Multi-Source PDF Document Q&A

Related posts

iLoveIMG: online image batch processing tool | free online use of image enlargement, remove the background

InfiniteTalk - Open Source Audio-Driven Video Generation Tool for Mission Vision AI

Oulu Translator Plugin: A Webpage Scratch Translation Tool with a Focus on Learning English

LlamaParse: High-quality document parsing and data extraction service by Llamaindex (1000 free pages per day).

No comments

Latest Collections

Latest Articles

Deep Searcher: Efficient Retrieval of Enterprise Private Documents and Intelligent Q&A

General Introduction

Function List

Using Help

Installation process

Instructions for use

Detailed function operation flow

Flashcard: a word flashcard foreign language learning tool built on Dify, replacing Duolingo.

LocalPdfChatRAG: Intelligent Chat Tool to Support Local Multi-Source PDF Document Q&A

Related posts

iLoveIMG: online image batch processing tool | free online use of image enlargement, remove the background

InfiniteTalk - Open Source Audio-Driven Video Generation Tool for Mission Vision AI

Oulu Translator Plugin: A Webpage Scratch Translation Tool with a Focus on Learning English

LlamaParse: High-quality document parsing and data extraction service by Llamaindex (1000 free pages per day).

No comments

Selected AI Tools

Latest Collections

Latest Articles