Private Deployment of DeepSeek + Dify: Building a Secure and Controllable Local AI Assistant System

AI hands-on tutorials6mos agorelease AI Sharing Circle

1.5K 00

summarize

DeepSeek is a groundbreaking open source big language model that revolutionizes AI dialog interaction with its advanced algorithmic architecture and reflexive chaining capabilities. With private deployment, you can fully control data security and usage security. You can also flexibly adjust the deployment scheme and implement a convenient customization system.

Dify As the same open source AI application development platform, it offers a complete private deployment solution. By seamlessly integrating locally deployed DeepSeek services into the Dify platform, organizations can build powerful AI applications within a local server environment while ensuring data privacy.

The following are the advantages of the private deployment option:

superior performance : Provide a dialog interaction experience comparable to business models
environmental isolation : Completely offline operation, eliminating the risk of data leakage
Data controllability : Full control of data assets to meet compliance requirements

pre-positioning

Hardware Environment:

CPU >= 2 Core
Video Memory/RAM ≥ 16 GiB (recommended)

Software environment:

Docker
Docker Compose
Ollama
Dify Community Edition

Starting deployment

1. Installation of Ollama

Ollama is a cross-platform large model management client (MacOS, Windows, Linux) designed to seamlessly deploy large language models (LLMs) such as DeepSeek, Llama, Mistral, etc. Ollama provides one-click deployment of large models, and all usage data is stored locally on the machine, providing full data privacy and security. and security.

Visit the Ollama website and follow the prompts to download and install the Ollama client. After installation, run ollama -v command will output the version number.

➜~ollama-v
ollamaversionis0.5.5

Select the appropriate DeepSeek size model for deployment based on your actual environment configuration. The 7B size model is recommended for initial installation.

Run command ollama run deepseek-r1:7b mounting DeepSeek R1 Model.

2. Install Dify Community Edition

Visit the Dify GitHub project address and run the following commands to complete the pull code repository and installation process.

gitclonehttps://github.com/langgenius/dify.git
cddify/docker
cp.env.example.env
dockercomposeup-d# 如果版本是 Docker Compose V1，使用以下命令：docker-compose up -d

After running the command, you should see the status and port mapping of all containers. For detailed instructions, please refer to Docker Compose DeploymentThe

Dify Community Edition uses port 80 by default. http://your_server_ip Access your privatized Dify platform.

3. Connecting DeepSeek to Dify

Click on the top right corner of the Dify platform Avatar → Settings → Model Provider Select Ollama and tap Add Model.

DeepSeek within the model provider corresponds to the online API service; locally deployed DeepSeek models correspond to the Ollama client. Please ensure that the DeepSeek model has been successfully deployed by the Ollama client, as detailed in the deployment instructions above.

Select the LLM model type.

Model Name, fill in the model name of the deployed model. The model model deployed above is deepseek-r1 7b, so fill in the deepseek-r1:7b
Base URL, the address where the Ollama client is running, usually http://your_server_ip:11434. In case of connection problems, please read incommon problemsThe
The other options remain at their default values. Depending on the Description of the DeepSeek modelThe maximum generated length is 32,768 Tokens.

Building AI Applications

DeepSeek AI Chatbot (simple application)

Tap "Create a Blank App" on the left side of the Dify platform homepage, select the "Chat Assistant" type of app and name it simply.

In the upper right hand corner, under Application Type, select Ollama Framework within the deepseek-r1:7b Model.

Verify that the AI application works by entering content in the preview dialog box. Generating a response means that the AI application build is complete.

Tap the Publish button at the top right of the app to get a link to the AI app and share it with others or embed it in another website.

DeepSeek AI Chatflow / Workflow (advanced application)

Chatflow / Workflow Apps can help you build AI applications with more complex functionality, such as having the ability to do document recognition, image recognition, voice recognition, and more. For a detailed description, please refer toWorkflow DocumentationThe

Tap "Create a Blank App" on the left side of the Dify platform homepage, select a "Chatflow" or "Workflow" type app and name it simply.

To add an LLM node, select the Ollama framework within the deepseek-r1:7b model and add the system prompt word within the {{#sys.query#}} variable to connect the start node.

Add the end node to complete the configuration. You can enter content in the preview box for testing. Generating a response means that the AI application build is complete.

common problems

1. Connection errors during Docker deployment

When deploying Dify and Ollama with Docker, the following errors may be encountered:

httpconnectionpool(host=127.0.0.1,port=11434): max retries exceeded with url:/cpi/chat
(CausedbyNewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f8562812c20>:
fail to establish a new connection:[Errno 111] Connection refused'))
httpconnectionpool(host=localhost,port=11434): max retries exceeded with url:/cpi/chat
(CausedbyNewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f8562812c20>:
fail to establish a new connection:[Errno 111] Connection refused'))

Cause of error : This error occurs because the Ollama service is not accessible in the Docker container. localhost usually points to the container itself, not the host or another container. To resolve this issue, you need to expose the Ollama service to the network.

macOS environment configuration method:

If Ollama is running as a macOS application, you need to set the environment variable using launchctl:

This is accomplished by calling the launchctl setenv Setting environment variables:

launchctlsetenvOLLAMA_HOST"0.0.0.0"

Restart the Ollama application.
If the above steps do not work, you can use the following method:

The problem is that inside docker, you should connect to the host.docker.internalto access the docker's hosts, so set the localhost Replace with host.docker.internal The service is ready to take effect:

http://host.docker.internal:11434

Linux environment configuration method:

If Ollama is running as a systemd service, you should use the systemctl Setting environment variables:

This is accomplished by calling the systemctl edit ollama.service Edit the systemd service. This will open an editor.
For each environment variable, the [Service] Add a line under the section Environment::

[Service]
Environment="OLLAMA_HOST=0.0.0.0"

Save and exit.
heavy load (on a truck) systemd and restart Ollama:

systemctldaemon-reload
systemctlrestartollama

Windows environment configuration method:

On Windows, Ollama inherits your user and system environment variables.

First exit the program by clicking on Ollama in the taskbar.
Edit system environment variables from the control panel
Edit or create new variables for your user account, such as OLLAMA_HOST,OLLAMA_MODELS etc.
Click OK / Apply Save
In a new terminal window, run ollama