EXO: Running distributed AI clusters using idle home devices with support for multiple inference engines and automated device discovery.

Latest AI Resources8mos agoupdate AI Sharing Circle

2.9K 00

General Introduction

Exo is an open source project designed to run its own AI cluster using everyday devices (e.g. iPhone, iPad, Android, Mac, Linux, etc.). Through dynamic model partitioning and automated device discovery, Exo is able to unify multiple devices into a single powerful GPU that supports multiple models such as LLaMA, Mistral, LlaVA, Qwen, and Deepseek.Exo also provides a ChatGPT-compatible API that allows users to easily run models on their own hardware.

Function List

Broad model support: Support for multiple models such as LLaMA, Mistral, LlaVA, Qwen and Deepseek.
Dynamic model partitioning: Optimize model partitioning based on current network topology and device resources.
Automated device discovery: Autodiscover other devices without manual configuration.
ChatGPT Compatible API: Provide a ChatGPT-compatible API to facilitate running the model on your own hardware.
equipment equality: The devices are connected to each other using point-to-point connections and do not use a master-slave architecture.
Multiple partitioning strategies: Supports multiple partitioning strategies such as ring memory-weighted partitioning.

Using Help

Installation process

preliminary::
- Make sure Python version >= 3.12.0.
- If using Linux and supporting NVIDIA GPUs, install the NVIDIA drivers, CUDA toolkit, and cuDNN library.
Installation from source::
- Cloning Project:git clone https://github.com/exo-explore/exo.git
- Go to the project catalog:cd exo
- Install the dependencies:pip install -e .
- Or use a virtual environment to install it:source install.sh

Functional operation flow

operational model::
- Run the example on multiple macOS devices:
  - Equipment 1:exo
  - Equipment 2:exo
  - Exo automatically discovers other devices and launches a ChatGPT-like WebUI (powered by tinygrad tinychat) athttp://localhost:52415The
- Run the example on a single device:
  - Use the command:exo run llama-3.2-3b
  - Use a customized prompt:exo run llama-3.2-3b --prompt "What is the meaning of exo?"
Model Storage::
- By default, models are stored in the~/.cache/huggingface/hubThe
- This can be done by setting the environment variableHF_HOMEto change the model storage location.
adjust components during testing::
- Using Environment VariablesDEBUG(0-9) Enable debug logging:DEBUG=9 exo
- For the tinygrad inference engine, use a separate debugging flagTINYGRAD_DEBUG(1-6):TINYGRAD_DEBUG=2 exo
Formatting Code::
- utilizationyapfFormatting code:
  - Installation formatting requirements:pip3 install -e '.[formatting]'
  - Run the formatting script:python3 format.py ./exo

Usage

Start EXO::

exo

EXO will automatically discover and connect to other devices without additional configuration.

operational model::

Use the default model:

 exo run llama-3.2-3b

Customization Tip:

 exo run llama-3.2-3b --prompt "EXO的意义是什么？"

API Usage Examples::
- Send request: bash curl http://localhost:52415/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "llama-3.2-3b", "messages": [{"role": "user", "content": "EXO的意义是什么？"}], "temperature": 0.7 }'

performance optimization

macOS users::
- Upgrade to the latest version of macOS.
- (of a computer) run./configure_mlx.shOptimize GPU memory allocation.

common problems

SSL error: On some MacOS/Python versions, the certificate is not installed correctly. Run the following command to fix it:

  /Applications/Python 3.x/Install Certificates.command

Debug Log: Enable debug logging:

  DEBUG=9 exo

Latest AI Resources # AI Java Open Source Projecct # Locally Deployed Open Source Large Modeling Tool

The article is copyrighted and should not be reproduced without permission.

Mailbutler：与电子邮件应用程序集成，优化收件箱体验，自动撰写和回复邮件。

Mailbutler: integrates with email apps to optimize the inbox experience and automate composing and replying to emails.

Latest AI Resources # AI Life Efficiency Assistant

7mos ago

01.5K

LangGraph Supervisor：利用监督智能体来管理多智能体协作的工具

LangGraph Supervisor: a tool for managing multi-intelligence collaboration using supervising intelligences

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

6mos ago

01.4K

Goose: open source scalable programming intelligences that automate the full range of programming tasks

Latest AI Resources # AI Java Open Source Projecct # AI Programming # Intelligent Body Development Framework

6mos ago

02.3K

HyperUGC: Generating Real UGC Videos with AI Avatars

Latest AI Resources # AI Marketing

6mos ago

01.3K

No comments

You must be logged in to leave a comment!

No comments...

EXO: Running distributed AI clusters using idle home devices with support for multiple inference engines and automated device discovery.

General Introduction

Function List

Using Help

Using Help

Installation process

Functional operation flow

Usage

performance optimization

common problems

PageGen: Quickly Convert Text Content, Screenshots to Responsive Static Pages

Llama Tutor: an AI tool that provides personalized tutoring, an open source AI personal tutor project built on Llama 3.1

Related posts

Mailbutler: integrates with email apps to optimize the inbox experience and automate composing and replying to emails.

LangGraph Supervisor: a tool for managing multi-intelligence collaboration using supervising intelligences

Goose: open source scalable programming intelligences that automate the full range of programming tasks

HyperUGC: Generating Real UGC Videos with AI Avatars

No comments

Latest Collections

Latest Articles

EXO: Running distributed AI clusters using idle home devices with support for multiple inference engines and automated device discovery.

General Introduction

Function List

Using Help

Using Help

Installation process

Functional operation flow

Usage

performance optimization

common problems

PageGen: Quickly Convert Text Content, Screenshots to Responsive Static Pages

Llama Tutor: an AI tool that provides personalized tutoring, an open source AI personal tutor project built on Llama 3.1

Related posts

Mailbutler: integrates with email apps to optimize the inbox experience and automate composing and replying to emails.

LangGraph Supervisor: a tool for managing multi-intelligence collaboration using supervising intelligences

Goose: open source scalable programming intelligences that automate the full range of programming tasks

HyperUGC: Generating Real UGC Videos with AI Avatars

No comments

Selected AI Tools

Latest Collections

Latest Articles