AI Engineering Academy: 2.18Vision RAG Visual Capabilities
Notes: https://colab.research.google.com/github/run-llama/llama_index/blob/main/docs/docs/examples/multi...
NuxtHub: deploy 5 Nuxt apps for free based on Cloudflare
General Introduction NuxtHub is a cloud platform designed to simplify and optimize the deployment and scaling of Nuxt applications. By leveraging Cloudflare's global network, NuxtHub provides a high-performance, low-cost, full-stack solution that supports developers...
Petals: distributed shared GPU running and fine-tuning of large language models, sharing GPU resources like a BitTorrent network
General Introduction Petals is an open source project developed by the BigScience Workshop to run Large Language Models (LLMs) through a distributed computing approach. Users can run LLMs at home using consumer-grade GPUs or Google Co...
Aphrodite Engine: an efficient LLM inference engine that supports multiple quantization formats and distributed inference.
General Introduction The Aphrodite Engine is the official backend engine for PygmalionAI, designed to provide an inference endpoint for PygmalionAI sites and to support rapid deployment of Hugging Face-compatible models. The engine utilizes vLLM's p...
Text generation web UI: Gradio-based large language modeling chat interface with support for multiple back-end services
General Introduction Text generation web UI is a Gradio-based web UI designed for the Large Language Model (LLM). It supports a variety of text generation backends, including Transformers, llama.cp...
llama.cpp: efficient inference tool, supports multiple hardware, easy to implement LLM inference
General Introduction llama.cpp is a library implemented in pure C/C++ designed to simplify the inference process for Large Language Models (LLMs). It supports a wide range of hardware platforms, including Apple Silicon, NVIDIA GPUs, and AMD GPUs, and provides a variety of quant...
Jan: Open Source Offline AI Assistant, ChatGPT Replacement, Run Local AI Models or Connect to Cloud AI
General Description Jan is an open source ChatGPT replacement that runs 100% offline on the user's device. It is driven by a Cortex engine and supports a wide range of hardware platforms, including NVIDIA GPUs and Apple M-series chips...
AyeSoul: minimalist AI real-time search engine that thinks and calls 50+ tools to assist with various tasks
General Introduction AyeSoul is a unified AI search, answer and task engine designed to help users accomplish a variety of everyday tasks through a simple interface. Whether it's web search, deep research, brainstorming, creative writing, content creation, or programming, AyeSoul can...
Komo: quickly search for information to generate structured answers, explore more search results
General Introduction Komo is an AI-powered search engine designed to provide a fast, private and ad-free search experience. Users can use Komo to explore in depth, get instant answers, and discuss a variety of topics. Its main features include search, explore and chat to help users efficiently...
Morphic: AI-powered open-source search engine that offers smart Q&A, video search, and generates UI code
General Introduction Morphic is a search engine based on AI technology with a generative user interface designed to provide intelligent Q&A and an efficient search experience. Users can perform a variety of searches with Morphic, including text, video, etc., and can save search history and share search results.Mo...









