LangBot: open source large model instant messaging robot, support for multiple WeChat, QQ, Flybook and other multi-platform deployment of AI robots
LangBot is a large model-based instant messaging bot platform that supports multiple messaging platforms and large models. The platform adapts to QQ, WeChat (enterprise WeChat, personal WeChat), Flybook, Discord, OneBot and other messaging platforms, and supports Open...
zChunk: a generic semantic chunking strategy based on Llama-70B
Comprehensive Introduction zChunk is a novel chunking strategy developed by ZeroEntropy that aims to provide a solution for generic semantic chunking. The strategy is based on the Llama-70B model, which optimizes the chunking process of documents by prompting for chunks to be generated, ensuring that information retrieval is maintained at a high...
Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice
General Introduction Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate natural speech translation in the target language and provide text translation in real time while the user is speaking. The model...
Qwen4Mac: Use Qwen's big models in the Mac menu bar to have conversations on the go!
General Introduction Qwen4Mac is an open source project designed to integrate the Qwen Large Language Model (LLM) into the Mac's menu bar, making it easy for users to call and use at any time. The project is developed and maintained by andreaturchet and provides an easy way for users to...
Pocket AI: offline AI assistant running in your phone, adapted for DeepSeek-R1 (5.37GB)
General Introduction Pocket AI (PocketPal AI Chinese version) is a powerful offline AI assistant designed to allow users to talk to AI anytime, anywhere. The project is based on Small Language Models (SLMs) and runs on cell phones without internet connection, especially adapted to Chinese user experience. Mouth...
Kokoro WebGPU: A Text-to-Speech Service for Offline Operation in Browsers
General Introduction Kokoro WebGPU is a WebGPU version of the Kokoro text-to-speech (TTS) model, provided by WebML Community on the Hugging Face platform. The project utilizes WebGPU technology to enable users to...
JustCMS: AI-powered headless content management system that uses AI to create content quickly (paid)
General Introduction JustCMS is an innovative content management system designed for busy content creators. It utilizes Artificial Intelligence technology to support every step of the process from content ideation to publishing.JustCMS utilizes a headless architecture to ensure speed and flexibility in content delivery. Users can...
Windsurf Next is released, get a sneak peek at Windsurf's latest features!
Windsurf is releasing a preview version called Windsurf Next, which is intended for users who want to get a taste of the latest features, even if they are not quite perfect and may still have some minor issues that need to be addressed in the official W...
DeepSeek R1 vs o3-mini: who is the most cost-effective inference model for 2025?
OpenAI o3-mini vs DeepSeek R1: An in-depth comparison of advanced AI inference models to understand the key differences between the two inference models. With the Artificial Intelligence (AI) tech landscape changing rapidly, inference models have become a focal point of technological innovation...
An in-depth look at Titans: the path to convergence of long-time memory and efficient sequence modeling
Titans: Learning to Memorize at Test Time Original article: https://arxiv.org/pdf/2501.00663v1 Titans Architecture Unofficial implementation: htt...









