AI Sharing Circle

AI is changing the world!
Youtu-Embedding - 腾讯优图开源的通用文本表示模型

Youtu-Embedding - Tencent Youtu open source generalized text representation model

Youtu-Embedding is a generalized text representation model open-sourced by Tencent's Youtu Lab, designed for enterprise-level applications. Through deep neural networks to map the text to a high-dimensional vector space, so that semantically similar sentences are closer in that space, to achieve accurate semantic retrieval.
6mos ago
033.9K
SAIL-VL2 - 字节跳动开源的多模态视觉语言模型

SAIL-VL2 - ByteHop's open source multimodal visual language model

SAIL-VL2 is an open source multimodal visual language model by the Byte Jump team, focusing on joint modeling of multimodal inputs such as images and text. Using the sparse mixture of experts (MoE) architecture and progressive training strategy, it achieves high performance at parameter scales from 2B to 8B, especially in the areas of graphic comprehension, math...
6mos ago
027.1K
MineContext - 字节开源的主动式上下文感知AI伙伴

MineContext - Bytes Open Source Active Context-Aware AI Partner

MineContext is an active context-aware AI partner open-sourced by the ByteDance Viking team to help users efficiently manage massive amounts of information and improve the efficiency of knowledge work. Over the screenshot and content understanding technology, automatically record the user's daily operations (such as browsing the web, editing documents, etc.), support...
6mos ago
048K
nanochat - Karpathy免费开源的低成本模型训练项目

nanochat - Karpathy's free and open source low-cost model training program

nanochat is an open source project released by AI legend and former Tesla AI Director Andrej Karpathy that allows individuals to quickly train a small ChatGPT-like language model at a very low cost and simplicity. The entire project uses only about 800...
6mos ago
033.6K
LLaVA-OneVision-1.5 - 免费开源的多模态模型,高性能多模态理解

LLaVA-OneVision-1.5 - Free and open source multimodal modeling, high performance multimodal understanding

LLaVA-OneVision-1.5 is an open-source multimodal model by the EvolvingLMMS-Lab team, using 8B parameter scale, through a compact three-phase training process (language-image alignment, conceptual equalization and knowledge injection, and instruction fine-tuning) on 128 A800...
6mos ago
032K
Paper2Video - 新加坡国立开源的学术论文自动生成演示视频项目

Paper2Video - NUS open source project to automatically generate demo videos for academic papers

Paper2Video is an open-source presentation video project for automatic generation of academic papers by Show Lab at National University of Singapore. Using the PaperTalker multi-intelligence framework, papers are transformed into full presentation videos containing slides, subtitles, voiceover and speaker avatar...
6mos ago
034.3K
NeuTTS Air - 支持离线CPU运行的免费轻量级语音合成模型

NeuTTS Air - Free and Lightweight Speech Synthesis Model with Offline CPU Running Support

NeuTTS Air is open source lightweight speech synthesis model, developed by Neuphonic team, which can run in real time on local devices (e.g. cell phones, laptops, Raspberry Pi) without relying on the cloud. Using 0.5B parameter Qwen architecture and self-developed NeuCodec codec...
6mos ago
040.1K
KAT-Dev-72B-Exp - 快手开源的免费编程专用模型

KAT-Dev-72B-Exp - Racer open source free programming-specific models

KAT-Dev-72B-Exp is an open-source programming-specific large language model launched by the Racer team, optimized based on reinforcement learning technology, which achieved an accuracy rate of 74.6% in the SWE-Bench Verified benchmark test, the best performance of any open-source model at present. The model uses innovative...
6mos ago
031.4K
Jamba Reasoning 3B - 以色列AI21 Labs开源的轻量级推理模型

Jamba Reasoning 3B - Israel AI21 Labs open source lightweight reasoning model

Jamba Reasoning 3B is a lightweight inference model open-sourced by Israeli AI startup AI21 Labs with strong performance and potential for a wide range of applications. It utilizes a hybrid SSM-Transformer architecture that combines Trans...
6mos ago
028.8K
吴恩达的《Agentic AI》最新智能体免费课程

Free Course on the Latest Intelligentsia from Agentic AI by Ernest Ng

Agentic AI is the newest course on intelligent bodies launched by Ernest Ng.The course focuses on the design and construction of intelligent bodies, covering the four major design patterns of reflection, tool use, planning, and multi-intelligent body collaboration. Learners will master how to make intelligent bodies check outputs, autonomously adjust through theoretical explanations and code practice...
6mos ago
053.8K