AI Sharing Circle

Day arching a pawn and sharing for the king!
Cosmos:世界基础模型,构建物理世界的人工智能基础模型平台

Cosmos: World Base Model, a platform for building AI base models of the physical world

Comprehensive Introduction NVIDIA Cosmos is a world-based modeling platform for developers specifically designed to help physics AI developers build their physics AI systems better and faster. The platform offers a range of pre-trained models, including diffusion and autoregressive world-based...
1yrs ago
060.4K
将文字描述或文档转换为美观的概念示意图,快速为PPT或文章配图

Convert text descriptions or documents into beautiful conceptual diagrams and quickly illustrate PPTs or articles.

I think the probability is that the pictures, tables, flowcharts. Today we recommend a free AI text to visual tool, the texture is a bit cow, it feels like Figma such a big factory texture, simple and powerful. Of course, it can also be a very beautiful and practical notes, documents tool...
1yrs ago
056.2K
Mini LLM Flow:使用100行代码构建“有向图结构”的LLM微型智能体

Mini LLM Flow: Building LLM Mini-Intelligents with "Directed Graph Structure" in 100 Lines of Code

General Introduction miniLLMFlow is a minimalist Large Language Model (LLM) development framework that contains only 100 lines of core code, demonstrating the design philosophy of "keeping it simple". The framework is specifically designed to enable AI assistants (e.g. ChatGPT, Claude, etc.) to...
1yrs ago
057.8K
GraphReader:基于图的智能体,增强大型语言模型的长文本处理能力

GraphReader: Graph-based Intelligents to Enhance Long Text Processing for Large Language Models

GraphReader: a graph-based intelligence that enhances long text processing for large language models Graphic Expert: like a tutor who is good at making mind maps, it transforms lengthy text into a clear knowledge network, so that the AI can easily find each level needed for an answer as if it were exploring along a map...
1yrs ago
046.8K
微信语音消息还能这样玩?小白也能用 Devbox 轻松实现公众号语音转文字!

WeChat voice messages can be played like this? Even a beginner can use Devbox to easily realize public number voice to text!

Many people would like to use WeChat's voice input directly, it's always faster to speak than to type. Unlike the common .mp3 and .wav formats, WeChat's voice input uses the .amr format by default. The image below shows the webhook received by the developer server from WeChat, indicating that the public...
1yrs ago
044.4K
小智 AI 聊天机器人:打造你的AI聊天伴侣,轻松实现语音对话和智能互动

Xiaozhi AI Chatbot: Build your AI chatting companion, easily realize voice conversation and intelligent interaction

Comprehensive Introduction Xiaozhi AI Chatbot is an open source project based on the ESP32 development board, designed to help users build their own AI chat companion. The project was developed by Shrimp and is mainly used for teaching purposes to help more people get started with AI hardware development and to understand how to apply large language models to real...
1yrs ago
0218.1K
DashInfer-VLM,多模态SOTA推理性能,超vLLM!

DashInfer-VLM, multimodal SOTA inference performance over vLLM!

Introduction DashInfer-VLM is an inference architecture for visual multimodal large model VLMs, especially optimized for inference acceleration of the Qwen VL model. The biggest difference between DashInfer-VLM and other inference acceleration frameworks for VLMs is that it puts the VIT part of...
1yrs ago
055.4K
将描述业务流程的文档转换为业务流程图:以辅导公司上市文档为例

Converting a document describing a business process into a business process diagram: an example of a document for coaching a company to go public

Someone in the group asked: which senior knows which ai can draw the flowchart of listed company information? Guess it is listing counseling documents related to the process, in fact, do not need any tools, as long as you can draw a sample flowchart, so that the big model to generate SVG code can be, of course, Mermaid syntax can be...
1yrs ago
051.5K
OpenAI Realtime API Next.js:构建实时语音对话AI应用的Next.js模板

OpenAI Realtime API Next.js: a Next.js template for building real-time voice conversation AI applications

Comprehensive introduction OpenAI Realtime API Next.js is an open source project based on the Next.js framework , designed to help developers quickly build real-time voice AI applications . The project integrates OpenAI's real-time API and WebRTC technology...
1yrs ago
049.7K
Film-Scan-Converter:扫描RAW图像格式的胶片转换为成品图像

Film-Scan-Converter: Scanning of RAW image format film for conversion to finished images

General Description Film-Scan-Converter is an open source Python script specialized in processing RAW film scans taken by digital cameras. The script is capable of converting film scans in RAW format into final usable images for photography enthusiasts and...
1yrs ago
055.9K