AI Sharing Circle

Day arching a pawn and sharing for the king!

Cosmos: World Base Model, a platform for building AI base models of the physical world

Comprehensive Introduction NVIDIA Cosmos is a world-based modeling platform for developers specifically designed to help physics AI developers build their physics AI systems better and faster. The platform offers a range of pre-trained models, including diffusion and autoregressive world-based...

Latest AI Resources # AI Open Services # AI Java Open Source Projecct

2yrs ago

089.9K

Convert text descriptions or documents into beautiful conceptual diagrams and quickly illustrate PPTs or articles.

I think the probability is that the pictures, tables, flowcharts. Today we recommend a free AI text to visual tool, the texture is a bit cow, it feels like Figma such a big factory texture, simple and powerful. Of course, it can also be a very beautiful and practical notes, documents tool...

AI hands-on tutorials

2yrs ago

081.5K

Mini LLM Flow：使用100行代码构建“有向图结构”的LLM微型智能体

Mini LLM Flow: Building LLM Mini-Intelligents with "Directed Graph Structure" in 100 Lines of Code

General Introduction miniLLMFlow is a minimalist Large Language Model (LLM) development framework that contains only 100 lines of core code, demonstrating the design philosophy of "keeping it simple". The framework is specifically designed to enable AI assistants (e.g. ChatGPT, Claude, etc.) to...

Latest AI Resources # AI Java Open Source Projecct # Intelligent Body Development Framework

2yrs ago

088.7K

GraphReader: Graph-based Intelligents to Enhance Long Text Processing for Large Language Models

GraphReader: a graph-based intelligence that enhances long text processing for large language models Graphic Expert: like a tutor who is good at making mind maps, it transforms lengthy text into a clear knowledge network, so that the AI can easily find each level needed for an answer as if it were exploring along a map...

AI Knowledge Base

2yrs ago

066.5K

WeChat voice messages can be played like this? Even a beginner can use Devbox to easily realize public number voice to text!

Many people would like to use WeChat's voice input directly, it's always faster to speak than to type. Unlike the common .mp3 and .wav formats, WeChat's voice input uses the .amr format by default. The image below shows the webhook received by the developer server from WeChat, indicating that the public...

AI hands-on tutorials

2yrs ago

065.5K

Xiaozhi AI Chatbot: Build your AI chatting companion, easily realize voice conversation and intelligent interaction

Comprehensive Introduction Xiaozhi AI Chatbot is an open source project based on the ESP32 development board, designed to help users build their own AI chat companion. The project was developed by Shrimp and is mainly used for teaching purposes to help more people get started with AI hardware development and to understand how to apply large language models to real...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

1yrs ago

0265K

DashInfer-VLM, multimodal SOTA inference performance over vLLM!

Introduction DashInfer-VLM is an inference architecture for visual multimodal large model VLMs, especially optimized for inference acceleration of the Qwen VL model. The biggest difference between DashInfer-VLM and other inference acceleration frameworks for VLMs is that it puts the VIT part of...

AI News

2yrs ago

076.8K

Converting a document describing a business process into a business process diagram: an example of a document for coaching a company to go public

Someone in the group asked: which senior knows which ai can draw the flowchart of listed company information? Guess it is listing counseling documents related to the process, in fact, do not need any tools, as long as you can draw a sample flowchart, so that the big model to generate SVG code can be, of course, Mermaid syntax can be...

AI hands-on tutorials

2yrs ago

070.5K

OpenAI Realtime API Next.js：构建实时语音对话AI应用的Next.js模板

OpenAI Realtime API Next.js: a Next.js template for building real-time voice conversation AI applications

Comprehensive introduction OpenAI Realtime API Next.js is an open source project based on the Next.js framework , designed to help developers quickly build real-time voice AI applications . The project integrates OpenAI's real-time API and WebRTC technology...

Latest AI Resources # AI Java Open Source Projecct # Multimodal Real-Time Interactive Products

2yrs ago

078.1K

Film-Scan-Converter: Scanning of RAW image format film for conversion to finished images

General Description Film-Scan-Converter is an open source Python script specialized in processing RAW film scans taken by digital cameras. The script is capable of converting film scans in RAW format into final usable images for photography enthusiasts and...

Latest AI Resources # AI Image Style Control

2yrs ago

083.1K

Loading more