Latest AI Resources

Total 2832 articles posts
Fay数字人框架:集成语言模型与3D数字角色,支持多种应用场景

Fay Digital Human Framework: Integrated language modeling and 3D digital characters to support multiple application scenarios

Comprehensive Introduction Fay is an open source 3D virtual digital human framework that integrates language models and digital characters for a variety of application scenarios, such as virtual shopping guides, virtual anchors, assistants, waiters, teachers, and voice- or text-based mobile assistants.The Fay framework supports full offline use, providing m...
11mos ago
035.9K
5ire:支持本地向量知识库的跨平台大模型桌面客户端

5ire: cross-platform large model desktop client with support for local vector knowledge bases

General Introduction 5ire is an open source cross-platform big model desktop client designed to provide users with convenient local vector knowledge base management and big model interaction capabilities. The software supports parsing and vectorized storage of multiple document formats with powerful retrieval-enhanced generation (RAG) capabilities. In addition, 5i...
1yrs ago
035.9K
灵宇智能:商业数字人直播服务商|飞影数字人|数字人直播带货

Lingyu Intelligence: Business Digital People Live Service Provider|Flying Shadow Digital People|Digital People Live Streaming Bandwagon

General Introduction Lingyu Intelligence is a Chinese company specializing in AI products and technologies that create intelligent physical digital people with "souls" and use AI technology to analyze data to help individuals, entrepreneurs, and businesses grow their revenues. They have launched a new product for virtual live streaming and interactive sales promotion...
1yrs ago
035.8K
佐糖:在线图片处理工具,一键抠图、去水印、照片修复、人像编辑

Zosugar: online photo processing tools, one-click keying, watermark removal, photo restoration, portrait editing

Comprehensive Introduction ZuoSugar (PicWish) is an intelligent AI image processing platform, providing a wealth of online photo editing tools, supporting the use of all platforms. Users can easily complete one-click keying, watermark removal, blurry photos become clear, lossless zoom, image cropping, image compression and black and white photo...
12mos ago
035.8K
HealthGPT:支持医学图像分析与诊断问答的医疗大模型

HealthGPT: A Medical Big Model to Support Medical Image Analysis and Diagnostic Q&A

Comprehensive Introduction HealthGPT is a state-of-the-art medical grand visual language model designed to enable unified medical visual understanding and generation capabilities through heterogeneous knowledge adaptation. The goal of the project is to integrate medical visual understanding and generation capabilities into a unified autoregressive framework that significantly improves the medical graph...
9mos ago
035.8K
AI Podcast Generator:自动抓取新闻生成音频播客

AI Podcast Generator: Automatically Capturing News to Generate Audio Podcasts

General Introduction AI Podcast Generator is an intelligent podcast generation tool that utilizes advanced AI technology to automatically create engaging audio content from web sources. The system generates natural flowing narratives by capturing news content and converting it into audio podcasts. The project is based on Next...
1yrs ago
035.7K
HivisionIDPhotos:开源智能AI证件照制作工具

HivisionIDPhotos: open source intelligent AI photo ID creation tool

Comprehensive introduction HivisionIDPhotos is an open source lightweight AI document photo production tool, can intelligently identify the user photo scene and keying, to generate a standard document photo in line with a variety of specifications. The tool supports custom background color and size, the future will also introduce beauty and...
1yrs ago
035.7K
笔格设计:在线图片编辑器,免费使用图像生成工具,轻松制作精美图片

Pen Grid Design: online photo editor, free to use the image generation tool, easy to create beautiful pictures

General Introduction Pen Grid Design is a website that provides online image editing and design services. Users can easily create and edit all kinds of images, including posters, PPT, GIF, etc. through this platform. Pen Grid Design provides a wealth of design materials and templates, and supports AI smart tools, such as AI image generation, A...
11mos ago
035.7K
Newsful:基于AI的金融新闻摘要网站

Newsful: an AI-based financial news summary site

General Introduction Newsful is an online platform that utilizes artificial intelligence technology to provide financial news services, focusing on real-time aggregation of corporate news and market developments from around the world. The site uses natural language processing (NLP) and machine learning technologies to extract information from multiple media sources for the use...
9mos ago
035.7K
TicNote – 出门问问推出的AI录音设备

TicNote - AI Recording Device from Out of the House

TicNote is an AI voice recorder launched by TicNote, which is a combination of Agentic AI hardware and software products, and is positioned as a "portable AI thinking partner". It adopts card-type design, thin and lightweight, and is equipped with a magnetic protective case, which can be easily carried or attached to the back of the phone.
5mos ago
035.6K
Knowledge Table:高效提取与探索结构化数据的开源工具

Knowledge Table: an open source tool for efficient extraction and exploration of structured data

Comprehensive Introduction Knowledge Table (Knowledge Table) is an open source project designed to simplify the process of extracting and exploring structured data from unstructured documents. Users can create structured knowledge representations such as tables and graphs through a natural language query interface. The tool supports customizing the extraction ...
1yrs ago
035.6K
Leffa:高保真模特虚拟试穿与人物姿势调整,Meta开源的可控人物图像生成模型

Leffa: High-fidelity model virtual fitting and character pose adjustment, Meta open source controllable character image generation model

Comprehensive Introduction Leffa is a unified framework for generating controllable character images, enabling precise manipulation of character appearance (e.g., virtual fitting) and pose (e.g., pose transfer). The framework significantly reduces distortion of fine-grained details by directing the target query to focus on the correct reference key in the attention layer, with ...
12mos ago
035.6K
Genspark AI - Genspark推出的AI浏览器

Genspark AI - Genspark Launches AI Browser

Genspark AI is an innovative AI browser from Genspark, Inc.Genspark AI comes with a built-in intelligent assistant that helps users find better deals, compare product prices, and analyze user reviews while shopping to help them make smarter purchasing decisions....
6mos ago
035.6K
TankWork:通过语音和文字操作电脑,并提供实时语音反馈的智能体

TankWork: an intelligent body that operates computers via voice and text and provides real-time voice feedback

General Introduction TankWork is an open source desktop agent framework designed to enable AI to perceive and control your computer through computer vision and system-level interaction. The framework allows agents to directly control computers through voice and text commands, process real-time screen content, and provide continuous audio visual...
10mos ago
035.6K
MindSearch:开源AI搜索引擎框架,部署您自己的 Perplexity 搜索引擎!

MindSearch: open source AI search engine framework to deploy your own Perplexity search engine!

Comprehensive Introduction MindSearch is an open source AI search engine framework launched by Shanghai Artificial Intelligence Laboratory (SAL), aiming to simulate human thought process for complex information gathering and integration. The tool combines the advanced technology of large-scale language modeling (LLM) and search engine through multi-intelligence...
11mos ago
035.6K
Fish Audio - AI 语音合成与声音克隆工具

Fish Audio - AI Speech Synthesis and Sound Cloning Tool

Fish Audio is a powerful generative AI speech synthesis tool that supports text-to-speech (TTS) and voice cloning. Users only need to input text, the tool supports the conversion to natural and smooth voice, the platform provides multiple languages and voice styles to choose from, to meet different scenarios and user...
6mos ago
035.5K
PromptWizard:优化提示工程的开源框架,提升任务性能

PromptWizard: an open source framework for optimizing prompt projects to improve task performance

Comprehensive Introduction PromptWizard is an open source framework developed by Microsoft that uses a self-evolutionary mechanism that allows the model to generate, evaluate, and improve prompt words and generate examples on its own, improving the quality of the output through continuous feedback. It can autonomously optimize the prompt words, generate and select appropriate examples, and...
11mos ago
035.5K
Maxun:开源无代码平台,自动抓取网页数据并转换为API或电子表格

Maxun: open source no-code platform that automatically crawls web data and converts it to APIs or spreadsheets

Comprehensive Introduction Maxun is an open source no-code web data extraction platform that allows users to train robots in minutes to automatically crawl web data and convert it into APIs or spreadsheets. The platform supports paging and scrolling, can adapt to changes in website layout, provides powerful data crawling...
11mos ago
035.5K
Moondream:批量反推图像提示词的开源轻量级视觉语言模型

Moondream: an open source lightweight visual language model for batch backpropagation of image cue words

Comprehensive Introduction Moondream is an open source lightweight visual language model designed to enable image description capabilities through deep learning and computer vision techniques. The model is able to run efficiently on a variety of platforms and is particularly suitable for edge devices.Moondream uses advanced techniques and...
11mos ago
035.5K
BagelBell:AI文字冒险游戏

BagelBell: AI Text Adventure Game

Comprehensive Introduction BagelBell is an AI character creation and interaction platform owned by ByteDance, known overseas as BagelBell in English.It provides users with a vibrant and creative virtual world in which they can explore stories, create characters, and interact with AI...
1yrs ago
035.5K
VideoLingo:视频转录单词级时间轴字幕,视频字幕翻译和本地化配音开源工具

VideoLingo: video transcription word-level timeline subtitles, video subtitle translation and localized dubbing open source tools

General Description VideoLingo is a one-stop video translation and localization dubbing tool designed to generate Netflix-grade, high-quality subtitles, eliminating raw machine translation and multi-line subtitles, and adding high-quality voiceovers that enable global knowledge to be shared across language barriers. By...
1yrs ago
035.4K
Tough Tongue AI:与AI对话练习面试与职场沟通技巧

Tough Tongue AI: Practice Interview and Workplace Communication Skills by Talking to an AI

General Introduction Tough Tongue AI is an artificial intelligence platform designed for practicing tough conversations. Users can simulate a variety of complex conversational situations, such as job interviews, salary negotiations, sales presentations, etc. by selecting preset scenarios or creating custom scenarios. The platform provides video and...
11mos ago
035.4K
Mistral OCR:94.89%总体精度,1000 页/30秒,只需1美元

Mistral OCR: 94.89% Overall Accuracy, 1000 Pages/30 Seconds, Only $1

In the long history of human civilization, every leap in the way information is acquired and parsed has profoundly driven social progress. From the ancient hieroglyphics, to the portable papyrus, to the later emergence of the printing press and today's wave of digitization, each technological innovation has greatly expanded the paradigm of human knowledge dissemination...
9mos ago
035.4K
shadcn/ui:组件库构建平台

shadcn/ui: component library building platform

General Introduction shadcn/ui is an open source component library building platform that provides beautiful and customizable UI components that users can copy and paste into their applications. The platform supports a variety of front-end frameworks and provides detailed installation and usage guidelines to help developers quickly get started...
1yrs ago
035.4K