Latest AI Resources

Total 2617 articles posts
合同嗖嗖:对话式生成AI智能合同,一键获取专业法律文书合同模板

Contract Whoosh: Conversationally generated AI smart contracts, one-click access to professional legal documents contract templates

Comprehensive Introduction Contract Whoosh is a revolutionary AI intelligent contract generation platform that adopts conversational interaction, allowing users to obtain professional contract documents through simple dialog. Relying on advanced artificial intelligence technology, the platform integrates a huge amount of contract template resources, and is able to intelligently...
8mos ago
03.3K
IMS Toucan:快速可控的多语言(支持7000+语言)文本转语音工具

IMS Toucan: Fast and Controllable Multilingual (7000+ languages supported) Text-to-Speech Tool

General Introduction IMS Toucan is a state-of-the-art text-to-speech (TTS) toolkit developed by the Institute for Natural Language Processing (IMS) at the University of Stuttgart, Germany. The toolkit supports more than 7000 languages and is characterized by fast, controllable and low computational resource requirements.IMS...
6mos ago
03.3K
文心智能体平台:建立在完整分发渠道和商业闭环的智能体应用

Wenxin Intelligent Body Platform: Intelligent Body Applications Built on Complete Distribution Channels and Commercial Closures

Introduction Wenxin Intelligent Body Platform AgentBuilder is a Baidu launched based on the Wenxin large model of the intelligent body (Agent) platform, to support the majority of developers in accordance with their own industry sectors, application scenarios, to select different types of development, to create a large model of the era of product capabilities. Developers can ...
5mos ago
03.3K
蝉镜:数字人视频创作平台,拥有数百款数字人模板以及克隆专属数字人形象(付费)

Cicada Mirror: digital human video creation platform with hundreds of digital human templates and cloning of exclusive digital human images (paid)

General Introduction Cicada is a platform focusing on digital human video creation, utilizing AI technology to simplify the video production process. Users can choose different digital human images, input copy and generate videos with multi-language voiceovers. The platform provides a rich library of templates and materials, which are suitable for a variety of fields such as advertising and marketing, education and training...
9mos ago
03.3K
Flair:AI生成专业摄影效果的商品展示图,产品商拍专用工具

Flair: AI generates professional photographic effect of the product display map, product commercial photography special tools

Comprehensive Introduction Flair is an AI-based online design tool focused on generating high-quality photographic images for e-commerce products. Users can quickly create realistic product scene images through drag-and-drop operations, which greatly improves design efficiency. The platform provides a wealth of templates and 3D elements to support real...
9mos ago
03.3K
紫东太初:多模态大模型平台,支持文本创作、图像生成、3D理解、信号分析等任务

Zidong Taichu: Multi-modal large model platform supporting text creation, image generation, 3D understanding, signal analysis and other tasks

Comprehensive Introduction Zidong Taichu is a new-generation multimodal big model platform launched by the Institute of Automation of the Chinese Academy of Sciences and the Wuhan Institute of Artificial Intelligence. The platform supports multiple tasks such as multi-round question and answer, text creation, image generation, 3D understanding and signal analysis, with powerful cognitive, understanding and creation capabilities. Zidong ...
10mos ago
03.3K
MegaParse:解析各类型文档为LLM可用数据,完整保留文档中的表格、图片等所有信息

MegaParse: parses all types of documents into LLM-available data, preserving all information in the document such as tables, pictures, etc. in its entirety

Comprehensive Introduction MegaParse is a powerful and versatile document parsing tool designed to optimize data processing for the Large Language Model (LLM). Whether you are working with text, PDF, PowerPoint presentations or Word documents, MegaParse...
8mos ago
03.3K
Kolors:生成高质量图像的文本到图像模型,支持生成中文海报

Kolors: text-to-image model for generating high-quality images, support for generating Chinese posters

Comprehensive Introduction Kolors is a large-scale text-to-image generation model developed by the Racer team, based on potential diffusion techniques. The model is trained on billions of text-image data pairs, and is capable of generating high-quality, complex semantically accurate images with support for both Chinese and English input.Kolors in visual quality...
8mos ago
03.3K
海绵音乐:智能AI音乐创作平台,文字和图片生成音乐

Sponge Music: Intelligent AI music creation platform, text and image generated music

General Introduction SpongeBob Music is a music creation platform based on artificial intelligence technology. Users only need to enter a sentence of inspiration or upload a picture to generate an exclusive piece of music. The platform provides a variety of music styles and creation tools to help users easily create high-quality music. Whether you are a professional musician or...
10mos ago
03.3K
HYPIR - 中国科学院团队推出的新型图像复原大模型

HYPIR - A new large model for image restoration introduced by a team from the Chinese Academy of Sciences

HYPIR is a large model for image restoration introduced by Dong Chao's team at Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. The model combines the fractional prior of diffusion modeling with adversarial generative networks to achieve efficient, high-quality image restoration.HYPIR can quickly restore old photos and improve resolution while keeping text clear...
2wks ago
03.3K
PantoMatrix(EMAGE):全身手势生成框架,从音频生成全身手势的3D动画框架

PantoMatrix (EMAGE): full-body gesture generation framework, 3D animation framework for generating full-body gestures from audio

Comprehensive Introduction PantoMatrix is an advanced full-body gesture generation framework capable of generating complete human movements from audio and partial gestures, including face, partial body, hand and full-body movements. The framework utilizes the latest multimodal datasets and deep learning techniques to provide high-quality 3D...
9mos ago
03.3K
Dream API:oneapi/newapi中转API,针对个人用户提供免费公益API

Dream API: oneapi/newapi transit API, providing free public service API for individual users.

Introduction After recommending many free large model API services in the Chief AI Sharing Circle, I suddenly found an important issue: there are official free small size scales; there are reverse API models; but there has been no free "official conversion" API. The reason why it has not been recommended is that the free "official conversion" API is not available. The reason why I haven't recommended it is that free "official conversion" is not available for large models.
9mos ago
03.3K
WeClone:用微信聊天记录和语音训练数字分身

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...
4mos ago
03.3K
SystoByte:编程系统设计练习平台,提供实时AI反馈,提升面试技能

SystoByte: a programming system design practice platform that provides real-time AI feedback to improve interview skills

General Introduction SystoByte is a platform built for system design practice, designed to help users improve their system design skills, especially in interview preparation. The platform provides a rich library of system design questions that users can design through an intuitive interface and get instant access to AI-generated...
8mos ago
03.3K
Sana:快速生成高分辨率图像,0.6B超小尺寸模型,低配笔记本GPU运行

Sana: fast generation of high-resolution images, 0.6B ultra-small size model, low-profile laptop GPU operation

General Introduction Sana is an efficient high-resolution image generation framework developed by NVIDIA Labs, capable of generating images up to 4096 × 4096 resolution in a matter of seconds.Sana utilizes a linear diffusion transformer and deep compression self-encoder technology to significantly...
9mos ago
03.3K
CR-Mentor:知识库+LLM 驱动的GitHub智能代码审查导师

CR-Mentor: Knowledge Base + LLM Driven Intelligent Code Review Mentor for GitHub

Comprehensive Introduction CR-Mentor is an intelligent code review tool that combines a specialized knowledge base with the power of Large Language Modeling (LLM). It not only supports code review for all programming languages, but also customizes exclusive review criteria and focus areas for teams based on best practices accumulated in the knowledge base. Through...
9mos ago
03.3K
DreamTalk:使用一张头像图片即可生成表情丰富的说话视频

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...
8mos ago
03.3K
DCT-Net:照片和视频转绘为动漫风格化的开源工具

DCT-Net: An Open Source Tool for Transpainting Photos and Videos to Anime Stylization

Comprehensive Introduction DCT-Net is an open source project developed by DAMO Academy and Wang Xuan Institute of Computer Technology, Peking University, aiming at anime stylized transformation of images. The project utilizes deep learning techniques through Domain-Calibrated Translation (Domain-Calibrat...
7mos ago
03.2K
WriteWise:喜马拉雅推出的专业AI小说写作工具

WriteWise: a professional AI novel writing tool from Himalaya

Comprehensive Introduction WriteWise is an online service platform focused on novel creation launched by Himalaya. It provides professional AI writing assistance, covering such things as persona setting, dialogue design and martial arts fighting. In addition, it also provides a computer version for download, supports rich editor format configuration as well as stable...
11mos ago
03.2K
纳米AI搜索(360AI助手):一站体验国内主流对话大模型

Nano AI Search (360 AI Assistant): one-stop experience of the domestic mainstream dialog large model

Site Description 360 launched an AI assistant that integrates many domestic advanced large models, similar to overseas POE.AI assistant currently integrates 12 domestic filing compliance use of mainstream large models (lack of Tencent hybrid large models slightly regrettable). 360 AI assistant can be based on the user's problem, automatic...
6mos ago
03.2K
ChatTTS:模仿真人说话声音的语音生成模型(ChatTTS一键加速包)

ChatTTS: a speech generation model that mimics the voice of a real person speaking (ChatTTS one-click acceleration package)

General Introduction ChatTTS is a generative speech model designed for conversational scenarios. It generates natural and expressive speech, supports multiple languages and multiple speakers, and is suitable for interactive conversations. The model does this by predicting and controlling fine-grained prosodic features such as laughter, pauses and interjections, sup...
6mos ago
03.2K
Fay数字人框架:集成语言模型与3D数字角色,支持多种应用场景

Fay Digital Human Framework: Integrated language modeling and 3D digital characters to support multiple application scenarios

Comprehensive Introduction Fay is an open source 3D virtual digital human framework that integrates language models and digital characters for a variety of application scenarios, such as virtual shopping guides, virtual anchors, assistants, waiters, teachers, and voice- or text-based mobile assistants.The Fay framework supports full offline use, providing m...
7mos ago
03.2K
Akool:生成图像和视频营销素材|视频换脸|视频翻译|人像说话

Akool: Generate images and video marketing materials | Video Face Swap | Video Translation | Portrait Speak

General Introduction Akool is a focus on personalized visual marketing and advertising. Through advanced AI technology, AKOOL can help users easily create high-quality, personalized video content for a wide range of fields such as advertising, online education, art creation and e-commerce. It provides face transposition...
9mos ago
03.2K
BRIA:生成式AI图像开放平台|图像去背景|图像元素编辑|RMBG

BRIA: Open Platform for Generative AI Images|Image De-Backgrounding|Image Element Editing|RMBG

BRIA General Introduction BRIA provides a comprehensive visually generated AI business solution with a platform that uses 100% licensed datasets to ensure copyright protection and creator benefits. The platform supports base model access, APIs, SDKs, and web integrations, practicing Responsible AI, taking responsibility for all output...
8mos ago
03.2K
搜狐简单AI:简约易上手的商业化AI绘图工具

Sohu Simple AI: A simple and easy-to-use commercial AI drawing tool

Comprehensive introduction Sohu Simple AI is an all-in-one AI creation assistant, dedicated to providing users with comprehensive AI creation services. The platform covers AI painting, text-generated diagrams, diagram-generated diagrams, AI copywriting, AI avatars, AI materials and other functions to help users easily realize the generation of creative content. Whether it's a market push...
9mos ago
03.2K
Visprex:快速可视化CSV文件,自动将数据生成各类分析图表,数据完全在浏览器中处理

Visprex: fast visualization of CSV files, automatically generate all kinds of analytical charts from the data, and process the data completely in the browser.

General Introduction Visprex is a lightweight data visualization tool designed to help users analyze and present data quickly and intuitively. The tool runs entirely in the browser, ensuring data privacy and security, and does not send data to any backend servers.Visprex supports a wide range of...
9mos ago
03.2K
Chatbot Arena(LMSYS):大语言模型基准测试和多模型比较性能的在线竞技平台

Chatbot Arena (LMSYS): an online competitive platform for benchmarking large language models and comparing performance across multiple models

General Introduction The LMSYS Org, known as the Large Model Systems Organization, is an open-access co-founded by students and faculty at the University of California, Berkeley, in collaboration with the University of California, San Diego, and Carnegie Mellon University...
5mos ago
03.2K