Latest AI Resources

Total 2832 articles posts
Excel AI:AI智能函数插件,实现数据提取、批量转换、公式生成、数据分析

Excel AI: AI intelligent function plug-ins, to achieve data extraction, batch conversion, formula generation, data analysis

Comprehensive introduction Excel AI is an Excel plug-in based on artificial intelligence technology , the unique AI function can be automatically populated according to the user description of all kinds of functions . Designed to enhance the efficiency of data processing through intelligent functions and automation tools. Users can use the plug-in to achieve data extraction, transfer...
11mos ago
041.6K
Step-Audio:多模态语音交互框架,识别语音并使用克隆语音交流等功能

Step-Audio: a multimodal voice interaction framework that recognizes speech and communicates using cloned speech, among other features

Comprehensive Introduction Step-Audio is an open source intelligent speech interaction framework designed to provide out-of-the-box speech understanding and generation capabilities for production environments. The framework supports multi-language dialog (e.g., Chinese, English, Japanese), emotional speech (e.g., happy, sad), regional dialects (e.g., Cantonese, Szechuan ...
9mos ago
041.6K
FinRobot:提升金融数据分析效率和投资研究的的智能体

FinRobot: An Intelligent Body to Improve Financial Data Analysis Efficiency and Investment Research

Comprehensive Introduction FinRobot is an open source AI intelligence platform developed by AI4Finance Foundation and designed for financial analytics. It not only covers traditional language models, but also incorporates a variety of AI technologies, aiming to provide a comprehensive solution for the financial industry.F...
10mos ago
041.5K
Infinity:生成高分辨率图像的比特自回归建模,实现无限制高分辨率图像生成

Infinity: bitwise autoregressive modeling for generating high-resolution images for unlimited high-resolution image generation

General Introduction Infinity is a groundbreaking high-resolution image generation framework developed by the FoundationVision team. The project breaks through the limitations of traditional image generation models through an innovative bit-level visual autoregressive modeling approach.The core features of Infinity...
11mos ago
041.4K
Doc2X:文档图片公式识别与转换工具,支持多格式转换与高精度翻译

Doc2X: Document image formula recognition and conversion tools, support for multi-format conversion and high-precision translation

Comprehensive introduction Doc2X is a powerful document image formula recognition and conversion tools, is committed to providing efficient and intelligent document processing solutions. Whether it is an academic research paper, a textbook, a corporate document or a financial report, Doc2X can accurately recognize PDF tables and...
10mos ago
041.4K
算了么:共享你电脑闲置 GPU 显卡算力赚钱,支持科学研究

Forget it: Share your computer's unused GPUs and graphics cards to earn money and support scientific research!

Comprehensive Introduction Nevermind is a platform that utilizes the arithmetic power of idle graphics cards to perform scientific calculations and earn revenue. Users can share their computer's idle GPU resources to support scientific research and technological progress, while earning a certain financial return. The platform aims to promote scientific progress and solve important scientific research problems...
12mos ago
041.2K
Llasa 1~8B:高品质语音生成和克隆的开源文本转语音模型

Llasa 1~8B: an open source text-to-speech model for high quality speech generation and cloning

General Introduction Llasa-3B is an open source text-to-speech (TTS) model developed by the Audio Lab of the Hong Kong University of Science and Technology (HKUST Audio). The model is based on the Llama 3.2B architecture, which has been carefully tuned to provide high-quality speech generation that not only supports multiple...
10mos ago
041.2K
通义万相:AI创意作画|文生图|图生图|虚拟模特|个人写真|涂鸦作画

Tongyi Wanxiang: AI Creative Painting|Text-to-Picture|To-Picture|Virtual Modeling|Personal Portrait|Doodle Painting

Comprehensive Introduction Tongyi Wanxiang is an AI creative painting platform under Aliyun, providing a variety of AI art creation functions. Users can create in a variety of ways such as text to generate images, image to generate images, graffiti painting, virtual modeling and personal portraits. The platform is based on the self-developed Composer combination of generating...
1yrs ago
041.1K
沉浸式翻译插件:免费多语言实时网页翻译工具,PDF/EPUB/视频字幕全支持

Immersive Translation Plugin: Free multi-language real-time web page translation tool, PDF/EPUB/video subtitle full support

Comprehensive Introduction Immersive Translator is a free and powerful browser plug-in designed to break down language barriers and help you read global information easily. It provides multi-language real-time web page translation services, supports dozens of languages to translate each other, and breaks through the limitations of traditional web page translation to extend the function to PDF documents, E...
8mos ago
041.1K
DeOldify:使用AI技术为黑白照片和视频上色的经典开源工具

DeOldify: the classic open-source tool for colorizing black-and-white photos and videos using AI technology

Comprehensive Introduction DeOldify is an open source project based on deep learning technology, specifically designed for intelligent colorization and restoration of black and white photos and videos. The project uses an innovative NoGAN training method to successfully solve the common defects of traditional GAN networks in the image coloring process...
11mos ago
041K
InstantID:上传一张图片,迁移人像特征来生成不同风格图片

InstantID: upload an image and migrate the portrait features to generate different styles of images

Comprehensive Introduction InstantID is an advanced technology focused on generating images with personalized styles or poses in seconds while ensuring a high level of fidelity using a single reference ID picture. The technology employs a diffusion model-based solution by integrating facial images, landmark maps...
1yrs ago
040.9K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
8mos ago
040.7K
LazyLLM:商汤开源构建多智能体应用的低代码开发工具

LazyLLM: Shangtang's open source low-code development tool for building multi-intelligence body applications

Comprehensive Introduction LazyLLM is an open source tool developed by the LazyAGI team, focusing on simplifying the development process of multi-intelligence large model applications. It helps developers quickly build complex AI applications through one-click deployment and lightweight gateway mechanisms, saving tedious engineering configuration...
9mos ago
040.7K
Kolors:生成高质量图像的文本到图像模型,支持生成中文海报

Kolors: text-to-image model for generating high-quality images, support for generating Chinese posters

Comprehensive Introduction Kolors is a large-scale text-to-image generation model developed by the Racer team, based on potential diffusion techniques. The model is trained on billions of text-image data pairs, and is capable of generating high-quality, complex semantically accurate images with support for both Chinese and English input.Kolors in visual quality...
11mos ago
040.7K
MakeSense:免费使用的图像标注工具,提升计算机视觉项目效率

MakeSense: a free-to-use image annotation tool to improve computer vision project efficiency

General Introduction Make Sense is a free online image annotation tool designed to help users quickly prepare datasets for computer vision projects. It requires no complicated installation, just open a browser access to use it, supports multiple operating systems, and is perfect for small deep learning projects. Users can...
9mos ago
040.7K
Pinokio:一键本地部署各类AI开源项目,小白全自动部署

Pinokio: one-click local deployment of all kinds of AI open source projects, fully automated deployment of white people

Pinokio General Introduction Pinokio is an innovative AI open source project deployment tool that allows users to easily install, run, and programmatically control a wide range of big model related applications with a single click. It is supported across multiple platforms and provides a community scripting library that covers most popular A...
1yrs ago
040.7K
腾讯智影:智能视频创作工具|AI数字人、动漫生成套件

Tencent Smart Shadow: Intelligent Video Creation Tool | AI Digital Man, Anime Generation Kit

Comprehensive Introduction Tencent Smart Shadow is an online intelligent video creation platform launched by Tencent, which can support text dubbing, digital human broadcasting, automatic subtitle recognition and other functions through powerful AI tools provided by cloud services.It integrates material search, video editing, rendering export and publishing, bringing users a convenient visual...
1yrs ago
040.6K
匠邦AI:教师教学辅助AI助手,为老师提供备案教案/PPT课件/课题论文/出题组卷

Artisan AI: Teacher teaching aid AI assistant, providing teachers with filed lesson plans / PPT courseware / subject papers / questions and papers.

Comprehensive Introduction Artisan AI is an intelligent assistant focusing on the field of education, aiming to improve teachers' work efficiency and teaching quality through artificial intelligence technology. The site provides a variety of functions, including lesson plan design, subject report guidance, thesis checking and weight reduction, PPT courseware generation, etc., to help teachers in teaching, research...
11mos ago
040.5K
Notta:AI会议记录与音频转录工具,自动转录会议、采访或录音

Notta: AI meeting recording and audio transcription tool to automatically transcribe meetings, interviews or recordings

General Description Notta is a powerful AI meeting recording and audio transcription tool designed to help users automatically convert meetings, interviews or audio recordings into searchable text. With Notta, users can easily transcribe, edit, summarize and collaborate to boost productivity.Notta supports...
11mos ago
040.5K
Apify:全栈网页抓取与数据提取平台,自动化数据收集,构建自定义爬虫,集成多种API

Apify: full-stack web crawling and data extraction platform, automate data collection, build custom crawlers, integrate multiple APIs

General Introduction Apify is a full-stack web crawling and data extraction platform that provides a variety of tools and services to help users automate data extraction from any website. Users can use off-the-shelf crawling tools or build and distribute their own data extraction tools.Apify supports multiple programming languages and frameworks...
1yrs ago
040.4K
DreamTalk:使用一张头像图片即可生成表情丰富的说话视频

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...
12mos ago
040.4K
WebShaper - 阿里通义开源的AI训练数据合成系统

WebShaper - Ali Tongyi's open source AI training data synthesis system

WebShaper is an AI training data synthesis system launched by Alibaba's Tongyi Lab, which is based on formal modeling and intelligence expansion mechanism to generate high-quality and scalable training data to help AI intelligences improve complex information retrieval capabilities. The system introduces the concept of "knowledge projection"...
4mos ago
040.3K
文心快码(Baidu Comate):你的AI编程助手,结合百度编程大数据,为你生成优质编程代码。

Wenxin Quick Code (Baidu Comate): your AI programming assistant, combined with Baidu programming big data, to generate quality programming code for you.

Comprehensive Introduction Baidu Comate is an advanced AI programming assistant developed by Baidu, based on Baidu's ERNIE Big Model, integrating proprietary and open source data to provide next-generation programming assistance. It features code completion, interpretation and debugging to help developers think, write and optimize...
9mos ago
040.2K
RD-Agent:自动化数据驱动研发工具,通过AI技术推动以数据为导向的研发过程

RD-Agent: an automated data-driven R&D tool to drive data-driven R&D processes through AI technology

Comprehensive Introduction RD-Agent is an open source tool from Microsoft designed to automate and optimize the research and development (R&D) process. The tool focuses on data-driven scenarios to improve the efficiency of model and data development through artificial intelligence techniques.RD-Agent integrates research...
9mos ago
040.1K
RealtimeVoiceChat:低延迟与AI进行自然口语对话

RealtimeVoiceChat: low-latency natural spoken conversation with AI

General Introduction RealtimeVoiceChat is an open source project focused on real-time, natural conversations with artificial intelligence via voice. Users use a microphone to input their voice, and the system captures the audio through a browser, quickly converts it to text, and a large-scale language model (LLM) generates back...
7mos ago
040.1K