Latest AI Resources

Total 2758 articles posts
MegaTTS3:合成中英文语音的轻量模型

MegaTTS3: A Lightweight Model for Synthesizing Chinese and English Speech

Comprehensive Introduction MegaTTS3 is an open source speech synthesis tool developed by ByteDance in cooperation with Zhejiang University, focusing on generating high-quality Chinese and English speech. Its core model is only 0.45B parameters , lightweight and efficient , support for mixed Chinese and English speech generation and speech cloning . The project is hosted on ...
7mos ago
027K
FinGPT:开源金融大语言模型平台,助力金融分析与预测

FinGPT: Open Source Financial Big Language Modeling Platform for Financial Analytics and Prediction

Comprehensive Introduction FinGPT is an open source financial big language modeling platform developed by the AI4Finance Foundation, designed for the financial sector to solve complex financial tasks and drive innovation in fintech.FinGPT utilizes lightweight adaptation techniques and reinforcement learning approaches...
9mos ago
027K
触手AI:简单易上手的AI绘图工具,支持训练自己的图像风格

Tentacle AI: simple and easy to use AI drawing tools, support training your own image style

Comprehensive Introduction Touch AI is a professional AI creation platform under Jellyfish Intelligence, providing AI painting, online drawing and massive models and other functions. The platform supports minimalist and professional modes with strong ease of use, provides a variety of drawing styles and design models, rich plug-in options, and allows users to experience AIGC creation capabilities online...
1yrs ago
027K
AI Test Kitchen:Google创意生成与AI技术实验平台

AI Test Kitchen: Google's Experimental Platform for Idea Generation and AI Technology

Comprehensive Introduction AI Test Kitchen is an experimentation platform launched by Google Labs to explore the combination of artificial intelligence and creativity. The platform allows users to experience and give feedback on emerging AI technologies such as LaMDA.The platform provides a variety of tools to help users transform ideas into real...
1yrs ago
027K
微信视频号下载器:快速下载微信视频号视频,支持多种格式和平台

WeChat Video No. Downloader: quickly download WeChat Video No. video, support multiple formats and platforms

Comprehensive Introduction WeChat Video No. Downloader is an open source project designed to help users quickly download video content from WeChat video numbers. The tool supports a variety of video formats and platforms, and users can easily use it on Windows and macOS systems. The project is developed by ltaoo and hosted on...
9mos ago
026.9K
Krita:开源数字绘画软件,集成ComfyUI免去繁琐配置(PS+AI)

Krita: open source digital painting software, integrated ComfyUI free of cumbersome configuration (PS + AI)

General Introduction Krita is a free open source and free professional painting software designed for illustrators, cartoonists, concept artists and animators. It provides a powerful brush engine, layer management, animation tools and a wealth of extended resources.Krita supports a variety of painting styles and work...
1yrs ago
026.9K
AI2SRT:利用 Gemini模型,一键为长视频创建解说短视频或视频总结

AI2SRT: Create short narrated videos or video summaries for long videos with one click using Gemini models

Comprehensive Introduction AI2SRT is an open source project that utilizes the GeminiAI Big Model to generate short narrated videos and video summaries for long videos with one click, while supporting audio and video transcription subtitles. The project aims to simplify the video content creation process and provide efficient subtitle generation and translation functions. Users can pass...
10mos ago
026.9K
文心快码(Baidu Comate):你的AI编程助手,结合百度编程大数据,为你生成优质编程代码。

Wenxin Quick Code (Baidu Comate): your AI programming assistant, combined with Baidu programming big data, to generate quality programming code for you.

Comprehensive Introduction Baidu Comate is an advanced AI programming assistant developed by Baidu, based on Baidu's ERNIE Big Model, integrating proprietary and open source data to provide next-generation programming assistance. It features code completion, interpretation and debugging to help developers think, write and optimize...
7mos ago
026.8K
Linly-Talker:数字人智能对话系统,结合大语言模型与视觉模型,实现互动新体验

Linly-Talker: An Intelligent Dialogue System for Digital People, Combining Big Language Modeling and Visual Modeling for a New Interactive Experience

Comprehensive Introduction Linly-Talker is an innovative digital human dialog system that combines Large Language Models (LLMs) with visual models to create a novel approach to human-computer interaction. The system integrates a variety of technologies such as Whisper, Linly, Micros...
8mos ago
026.8K
Goose:开源可扩展的编程智能体,自动化执行编程全流程任务

Goose: open source scalable programming intelligences that automate the full range of programming tasks

General Introduction Goose is an open source AI agent tool developed by Block, Inc. designed to help developers automate everyday development tasks. It supports a wide range of Large Language Models (LLMs) and interacts with users via the command line or desktop application interfaces.Goose can perform a wide range of tasks from agent...
9mos ago
026.8K
AI Hedge Fund:开源自动化交易系统,利用多智能体进行复杂对冲基金交易决策

AI Hedge Fund: open-source automated trading system utilizing multiple intelligences for complex hedge fund trading decisions

General Introduction AI Hedge Fund is an artificial intelligence hedge fund that utilizes a multi-agent system for trading decisions. The system works in concert with multiple specialized agents, including market data agents, quantitative agents, risk management agents, and portfolio management agents, to achieve complex trading...
9mos ago
026.8K
Agent TARS:使用视觉和命令操作电脑的开源智能体

Agent TARS: An Open Source Intelligence Using Vision and Commands to Operate Computers

Comprehensive Introduction Agent TARS is a multimodal AI intelligence open-sourced by ByteDance.The core feature is to visually understand web content and combine command line and file system operations to help users complete complex computer tasks. Instead of requiring manual operations like traditional tools, it can self...
7mos ago
026.7K
Fish Agent:端到端AI语音克隆助手,实时语音对话助理,Fish Speech衍生项目

Fish Agent: end-to-end AI voice cloning assistant, real-time voice conversation assistant, Fish Speech spin-off project

Comprehensive Introduction Fish Speech Derivative Project Fish Agent is a revolutionary end-to-end AI speech cloning system developed based on the V0.1 3B model architecture. As a fully end-to-end speech clone processing system, its most important feature is the use of innovative speechless...
9mos ago
026.7K
Browser-Use:构建智能网页自动化工具,让AI智能体轻松操作浏览器

Browser-Use: Building Intelligent Web Automation Tools for AI Intelligents to Easily Operate Browsers

Comprehensive Introduction Browser-Use is an innovative open source web automation tool specifically designed to enable Language Models (LLMs) to naturally interact with websites. It provides a powerful and flexible framework that supports a wide range of mainstream language models, including GPT-4, Claud...
10mos ago
026.7K
GeekAI:自部署商业化多功能AI助手,完整接入多模型API运营后台

GeekAI: Self-deployed commercialized multi-functional AI assistant with complete access to multi-model API operation backend

Comprehensive introduction GeekAI is a full set of open source solutions for AI assistants based on AI big language model API implementation. The project comes with an operations management backend , out of the box , integrated with ChatGPT, Azure, ChatGLM, Xunfei Starfire, Wenxin Yiyin and many other p...
1yrs ago
026.7K
sensitive-word:敏感词过滤工具,高效DFA算法实现

sensitive-word: sensitive word filtering tool, efficient DFA algorithm implementation

Comprehensive introduction Sensitive Word Filtering Tool (Sensitive Word) is a high-performance Java sensitive word filtering tool based on the implementation of the DFA algorithm framework . The tool is able to efficiently detect and filter sensitive words , supports a variety of format conversion and custom replacement strategies. Its design goal is to provide ...
1yrs ago
026.6K
99AI:集成多模态AI服务的商业化Web应用(免费开源)

99AI: A commercialized web application integrating multimodal AI services (free and open source)

Comprehensive Introduction 99AI is an open source AI web application project that aims to provide an easy-to-deploy, low-threshold integrated AI service platform. The project supports intelligent dialog, multimodal modeling, application plaza, networked search, and integrates AI painting, music and video...
11mos ago
026.6K
紫东太初:多模态大模型平台,支持文本创作、图像生成、3D理解、信号分析等任务

Zidong Taichu: Multi-modal large model platform supporting text creation, image generation, 3D understanding, signal analysis and other tasks

Comprehensive Introduction Zidong Taichu is a new-generation multimodal big model platform launched by the Institute of Automation of the Chinese Academy of Sciences and the Wuhan Institute of Artificial Intelligence. The platform supports multiple tasks such as multi-round question and answer, text creation, image generation, 3D understanding and signal analysis, with powerful cognitive, understanding and creation capabilities. Zidong ...
1yrs ago
026.6K
Artflow:创作人物一致性的动画故事和虚拟数字人口播视频

Artflow: Creating character-consistent animated stories and virtual digital pop-up videos

General Description Artflow is an online platform that enables users to upload photos, train exclusive AI characters, and create character-consistent videos and animated stories. Offering free training for the first time, users can customize their identity to create unique images and videos for a variety of scenarios. Monthly ...
1yrs ago
026.6K
Hibiki:实时语音翻译模型,保留原声特点的流式翻译

Hibiki: a real-time speech translation model, streaming translation that preserves the characteristics of the original voice

General Introduction Hibiki is a high-fidelity real-time speech translation model developed by Kyutai Labs. Unlike traditional offline translation, Hibiki is able to generate natural speech translation in the target language and provide text translation in real time while the user is speaking. The model...
8mos ago
026.6K
Edraw.AI(亿图):在线协作白板工具,AI生成流程图和多种图表

Edraw.AI: Online collaborative whiteboard tool, AI-generated flowcharts and multiple diagrams

Comprehensive Introduction Edraw.AI is a revolutionary AI-powered online visualization whiteboard collaboration platform that integrates more than 40 intelligent tools and a library of carefully designed templates. The platform uses advanced AI technology to quickly transform users' textual thoughts into professional visual diagrams. The platform supports...
10mos ago
026.6K
DreamTalk:使用一张头像图片即可生成表情丰富的说话视频

DreamTalk: Generate expressive talking videos with a single avatar image!

DreamTalk Comprehensive Introduction DreamTalk is a diffusion model-driven expression talking head generation framework, jointly developed by Tsinghua University, Alibaba Group and Huazhong University of Science and Technology. It mainly consists of three parts: a noise reduction network, a style-aware lip expert and a style predictor, and can be based on...
10mos ago
026.5K