Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

NeoCodeium怎么用?

How does NeoCodeium work?

NeoCodeium is a plugin that provides AI code completion functionality for Neovim, developed based on Codeium technology. The plugin aims to solve the flickering problem of the official plugin during multi-line virtual text processing and provide a smoother user experience.NeoC...
1yrs ago
057.7K
Waifu2x Extension GUI:深度学习技术放大、修复图像与视频插帧(Windows x64)

Waifu2x Extension GUI: Deep Learning Techniques to Enlarge, Repair Image and Video Interpolation (Windows x64)

Comprehensive Introduction Waifu2x-Extension-GUI is a powerful image and video processing tool that utilizes deep convolutional neural network techniques to achieve super-resolution zoom and video frame interpolation for images, GIFs and videos. The tool supports multiple algorithms and engines, including Wai...
1yrs ago
078.7K
OpenAI开始提供大模型(GPT系列模型)的提示缓存(Prompt Caching):GPT-4o系列模型输入价格下降一半,访问速度提升80%

OpenAI started to provide Prompt Caching for large models (GPT series models): the price of GPT-4o series model inputs dropped by half, and the access speed increased by 80%

In large model applications, processing complex requests is often accompanied by high latency and cost, especially when there is a lot of repetition in the request content. This "slow request" problem is especially prominent in scenarios with long prompts and high-frequency interactions. To address this challenge, OpenAI recently ...
1yrs ago
054.2K
R2R:多模态内容解析并结合知识图谱与混合搜索的先进AI检索(RAG)系统

R2R: An Advanced AI Retrieval (RAG) System for Multimodal Content Parsing and Combining Knowledge Graph with Hybrid Search

Comprehensive Introduction R2R (RAG to Riches) is an advanced AI retrieval system supporting Retrieval Augmented Generation (RAG) functionality with production-ready features. Built on a containerized RESTful API, the system provides multimodal content parsing, hybrid search functionality...
1yrs ago
095.1K
Megrez-3B-Omni:端侧多模态理解模型,支持文本、图像、音频多模态理解和分析

Megrez-3B-Omni: an end-side multimodal understanding model supporting text, image, and audio multimodal understanding and analysis

Comprehensive Introduction Infini-Megrez is an edge intelligence solution developed by the unquestioned core dome (Infinigence AI), aiming to achieve efficient multimodal understanding and analysis through hardware and software co-design. At the core of the project is the Megrez-3B model, which supports graph...
1yrs ago
047.4K
3B模型长思考后击败70B!HuggingFace逆向出o1背后技术细节并开源

HuggingFace reverses out the technical details behind o1 and open-sources it!

Small models can outperform larger models if they are given longer to think. In recent times, there has been an unprecedented amount of enthusiasm in the industry for small models, with a number of 'practical tricks' to allow them to outperform larger scale models in terms of performance. It can be argued that putting the spotlight on improving smaller...
1yrs ago
043.4K
RAGFlow:基于深度文档理解的开源RAG引擎,提供高效的检索增强生成工作流

RAGFlow: an open source RAG engine based on deep document understanding, providing efficient retrieval-enhanced generation workflows

Comprehensive Introduction RAGFlow is an open source Retrieval Augmented Generation (RAG) engine based on deep document understanding technology. It provides an efficient RAG workflow for organizations of all sizes, incorporating a large-scale language model (LLM) capable of delivering data in complex formats based on real...
1yrs ago
099.6K
再见 LangChain!Atomic Agents火了!

Goodbye LangChain! Atomic Agents is on fire!

Frameworks like LangChain, CrewAI, and AutoGen have become popular by providing high-level abstractions for building AI systems. However, many developers, including myself, have found that these tools do more harm than good, often adding unnecessary complexity and frustration to the development process...
1yrs ago
047.9K
Depth AI:构建全面的代码知识图谱,深度理解代码库的AI助手

Depth AI: An AI assistant for building a comprehensive code knowledge graph and deep understanding of the code base

Comprehensive Introduction Depth AI is an artificial intelligence assistant designed for developers to deeply understand and analyze code bases. By building a comprehensive code knowledge graph, Depth AI can answer complex technical questions and help developers manage and optimize their code more efficiently. Whether...
1yrs ago
077.2K
SystoByte:编程系统设计练习平台,提供实时AI反馈,提升面试技能

SystoByte: a programming system design practice platform that provides real-time AI feedback to improve interview skills

General Introduction SystoByte is a platform built for system design practice, designed to help users improve their system design skills, especially in interview preparation. The platform provides a rich library of system design questions that users can design through an intuitive interface and get instant access to AI-generated...
1yrs ago
050.8K
FindPicLocation:使用AI技术定位照片拍摄地点,快速获取片GPS定位

FindPicLocation: Use AI technology to locate the location where the photo was taken, and quickly get the GPS location of the photo.

Comprehensive Introduction FindPicLocation is a website that utilizes artificial intelligence technology to help users locate where their photos were taken. Users just need to upload photos, and the system will automatically analyze the EXIF data in the photos, extract the GPS coordinates, and display the exact location on the map. The site aims to...
1yrs ago
088.2K
CrewAI:多角色扮演协作智能框架,简化复杂任务

CrewAI: A Multi-Roleplay Collaborative Intelligence Framework to Simplify Complex Tasks

Comprehensive Introduction CrewAI is an advanced framework designed to orchestrate collaboration between role-playing and autonomous AI agents. By facilitating collaborative intelligence, CrewAI enables agents to work together seamlessly to solve complex tasks. Whether you're building an intelligent assistant platform, automating customer service teams, or multi-agent...
1yrs ago
078.4K
Cohere AI 推出 Rerank 3.5:相关知识排序技术的新时代

Cohere AI Launches Rerank 3.5: A New Era of Relevant Knowledge Sorting Technology

Overview In the age of the information explosion, organizations have come to rely on search technology not just to find content, but to improve efficiency and productivity. However, traditional search models often struggle to truly understand user intent, resulting in inaccurate, irrelevant or even incomplete search results. This experience not only frustrates users...
1yrs ago
050.1K
Google全新发布AI视频Veo2、AI绘图Imagen3

Google Newly Releases AI Video Veo2, AI Mapping Imagen3

Earlier this year, Google launched its video generation model Veo and its newest image generation model Imagen 3. Since then, it's been exciting to see people bring their ideas to life with these models: YouTube creators are exploring the possibilities for YouTub...
1yrs ago
046.3K
SiliconCloud上线加速版视频模型Mochi-1-Preview

SiliconCloud Goes Live with Accelerated Video Model Mochi-1-Preview

Recently, GenmoAI open-sourced the video generation model mochi 1 preview (10B) with high-fidelity actions and robust cue following capabilities, currently supporting 480p resolution video generation. Today, SiliconCloud, a silicon based flow, went live with an inference accelerated version of mo...
1yrs ago
043.7K
如何将copilot安装到国内电脑

How to install copilot to domestic computer

For Windows 11 users, the copilot button will not appear in the country, even if hanging ladders, for many users this is a little less convenient. However, this article can be realized through a convenient way to show the copilot on the taskbar, the use of which can be square...
1yrs ago
055.3K
这个AI设计软件厉害了,只要一张产品图就能生成专业的电商主图,爆款产品这不就来了嘛。

This AI design software is awesome, as long as a product image can generate a professional e-commerce main picture, pop-up products which do not come well.

In today's competitive e-commerce market, how to make your product stand out from the crowd of choices has become a challenge that every brand and business must face. The importance of visual marketing as one of the key factors for e-commerce success cannot be overstated. An attractive and professional product image display not only...
11mos ago
047.1K
Leffa:高保真模特虚拟试穿与人物姿势调整,Meta开源的可控人物图像生成模型

Leffa: High-fidelity model virtual fitting and character pose adjustment, Meta open source controllable character image generation model

Comprehensive Introduction Leffa is a unified framework for generating controllable character images, enabling precise manipulation of character appearance (e.g., virtual fitting) and pose (e.g., pose transfer). The framework significantly reduces distortion of fine-grained details by directing the target query to focus on the correct reference key in the attention layer, with ...
1yrs ago
065.6K
MMAudio:为视频画面生成同步音效与配乐,视频到音频的多模态联合训练工具

MMAudio: generating synchronized sound effects and soundtracks for video footage, video-to-audio multimodal co-training tool

General Introduction MMAudio is an open-source project aiming to generate high-quality synchronized audio through joint multimodal training. Developed by Ho Kei Cheng et al. at the Chinese University of Hong Kong, the project's main function is to generate synchronized audio based on video and/or text input.MM...
1yrs ago
068.7K
AutoGPT:工作流自动化与自主执行任务的智能体构建平台

AutoGPT: Intelligent Body Building Platform for Workflow Automation and Autonomous Task Execution

General Description AutoGPT is a powerful platform designed to help users create, deploy and manage continuously running AI agents and automate complex workflows. Developed by Significant Gravitas, the platform offers a wide range of tools and features that enable users to focus...
1yrs ago
061.3K
YOO简历:智能简历生成工具,在线制作大厂简历范文,提升求职成功率

YOO Resume: intelligent resume generation tool, online production of large factory resume sample, enhance the success rate of job hunting

Comprehensive Introduction YOO Resume is an intelligent resume generation tool launched by Zhuhai Biyou Technology Co. Ltd, aiming to help users create professional resumes quickly and efficiently through artificial intelligence technology. Whether you are a new student or an experienced job seeker, YOO Resume provides personalized resume templates and...
1yrs ago
054.3K
瑞达写作:一键生成论文,免费选题生成论文大纲, 论文润色,引用文献数据

Rida Writing: One-click essay generation, free topic selection to generate an essay outline, thesis touch-up, citation of literature data

Comprehensive Introduction Rida Writing is an AI platform that focuses on academic paper writing, aiming to help users efficiently complete their paper writing tasks. By entering a dissertation title, users can generate complete dissertation content with up to 50,000 words in one click. The platform offers a variety of features, including free topic selection, idea outline...
1yrs ago
059.3K
Qwen-Agent:基于Qwen的智能代理应用框架,包括工具调用、代码解释器、RAG和Chrome扩展。

Qwen-Agent: Qwen-based framework for intelligent agent applications, including tool calls, code interpreters, RAGs and Chrome extensions.

Comprehensive Introduction Qwen-Agent is an intelligent agent application framework developed based on Qwen 2.0 and above, with capabilities such as command following, tool usage, planning and memorization. The framework provides a variety of sample applications such as browser assistants, code interpreters and custom assistants...
1yrs ago
078.6K
Mini-Cover:在线封面制作,专为博客、短视频、社交媒体等生成个性化封面

Mini-Cover: online cover creation, designed to generate personalized covers for blogs, short videos, social media and more

General Introduction Mini-Cover is an open source online cover generation tool designed to generate personalized covers for platforms such as blogs, short videos and social media. Developed by JLinMr, the tool aims to provide a simple and efficient solution to help users quickly generate covers that meet their needs...
1yrs ago
061.4K
2024年度RAG清单,RAG应用策略100+

2024 RAG Inventory, RAG Application Strategy 100+

Looking back to 2024, the big models are changing day by day, and hundreds of intelligent bodies are competing. As an important part of AI applications, RAG is also a "group of heroes and lords". At the beginning of the year, ModularRAG continues to heat up, GraphRAG shines, and in the middle of the year, open source tools are in full swing, and knowledge graphs are...
1yrs ago
056.8K
Swarms:多智能体编排框架,企业级生产工具

Swarms: Multi-intelligent Orchestration Framework, Enterprise Production Tool

General Introduction Swarms is an enterprise-grade production-ready multi-agent orchestration framework designed to boost business productivity through efficient agent management and task processing. With support for multiple models, multiple memory systems and custom agent creation, the framework provides a modular design and comprehensive logging capabilities to ensure that the system...
1yrs ago
054.2K
Rexera 的 AI 智能体如何通过 LangGraph 驱动质量控制

How Rexera's AI Intelligence Drives Quality Control with LangGraph

Learn how Rexera migrated to LangGraph to create powerful quality control intelligences for real estate business processes and significantly improve the accuracy of their Large Language Model (LLM) responses. Rexera is revolutionizing manual processes by leveraging AI to automate...
1yrs ago
051.9K
算了么:共享你电脑闲置 GPU 显卡算力赚钱,支持科学研究

Forget it: Share your computer's unused GPUs and graphics cards to earn money and support scientific research!

Comprehensive Introduction Nevermind is a platform that utilizes the arithmetic power of idle graphics cards to perform scientific calculations and earn revenue. Users can share their computer's idle GPU resources to support scientific research and technological progress, while earning a certain financial return. The platform aims to promote scientific progress and solve important scientific research problems...
1yrs ago
0128.1K
Sonic:音频驱动肖像图片生成面部表情生动的数字人口播视频

Sonic: Audio-driven portrait images generate digital demo videos with vivid facial expressions

General Introduction Sonic is an innovative platform focusing on global audio perception designed to generate vivid portrait animations driven by audio. Developed by a team of researchers from Tencent and Zhejiang University, the platform utilizes audio information to control facial expressions and head movements to generate natural and smooth animated videos.S...
1yrs ago
077.5K
Ultravox:实时端到端语音对话的音频多模态大模型,GPT-4o语音交互的开源实现

Ultravox: an audio multimodal macromodel for real-time end-to-end voice dialog, an open source implementation of GPT-4o voice interaction

Comprehensive Introduction Ultravox is an innovative multimodal Large Language Model (LLM) designed for real-time speech processing. Unlike traditional speech recognition systems, Ultravox eliminates the need for a separate Audio Speech Recognition (ASR) stage, and is able to directly convert audio into high-dimensional space in...
1yrs ago
069K
卷起来了!长文本向量模型分块策略大比拼

Rolled Up! Long Text Vector Model Chunking Strategies Competition

Long Text Vector Modeling The ability to encode ten pages of text into a single vector sounds powerful, but is it really practical? Many people think... Not necessarily. Is it okay to use it directly? Should it be chunked? How to divide the most efficient? In this article, we will take you to an in-depth discussion of different chunking strategies for long text vector models, analyze the li...
1yrs ago
045.7K
Research Rabbit:使用本地LLM进行网页研究和报告撰写,自动深入用户指定主题并生成总结。

Research Rabbit: Web research and report writing using native LLM, automatically drilling down into user-specified topics and generating summaries.

General Introduction Research Rabbit is a native LLM (Large Language Model) based web research and summarization assistant. After the user provides a research topic, Research Rabbit generates a search query, obtains relevant web results, and summarizes those results...
1yrs ago
073K
AgentClientDemo:演示智能体运行过程的Python客户端,提供直观的图形用户界面

AgentClientDemo: a Python client that demonstrates the process of running an intelligent body, providing an intuitive graphical user interface

Comprehensive Introduction AgentClientDemo is a comprehensive Python project that integrates intelligent (Agent) and client (Client) functionality. The project is based on the PyQt framework and provides an intuitive and easy-to-use graphical user interface (G...
1yrs ago
057.1K
OpenAI-o1有多厉害?深度优化论文,提升论文写作质量!30个极品提示词分享

How powerful is OpenAI-o1? Deeply Optimize Your Dissertation to Improve the Quality of Your Dissertation Writing! 30 Extreme Prompt Words to Share

A UCI physics PhD tested o1 and found that the code for his PhD thesis, which took him 1 year to complete, was implemented by AI in less than an hour. o1 models are already strong enough to straighten out PhD thesis code! This also means revolutionizing the writing of academic papers. By carefully constructing prompt words...
1yrs ago
055K
3小时完成论文初稿! ChatGPT全流程覆盖论文写作每个阶段(附提示词模板)

Finish the first draft of your dissertation in 3 hours! ChatGPT Full Process Coverage of Every Stage of Dissertation Writing (with Prompt Word Templates)

Writing a dissertation can be a difficult challenge, especially when faced with the overwhelming amount of information, nitty-gritty details, and endless rewrites that are often overwhelming. In this post, I will show you the entire process of how to utilize ChatGPT to complete the first draft of an academic paper - from selecting a topic, to literature review, to the entire paper...
1yrs ago
061K
斯坦福大学开源的ChatGPT论文写作提示词

Stanford University's open source ChatGPT essay writing prompts

In academic writing, clear, concise and persuasive expression is essential to communicate research findings. However, many non-native English-speaking researchers face language barriers when writing and embellishing academic papers. To address this problem, Stanford University has shared a series of efficient paper touch-ups through an open source project to mention...
1yrs ago
057.3K