meso- (chemistry)SmartResume - 阿里巴巴开源的AI简历解析与优化工具
SmartResume 是阿里巴巴开源的智能简历解析与优化工具,能高效地从 PDF、图片或 Office 文档中提取结构化信息,如基本资料、教育经历和工作经验等。通过融合 OCR 技术和 PDF 元数...
meso- (chemistry)Step-Audio-EditX - 阶跃星辰开源的首个LLM级音频编辑大模型
Step-Audio-EditX是开源的音频编辑大模型,由阶跃星辰团队研发,专注于通过人工智能技术实现音频内容的精细操控。模型能动态调整音频的情绪、说话风格(如撒娇、老人腔等)和副语言元素(如笑声、叹...
meso- (chemistry)Open-o3 Video - 北大联合字节开源的视频推理模型
Open-o3 Video 是北京大学和字节跳动联合开发的开源视频推理模型,专注于通过时间和空间证据增强视频推理能力。通过明确标注关键证据的时间戳和边界框,帮助模型更好地理解和解释视频内容。
Handy - Open Source Free Native AI Speech to Text Tool
Handy is open source and free local speech to text tool, supporting Windows, MacOS and Linux systems, developed by Rust and React. It is suitable for quick transcription and text input by processing voice data locally without uploading it to the cloud to ensure privacy and security.
FG-CLIP 2 - 360 Open Source Cross-Modal Visual Language Model for Graphic Texts
FG-CLIP 2 is the world's leading graphical cross-modal visual language model (VL-M) launched by 360 Artificial Intelligence Research Institute, which surpasses similar models from Google and Meta in 29 authoritative benchmark tests, making it the most powerful VL-M at present.It is able to accurately recognize the gross...
BettaFish - Open Source Multi-Intelligence Public Opinion Analyzing System
BettaFish is an open source multi-intelligence system for public opinion analysis. Using multi-intelligent body architecture, through Query, Media, Insight, Report and other Agents work together to achieve retrieval, extraction and reporting closed loop. The system supports AI-driven full ...
Ouro - A new cyclic language model open-sourced by the ByteHopper Seed team
Ouro is a new type of Looped Language Models (LLMs) developed by the ByteDance Seed team, with the core innovation of directly building inference capabilities in the pre-training phase through a parameter-sharing recurrent computation structure. The model uses 24 layers as the base block through...
ChronoEdit - AI image editing framework jointly open-sourced by NVIDIA and the University of Toronto
ChronoEdit, an open-source AI image editing framework developed by NVIDIA in conjunction with the University of Toronto, redefines the image editing task as a video generation task to ensure that the editing results are temporally and physically consistent. By distilling a pre-trained video generation model with 14B parameters from a...
LongCat-Flash-Omni - A Fully Modal Large Language Model for Meituan Open Source
LongCat-Flash-Omni is an open source fully modal big language model released by the LongCat team of Meituan. With a parameter scale of 560 billion (27 billion activated parameters), it realizes millisecond-level real-time audio and video interaction capabilities while maintaining a large number of parameters.
Petri - Anthropic's open source AI security auditing framework
Petri is an open source AI security auditing framework developed by Anthropic that systematically assesses the security and behavioral alignment of AI models. By simulating a real-world scenario where an automated auditor engages in multiple rounds of conversations with a target model, followed by a judge agent that acts on the model's...









