BAGEL - Open source multimodal base model launched by Wordpress
BAGEL is a multimodal base model open-sourced by ByteDance with 14 billion parameters, of which 7 billion are active. The model base with the Mixed Transformer Expert Architecture (MoT) captures pixel-level and semantic-level features of an image with two independent encoders, respectively, to support efficient processing of images, text, video...
DeepSeek-R1 - AI inference model from DeepSeek, performance aligned to OpenAI o1 release
DeepSeek-R1 is a high-performance AI inference model launched by Hangzhou-based DeepSeek, benchmarking against OpenAI's o1 official version. The model is post-trained based on large-scale reinforcement learning techniques and requires only a very small amount of labeled data to reason in math, code and natural language...
Phantom Boat AI - One-stop AI short film creation platform, batch generation of various types of video content
Phantom Boat AI is a powerful one-stop AI short film creation platform that supports efficient batch generation of various types of video content, including commercials, promos, animations and more. The platform is based on Midjourney, Runway and other world-leading AI models, and provides creators with a wide range of services from scriptwriting to...
Circuit Tracer - Anthropic open source tool for visualizing the inner workings of models
Circuit Tracer is an open source tool from Anthropic for studying the internal workings of large language models. Based on the generation of attribution graphs (attribution graphs) to reveal the internal steps that the model undergoes when generating a particular output ...
Google AI Edge Gallery - Google launches AI app that supports running AI models on your phone
Google AI Edge Gallery is an experimental AI app from Google that lets users experience and use Machine Learning (ML) and Generative Artificial Intelligence (GenAI) models on native devices. The app is supported on Android devices.
Data Agent - Volcano Engine's Next Generation Enterprise Data Intelligence Body
Data Agent is a new generation of enterprise-grade data intelligence launched by Volcano Engine, focusing on data analysis and intelligent marketing.Data Agent integrates structured and unstructured data within the enterprise, and generates comprehensive and in-depth research reports based on in-depth research and analysis.
Keling 2.1 - AI Video Generation Model Launched by Shutterstock
KeLing 2.1 is an AI video generation model launched by Racer, which is now available on the KeLing AI video platform. The model contains three versions: standard, high quality and master, providing 720P, 1080P and movie and TV level effects to meet different creative needs. The standard version of the generation speed, suitable for rapid production...
Little Lark - Smart Creation Agent by Shear Image
Little Lark is an intelligent creation Agent launched by Shear Image, based on AI technology to reshape the boundaries of content creation, making creation simpler, more efficient and more interesting. Little Lark supports zero-threshold creation of videos, digital pop-up videos, design drawings and pictures for backgrounds, users only need to enter a command, AI support efficiently complete...
Drafting AI Community - AI creative content design platform, a variety of design resources to meet different creative needs
Drafting AI Community is an online AI creative inspiration platform that provides users with a wealth of creative design resources and tools. The platform covers a variety of design fields, including image photos, e-commerce design, holiday themes, 3D illustrations, avatar design, Xiaohongshu materials, portrait design, etc., to meet the needs of different users.
Ming-lite-omni - ants Bering team open source unified multimodal big model
Ming-Lite-Omni is an open source unified multimodal big model from Ant Group's Bailing Big Model team, built on the efficient Mixture of Experts (MoE) architecture.Ming-Lite-Omni supports processing of text, images, audio and video...