LLaVA-OneVision-1.5 - Free and open source multimodal modeling, high performance multimodal understanding
LLaVA-OneVision-1.5 is an open-source multimodal model by the EvolvingLMMS-Lab team, using 8B parameter scale, through a compact three-phase training process (language-image alignment, conceptual equalization and knowledge injection, and instruction fine-tuning) on 128 A800...
Paper2Video - NUS open source project to automatically generate demo videos for academic papers
Paper2Video is an open-source presentation video project for automatic generation of academic papers by Show Lab at National University of Singapore. Using the PaperTalker multi-intelligence framework, papers are transformed into full presentation videos containing slides, subtitles, voiceover and speaker avatar...
NeuTTS Air - Free and Lightweight Speech Synthesis Model with Offline CPU Running Support
NeuTTS Air is open source lightweight speech synthesis model, developed by Neuphonic team, which can run in real time on local devices (e.g. cell phones, laptops, Raspberry Pi) without relying on the cloud. Using 0.5B parameter Qwen architecture and self-developed NeuCodec codec...
KAT-Dev-72B-Exp - Racer open source free programming-specific models
KAT-Dev-72B-Exp is an open-source programming-specific large language model launched by the Racer team, optimized based on reinforcement learning technology, which achieved an accuracy rate of 74.6% in the SWE-Bench Verified benchmark test, the best performance of any open-source model at present. The model uses innovative...
Jamba Reasoning 3B - Israel AI21 Labs open source lightweight reasoning model
Jamba Reasoning 3B is a lightweight inference model open-sourced by Israeli AI startup AI21 Labs with strong performance and potential for a wide range of applications. It utilizes a hybrid SSM-Transformer architecture that combines Trans...
Free Course on the Latest Intelligentsia from Agentic AI by Ernest Ng
Agentic AI is the newest course on intelligent bodies launched by Ernest Ng.The course focuses on the design and construction of intelligent bodies, covering the four major design patterns of reflection, tool use, planning, and multi-intelligent body collaboration. Learners will master how to make intelligent bodies check outputs, autonomously adjust through theoretical explanations and code practice...
OpenAgents - Open Source Free Open Collaboration Project for Building AI Agent Networks
OpenAgents is the open source project that creates a network of AI agents and facilitates open collaboration between agents. A basic network infrastructure is provided to enable AI agents to seamlessly connect and collaborate. Users can quickly start their own agent network, extend functionality through a modular architecture, support...
Androidify - Google open sources free resources on how to build AI apps on Android
Androidify is Google's open source project to help developers learn how to build AI-driven apps on Android. The project uses Google's latest technologies such as Jetpack Compose, Gemini API (via Fire...
Ling-1T - Ant Group's open source universal language model for trillions of parameters
Ling-1T is a trillion-parameter general-purpose language model open-sourced by Ant Group, which belongs to the flagship product of the Ling 2.0 series of Bering's large models. The model adopts a highly efficient MoE architecture, supports 128K context windows, and surpasses GPT in 7 benchmarks including code generation, mathematical reasoning, and logic test...
EchoCare - Hong Kong Academy of Sciences open source ultrasound base large model
EchoCare is a large model of ultrasound base developed by the Center for Artificial Intelligence and Robotics Innovation (CAIR) at the Hong Kong Institute of Innovation and Research of the Chinese Academy of Sciences (CAS), trained based on the world's largest ultrasound image dataset (more than 4.5 million images), covering multi-center, multi-region, multi-ethnicity, and more than 50 individuals...









