Banana Slides - Open source AI PPT generation tool based on Nano Banana Pro models
Banana Slides is an open source intelligent PPT generator based on the Nano Banana Pro AI model, which supports the rapid creation of professional presentations using natural language commands. Allows users to describe the topic (e.g. "Human impact on the ecosystem") in a single sentence, which can be self...
Kaleido - A multi-subject reference video generation model open-sourced by Smart Spectrum AI in collaboration with Tsinghua University and others
Kaleido is an open source multi-subject reference video generation model jointly developed by Hefei University of Technology, Tsinghua University and Smart Spectrum AI. It generates subject-consistent videos through multiple reference images, solving the deficiencies of existing models in multi-subject consistency and background decoupling.Kaleido generates videos through specialized data...
Paper2Slides - HKU open source academic papers into slides AI tool
Paper2Slides is an open source AI tool from the Data Intelligence Laboratory of the University of Hong Kong that converts academic papers into professional slides or posters in one click. Using RAG (Retrieval Augmented Generation) technology, directly parsing the document content rather than relying on network information, to ensure that the generated PPT is highly consistent with the original...
RealVideo - Wisdom Spectrum AI's open source real-time streaming video generation system
RealVideo is an open-source real-time streaming video generation system from Smart Spectrum AI that can quickly generate natural and smooth video responses in 2 to 3 seconds. Users only need to upload a photo and enter text, and the system can generate corresponding voice and video, realizing real-time conversations with AI characters...
OpenScreen - Open source free screen recording tool for Mac and Windows.
OpenScreen is an open source and free screen recording tool that provides users with an easy to use and fully functional alternative to Screen Studio. It supports both Mac and Windows, is completely free and follows the MIT protocol, and can be used for individual...
SCAIL - Smart Spectrum and Tsinghua open source film and television character animation generation framework
SCAIL (Studio-Grade Character Animation via In-Context Learning) is a film and television grade character animation generation framework proposed by Smart Spectrum in collaboration with Prof. Liu Yongjin's group at Tsinghua University. Through...
DeepSearchQA - Google's Open Source AI Research Agent Testing Benchmarks
DeepSearchQA is Google's open-source AI research Agent test benchmark, specifically designed to evaluate the performance of intelligences on complex multi-step query tasks. It consists of 900 hand-designed "causal chain" tasks covering 17 domains, requiring the AI to act like a human researcher and push through multi-step...
Claude-Mem - Open Source Claude Code Memory Plugin with Cross-Session Persistent Memory Support
Claude-Mem is an open source plugin for Claude Code that solves the problem of AI memory loss across sessions. It helps Claude by automatically capturing the tool's use of observations, generating semantic summaries, and injecting relevant context in subsequent sessions...
KoalaQA - Open source AI after-sales service system to help companies quickly build Q&A platforms
KoalaQA is an open source intelligent after-sales service system developed by the Chaitin team. Based on the AI model, it provides AI customer service, AI search and knowledge base management functions to help enterprises quickly build an intelligent Q&A platform. The system supports 24/7 real-time response ...
VoxCPM 1.5 - Faceted Intelligence Open Source End-to-End Text-to-Speech Modeling
VoxCPM 1.5 is an open source speech generation model released by Facade Intelligence, based on text-to-speech (TTS) technology without the need for a splitter, featuring several innovations and improvements. Adopting an end-to-end diffusion autoregressive architecture, it generates continuous speech waveforms directly from text, avoiding the limitations of traditional segmentation methods...









