Github: https://github.com/hkust-nlp/simpleRL-reason This blog will show a replication of DeepSeek-R1-Zero and DeepSeek-R1 training using small models and limited data, with many of the experiments performed in our independent DeepSeek-R1 release of ...
General Description Fullmoon is an application designed for iOS devices and aims to provide the ability to chat privately with native large language models. The app is optimized for Apple Silicon and is supported on iPhone, iPad and Mac. Users' chats are saved locally and can be customized should...
Model Overview In recent years, large model training based on Mixture of Experts (MoE) architecture has become an important research direction in the field of artificial intelligence.The Qwen team recently released the Qwen2.5-Max model, which employs more than 20 trillion tokens of pre-training data and refined post-training scheme in M...
General Introduction Onlook is an open source design tool built for designers and developers that allows users to design directly in a running React application and convert design changes to code. The tool provides an intuitive visual editing experience, similar to Figma or Webflow, but with a focus on this...
General Introduction YuE is an open source full song generation base model that focuses on transforming lyrics into full songs. Unlike other models that only generate short snippets of non-vocal music, YuE is capable of generating full songs with lead and backing vocals up to several minutes in length. The model solves the music generation problem of long on...
General Introduction PocketPal AI is an open source mobile app designed to bring Small Language Models (SLMs) directly to your phone for both iOS and Android users. It provides a web-independent AI chat experience, ensuring that users are hidden...
General Introduction Cog-ComfyUI is an open source project designed to run ComfyUI workflows via an API. Created by GitHub user fofr, the project provides an efficient way to integrate and run ComfyUI workflows.ComfyUI is a user interface for image generation and manipulation that supports a variety of models...
General Introduction Supermemory is an open source project designed to help users build their "second brain". With a powerful Chrome extension and AI technology, it allows users to easily save, organize, and retrieve information from a variety of sources such as web pages, Twitter bookmarks, etc. Supermemory ...
General Introduction Open NotebookLM is an open source project designed to convert any PDF document into a podcast. The tool utilizes open source Large Language Model (LLM) and Text-to-Speech (TTS) models to process PDF content, generate natural dialog suitable for audio podcasts, and output to MP3 files. The project is supported by the N...
Comprehensive Introduction Deeptrain is a platform focusing on AI video processing, which can effectively integrate video content into various AI applications through its advanced technology that supports over 200 language models. Users can train models directly by providing video URLs without having to download the videos.Deeptrain provides...
Good New Year! Greetings to all of you! Recently, my circle of friends has been bombarded with news related to DeepSeek-R1, and I believe you have all heard about our domestic open source model DeepSeek! I'm sure you've all heard about DeepSeek, our homegrown open source model, and there have been a lot of tutorials on how to deploy DeepSeek-R1 locally, so let's do something different today...
General Introduction Open Intelligence is a company dedicated to providing open source AI solutions, and its main product, Apollo, allows users to interact directly with their private AI backends via their cell phones. The platform not only supports individual users to autonomously manage their AI backends, but also provides support for a variety of AI application scenarios, such as chatting...
General Introduction Llamao is a private and offline running Llama AI chatbot designed to provide users with an intelligent assistant service without internet connection. Unlike ChatGPT, Llamao runs entirely on the user's device, ensuring absolute privacy and security of user data. Whether it's writing, brainstorming or solving...
General Introduction Codev is an AI-driven platform designed to help users quickly generate full-stack web applications. Whether you are a developer or a non-developer, simply describe the application idea through natural language and Codev generates a complete Next.js application with all the necessary components, styles and features. The platform uses Next...
I. BACKGROUND AND CHALLENGES With the rapid development of AI technology, large-scale language models (LLMs) have become a core driver in the field of natural language processing. However, training these models requires huge computational resources and time costs, which has led to the rise of Knowledge Distillation (KD) techniques. Knowledge distillation works by combining large ...
General Introduction Lux is a fast and simple video download library and command line tool written in Go. It supports downloading videos from multiple websites, including YouTube, Bilibili, Youku, etc. Lux provides a variety of download options and features, such as multi-threaded downloads, breakpoints, automatic retries, etc., extremely...