Wenshin Big Model X1.1 - Baidu's Deep Thinking Model for Better Understanding
Wenxin Big Model X1.1 is a deep thinking model launched by Baidu, based on a hybrid reinforcement learning framework that focuses on improving language understanding and generation. The model excels in handling complex questions, following instructions and simulating the behavior of intelligences, and can accurately provide knowledgeable answers and high-quality text content.
Hybrid Image 2.1 - Tencent's Open Source Vendor Graph Model
HunyuanImage 2.1 is Tencent's open source graphic model, designed for high-quality image generation. The model supports native 2K resolution, can accurately render complex scenes and details, so that the character's expression and movement can be vividly reproduced.
Free LangChain for LLM Application Development Course by Ernest Ng
LangChain for LLM Application Development is an online course presented by DeepLearning.AI, featuring LangChain founder Harrison Chase and Andrew Ng.
Free course on how Transformer LLMs work by Enda Wu
Transformer LLMs work on the principle that DeepLearning.AI and Jay Alammar and Maarten Grootend, authors of Hands-On Large Language Models...
Seedream 4.0 - the latest generation of image creation models launched by Bytes
Seedream 4.0 is an advanced image generation and editing tool launched by ByteDance, centered on the integration of generation and editing, with powerful features such as precise command editing, high feature retention, and deep intent understanding.
rStar2-Agent - Microsoft's Open Source Efficient AI Reasoning Model
rStar2-Agent is an advanced AI mathematical reasoning model open-sourced by Microsoft that demonstrates strong mathematical problem solving capabilities by achieving an accuracy of 80.61 TP3T in the AIME24 test. The model is equipped with scientific reasoning capabilities, achieving in the GPQA-Diamond benchmark...
InfinityHuman - Long video digital human generation model launched by Bytes in collaboration with ZJU
InfinityHuman is a commercial-grade long time-series audio-driven character video generation model jointly launched by ByteDance and Zhejiang University. The model is audio-driven and can generate high-resolution, long duration and visually consistent character videos.
Kimi K2-0905 - The latest model release from Dark Side of the Moon!
Kimi K2-0905 is an advanced AI model from Dark Side of the Moon Technologies Ltd. that excels in programming assistance, generates code efficiently, and supports the generation of neat and standardized code in front-end development. The model context length is extended to 256K to handle complex tasks.
Meeseeks - Meeseeks open-source assessment set for evaluating the ability to follow model instructions
Meeseeks is an open source large model evaluation set used by the Meituan M17 team to evaluate the model's ability to follow instructions.Meeseeks uses a three-tiered evaluation framework to comprehensively measure whether the model is able to generate answers in strict accordance with the user's instructions from the macro to the micro level, without evaluating the knowledge of the content of the answers positively ...
gpt-realtime - OpenAI's newest AI speech model
gpt-realtime is an advanced speech model from OpenAI that supports direct audio processing to generate natural and smooth speech. The model supports multiple languages and styles, understands non-verbal cues such as laughter, and can switch between languages.