MiniCPM 4.1 - Ultra-efficient end-side grand model introduced by Facing Face Intelligence
MiniCPM 4.1 is an ultra-efficient end-side large language model introduced by Facade Intelligence. With InfLLM v2 sparse attention architecture, each lexeme only needs to calculate the relevance to less than 5% lexemes, which significantly reduces the processing overhead of long text. In a 128K long text scenario...






















































































![FLUX.1 Krea [dev] - 黑森林和Krea AI联合推出的文生图模型](https://aisharenet.com/wp-content/uploads/2025/08/1754032748-1754032748-FLUX.1-Krea-dev-website-2.png)











