Large model fine-tuning

Total 28 articles posts

Sorting

GraphGen: Fine-tuning Language Models Using Knowledge Graphs to Generate Synthetic Data

Comprehensive Introduction GraphGen is an open source framework developed by OpenScienceLab, an AI lab in Shanghai, hosted on GitHub, focused on optimizing supervised fine-tuning of Large Language Models (LLMs) by guiding synthetic data generation through knowledge graphs. It was developed from ...

3mos ago

0904

MiniMind-V: 1 hour training of a 26M parameter visual language model

General Introduction MiniMind-V is an open source project, hosted on GitHub, designed to help users train a lightweight visual language model (VLM) with only 26 million parameters in less than an hour. It is based on the MiniMind language model, with new visual...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

01K

DeepCoder-14B-Preview: an open source model that specializes in code generation

General Introduction DeepCoder-14B-Preview is an open source code generation model developed by the Agentica team and released on the Hugging Face platform. It is based on the DeepSeek-R1-Distilled-Q...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

01.3K

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

01.4K

Search-R1: A Tool for Reinforcement Learning to Train Large Models for Search and Reasoning

General Introduction Search-R1 is an open source project, developed by PeterGriffinJin on GitHub, built on the veRL framework. It trains Large Language Models (LLMs) through Reinforcement Learning (RL) techniques, allowing the models to autonomously learn...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

01.2K

Optexity: an open-source project to train AI to perform web actions with human demonstrations

General Introduction Optexity is an open source project on GitHub, developed by the Optexity team. Its core is to use human demonstration data to train AI to complete computer tasks, especially web page operations. The project contains three code libraries : Compute...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning # Desktop Automation Intelligence

4mos ago

01.2K

Bonsai: A three-valued weighted language model suitable for operation on edge devices

Comprehensive Introduction Bonsai is an open source language model developed by deepgrove-ai with a parameter size of 500 million, using ternary weights. It is based on the Llama architecture and the Mistral classifier...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

01.5K

Second Me: Locally trained AI doppelgangers with personal memories and habits

Second Me is an open source project developed by the Mindverse team that allows you to create an AI on your computer that acts like a "digital doppelganger", learning your speech patterns and habits through your words and memories, and becoming a smart person who understands your...

Latest AI Resources # AI Java Open Source Projecct # AI Life Efficiency Assistant # Large model fine-tuning

5mos ago

01.8K

Easy Dataset: an easy tool for creating fine-tuned datasets for large models

Comprehensive Introduction Easy Dataset is an open source tool designed specifically for fine-tuning large models (LLMs), hosted on GitHub. It provides an easy-to-use interface that allows users to upload files, automatically segment content, generate questions and answers, and ultimately output a suitable...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

01.3K

MM-EUREKA: A Multimodal Reinforcement Learning Tool for Exploring Visual Reasoning

Comprehensive Introduction MM-EUREKA is an open source project developed by Shanghai Artificial Intelligence Laboratory, Shanghai Jiao Tong University and other parties. It extends textual reasoning capabilities to multimodal scenarios through rule-based reinforcement learning techniques to help models process image and textual information. The core of this tool...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

01.2K

AI Toolkit by Ostris：Stable Diffusion与FLUX.1模型训练工具包

AI Toolkit by Ostris: Stable Diffusion with FLUX.1 Model Training Toolkit

General Introduction AI Toolkit by Ostris is an open source AI toolset focused on supporting Stable Diffusion and FLUX.1 models for training and image generation tasks. The toolset is created and maintained by developer Ostris, tor...

Latest AI Resources # AI Image Generation Aids # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

01.5K

X-R1: Low-cost training of 0.5B models in common devices

General Introduction X-R1 is a reinforcement learning framework open-sourced on GitHub by the dhcode-cpp team, aiming to provide developers with a low-cost, efficient tool for training models based on end-to-end reinforcement learning. The project is supported by DeepSeek...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

01.1K

OpenManus-RL: Fine-tuning Large Models to Enhance Intelligent Body Reasoning and Decision Making

General Introduction OpenManus-RL is an open source project developed by UIUC-Ulab in conjunction with the OpenManus team of the MetaGPT community, hosted on GitHub.The project enhances large-scale language modeling (LLM) through reinforcement learning (RL) techniques...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

01.6K

TPO-LLM-WebUI: An AI framework where you can input questions to train a model to output results in real time

Comprehensive Introduction TPO-LLM-WebUI is an innovative project open-sourced by Airmomo on GitHub that enables real-time optimization of Large Language Models (LLMs) through an intuitive web interface. It uses TPO (Test-Time Pr...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

6mos ago

01.7K

Open-Reasoner-Zero: Open Source Large-Scale Reasoning Reinforcement Learning Training Platform

General Introduction Open-Reasoner-Zero is an open source project focused on reinforcement learning (RL) research, developed by the Open-Reasoner-Zero team on GitHub. It aims to provide efficient, scalable and easy-to-use training ...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

6mos ago

01.4K

Chinese based full-blooded DeepSeek-R1 distillation dataset, supports Chinese R1 distillation SFT dataset

Comprehensive Introduction The Chinese DeepSeek-R1 distillation dataset is an open source Chinese dataset containing 110K pieces of data designed to support machine learning and natural language processing research. The dataset is released by Cong Liu's NLP team. The dataset contains not only mathematical data, but also a large number of general types...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

6mos ago

01.2K

ColossalAI: Providing Efficient Large-Scale AI Model Training Solutions

Comprehensive Introduction ColossalAI is an open source platform developed by HPC-AI Technologies to provide an efficient and cost-effective solution for training and reasoning of large-scale AI models. By supporting multiple parallelization strategies, heterogeneous memory management, and mixed-precision training, ColossalAI...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

6mos ago

01K

One Shot LoRA: The All-in-One Platform for Rapidly Generating Video LoRA Models

General Introduction One Shot LoRA is a platform focused on generating high quality video LoRA models from videos. Users can quickly and easily train boutique LoRA models from videos without logging in or storing private data. The platform supports Hunyua...

Latest AI Resources # AI Image Generation Aids # Large model fine-tuning

6mos ago

01.4K

Kiln: Simple LLM model fine-tuning and data synthesis tool, 0 code base to fine-tune your own small models

Comprehensive Introduction Kiln is an open source tool focusing on fine-tuning, synthetic data generation and dataset collaboration for Large Language Models (LLMs). It provides an intuitive desktop application with support for Windows, MacOS and Linux systems, allowing users to implement zero-code implementation of Ll...

Latest AI Resources # Large model fine-tuning

6mos ago

01.8K

Maestro: A tool to simplify the process of fine-tuning mainstream open source visual language models

Comprehensive Introduction Maestro is a tool developed by Roboflow to simplify and accelerate the process of fine-tuning multimodal models, so that everyone can train their own visual macromodels. It provides ready-made recipes for fine-tuning popular visual language models (VLMs) such as F...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

6mos ago

01.4K

LlamaEdge: the quickest way to run and fine-tune LLM locally

General Introduction LlamaEdge is an open source project designed to simplify the process of running and fine-tuning large language models (LLMs) on local or edge devices. The project supports the Llama2 family of models and provides OpenAI-compatible API services that enable users to easily create and run...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

7mos ago

01.6K

Unsloth: an open source tool for efficiently fine-tuning and training large language models

Comprehensive Introduction Unsloth is an open source project designed to provide efficient tools for fine-tuning and training large language models (LLMs). The project supports a variety of well-known models, including Llama, Mistral, Phi, and Gemma.Unsloth's...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

6mos ago

01.8K

Bakery: easily fine-tune and monetize open source AI models

General Introduction Bakery is a platform designed for AI startups, machine learning engineers and researchers to provide simple and efficient AI model fine-tuning and monetization services. Users can access community-driven datasets through Bakery, create or upload their own datasets, fine-tune models...

Latest AI Resources # AI Side Hustle Money Making Programs # Large model fine-tuning

7mos ago

01.7K

NVIDIA Garak: Open-source tool to detect LLM vulnerabilities and secure generative AI

Comprehensive Introduction NVIDIA Garak is an open source tool that specializes in detecting vulnerabilities in Large Language Models (LLMs). It checks the model for multiple weaknesses such as illusions, data leakage, hint injection, error message generation, harmful content generation, etc. through static, dynamic and adaptive probing...

Latest AI Resources # AI Java Open Source Projecct # Large model fine-tuning

9mos ago

01.6K

ModelScope Swift: a lightweight infrastructure for efficiently fine-tuning and deploying large models.

Comprehensive Introduction ModelScope Swift (MS-Swift for short) is an efficient lightweight infrastructure designed for fine-tuning, reasoning, evaluating, and deploying large models (LLMs) and multimodal large models (MLLMs). The framework supports over 400 LLM...

Latest AI Resources # Large model fine-tuning

9mos ago

02.2K

LLaMA Factory: Efficient fine-tuning of more than a hundred open-source large models, easy model customization

Comprehensive Introduction LLaMA-Factory is a unified and efficient fine-tuning framework that supports flexible customization and efficient training of more than 100 large language models (LLMs). With the built-in LLaMA Board web interface, users do not need to write code to complete the model...

Latest AI Resources # Large model fine-tuning

9mos ago

01.9K

Petals：分布式共享GPU运行和微调大语言模型，像BitTorrent网络一样共享GPU资源

Petals: distributed shared GPU running and fine-tuning of large language models, sharing GPU resources like a BitTorrent network

General Introduction Petals is an open source project developed by the BigScience Workshop to run Large Language Models (LLMs) through a distributed computing approach. Users can run LLMs at home using consumer-grade GPUs or Google Co...

Latest AI Resources # Large model fine-tuning # Locally Deployed Open Source Large Modeling Tool

9mos ago

02K

Forefront AI: Machine Learning Model Tuning Platform | AI Chat Assistant

Comprehensive Introduction Forefront AI is an advanced AI platform that focuses on the customization and deployment of open source models. Users can select and fine-tune a variety of powerful AI models, such as GPT-4, GPT-3.5, etc., to meet different task requirements. The platform supports uploading PD...

Latest AI Resources # AI Open Services # AI Integrated Multi-Model Dialog Platform # Large model fine-tuning

9mos ago

01.7K

No more