Foundation Text-Generation Models Below 360M Parameters
Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters.
Text Generation • 0.4B • Updated • 59.9k • 99Note License: apache-2.0 Context Length: 8k
facebook/MobileLLM-R1-360M-base
Text Generation • 0.4B • Updated • 132 • 13Note License: fair-noncommercial-research Context Length: 4k
PleIAs/Pleias-350m-Preview
0.4B • Updated • 13.5k • 26Note License: apache-2.0 Context Length: 2k
OuteAI/Lite-Oute-1-300M
Text Generation • 0.3B • Updated • 10 • 9Note License: apache-2.0 Context Length: 4k
keeeeenw/MicroLlama
Text Generation • 0.3B • Updated • 184 • 53Note License: apache-2.0 Context Length: 2k
google/gemma-3-270m
Text Generation • 0.3B • Updated • 6.13M • 1.03kNote License: gemma Context Length: 32k
cerebras/Cerebras-GPT-256M
Text Generation • Updated • 1.43k • 25Note License: apache-2.0 Context Length: 2k
UUFO-Aigis/Pico-OpenLAiNN-250M
0.3B • Updated • 4 • 3Note License: apache-2.0 Context Length: 2k
upstage/TinySolar-248m-4k
Text Generation • 0.2B • Updated • 162 • 11Note License: apache-2.0 Context Length: 4k
M4-ai/TinyMistral-248M-v3
Text Generation • 0.2B • Updated • 21 • 8Note License: apache-2.0 Context Length: 2k
MiniLLM/MiniPLM-llama3.1-212M
Text Generation • 0.2B • Updated • 7 • 6Note License: apache-2.0 Context Length: 1k
MiniLLM/MiniPLM-Qwen-200M
Text Generation • 0.2B • Updated • 178 • 9Note License: apache-2.0 Context Length: 1k
xTimeCrystal/MiniModel-200M-Base
Text Generation • Updated • 13 • 30Note License: apache-2.0 Context Length: 2k
princeton-nlp/Sheared-Pythia-160m
Text Generation • Updated • 791 • 4Note License: apache-2.0 Context Length: 2k
JackFram/llama-160m
Text Generation • 0.2B • Updated • 145k • 37Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-160M
Text Generation • 0.2B • Updated • 32 • 6Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-160m
Text Generation • 0.2B • Updated • 3.02M • 42Note License: apache-2.0 Context Length: 2k
facebook/MobileLLM-R1-140M-base
Text Generation • 0.1B • Updated • 607 • 19Note License: fair-noncommercial-research Context Length: 4k
openai-community/gpt2
Text Generation • 0.1B • Updated • 16.3M • 3.27kNote License: mit Context Length: 1k
HuggingFaceTB/SmolLM2-135M
Text Generation • 0.1B • Updated • 1.41M • 196Note License: apache-2.0 Context Length: 8k
amd/AMD-Llama-135m
Text Generation • 0.1B • Updated • 2.87k • 120Note License: apache-2.0 Context Length: 2k
MiniLLM/MiniPLM-Mamba-130M
Text Generation • 0.1B • Updated • 9 • 3Note License: apache-2.0 Context Length: 1k
Nikity/lille-130m-base
Text Generation • 0.1B • Updated • 29 • 13Note License: apache-2.0 Context Length: 0.5k
EleutherAI/gpt-neo-125m
Text Generation • 0.2B • Updated • 441k • 228Note License: mit Context Length: 2k
cerebras/Cerebras-GPT-111M
Text Generation • Updated • 3.12k • 79Note License: apache-2.0 Context Length: 2k
BEE-spoke-data/smol_llama-101M-GQA
Text Generation • 0.1B • Updated • 780 • 33Note License: apache-2.0 Context Length: 1k
UUFO-Aigis/Pico-OpenLAiNN-100M
0.1B • Updated • 71 • 1Note License: apache-2.0 Context Length: 2k
raincandy-u/Rain-100M
Text Generation • 97.2M • Updated • 30 • 18Note License: apache-2.0 Context Length: 4k
Felladrin/Qwen2-96M
Text Generation • 96.2M • Updated • 4 • 3Note License: apache-2.0 Context Length: 8k
Felladrin/Minueza-2-96M
Text Generation • 96M • Updated • 7 • 6Note License: apache-2.0 Context Length: 4k
distilbert/distilgpt2
Text Generation • 88.2M • Updated • 4.4M • 631Note License: apache-2.0 Context Length: 1k
weiser/82M-0.4
Text Generation • 82.1M • Updated • 27Note License: apache-2.0 Context Length: 1k
BEE-spoke-data/smol_llama-81M-tied
Text Generation • 81.3M • Updated • 221 • 10Note License: apache-2.0 Context Length: 1k
EleutherAI/pythia-70m
95.6M • Updated • 570k • 81Note License: apache-2.0 Context Length: 2k
JackFram/llama-68m
Text Generation • Updated • 178k • 37Note License: apache-2.0 Context Length: 2k
OuteAI/Lite-Oute-1-65M
Text Generation • 65M • Updated • 15 • 12Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-60M
Text Generation • 54.6M • Updated • 49 • 4Note License: apache-2.0 Context Length: 2k
SupraLabs/Supra-50M-Base
Text Generation • 51.8M • Updated • 982 • 14Note License: apache-2.0 Context Length: 1k
Felladrin/Minueza-32M-Base
Text Generation • 32.8M • Updated • 662 • 19Note License: apache-2.0 Context Length: 2k
GerbilLab/Gerbil-A-32m
Text Generation • Updated • 8 • 2Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-31m-deduped
Text Generation • 55.7M • Updated • 1.45k • 5Note License: apache-2.0 Context Length: 2k
SmallDoge/Doge-20M
Text Generation • 13.1M • Updated • 813 • 9Note License: apache-2.0 Context Length: 2k
EleutherAI/pythia-14m-deduped
Text Generation • 39.2M • Updated • 13.6k • 29Note License: apache-2.0 Context Length: 2k