Train the smallest LM you can that fits in 16MB.
T

Train the smallest LM you can that fits in 16MB.

Train the smallest LM you can that fits in 16MB.

5,136 stars
N/A forks
N/A contributors

README

Project documentation from GitHub

Parameter Golf: The Art of the Tiny Language Model

Sometimes the most interesting challenges come with the tightest constraints. While the AI world chases ever-larger models with trillions of parameters, a project from OpenAI asks a different, almost minimalist question: how small can you go?

Parameter Golf is a fun, thought-provoking experiment that flips the usual script. Instead of scaling up, it's about compressing down. The goal is simple, in a hacker-y kind of way: train the smallest capable language model you can that fits into a 16MB file. It's a benchmark for efficiency, cleverness, and understanding what's truly essential in a model.

What It Does

This repository establishes a playful competition. The core challenge is to create a functional language model with a hard limit—its saved checkpoint file must be 16 megabytes or smaller. It's not just about pruning a giant model; it's about training a small one from the ground up to be both compact and competent.

Think of it as a leaderboard for model minimalism. Participants experiment with architectures, training techniques, and data to push performance within that strict size budget. The project provides the rules, a baseline model to beat, and a framework for submitting and evaluating your own tiny creation.

Why It's Cool

The brilliance of Parameter Golf is in its constraints. A 16MB limit is brutally small—for perspective, that's roughly the size of a medium-quality MP3 song. This forces you to think about every single parameter.

It pushes you to consider:

  • Architectural ingenuity: What novel, efficient model structures can you design?
  • Data selection & preprocessing: Is your training data optimally used?
  • Training tricks: How do you coax maximum capability from minimal resources?
  • The essence of "understanding": What's the absolute core of what a language model needs to know?

This isn't just an academic exercise. The principles explored here—extreme model efficiency, low memory footprint, and cost-effective training—have direct applications in on-device AI, edge computing, and making AI more accessible and deployable everywhere.

How to Try It

Ready to tee off? The game is hosted on GitHub.

  1. Head over to the Parameter Golf repository.
  2. Read the README.md thoroughly. It explains the rules, the evaluation method (perplexity on a specific dataset), and how submissions work.
  3. Check out the provided baseline model. Your first goal is to beat it.
  4. Start experimenting! The repo has t

Did you like this issue?

Join our weekly newsletter

Love discovering amazing projects?

Help us continue bringing you the best open-source discoveries every week.

Back to Projects
Last updated: Mar 20, 2026