The AI community increasingly distinguishes between open-weights and open-source models — two terms that are frequently conflated but describe meaningfully different levels of openness. The distinction matters for reproducibility, trust, competition, and the future of AI governance. 1)
An open-weights model releases the trained model parameters (weights and biases) that determine how the model processes inputs and generates outputs. Users can download, deploy, fine-tune, and build on these weights.
What open-weights typically provides:
What open-weights typically does not provide:
Examples: Meta Llama (3, 3.1, 4), Mistral, Google Gemma 2)
The Open Source Initiative (OSI) published its Open Source AI Definition (v1.0) in 2024, setting a rigorous standard. To qualify as open-source, an AI model must provide:
Examples that approach this standard: OLMo (Allen Institute for AI), Pythia (EleutherAI)
Meta's decision to label the Llama model family as “open source” sparked significant debate. Critics point out that Llama:
These restrictions violate the traditional open-source principle of unrestricted use. Meta argues that releasing weights provides meaningful transparency and community benefit, even without full reproducibility. The debate highlighted the need for clearer terminology — hence the adoption of “open-weights” as a distinct category. 4)
The difference between open-weights and open-source has practical consequences:
For reproducibility: Open-source models can be independently verified and reproduced. Open-weights models must be taken on trust — users cannot confirm how they were trained or what data they learned from.
For safety and auditing: Full open-source enables independent safety research, bias auditing, and vulnerability analysis. Open-weights provides limited insight into training-time decisions.
For competition: Open-weights enables downstream fine-tuning and deployment, fostering an ecosystem of adapted models. But without training data and code, competitors cannot truly replicate or build equivalent base models.
For regulatory compliance: Some regulatory frameworks may require transparency about training data — a requirement open-weights alone cannot satisfy. 5)
In practice, AI model openness exists on a spectrum:
| Level | Provides | Example |
| Closed / Proprietary | API access only | GPT-4, Claude |
| Open-weights | Weights + inference code | Llama, Mistral |
| Open-weights + data info | Weights + data documentation | Falcon 2 |
| Full open-source (OSI) | Weights + code + data + unrestricted license | OLMo |