| | GPT-OSS from the Ground Up (cameronrwolfe.substack.com) |
| 1 point by Brajeshwar 50 days ago | past |
|
| | AI Agents from First Principles (cameronrwolfe.substack.com) |
| 2 points by Brajeshwar 4 months ago | past |
|
| | A Guide for Debugging LLM Training Data (cameronrwolfe.substack.com) |
| 3 points by Brajeshwar 4 months ago | past |
|
| | NanoMoE: Mixture-of-Experts (Moe) LLMs from Scratch in PyTorch (cameronrwolfe.substack.com) |
| 4 points by danboarder 6 months ago | past |
|
| | Vision Large Language Models (VLLMs) (cameronrwolfe.substack.com) |
| 3 points by Brajeshwar 6 months ago | past |
|
| | Demystifying Reasoning Models (cameronrwolfe.substack.com) |
| 2 points by Brajeshwar 7 months ago | past |
|
| | Scaling Laws for LLMs: From GPT-3 to o3 (cameronrwolfe.substack.com) |
| 1 point by UCdallasGA 8 months ago | past |
|
| | Mixture-of-Experts (Moe) LLMs (cameronrwolfe.substack.com) |
| 2 points by Brajeshwar 8 months ago | past |
|
| | Mixture-of-Experts (MoE) LLMs (cameronrwolfe.substack.com) |
| 1 point by Philpax 8 months ago | past |
|
| | Scaling Laws for LLMs: From GPT-3 to o3 (cameronrwolfe.substack.com) |
| 1 point by Brajeshwar 9 months ago | past |
|
| | Model Merging: A Survey (cameronrwolfe.substack.com) |
| 2 points by Brajeshwar on Sept 18, 2024 | past |
|
| | Mixture-of-Experts (Moe): The Birth and Rise of Conditional Computation (cameronrwolfe.substack.com) |
| 1 point by Brajeshwar on March 21, 2024 | past |
|
| | Decoder-Only Transformers: The Workhorse of Generative LLMs (cameronrwolfe.substack.com) |
| 2 points by Brajeshwar on March 5, 2024 | past |
|
| | Dolma, OLMo, and the Future of Open-Source LLMs (cameronrwolfe.substack.com) |
| 1 point by Brajeshwar on Feb 20, 2024 | past |
|
| | The Basics of AI-Powered (Vector) Search (cameronrwolfe.substack.com) |
| 2 points by Brajeshwar on Jan 9, 2024 | past |
|
| | Understanding and Using Supervised Fine-Tuning (SFT) for Language Models (cameronrwolfe.substack.com) |
| 1 point by tosh on Dec 26, 2023 | past |
|
| | Explaining ChatGPT to Anyone in <20 Minutes (cameronrwolfe.substack.com) |
| 1 point by Brajeshwar on Dec 12, 2023 | past |
|
| | LLaMA-2 from the Ground Up (cameronrwolfe.substack.com) |
| 1 point by verdverm on Aug 28, 2023 | past |
|
| | Understanding LLaMA-2 from the Ground Up (cameronrwolfe.substack.com) |
| 2 points by cwolferesearch on Aug 14, 2023 | past |
|
| | Open-Source LLMs: Imitation and Alignment (Part Three) (cameronrwolfe.substack.com) |
| 2 points by cwolferesearch on Aug 7, 2023 | past |
|
| | For those who've been missing out, here's the history of open source LLMs (cameronrwolfe.substack.com) |
| 2 points by adr1an on Aug 7, 2023 | past |
|
| | Open-Source LLMs: Better Base Models (cameronrwolfe.substack.com) |
| 2 points by makaimc on Aug 1, 2023 | past |
|
| | Open-Source LLMs: Better Base Models (Part Two) (cameronrwolfe.substack.com) |
| 14 points by cwolferesearch on July 31, 2023 | past | 1 comment |
|
| | An Overview of Llama’s model architecture (cameronrwolfe.substack.com) |
| 2 points by tim_sw on April 10, 2023 | past |
|