Archives

All the articles I've archived.

2026 ¹⁸

June ⁹

Inside DSpark: DeepSeek's Confidence-Scheduled Speculative Decoding

28 Jun, 2026

A deep dive into DSpark, DeepSeek's new draft model for speculative decoding. We cover what it actually is — a semi-autoregressive drafter paired with a confidence-scheduled, load-aware verifier — how it differs from vanilla speculative decoding, Medusa, EAGLE-3 and parallel drafters like DFlash, and why it delivers 60–85% faster per-user generation inside the DeepSeek-V4 serving stack.
Inside GLM-5.2: IndexShare, KVShare, and the End-to-End TV Loss

21 Jun, 2026

A deep dive into GLM-5.2 — a 753B open-weight MoE that serves a 1M-token context. We walk the three innovations that make it cheap to run: IndexShare (cross-layer sparse-attention index reuse), KVShare + rejection sampling for speculative decoding, and a novel end-to-end TV loss that breaks the entropy bound on MTP acceptance. Plus the slime RL stack behind its long-horizon agentic skills.
Mastra vs Agno: two agent frameworks with very different centers of gravity

17 Jun, 2026

A practical comparison of Mastra and Agno, covering developer experience, agents, teams, workflows, memory, RAG, observability, deployment, and when to choose each framework.
Difference Between On-Policy Distillation and Reinforcement Learning

17 Jun, 2026

A in-depth analysis on comparing on-policy distillation with reinforcement learning.
HRM-Text & MagicNorm: Pretraining a 1B Language Model for ~$1,500

9 Jun, 2026

A walkthrough of HRM-Text: Efficient Pretraining Beyond Scaling — the biologically-inspired Hierarchical Recurrent Model that swaps the Transformer for a dual-timescale recurrent core, and MagicNorm, the normalization trick that makes that deep recurrence trainable by exploiting the forward/backward asymmetry of truncated backpropagation through time.
Building a Mastra Text-to-SQL Chat App with Observability, Auth, and Deployability

8 Jun, 2026

An end-to-end guide for building a natural-language PostgreSQL query assistant with Mastra: starting from the official text-to-SQL template and extending it for OpenRouter models, production-grade tracing/logging/metrics, role-based access control, editable agent configuration, and local/VPS deployment.
Top Interview 150 — Solutions in Python

2 Jun, 2026

Worked Python solutions to the LeetCode Top Interview 150, organized by topic with a short approach for each problem.
From AlexNet to World Models: The Evolution of Multi-Modal Neural Networks

2 Jun, 2026

A ground-up tour of how neural networks learned to see, then to see-and-read, and finally to imagine. From AlexNet and CNNs, through CLIP and the vision-language models behind GPT-4V, to world models like Dreamer, V-JEPA 2, and LeWorldModel — with architectures, math, and benchmark numbers along the way.
Attention Residuals: Softmax Attention Over Depth

1 Jun, 2026

A deep dive into the Kimi team's Attention Residuals (AttnRes) — replacing the fixed-weight residual connection with learned softmax attention over depth. Covers the time–depth duality, Full vs Block AttnRes, the structured-matrix view that unifies prior residual variants, the pipeline-parallel infra that makes it practical, and the scaling-law and 48B-MoE results.

May ⁹

GRPO and DAPO: A Deep Dive into RL for Reasoning LLMs

28 May, 2026

An end-to-end walkthrough of Group Relative Policy Optimization (GRPO) and Decoupled Clip and Dynamic sAmpling Policy Optimization (DAPO) — the two RL algorithms that drive open reasoning models in 2025–2026. Full math, every design choice motivated, and a head-to-head comparison.
From GRPO to GSPO: Group-Based Policy Optimization for LLMs

28 May, 2026

A complete walkthrough of Group Relative Policy Optimization (GRPO) and Group Sequence Policy Optimization (GSPO) — the policy-gradient algorithms behind DeepSeek-R1 and Qwen3. Full math, the failure mode that motivated GSPO, the MoE story, and a side-by-side comparison.
GRPO and Dr.GRPO: The Math, the Biases, and the Fix

28 May, 2026

An end-to-end derivation of Group Relative Policy Optimization (GRPO) from DeepSeekMath and the Dr.GRPO correction from Liu et al. Covers the full objective, the gradient, the two biases (length and question difficulty), the unbiased fix, and the practical recipe behind R1-Zero–style training.
Training Composer 2: How Cursor Builds a Coding Agent Model

27 May, 2026

A structured walkthrough of Sasha Rush's Training Composer 2 workshop: why Cursor chose Kimi K2.5, how continued pretraining and long-horizon RL fit together, what CursorBench measures, and where Composer is headed.
Leetcode Problems

26 May, 2026

Leetcode grinding.
Hybrid Attention and MLA: The Tradeoff

23 May, 2026

A side-by-side dive into Xiaomi MiMo's hybrid sliding-window/global attention and DeepSeek's Multi-head Latent Attention. The two answer the same question — how to make attention affordable at long context — with very different bets, and those bets shape everything from training infra to KV cache size.
Kimi K2.5: Joint Text–Vision Training and the Agent Swarm

19 May, 2026

A walkthrough of two ideas behind Kimi K2.5: how joint text–vision pre-training and RL make each modality help the other, and how Agent Swarm replaces sequential tool use with a learned parallel orchestrator.
Inside DeepSeek's Sparse Attention: From NSA to DSA

18 May, 2026

A deep dive into DeepSeek's two sparse attention designs — Native Sparse Attention (NSA) and DeepSeek Sparse Attention (DSA) — covering the math, the hardware story, and why DSA in V3.2 looks so different from NSA.
AstroPaper 6.0

17 May, 2026

AstroPaper v6: a from-scratch rewrite on Astro v6, Tailwind v4, and a new config system.

2025 ¹

March ¹

AstroPaper 5.0

8 Mar, 2025

AstroPaper v5: keep the clean look, updates under the hood.

2024 ⁴

September ¹

How to add LaTeX Equations in Astro blog posts

Updated: 22 Mar, 2025

Learn how to add LaTeX equations in Astro blog posts using Markdown, KaTeX, and remark/rehype plugins.

July ¹

How to integrate Giscus comments into AstroPaper

Updated: 12 Mar, 2025

Comment function on a static blog hosted on GitHub Pages with Giscus.

January ²

AstroPaper 4.0

4 Jan, 2024

AstroPaper v4: ensuring a smoother and more feature-rich blogging experience.
How to use Git Hooks to set Created and Modified Dates

Updated: 9 Jan, 2024

How to use Git Hooks to set your Created and Modified Dates on AstroPaper

2023 ³

September ¹

AstroPaper 3.0

25 Sep, 2023

AstroPaper Version 3: Elevating Your Web Experience with Astro v3 and Seamless View Transitions

July ¹

How to update dependencies of AstroPaper

20 Jul, 2023

How to update project dependencies and AstroPaper template.

January ¹

AstroPaper 2.0

30 Jan, 2023

AstroPaper with the enhancements of Astro v2. Type-safe markdown contents, bug fixes and better dev experience etc.

2022 ⁸

December ¹

Dynamic OG image generation in AstroPaper blog posts

Updated: 4 May, 2026

New feature in AstroPaper v1.4.0, introducing dynamic OG image generation for blog posts.

September ⁴

Predefined color schemes

Updated: 16 May, 2026

Some of the well-crafted, updated predefined color schemes for AstroPaper.
Customizing AstroPaper theme color schemes

Updated: 17 May, 2026

How you can enable/disable light & dark mode; and customize color schemes of AstroPaper theme.
Adding new posts in AstroPaper theme

Updated: 17 May, 2026

Some rules & recommendations for creating or adding new posts using AstroPaper theme.
How to configure AstroPaper theme

Updated: 17 May, 2026

How you can make AstroPaper theme absolutely yours.

July ¹

Tailwind Typography Plugin

5 Jul, 2022

EXAMPLE POST: About Tailwind Typography Plugin and how you can use it effectively.

June ¹

How Do I Develop My Terminal Portfolio Website with React

9 Jun, 2022

EXAMPLE POST: Developing a terminal-like website using ReactJS, TypeScript and Styled-Components. Includes features like autocomplete, multiple themes, command hints etc.

March ¹

How Do I Develop My Portfolio Website & Blog

25 Mar, 2022

EXAMPLE POST: My experience about developing my first portfolio website and a blog using NextJS and a headless CMS.

Archives

Inside DSpark: DeepSeek's Confidence-Scheduled Speculative Decoding

Inside GLM-5.2: IndexShare, KVShare, and the End-to-End TV Loss

Mastra vs Agno: two agent frameworks with very different centers of gravity

Difference Between On-Policy Distillation and Reinforcement Learning

HRM-Text & MagicNorm: Pretraining a 1B Language Model for ~$1,500

Building a Mastra Text-to-SQL Chat App with Observability, Auth, and Deployability

Top Interview 150 — Solutions in Python

From AlexNet to World Models: The Evolution of Multi-Modal Neural Networks

Attention Residuals: Softmax Attention Over Depth

GRPO and DAPO: A Deep Dive into RL for Reasoning LLMs

From GRPO to GSPO: Group-Based Policy Optimization for LLMs

GRPO and Dr.GRPO: The Math, the Biases, and the Fix

Training Composer 2: How Cursor Builds a Coding Agent Model

Leetcode Problems

Hybrid Attention and MLA: The Tradeoff

Kimi K2.5: Joint Text–Vision Training and the Agent Swarm

Inside DeepSeek's Sparse Attention: From NSA to DSA

AstroPaper 6.0

AstroPaper 5.0

How to add LaTeX Equations in Astro blog posts

How to integrate Giscus comments into AstroPaper

AstroPaper 4.0

How to use Git Hooks to set Created and Modified Dates

AstroPaper 3.0

How to update dependencies of AstroPaper

AstroPaper 2.0

Dynamic OG image generation in AstroPaper blog posts

Predefined color schemes

Customizing AstroPaper theme color schemes

Adding new posts in AstroPaper theme

How to configure AstroPaper theme

Tailwind Typography Plugin

How Do I Develop My Terminal Portfolio Website with React

How Do I Develop My Portfolio Website & Blog