Multi-Armed-Bandit on The Kiseki Log

Multi-Armed-Bandit on The Kiseki Log /tags/multi-armed-bandit/ Recent content in Multi-Armed-Bandit on The Kiseki Log Hugo -- 0.147.7 en 2023-2026 Shichao Song CC BY-SA 4.0 Fri, 15 May 2026 11:10:58 +0800 Fully Annotated Guide to "The Multi-Armed Bandit Problem and Its Solutions" /posts/260430-multi-armed-bandit/ Thu, 30 Apr 2026 14:25:31 +0800 /posts/260430-multi-armed-bandit/ The multi-armed bandit problem is a classic exploration–exploitation dilemma in reinforcement learning. Lilian Weng’s post is an excellent introduction, but some mathematical details and motivations can be cryptic. This article annotates it with step-by-step explanations and supplementary notes.