Multi-Armed Bandit Problem Example

版本 1.0.1 (153.6 KB) 作者: Toshiaki Takeuchi

Learn how to implement two basic but powerful strategies to solve multi-armed bandit problems with MATLAB.

关注

5.0

(1)

708.0 次下载

更新时间 2019/1/10

查看许可证

Casino slot machines have a playful nickname - "one-armed bandit" - because of the single lever it has and our tendency to lose money when we play them.
Ordinary slot machines have only one lever. What if you had multiple levers to pull, each with different payout. This is a multi-armed bandit. You don't know which lever has the highest payout - you just have to try different levers to see which one works best, but for how long? If you keep pulling the low payout lever, you forego more rewards, but you won't know which lever is good until you try sufficient number of times.

Bandit algorithms are related to the field of machine learning called reinforcement learning. Rather than learning from explicit training data, or discovering patterns in static data, reinforcement learning discovers the best option from trial and error with live examples. The multi-armed bandits focus on the question of exploration vs. exploitation trade-off - how much resources should be spent in trial and error vs. maximizing the benefit. There are many different formulation of bandit problems and strategies to solve them.

引用格式

Toshiaki Takeuchi (2025). Multi-Armed Bandit Problem Example (https://ww2.mathworks.cn/matlabcentral/fileexchange/69598-multi-armed-bandit-problem-example), MATLAB Central File Exchange. 检索时间: 2025/10/6.

MATLAB 版本兼容性

创建方式 R2018b

与 R2018b 及更高版本兼容

平台兼容性

Windows macOS Linux

类别

Signal Processing > Wavelet Toolbox > Filter Banks >

在 Help Center 和 MATLAB Answers 中查找有关 Filter Banks 的更多信息

标签添加标签

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

multiarmedbandit.mlx

版本	已发布	发行说明
1.0.1	2019/1/10	Added an image	下载
1.0.0	2018/12/6		下载

Multi-Armed Bandit Problem Example

引用格式

必需项

MATLAB 版本兼容性

平台兼容性

类别

标签添加标签

Community Treasure Hunt

探索实时编辑器

Multi-Armed Bandit Problem Example

引用格式

必需项

MATLAB 版本兼容性

平台兼容性

类别

标签 添加标签

Community Treasure Hunt

探索实时编辑器

标签添加标签