toIPA

Home
Blog
Category

Buy Me a Coffee at ko-fi.com

Multi-armed bandit

reinforcement learning problem exemplifying the exploration–exploitation tradeoff

Pronunciation

/ˈmʌlti - ɑrmd ˈbændɪt/

Categories

mathematical problem optimization problem

© Copyright toipa.org

Privacy Contact