Buy Me a Coffee at ko-fi.com

Multi-armed bandit

reinforcement learning problem exemplifying the exploration–exploitation tradeoff

Pronunciation
/ˈmʌlti - ɑrmd ˈbændɪt/