Approximation Algorithm for RMAB
The paper “Approximation Algorithms for Restless Bandit Problems” by Guha, Munagala, Shi (2009) designed a $2 + \epsilon$-approximation algorithm for a special class of RMAB (“Feedback MAB”, and generalized to “Monotone MAB”). The algorithm is fundamentally different to the classical Whittle Index. The paper’s analysis uses a duality-based algorithmic technique—it is vastly different compared with Weber (1988)’s proof for RMAB’s asymptotic optimality, hence the $2 + \epsilon$ approximation outcome doesn’t requires asymptotic....