ninaboord / modified-thompson-sampling Goto Github PK
View Code? Open in Web Editor NEWThe Thompson Algorithm is the current "best" algorithm for the multi-armed bandit problem, but this is an area of ongoing research. Through a lot of trial and error, I found a unique, simple modification and in all simulations, my algorithm was more optimal than the Thompson Algorithm. I received a distinction for this project.