Upper confidence bound (UCB)

  1. Multi-Armed Bandit: AI (Brace For These Hidden GPT Dangers)
  2. Thompson Sampling: AI (Brace For These Hidden GPT Dangers)