
UCB
UCB, or Upper Confidence Bound, is a method used in decision-making algorithms to balance exploring new options and exploiting known ones. It assigns a confidence score to each choice, which combines its average success rate with an uncertainty measure about its potential. This way, options that haven't been tried much are given a chance to be tested, while still favoring those with better historical performance. UCB helps algorithms make smarter decisions over time, especially in situations where information is limited or uncertain, by systematically exploring options that could turn out to be optimal.