bandit-algorithms

A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.

bandits bandit-algorithms multiarmed-bandits

Updated Jan 26, 2022
Python

ngutowski / algossim

Star

This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.

recommendation-system artificial-intelligence-algorithms contextual-bandits bandit-algorithms

Updated Dec 7, 2021
Python

DURUII / Replica-AUCB

Star

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

albertopirillo / ola-project-2023

Star

Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.

reinforcement-learning marketing-automation online-learning bandit-algorithms

Updated Sep 26, 2023
Python

doerlbh / BanditZoo

Star

Python library of bandits and RL agents in different real-world environments

reinforcement-learning simulation bandits bandit bandit-algorithms

Updated Feb 21, 2022
Python

guptav96 / bandit-algorithms

Star

A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB

reinforcement-learning bandit-algorithms exploration-exploitation

Updated Feb 27, 2022
Python

duongnhatthang / meta-bandit

Star

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

python3 multi-task bandit meta-learning partial-monitoring sequential-decision-making-problems bandit-algorithms sequential-decisions meta-bandit

Updated Sep 27, 2024
Python

jia-yi-chen / Bandit-and-Reinforcement-Learning

Star

Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)

reinforcement-learning monte-carlo q-learning grid-world dynamic-programming markov-decision-processes multi-armed-bandit bandit-algorithms

Updated Oct 2, 2021
Python

amirbalef / PS_MOMAB

Star

Multi-Objective Multi-Armed Bandit

multi-objective multi-armed-bandit non-stationary bandit-algorithms ucb-algorithm

Updated Jul 17, 2023
Python

MIFA-Lab / LDPbandit2020

Star

Implementation for NeurIPS 2020 paper "Locally Differentially Private (Contextual) Bandits Learning" (https://arxiv.org/abs/2006.00701)

numpy differential-privacy bandit-algorithms

Updated Jun 6, 2022
Python

singhsidhukuldeep / contextual-bandits

Star

A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications

python machine-learning reinforcement-learning algorithms epsilon-greedy multi-armed-bandit contextual-bandits bandit-algorithms linucb