#

bandit-algorithms

Here are 86 public repositories matching this topic...

SMPyBandits

SMPyBandits / SMPyBandits

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

python open-source research internet-of-things simulations multi-arm-bandits multi-armed-bandit learning-theory bandit-algorithms cognitive-radio

Updated Apr 30, 2024
Jupyter Notebook

c-bata / goptuna

A hyperparameter optimization framework, inspired by Optuna.

bayesian-optimization evolution-strategies blackbox-optimization bandit-algorithms

Updated Aug 30, 2024
Go

WilliamLwj / PyXAB

PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms

data-science machine-learning algorithm reinforcement-learning optimization machine-learning-algorithms hyperparameter-optimization hyperparameter-tuning optimization-algorithms online-learning automl blackbox-optimization bandit-algorithms lipschitz-bandit x-armed-bandit continuous-armed-bandit

Updated Oct 24, 2024
Python

KKeishiro / Yahoo_recommendation

Yahoo! news article recommendation system by linUCB

recommendation-system contextual-bandit bandit-algorithms linucb

Updated Feb 1, 2018
Python

gdmarmerola / interactive-intro-rl

Big Data's open seminars: An Interactive Introduction to Reinforcement Learning

machine-learning reinforcement-learning bandit-algorithms

Updated Jun 7, 2021
Jupyter Notebook

sshkhr / Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

reinforcement-learning tensorflow deep-reinforcement-learning pytorch policy-gradient evolutionary-algorithms markov-decision-processes td-learning monte-carlo-sampling bandit-algorithms

Updated Dec 22, 2021
Jupyter Notebook

Alanthink / banditpylib

A lightweight python library for bandit algorithms

bandit-algorithms

Updated Jul 21, 2022
Python

niffler92 / Bandit

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

Updated Oct 12, 2017
Python

doerlbh / MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

paper speaker-recognition online-learning speaker-diarization contextual-bandits bandit-algorithms interspeech self-supervised-learning acml interspeech2020 online-speaker-diarization

Updated Sep 20, 2021
Cuda

kulinshah98 / Multi-Armed-Bandit-Algorithms

Python implementation of UCB, EXP3 and Epsilon greedy algorithms

epsilon-greedy multi-armed-bandits upper-confidence-bounds bandit-algorithms stochastic-bandit-algorithms adversarial-bandit-algorithms exp3-algorithm

Updated Oct 4, 2018
Python

gdmarmerola / advanced-bandit-problems

More about the exploration-exploitation tradeoff with harder bandits

machine-learning multi-armed-bandit bandit-algorithms

Updated May 12, 2019
Jupyter Notebook

mmalekzadeh / privacy-preserving-bandits

Privacy-Preserving Bandits (MLSys'20)

machine-learning reinforcement-learning recommender-system recommendation bandit-learning differential-privacy contextual-bandits bandit-algorithm federated-learning bandit-algorithms privacy-preserving-machine-learning online-machine-learning criteo-dataset differentially-private privacy-preserving-bandits

Updated Dec 8, 2022
Jupyter Notebook

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

A curated list on papers about combinatorial multi-armed bandit problems.

thompson-sampling multi-armed-bandit combinatorial-optimization bandit-algorithms combinatorial-bandit

Updated May 10, 2021

rssalessio / reading-list

This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.

learning machine-learning statistics reinforcement-learning deep-learning optimization reading-list bandit-algorithms

Updated Sep 5, 2023

sparsh-ai / reco-bandit

Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning

recommender-system contextual-bandits bandit-algorithms

Updated Jul 1, 2021
Jupyter Notebook

gokceuludogan / interactive-music-recommendation

Personalized and Interactive Music Recommendation with Bandit approach

music-recommendation bandit-algorithms exploration-exploitation bayes-ucb

Updated Sep 15, 2019
Jupyter Notebook

MaxenceGiraud / MachineLearningAlgos

Personal reimplementation of some ML algorithms for learning purposes

Updated Jul 13, 2021
Python

babaniyi / Deep-contextual-bandits

A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.

bandits bandit-algorithms multiarmed-bandits

Updated Jan 26, 2022
Python

ngutowski / algossim

This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.

recommendation-system artificial-intelligence-algorithms contextual-bandits bandit-algorithms

Updated Dec 7, 2021
Python

DURUII / Replica-AUCB

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

Improve this page

Add a description, image, and links to the bandit-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandit-algorithms topic, visit your repo's landing page and select "manage topics."