🎯
Focusing
Ph.D. student at Shanghai Jiao Tong University.
-
Shanghai Jiao Tong University
- Shanghai
- ChangWinde.github.io
Highlights
- Pro
Pinned Loading
-
PiCor
PiCor Public[AAAI 2023] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".
-
RyanLiu112/MRN
RyanLiu112/MRN Public[NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning".
-
zowiezhang/Amulet
zowiezhang/Amulet PublicOfficial repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"
Python 6
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.