Hey! I’m a Master’s student in Computer Science at Sorbonne Université. I’m interested in understanding how reinforcement learning algorithms really work and why they succeed or fail. Currently, I’m researching overestimation bias in RL and comparing optimistic versus pessimistic approaches, under the supervision of Prof. Olivier Sigaud at ISIR. When I’m not coding, you’ll find me maintaining my homelab with NixOS, contributing to ALIAS, or bouldering.
Estimation biases represent a persistent challenge in reinforcement learning, where errors in value estimation can accumulate through bootstrapping and compromise learning efficiency. Among these …