Hey! I’m a Master’s 2 student in Computer Science (AI2D track) at Sorbonne Université. I’m interested in reinforcement learning because it can achieve incredible performance in real-world tasks, yet we still don’t understand why many algorithms work or fail. I recently researched estimation bias in RL with Prof. Olivier Sigaud at ISIR. I believe in transparent, controllable AI rather than closed black boxes. Outside of my studies, I’m bouldering or developing open-source tools for ALIAS, including a hackathon organization platform for CS student associations.

Estimation biases represent a persistent challenge in reinforcement learning, where errors in value estimation can accumulate through bootstrapping and compromise learning efficiency. Among these …