Hey! I’m finishing my Master’s in Computer Science (AI2D track) at Sorbonne Université. I’m interested in reinforcement learning because it can achieve incredible performance in real-world tasks, yet we still don’t understand why many algorithms work or fail. I’m currently at ISIR with Prof. Olivier Sigaud, working on a new RL algorithm we aim to submit to ICLR. I believe in transparent, controllable AI rather than closed black boxes. Outside of my studies, I’m reading, running or writing open source software, including tools for ALIAS, the CS student association at Sorbonne, or a self-hosted audiobook stack.

Estimation biases represent a persistent challenge in reinforcement learning, where errors in value estimation can accumulate through bootstrapping and compromise learning efficiency. Among these …