PhD Position Reinforcement Learning Theory

Research / Academic

We are looking for a motivated candidate to work on the topics of theoretical machine learning, specifically in the domain of sequential decision-making, which includes bandit problems and theoretical reinforcement learning. The primary objective of this project is to analyze and design learning algorithms and provide formal guarantees regarding their performance. To achieve this, you will use your skills in advanced mathematical and statistical techniques. The project is situated in the context of efficient online learning, with a focus on scaling with model complexity and function approximation.
You will be welcomed into our Sequential Decision-Making group, where we focus on various aspects of reinforcement learning. During your PhD, you will have the opportunity to tackle challenging problems related to developing advanced function approximation methods and robust reinforcement learning techniques. You will delve deeply into the rapidly evolving field of reinforcement learning theory, while also exploring relevant areas of mathematics.


  • Hold a master's degree in mathematics, computer science, physics, or a related discipline.
  • Demonstrate eagerness to tackle complex mathematical challenges.
  • Have proficiency in both written and spoken English.
  • Good mathematical background, including knowledge of statistics and optimization. Background in machine learning is a plus..

Doing a PhD at TU Delft requires English proficiency at a certain level to ensure that the candidate is able to communicate and interact well, participate in English-taught Doctoral Education courses, and write scientific articles and a final thesis. For more details please check the Graduate Schools Admission Requirements.

Salary Benefits:

Doctoral candidates will be offered a 4-year period of employment in principle, but in the form of 2 employment contracts. An initial 1,5 year contract with an official go/no go progress assessment within 15 months. Followed by an additional contract for the remaining 2,5 years assuming everything goes well and performance requirements are met.
Salary and benefits are in accordance with the Collective Labour Agreement for Dutch Universities, increasing from € 2770 per month in the first year to € 3539 in the fourth year. As a PhD candidate you will be enrolled in the TU Delft Graduate School. The TU Delft Graduate School provides an inspiring research environment with an excellent team of supervisors, academic staff and a mentor. The Doctoral Education Programme is aimed at developing your transferable, discipline-related and research skills.
The TU Delft offers a customisable compensation package, discounts on health insurance, and a monthly work costs contribution. Flexible work schedules can be arranged.
For international applicants, TU Delft has the Coming to Delft Service. This service provides information for new international employees to help you prepare the relocation and to settle in the Netherlands. The Coming to Delft Service offers a Dual Career Programme for partners and they organise events to expand your (social) network.

Work Hours:

36 - 40 hours per week


Mekelweg 2