BBMDP accepted at NeurIPS 2025
Our paper A Markov Decision Process for Variable Selection in Branch & Bound has been accepted at NeurIPS 2025!
This work introduces BBMDP, a principled vanilla MDP formulation for the branching problem in branch-and-bound (B&B) solvers. Unlike prior RL approaches that relied on approximate TreeMDP formulations — which do not satisfy standard Markov properties — BBMDP models the full state of the search tree, enabling the use of standard RL algorithms with theoretical convergence guarantees.
Our agent, DQN-BBMDP, is evaluated on four standard MILP benchmarks and achieves:
- 27% reduction in node count vs. prior RL state-of-the-art (DQN-Retro)
- 38% reduction in solving time
- Strong transfer performance on larger, unseen instances
Authors: Paul Strang, Zacharie Alès, Côme Bissuel, Olivier Juan, Safia Kedad-Sidhoum, Emmanuel Rachelson
Links: Project page · arXiv · OpenReview · Code