An Efficient Node Selection Policy for Monte Carlo Tree Search with Neural Networks
Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management
Simulation Optimization of Conditional Value-at-Risk
Efficient Learning for Selecting Top-m Context-Dependent Designs
A New Likelihood Ratio Method for Training Artificial Neural Networks