Comparison of DQN and PPO algorithms in a warehouse. Switch between algorithms, adjust parameters, and visualize training results in real-time.
Default: 146 packages (all the packages currently exist in this warehouse)
Deep Q-Network uses experience replay and target networks for stable learning.
Ready to Start Training
Click "Start Training" to begin the dqn simulation