Warehouse Agents Navigation

Warehouse Agents Navigation

Comparison of DQN and PPO algorithms in a warehouse. Switch between algorithms, adjust parameters, and visualize training results in real-time.

TensorFlow
Deep Q-Network
PPO Algorithm
Training Controls
Configure and run your RL algorithms

Default: 146 packages (all the packages currently exist in this warehouse)

Current: DQN - Warehouse

Deep Q-Network uses experience replay and target networks for stable learning.

Environment Visualization
Real-time view of warehouse robots managing packages

Ready to Start Training

Click "Start Training" to begin the dqn simulation

Training Metrics
Real-time performance metrics during training

Episode Reward

Current: 0.0

Episode Training Loss

update per episode
Current: 0.000

Epsilon (Exploration)

Current: 1.000