Multiple Agents • High-Speed Processing • Zero Libraries
Episodes
0
Total Steps
0
Unlike standard RL, this uses a "Swarm" of agents updating a shared Q-Table. This effectively implements a multi-threaded discovery process.
Higher "Steps/Frame" runs more logic per visual update, allowing the model to train thousands of times faster than real-time.