increasing the action gap - new operators for reinforcement learning
/ 23