el 00:00:00, ep 0000, ts 000015, ar 10 015.0±000.0, 100 015.0±000.0, ex 100 0.3±0.0, ev 009.0±000.0
el 00:01:00, ep 0899, ts 028293, ar 10 130.4±056.3, 100 056.1±040.1, ex 100 0.3±0.1, ev 080.0±050.5
el 00:02:00, ep 1342, ts 066205, ar 10 125.7±081.2, 100 132.6±079.2, ex 100 0.3±0.1, ev 233.2±066.6
el 00:03:00, ep 1631, ts 101018, ar 10 110.9±066.5, 100 126.3±082.1, ex 100 0.3±0.1, ev 312.1±098.1
el 00:04:00, ep 1871, ts 140058, ar 10 296.5±124.0, 100 165.3±122.1, ex 100 0.3±0.1, ev 337.5±118.2
el 00:05:00, ep 2087, ts 176833, ar 10 184.5±084.8, 100 162.1±114.2, ex 100 0.3±0.1, ev 428.0±072.0
el 00:06:00, ep 2299, ts 213879, ar 10 187.4±122.9, 100 181.3±125.7, ex 100 0.3±0.1, ev 456.5±074.0
el 00:06:34, ep 2419, ts 233086, ar 10 211.8±123.1, 100 160.8±117.1, ex 100 0.3±0.1, ev 475.9±059.9
--> reached_goal_mean_reward ✓
Training complete.
Final evaluation score 499.31±4.58 in 188.65s training time, 414.18s wall-clock time.
el 00:00:00, ep 0000, ts 000015, ar 10 015.0±000.0, 100 015.0±000.0, ex 100 0.3±0.0, ev 010.0±000.0
el 00:01:00, ep 0788, ts 028599, ar 10 013.3±004.7, 100 042.2±057.9, ex 100 0.2±0.1, ev 096.0±156.1
el 00:02:00, ep 1242, ts 062379, ar 10 098.8±047.5, 100 062.8±050.6, ex 100 0.3±0.1, ev 131.6±066.6
el 00:03:00, ep 1563, ts 102834, ar 10 201.5±156.3, 100 123.4±100.0, ex 100 0.3±0.1, ev 237.4±090.9
el 00:04:00, ep 1813, ts 139898, ar 10 119.9±058.8, 100 151.4±098.1, ex 100 0.3±0.1, ev 345.9±112.9
el 00:05:00, ep 2050, ts 177897, ar 10 138.1±088.2, 100 155.7±117.1, ex 100 0.3±0.1, ev 400.7±104.2
el 00:06:01, ep 2281, ts 214001, ar 10 189.4±139.4, 100 157.8±114.9, ex 100 0.3±0.1, ev 387.0±102.0
el 00:07:01, ep 2493, ts 250122, ar 10 145.6±124.2, 100 164.3±118.9, ex 100 0.3±0.1, ev 437.7±075.7
el 00:08:01, ep 2707, ts 287658, ar 10 176.0±128.6, 100 169.2±113.1, ex 100 0.3±0.1, ev 419.1±095.2
el 00:08:28, ep 2798, ts 304508, ar 10 180.6±141.6, 100 185.6±123.2, ex 100 0.3±0.1, ev 476.4±060.3
--> reached_goal_mean_reward ✓
Training complete.
Final evaluation score 499.47±2.70 in 233.22s training time, 528.88s wall-clock time.
el 00:00:00, ep 0000, ts 000013, ar 10 013.0±000.0, 100 013.0±000.0, ex 100 0.2±0.0, ev 009.0±000.0
el 00:01:00, ep 0885, ts 025973, ar 10 105.0±066.9, 100 094.7±067.9, ex 100 0.3±0.1, ev 239.7±134.9
el 00:02:00, ep 1229, ts 062492, ar 10 066.1±047.1, 100 108.0±066.9, ex 100 0.3±0.1, ev 217.8±075.3
el 00:03:00, ep 1494, ts 100171, ar 10 116.0±050.0, 100 140.5±102.0, ex 100 0.3±0.1, ev 334.3±091.1
el 00:04:00, ep 1735, ts 136782, ar 10 161.9±102.5, 100 157.2±101.1, ex 100 0.3±0.1, ev 353.4±106.1
el 00:05:00, ep 1963, ts 173753, ar 10 135.8±101.4, 100 166.6±124.8, ex 100 0.3±0.1, ev 416.5±100.4
el 00:06:00, ep 2165, ts 204790, ar 10 113.2±089.1, 100 169.1±105.5, ex 100 0.3±0.1, ev 400.0±100.5
el 00:07:00, ep 2370, ts 236920, ar 10 108.3±059.7, 100 166.8±104.8, ex 100 0.3±0.1, ev 444.6±074.7
el 00:08:00, ep 2562, ts 270419, ar 10 151.0±086.8, 100 175.4±120.2, ex 100 0.3±0.1, ev 442.4±087.7
el 00:09:00, ep 2745, ts 302905, ar 10 201.2±127.1, 100 168.0±112.9, ex 100 0.3±0.1, ev 442.0±089.3
el 00:09:07, ep 2764, ts 306173, ar 10 185.9±132.7, 100 176.3±118.2, ex 100 0.3±0.1, ev 475.6±058.1
--> reached_goal_mean_reward ✓
Training complete.
Final evaluation score 500.00±0.00 in 243.08s training time, 567.50s wall-clock time.
el 00:00:00, ep 0000, ts 000052, ar 10 052.0±000.0, 100 052.0±000.0, ex 100 0.2±0.0, ev 035.0±000.0
el 00:01:00, ep 0749, ts 025988, ar 10 085.8±026.6, 100 095.8±062.5, ex 100 0.3±0.1, ev 195.6±116.2
el 00:02:00, ep 1120, ts 058853, ar 10 095.9±066.0, 100 087.6±096.0, ex 100 0.3±0.1, ev 213.7±136.2
el 00:03:00, ep 1416, ts 092345, ar 10 148.5±134.1, 100 142.6±094.1, ex 100 0.3±0.1, ev 323.2±104.9
el 00:04:00, ep 1685, ts 131111, ar 10 116.1±078.6, 100 134.8±101.0, ex 100 0.3±0.1, ev 282.4±091.9
el 00:05:00, ep 1914, ts 164830, ar 10 133.1±085.1, 100 136.8±090.8, ex 100 0.3±0.1, ev 410.5±112.3
el 00:06:00, ep 2142, ts 198735, ar 10 219.3±129.6, 100 154.6±109.0, ex 100 0.3±0.1, ev 412.5±111.1
el 00:07:00, ep 2355, ts 232257, ar 10 161.4±089.0, 100 164.5±115.5, ex 100 0.3±0.1, ev 430.9±093.2
el 00:08:00, ep 2562, ts 268801, ar 10 208.2±119.9, 100 178.9±134.9, ex 100 0.3±0.1, ev 415.2±113.1
el 00:09:01, ep 2756, ts 301532, ar 10 167.6±135.6, 100 161.4±128.9, ex 100 0.3±0.1, ev 465.1±074.3
el 00:10:01, ep 2956, ts 334390, ar 10 176.4±097.3, 100 168.8±126.0, ex 100 0.3±0.1, ev 433.1±081.4
el 00:10:30, ep 3046, ts 349198, ar 10 134.2±103.3, 100 165.7±107.4, ex 100 0.3±0.1, ev 475.1±049.9
--> reached_goal_mean_reward ✓
Training complete.
Final evaluation score 446.17±49.13 in 277.89s training time, 650.60s wall-clock time.
el 00:00:00, ep 0000, ts 000013, ar 10 013.0±000.0, 100 013.0±000.0, ex 100 0.2±0.0, ev 009.0±000.0
el 00:01:00, ep 0650, ts 028596, ar 10 155.9±114.1, 100 106.3±081.6, ex 100 0.3±0.1, ev 271.9±141.3
el 00:02:00, ep 0901, ts 061422, ar 10 120.8±037.7, 100 144.1±093.5, ex 100 0.3±0.1, ev 409.3±096.6
el 00:03:00, ep 1109, ts 095096, ar 10 167.2±144.6, 100 175.2±112.3, ex 100 0.3±0.0, ev 432.9±085.5
el 00:04:00, ep 1320, ts 127564, ar 10 185.5±124.7, 100 154.4±102.4, ex 100 0.3±0.1, ev 381.7±122.8
el 00:05:00, ep 1532, ts 161720, ar 10 184.1±136.5, 100 164.9±107.0, ex 100 0.3±0.1, ev 450.7±076.5
el 00:05:22, ep 1598, ts 173778, ar 10 126.9±078.9, 100 179.2±127.3, ex 100 0.3±0.1, ev 475.8±043.2
--> reached_goal_mean_reward ✓
Training complete.
Final evaluation score 473.34±35.27 in 140.87s training time, 342.61s wall-clock time.