TACC:  Starting up job 3496458 
TACC:  Starting parallel tasks... 
warning: variables which starts with __, is a module or class declaration are omitted
process rank 0 is bound to device 0
distributed environment is initialzied
model is created
warning: variables which starts with __, is a module or class declaration are omitted
process rank 3 is bound to device 3
Files already downloaded and verified
Files already downloaded and verified
warning: variables which starts with __, is a module or class declaration are omitted
process rank 2 is bound to device 2
Files already downloaded and verified
Files already downloaded and verified
Files already downloaded and verified
Files already downloaded and verified
training and testing dataloaders are created
loss is created
warning: variables which starts with __, is a module or class declaration are omitted
process rank 7 is bound to device 3
Files already downloaded and verified
Files already downloaded and verified
warning: variables which starts with __, is a module or class declaration are omitted
process rank 6 is bound to device 2
Files already downloaded and verified
Files already downloaded and verified
optimizer is created
start training
warning: variables which starts with __, is a module or class declaration are omitted
process rank 4 is bound to device 0
Files already downloaded and verified
Files already downloaded and verified
warning: variables which starts with __, is a module or class declaration are omitted
process rank 5 is bound to device 1
Files already downloaded and verified
Files already downloaded and verified
warning: variables which starts with __, is a module or class declaration are omitted
process rank 1 is bound to device 1
Files already downloaded and verified
Files already downloaded and verified
epoch: 0, train loss: 1.936693473738067
epoch: 1, train loss: 1.627108974116189
epoch: 1, eval loss: 1.5279120564460755, correct: 4576, total: 10000, acc = 0.4575999975204468
epoch: 2, train loss: 1.438910031805233
epoch: 3, train loss: 1.3184991053172521
epoch: 3, eval loss: 1.3557079970836639, correct: 5129, total: 10000, acc = 0.5128999948501587
epoch: 4, train loss: 1.271946340191121
epoch: 5, train loss: 1.2340542175331894
epoch: 5, eval loss: 1.207822185754776, correct: 5703, total: 10000, acc = 0.5702999830245972
epoch: 6, train loss: 1.187913371592152
epoch: 7, train loss: 1.154962458172623
epoch: 7, eval loss: 1.0685692846775054, correct: 6100, total: 10000, acc = 0.6100000143051147
epoch: 8, train loss: 1.1158924905621275
epoch: 9, train loss: 1.0909727805731249
epoch: 9, eval loss: 1.0345157146453858, correct: 6328, total: 10000, acc = 0.6327999830245972
epoch: 10, train loss: 1.0725988399009316
epoch: 11, train loss: 1.0453423085261364
epoch: 11, eval loss: 0.9778846323490142, correct: 6543, total: 10000, acc = 0.6542999744415283
epoch: 12, train loss: 1.0397504823548454
epoch: 13, train loss: 1.011059400986652
epoch: 13, eval loss: 0.9668682873249054, correct: 6446, total: 10000, acc = 0.644599974155426
epoch: 14, train loss: 0.9938353963044225
epoch: 15, train loss: 0.9691349967401854
epoch: 15, eval loss: 0.9465512812137604, correct: 6657, total: 10000, acc = 0.6656999588012695
epoch: 16, train loss: 0.9470896617490419
epoch: 17, train loss: 0.927201622602891
epoch: 17, eval loss: 0.8875106543302536, correct: 6837, total: 10000, acc = 0.6836999654769897
epoch: 18, train loss: 0.8975223132542202
epoch: 19, train loss: 0.8810242603019792
epoch: 19, eval loss: 0.8688296616077423, correct: 6832, total: 10000, acc = 0.6832000017166138
epoch: 20, train loss: 0.8482622784011218
epoch: 21, train loss: 0.8266285700457436
epoch: 21, eval loss: 0.7801274597644806, correct: 7205, total: 10000, acc = 0.7204999923706055
epoch: 22, train loss: 0.8038581859092323
epoch: 23, train loss: 0.7879118153027126
epoch: 23, eval loss: 0.7779350578784943, correct: 7203, total: 10000, acc = 0.7202999591827393
epoch: 24, train loss: 0.7542270896386127
epoch: 25, train loss: 0.7369782894241567
epoch: 25, eval loss: 0.7534965008497239, correct: 7362, total: 10000, acc = 0.7361999750137329
epoch: 26, train loss: 0.7095995545387268
epoch: 27, train loss: 0.6873777825005201
epoch: 27, eval loss: 0.7344318777322769, correct: 7381, total: 10000, acc = 0.738099992275238
epoch: 28, train loss: 0.6713967414534822
epoch: 29, train loss: 0.650338428969286
epoch: 29, eval loss: 0.677948921918869, correct: 7653, total: 10000, acc = 0.7652999758720398
epoch: 30, train loss: 0.6301205882004329
epoch: 31, train loss: 0.5990057824825754
epoch: 31, eval loss: 0.6719370454549789, correct: 7643, total: 10000, acc = 0.7642999887466431
epoch: 32, train loss: 0.590088236696866
epoch: 33, train loss: 0.5689327443132595
epoch: 33, eval loss: 0.6191721886396409, correct: 7807, total: 10000, acc = 0.7806999683380127
epoch: 34, train loss: 0.5426055670392756
epoch: 35, train loss: 0.5270413601276825
epoch: 35, eval loss: 0.6150132775306701, correct: 7879, total: 10000, acc = 0.7878999710083008
epoch: 36, train loss: 0.5215025428606539
epoch: 37, train loss: 0.4952395400222467
epoch: 37, eval loss: 0.628344652056694, correct: 7868, total: 10000, acc = 0.786799967288971
epoch: 38, train loss: 0.47989121687655545
epoch: 39, train loss: 0.46510300618045186
epoch: 39, eval loss: 0.5977057978510857, correct: 7944, total: 10000, acc = 0.7943999767303467
epoch: 40, train loss: 0.4441945254802704
epoch: 41, train loss: 0.4285763985648447
epoch: 41, eval loss: 0.5695438250899315, correct: 8023, total: 10000, acc = 0.802299976348877
epoch: 42, train loss: 0.41337763776584546
epoch: 43, train loss: 0.3940146170100387
epoch: 43, eval loss: 0.5688270673155784, correct: 8091, total: 10000, acc = 0.8090999722480774
epoch: 44, train loss: 0.37741332303504554
epoch: 45, train loss: 0.36565779605690313
epoch: 45, eval loss: 0.5831407308578491, correct: 8104, total: 10000, acc = 0.8104000091552734
epoch: 46, train loss: 0.3468657017362361
epoch: 47, train loss: 0.32949359198005834
epoch: 47, eval loss: 0.5751512110233307, correct: 8097, total: 10000, acc = 0.8096999526023865
epoch: 48, train loss: 0.3140165246262842
epoch: 49, train loss: 0.29480520498995877
epoch: 49, eval loss: 0.5712087765336037, correct: 8184, total: 10000, acc = 0.818399965763092
epoch: 50, train loss: 0.2766021394303867
epoch: 51, train loss: 0.26527753776433516
epoch: 51, eval loss: 0.5643855139613152, correct: 8218, total: 10000, acc = 0.8217999935150146
epoch: 52, train loss: 0.2525861115784061
epoch: 53, train loss: 0.23714738658496312
epoch: 53, eval loss: 0.5732526823878288, correct: 8249, total: 10000, acc = 0.8248999714851379
epoch: 54, train loss: 0.2238179413335664
epoch: 55, train loss: 0.2119908875652722
epoch: 55, eval loss: 0.5957901775836945, correct: 8261, total: 10000, acc = 0.8260999917984009
epoch: 56, train loss: 0.19989302222217833
epoch: 57, train loss: 0.1875186789096618
epoch: 57, eval loss: 0.5905491337180138, correct: 8290, total: 10000, acc = 0.8289999961853027
epoch: 58, train loss: 0.18436841180129926
epoch: 59, train loss: 0.17459663231762088
epoch: 59, eval loss: 0.589044263958931, correct: 8313, total: 10000, acc = 0.8312999606132507
finish training
