TACC:  Starting up job 3498509 
TACC:  Starting parallel tasks... 
warning: variables which starts with __, is a module or class declaration are omitted
process rank 0 is bound to device 0
distributed environment is initialzied
model is created
Files already downloaded and verified
Files already downloaded and verified
training and testing dataloaders are created
loss is created
optimizer is created
start training
warning: variables which starts with __, is a module or class declaration are omitted
process rank 2 is bound to device 2
Files already downloaded and verified
Files already downloaded and verified
warning: variables which starts with __, is a module or class declaration are omitted
process rank 3 is bound to device 3
Files already downloaded and verified
Files already downloaded and verified
warning: variables which starts with __, is a module or class declaration are omitted
process rank 1 is bound to device 1
Files already downloaded and verified
Files already downloaded and verified
epoch: 0, train loss: 2.107759721425115
epoch: 1, train loss: 1.8388929500871776
epoch: 1, eval loss: 1.7622965753078461, correct: 3535, total: 10000, acc = 0.35349997878074646
epoch: 2, train loss: 1.7141443588295762
epoch: 3, train loss: 1.6003259931291853
epoch: 3, eval loss: 1.608506625890732, correct: 4263, total: 10000, acc = 0.4262999892234802
epoch: 4, train loss: 1.5016733225511045
epoch: 5, train loss: 1.4050611877927974
epoch: 5, eval loss: 1.386299443244934, correct: 4984, total: 10000, acc = 0.4983999729156494
epoch: 6, train loss: 1.3264902623332278
epoch: 7, train loss: 1.2681689250225923
epoch: 7, eval loss: 1.3251740992069245, correct: 5295, total: 10000, acc = 0.5295000076293945
epoch: 8, train loss: 1.2236176984650748
epoch: 9, train loss: 1.172800781775494
epoch: 9, eval loss: 1.1429427027702332, correct: 5966, total: 10000, acc = 0.5965999960899353
epoch: 10, train loss: 1.1335287532027887
epoch: 11, train loss: 1.0974334563527788
epoch: 11, eval loss: 1.1024536848068238, correct: 6107, total: 10000, acc = 0.6107000112533569
epoch: 12, train loss: 1.0638826300903244
epoch: 13, train loss: 1.0406859383291127
epoch: 13, eval loss: 1.0324654281139374, correct: 6282, total: 10000, acc = 0.6281999945640564
epoch: 14, train loss: 1.0157714376644211
epoch: 15, train loss: 0.990898135365272
epoch: 15, eval loss: 0.9790050059556961, correct: 6539, total: 10000, acc = 0.6538999676704407
epoch: 16, train loss: 0.963820260398242
epoch: 17, train loss: 0.9404383374720203
epoch: 17, eval loss: 0.9367435872554779, correct: 6691, total: 10000, acc = 0.6690999865531921
epoch: 18, train loss: 0.9299906589546982
epoch: 19, train loss: 0.9038882474510037
epoch: 19, eval loss: 0.9210823565721512, correct: 6709, total: 10000, acc = 0.6708999872207642
epoch: 20, train loss: 0.8825302799137271
epoch: 21, train loss: 0.8686576388320144
epoch: 21, eval loss: 0.8791542768478393, correct: 6913, total: 10000, acc = 0.6912999749183655
epoch: 22, train loss: 0.8509396040926174
epoch: 23, train loss: 0.8375457452268017
epoch: 23, eval loss: 0.8651147484779358, correct: 6948, total: 10000, acc = 0.6947999596595764
epoch: 24, train loss: 0.8163802222329744
epoch: 25, train loss: 0.8068491317787949
epoch: 25, eval loss: 0.8353333532810211, correct: 7089, total: 10000, acc = 0.708899974822998
epoch: 26, train loss: 0.7894753631280393
epoch: 27, train loss: 0.7779296344640304
epoch: 27, eval loss: 0.8161472469568253, correct: 7143, total: 10000, acc = 0.7142999768257141
epoch: 28, train loss: 0.763744876092794
epoch: 29, train loss: 0.7521962505214068
epoch: 29, eval loss: 0.7903082758188248, correct: 7219, total: 10000, acc = 0.7218999862670898
epoch: 30, train loss: 0.7443178624522929
epoch: 31, train loss: 0.7280340212948468
epoch: 31, eval loss: 0.7877005040645599, correct: 7233, total: 10000, acc = 0.7232999801635742
epoch: 32, train loss: 0.7196985489251663
epoch: 33, train loss: 0.7108793039711154
epoch: 33, eval loss: 0.7838329076766968, correct: 7292, total: 10000, acc = 0.729200005531311
epoch: 34, train loss: 0.6965019471791326
epoch: 35, train loss: 0.6875918537986522
epoch: 35, eval loss: 0.7513678789138794, correct: 7392, total: 10000, acc = 0.7391999959945679
epoch: 36, train loss: 0.6793362346230721
epoch: 37, train loss: 0.6741023343436572
epoch: 37, eval loss: 0.7752945452928544, correct: 7316, total: 10000, acc = 0.7315999865531921
epoch: 38, train loss: 0.6629589072295597
epoch: 39, train loss: 0.6507086388918818
epoch: 39, eval loss: 0.7758691757917404, correct: 7322, total: 10000, acc = 0.7321999669075012
epoch: 40, train loss: 0.6381483582817778
epoch: 41, train loss: 0.6374095179596726
epoch: 41, eval loss: 0.7589699536561966, correct: 7386, total: 10000, acc = 0.738599956035614
epoch: 42, train loss: 0.6251792050137812
epoch: 43, train loss: 0.6148473596086308
epoch: 43, eval loss: 0.7495014071464539, correct: 7478, total: 10000, acc = 0.7477999925613403
epoch: 44, train loss: 0.6119371378908351
epoch: 45, train loss: 0.6012086509441843
epoch: 45, eval loss: 0.725347763299942, correct: 7515, total: 10000, acc = 0.7515000104904175
epoch: 46, train loss: 0.597867566103838
epoch: 47, train loss: 0.5913592832429069
epoch: 47, eval loss: 0.7254288077354432, correct: 7529, total: 10000, acc = 0.7529000043869019
epoch: 48, train loss: 0.5801522807807339
epoch: 49, train loss: 0.575563525666996
epoch: 49, eval loss: 0.7291093468666077, correct: 7533, total: 10000, acc = 0.7532999515533447
epoch: 50, train loss: 0.573031121674849
epoch: 51, train loss: 0.5667383588698446
epoch: 51, eval loss: 0.7240727603435516, correct: 7570, total: 10000, acc = 0.7569999694824219
epoch: 52, train loss: 0.5578772419569443
epoch: 53, train loss: 0.5526659309255834
epoch: 53, eval loss: 0.7226850330829621, correct: 7576, total: 10000, acc = 0.7576000094413757
epoch: 54, train loss: 0.5473246245968099
epoch: 55, train loss: 0.5443006860358375
epoch: 55, eval loss: 0.720612645149231, correct: 7596, total: 10000, acc = 0.7595999836921692
epoch: 56, train loss: 0.5361242987671677
epoch: 57, train loss: 0.5323515981435776
epoch: 57, eval loss: 0.7203025311231613, correct: 7580, total: 10000, acc = 0.7579999566078186
epoch: 58, train loss: 0.5297852404871766
epoch: 59, train loss: 0.5288004583241989
epoch: 59, eval loss: 0.7189624041318894, correct: 7605, total: 10000, acc = 0.7604999542236328
finish training
