INFO - 06/21/21 23:13:46 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:13:46 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:13:46 - 0:00:00 - The experiment will be stored in logs/conll2003/1 INFO - 06/21/21 23:25:29 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:25:29 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:25:29 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:25:29 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:29 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:25:29 - 0:00:01 - Attempting to acquire lock 22598820184656 on /root/.cache/huggingface/transformers/dea67b44b38d504f2523f3ddb6acb601b23d67bee52c942da336fa1283100990.94cae8b3a8dbab1d59b9d4827f7ce79e73124efa6bb970412cd503383a95f373.lock INFO - 06/21/21 23:25:29 - 0:00:01 - Lock 22598820184656 acquired on /root/.cache/huggingface/transformers/dea67b44b38d504f2523f3ddb6acb601b23d67bee52c942da336fa1283100990.94cae8b3a8dbab1d59b9d4827f7ce79e73124efa6bb970412cd503383a95f373.lock DEBUG - 06/21/21 23:25:29 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:30 - 0:00:01 - https://huggingface.co:443 "GET /roberta-large/resolve/main/config.json HTTP/1.1" 200 482 DEBUG - 06/21/21 23:25:30 - 0:00:01 - Attempting to release lock 22598820184656 on /root/.cache/huggingface/transformers/dea67b44b38d504f2523f3ddb6acb601b23d67bee52c942da336fa1283100990.94cae8b3a8dbab1d59b9d4827f7ce79e73124efa6bb970412cd503383a95f373.lock INFO - 06/21/21 23:25:30 - 0:00:01 - Lock 22598820184656 released on /root/.cache/huggingface/transformers/dea67b44b38d504f2523f3ddb6acb601b23d67bee52c942da336fa1283100990.94cae8b3a8dbab1d59b9d4827f7ce79e73124efa6bb970412cd503383a95f373.lock DEBUG - 06/21/21 23:25:30 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:30 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:25:30 - 0:00:01 - Attempting to acquire lock 22598820184656 on /root/.cache/huggingface/transformers/7c1ba2435b05451bc3b4da073c8dec9630b22024a65f6c41053caccf2880eb8f.d67d6b367eb24ab43b08ad55e014cf254076934f71d832bbab9ad35644a375ab.lock INFO - 06/21/21 23:25:30 - 0:00:01 - Lock 22598820184656 acquired on /root/.cache/huggingface/transformers/7c1ba2435b05451bc3b4da073c8dec9630b22024a65f6c41053caccf2880eb8f.d67d6b367eb24ab43b08ad55e014cf254076934f71d832bbab9ad35644a375ab.lock DEBUG - 06/21/21 23:25:30 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:30 - 0:00:01 - https://huggingface.co:443 "GET /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 898823 DEBUG - 06/21/21 23:25:30 - 0:00:02 - Attempting to release lock 22598820184656 on /root/.cache/huggingface/transformers/7c1ba2435b05451bc3b4da073c8dec9630b22024a65f6c41053caccf2880eb8f.d67d6b367eb24ab43b08ad55e014cf254076934f71d832bbab9ad35644a375ab.lock INFO - 06/21/21 23:25:30 - 0:00:02 - Lock 22598820184656 released on /root/.cache/huggingface/transformers/7c1ba2435b05451bc3b4da073c8dec9630b22024a65f6c41053caccf2880eb8f.d67d6b367eb24ab43b08ad55e014cf254076934f71d832bbab9ad35644a375ab.lock DEBUG - 06/21/21 23:25:30 - 0:00:02 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:31 - 0:00:02 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:25:31 - 0:00:02 - Attempting to acquire lock 22597850387840 on /root/.cache/huggingface/transformers/20b5a00a80e27ae9accbe25672aba42ad2d4d4cb2c4b9359b50ca8e34e107d6d.5d12962c5ee615a4c803841266e9c3be9a691a924f72d395d3a6c6c81157788b.lock INFO - 06/21/21 23:25:31 - 0:00:02 - Lock 22597850387840 acquired on /root/.cache/huggingface/transformers/20b5a00a80e27ae9accbe25672aba42ad2d4d4cb2c4b9359b50ca8e34e107d6d.5d12962c5ee615a4c803841266e9c3be9a691a924f72d395d3a6c6c81157788b.lock DEBUG - 06/21/21 23:25:31 - 0:00:02 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:31 - 0:00:02 - https://huggingface.co:443 "GET /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 456318 DEBUG - 06/21/21 23:25:31 - 0:00:02 - Attempting to release lock 22597850387840 on /root/.cache/huggingface/transformers/20b5a00a80e27ae9accbe25672aba42ad2d4d4cb2c4b9359b50ca8e34e107d6d.5d12962c5ee615a4c803841266e9c3be9a691a924f72d395d3a6c6c81157788b.lock INFO - 06/21/21 23:25:31 - 0:00:02 - Lock 22597850387840 released on /root/.cache/huggingface/transformers/20b5a00a80e27ae9accbe25672aba42ad2d4d4cb2c4b9359b50ca8e34e107d6d.5d12962c5ee615a4c803841266e9c3be9a691a924f72d395d3a6c6c81157788b.lock DEBUG - 06/21/21 23:25:31 - 0:00:02 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:31 - 0:00:03 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:25:31 - 0:00:03 - Attempting to acquire lock 22597850387840 on /root/.cache/huggingface/transformers/e16a2590deb9e6d73711d6e05bf27d832fa8c1162d807222e043ca650a556964.fc9576039592f026ad76a1c231b89aee8668488c671dfbe6616bab2ed298d730.lock INFO - 06/21/21 23:25:31 - 0:00:03 - Lock 22597850387840 acquired on /root/.cache/huggingface/transformers/e16a2590deb9e6d73711d6e05bf27d832fa8c1162d807222e043ca650a556964.fc9576039592f026ad76a1c231b89aee8668488c671dfbe6616bab2ed298d730.lock DEBUG - 06/21/21 23:25:31 - 0:00:03 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:25:32 - 0:00:03 - https://huggingface.co:443 "GET /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 1355863 DEBUG - 06/21/21 23:25:32 - 0:00:03 - Attempting to release lock 22597850387840 on /root/.cache/huggingface/transformers/e16a2590deb9e6d73711d6e05bf27d832fa8c1162d807222e043ca650a556964.fc9576039592f026ad76a1c231b89aee8668488c671dfbe6616bab2ed298d730.lock INFO - 06/21/21 23:25:32 - 0:00:03 - Lock 22597850387840 released on /root/.cache/huggingface/transformers/e16a2590deb9e6d73711d6e05bf27d832fa8c1162d807222e043ca650a556964.fc9576039592f026ad76a1c231b89aee8668488c671dfbe6616bab2ed298d730.lock INFO - 06/21/21 23:26:26 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:26:26 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:26:26 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:26:26 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:26:26 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:26:26 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:26:27 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:26:27 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:26:27 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:26:27 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:26:27 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:26:39 - 0:00:13 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:26:39 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:26:39 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:26:39 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:26:39 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 DEBUG - 06/21/21 23:26:39 - 0:00:13 - Attempting to acquire lock 23082502829920 on /root/.cache/huggingface/transformers/8e36ec2f5052bec1e79e139b84c2c3089cb647694ba0f4f634fec7b8258f7c89.c43841d8c5cd23c435408295164cda9525270aa42cd0cc9200911570c0342352.lock INFO - 06/21/21 23:26:39 - 0:00:13 - Lock 23082502829920 acquired on /root/.cache/huggingface/transformers/8e36ec2f5052bec1e79e139b84c2c3089cb647694ba0f4f634fec7b8258f7c89.c43841d8c5cd23c435408295164cda9525270aa42cd0cc9200911570c0342352.lock DEBUG - 06/21/21 23:26:39 - 0:00:13 - Starting new HTTPS connection (1): cdn-lfs.huggingface.co:443 DEBUG - 06/21/21 23:26:39 - 0:00:13 - https://cdn-lfs.huggingface.co:443 "GET /roberta-large/36a10a8b694fadf9bf4f9049d14e257e88be45313ae02d882af9e60f39b8b2e8 HTTP/1.1" 200 1425941629 DEBUG - 06/21/21 23:27:01 - 0:00:34 - Attempting to release lock 23082502829920 on /root/.cache/huggingface/transformers/8e36ec2f5052bec1e79e139b84c2c3089cb647694ba0f4f634fec7b8258f7c89.c43841d8c5cd23c435408295164cda9525270aa42cd0cc9200911570c0342352.lock INFO - 06/21/21 23:27:01 - 0:00:34 - Lock 23082502829920 released on /root/.cache/huggingface/transformers/8e36ec2f5052bec1e79e139b84c2c3089cb647694ba0f4f634fec7b8258f7c89.c43841d8c5cd23c435408295164cda9525270aa42cd0cc9200911570c0342352.lock INFO - 06/21/21 23:27:57 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:27:57 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:27:57 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:27:57 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:27:57 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:27:57 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:27:58 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:27:58 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:27:58 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:27:58 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:27:58 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:28:09 - 0:00:12 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:28:09 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:28:10 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:28:10 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:28:10 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:28:17 - 0:00:20 - Start NER training ... INFO - 06/21/21 23:28:17 - 0:00:20 - ============== epoch 0 ============== INFO - 06/21/21 23:29:45 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:29:45 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:29:45 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:29:45 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:29:45 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:29:45 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:29:45 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:29:45 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:29:46 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:29:46 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:29:46 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:29:57 - 0:00:12 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:29:57 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:29:57 - 0:00:12 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:29:57 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:29:57 - 0:00:12 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:30:04 - 0:00:19 - Start NER training ... INFO - 06/21/21 23:30:04 - 0:00:19 - ============== epoch 0 ============== INFO - 06/21/21 23:31:17 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:31:17 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:31:17 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:31:17 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:31:17 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:31:17 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:31:17 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:31:17 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:31:18 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:31:18 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:31:18 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:31:29 - 0:00:13 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:31:29 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:31:30 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:31:30 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:31:30 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:31:37 - 0:00:20 - Start NER training ... INFO - 06/21/21 23:31:37 - 0:00:20 - ============== epoch 0 ============== INFO - 06/21/21 23:33:58 - 0:02:42 - Finish training epoch 0. loss: 0.0696 INFO - 06/21/21 23:33:58 - 0:02:42 - ============== Evaluate epoch 0 on Dev Set ============== INFO - 06/21/21 23:34:08 - 0:02:51 - Evaluate on Dev Set. F1: 95.5005. INFO - 06/21/21 23:34:08 - 0:02:51 - Found better model!! INFO - 06/21/21 23:48:39 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:48:39 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:48:39 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:48:39 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:48:39 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:48:39 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:48:40 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:48:40 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:48:40 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:48:40 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:48:40 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:48:51 - 0:00:12 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:48:51 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:48:51 - 0:00:12 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:48:51 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:48:51 - 0:00:12 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:49:00 - 0:00:21 - Start NER training ... INFO - 06/21/21 23:49:00 - 0:00:21 - ============== epoch 0 ============== INFO - 06/21/21 23:51:22 - 0:02:43 - Finish training epoch 0. loss: 0.0696 INFO - 06/21/21 23:51:22 - 0:02:43 - ============== Evaluate epoch 0 on Dev Set ============== INFO - 06/21/21 23:51:31 - 0:02:52 - Evaluate on Dev Set. F1: 95.5005. INFO - 06/21/21 23:51:31 - 0:02:52 - Found better model!! INFO - 06/21/21 23:51:33 - 0:02:54 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/21/21 23:51:33 - 0:02:54 - ============== epoch 1 ============== INFO - 06/21/21 23:53:55 - 0:05:16 - Finish training epoch 1. loss: 0.0234 INFO - 06/21/21 23:53:55 - 0:05:16 - ============== Evaluate epoch 1 on Dev Set ============== INFO - 06/21/21 23:54:03 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:54:03 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 111 INFO - 06/21/21 23:54:03 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:54:03 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:54:04 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:54:04 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:54:04 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:54:04 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:54:04 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:54:04 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:54:05 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:54:05 - 0:05:25 - Evaluate on Dev Set. F1: 96.9048. INFO - 06/21/21 23:54:05 - 0:05:25 - Found better model!! INFO - 06/21/21 23:54:06 - 0:05:27 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/21/21 23:54:06 - 0:05:27 - ============== epoch 2 ============== INFO - 06/21/21 23:54:16 - 0:00:12 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:54:16 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:54:16 - 0:00:12 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:54:16 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:54:16 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:54:24 - 0:00:20 - Start NER training ... INFO - 06/21/21 23:54:24 - 0:00:20 - ============== epoch 0 ============== INFO - 06/21/21 23:55:40 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:55:40 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 5e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 123456 INFO - 06/21/21 23:55:40 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:55:40 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:55:40 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:55:40 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:55:41 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:55:41 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:55:41 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:55:41 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:55:41 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:55:53 - 0:00:13 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:55:53 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:55:53 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:55:53 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:55:53 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:56:01 - 0:00:21 - Start NER training ... INFO - 06/21/21 23:56:01 - 0:00:21 - ============== epoch 0 ============== INFO - 06/21/21 23:56:29 - 0:07:50 - Finish training epoch 2. loss: 0.0162 INFO - 06/21/21 23:56:29 - 0:07:50 - ============== Evaluate epoch 2 on Dev Set ============== INFO - 06/21/21 23:56:38 - 0:07:59 - Evaluate on Dev Set. F1: 97.3381. INFO - 06/21/21 23:56:38 - 0:07:59 - Found better model!! INFO - 06/21/21 23:56:40 - 0:08:01 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/21/21 23:56:40 - 0:08:01 - ============== epoch 3 ============== INFO - 06/21/21 23:56:47 - 0:02:43 - Finish training epoch 0. loss: 0.0580 INFO - 06/21/21 23:56:47 - 0:02:43 - ============== Evaluate epoch 0 on Dev Set ============== INFO - 06/21/21 23:56:56 - 0:02:53 - Evaluate on Dev Set. F1: 96.7327. INFO - 06/21/21 23:56:56 - 0:02:53 - Found better model!! INFO - 06/21/21 23:56:58 - 0:02:54 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/21/21 23:56:58 - 0:02:54 - ============== epoch 1 ============== INFO - 06/21/21 23:58:25 - 0:02:45 - Finish training epoch 0. loss: 0.0544 INFO - 06/21/21 23:58:25 - 0:02:45 - ============== Evaluate epoch 0 on Dev Set ============== INFO - 06/21/21 23:58:34 - 0:02:54 - Evaluate on Dev Set. F1: 96.8227. INFO - 06/21/21 23:58:34 - 0:02:54 - Found better model!! INFO - 06/21/21 23:58:36 - 0:02:56 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/21/21 23:58:36 - 0:02:56 - ============== epoch 1 ============== INFO - 06/21/21 23:58:40 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:58:40 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 3e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 555 INFO - 06/21/21 23:58:40 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:58:40 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:40 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:58:40 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:41 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:58:41 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:41 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:58:41 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:41 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:58:57 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:58:57 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 3e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 111 INFO - 06/21/21 23:58:57 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:58:57 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:57 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:58:57 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:58 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:58:58 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:58 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:58:58 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:58:58 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:59:02 - 0:10:23 - Finish training epoch 3. loss: 0.0136 INFO - 06/21/21 23:59:02 - 0:10:23 - ============== Evaluate epoch 3 on Dev Set ============== INFO - 06/21/21 23:59:10 - 0:00:12 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:59:10 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:59:10 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:59:10 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:59:10 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:59:12 - 0:10:33 - Evaluate on Dev Set. F1: 96.0542. INFO - 06/21/21 23:59:12 - 0:10:33 - No better model found (1/3) INFO - 06/21/21 23:59:12 - 0:10:33 - ============== epoch 4 ============== INFO - 06/21/21 23:59:18 - 0:00:20 - Start NER training ... INFO - 06/21/21 23:59:18 - 0:00:20 - ============== epoch 0 ============== INFO - 06/21/21 23:59:21 - 0:05:18 - Finish training epoch 1. loss: 0.0190 INFO - 06/21/21 23:59:21 - 0:05:18 - ============== Evaluate epoch 1 on Dev Set ============== INFO - 06/21/21 23:59:30 - 0:00:00 - ============ Initialized logger ============ INFO - 06/21/21 23:59:30 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 2e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 111 INFO - 06/21/21 23:59:30 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/21/21 23:59:30 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:59:30 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:59:30 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 INFO - 06/21/21 23:59:31 - 0:05:27 - Evaluate on Dev Set. F1: 97.1510. INFO - 06/21/21 23:59:31 - 0:05:27 - Found better model!! DEBUG - 06/21/21 23:59:31 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:59:31 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:59:31 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/21/21 23:59:31 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:59:31 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/21/21 23:59:32 - 0:05:29 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/21/21 23:59:32 - 0:05:29 - ============== epoch 2 ============== INFO - 06/21/21 23:59:43 - 0:00:13 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/21/21 23:59:43 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:59:43 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/21/21 23:59:43 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/21/21 23:59:44 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/21/21 23:59:51 - 0:00:21 - Start NER training ... INFO - 06/21/21 23:59:51 - 0:00:21 - ============== epoch 0 ============== INFO - 06/22/21 00:01:00 - 0:05:20 - Finish training epoch 1. loss: 0.0229 INFO - 06/22/21 00:01:00 - 0:05:20 - ============== Evaluate epoch 1 on Dev Set ============== INFO - 06/22/21 00:01:10 - 0:05:30 - Evaluate on Dev Set. F1: 97.0174. INFO - 06/22/21 00:01:10 - 0:05:30 - Found better model!! INFO - 06/22/21 00:01:12 - 0:05:31 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:01:12 - 0:05:31 - ============== epoch 2 ============== INFO - 06/22/21 00:01:35 - 0:12:56 - Finish training epoch 4. loss: 0.0170 INFO - 06/22/21 00:01:35 - 0:12:56 - ============== Evaluate epoch 4 on Dev Set ============== INFO - 06/22/21 00:01:40 - 0:02:43 - Finish training epoch 0. loss: 0.0544 INFO - 06/22/21 00:01:40 - 0:02:43 - ============== Evaluate epoch 0 on Dev Set ============== INFO - 06/22/21 00:01:45 - 0:13:05 - Evaluate on Dev Set. F1: 97.1884. INFO - 06/22/21 00:01:45 - 0:13:05 - No better model found (2/3) INFO - 06/22/21 00:01:45 - 0:13:05 - ============== epoch 5 ============== INFO - 06/22/21 00:01:50 - 0:02:53 - Evaluate on Dev Set. F1: 96.2938. INFO - 06/22/21 00:01:50 - 0:02:53 - Found better model!! INFO - 06/22/21 00:01:52 - 0:02:55 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:01:52 - 0:02:55 - ============== epoch 1 ============== INFO - 06/22/21 00:01:55 - 0:07:51 - Finish training epoch 2. loss: 0.0200 INFO - 06/22/21 00:01:55 - 0:07:51 - ============== Evaluate epoch 2 on Dev Set ============== INFO - 06/22/21 00:02:04 - 0:08:01 - Evaluate on Dev Set. F1: 96.9804. INFO - 06/22/21 00:02:04 - 0:08:01 - No better model found (1/3) INFO - 06/22/21 00:02:04 - 0:08:01 - ============== epoch 3 ============== INFO - 06/22/21 00:02:13 - 0:02:42 - Finish training epoch 0. loss: 0.0547 INFO - 06/22/21 00:02:13 - 0:02:42 - ============== Evaluate epoch 0 on Dev Set ============== INFO - 06/22/21 00:02:22 - 0:02:52 - Evaluate on Dev Set. F1: 97.0400. INFO - 06/22/21 00:02:22 - 0:02:52 - Found better model!! INFO - 06/22/21 00:02:24 - 0:02:54 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:02:24 - 0:02:54 - ============== epoch 1 ============== INFO - 06/22/21 00:03:35 - 0:07:55 - Finish training epoch 2. loss: 0.0173 INFO - 06/22/21 00:03:35 - 0:07:55 - ============== Evaluate epoch 2 on Dev Set ============== INFO - 06/22/21 00:03:45 - 0:08:04 - Evaluate on Dev Set. F1: 97.3191. INFO - 06/22/21 00:03:45 - 0:08:04 - Found better model!! INFO - 06/22/21 00:03:46 - 0:08:06 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:03:46 - 0:08:06 - ============== epoch 3 ============== INFO - 06/22/21 00:04:07 - 0:15:28 - Finish training epoch 5. loss: 0.0083 INFO - 06/22/21 00:04:07 - 0:15:28 - ============== Evaluate epoch 5 on Dev Set ============== INFO - 06/22/21 00:04:14 - 0:05:17 - Finish training epoch 1. loss: 0.0182 INFO - 06/22/21 00:04:14 - 0:05:17 - ============== Evaluate epoch 1 on Dev Set ============== INFO - 06/22/21 00:04:17 - 0:15:37 - Evaluate on Dev Set. F1: 97.3169. INFO - 06/22/21 00:04:17 - 0:15:37 - No better model found (3/3) INFO - 06/22/21 00:04:17 - 0:15:37 - ============== Evaluate on Test Set ============== INFO - 06/22/21 00:04:24 - 0:05:27 - Evaluate on Dev Set. F1: 97.6314. INFO - 06/22/21 00:04:24 - 0:05:27 - Found better model!! INFO - 06/22/21 00:04:26 - 0:15:46 - Evaluate on Test Set. F1: 95.6012. INFO - 06/22/21 00:04:26 - 0:05:29 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:04:26 - 0:05:29 - ============== epoch 2 ============== INFO - 06/22/21 00:04:27 - 0:10:24 - Finish training epoch 3. loss: 0.0157 INFO - 06/22/21 00:04:27 - 0:10:24 - ============== Evaluate epoch 3 on Dev Set ============== INFO - 06/22/21 00:04:37 - 0:10:33 - Evaluate on Dev Set. F1: 97.6654. INFO - 06/22/21 00:04:37 - 0:10:33 - Found better model!! INFO - 06/22/21 00:04:39 - 0:10:35 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:04:39 - 0:10:35 - ============== epoch 4 ============== INFO - 06/22/21 00:04:45 - 0:05:15 - Finish training epoch 1. loss: 0.0177 INFO - 06/22/21 00:04:45 - 0:05:15 - ============== Evaluate epoch 1 on Dev Set ============== INFO - 06/22/21 00:04:55 - 0:05:25 - Evaluate on Dev Set. F1: 97.6093. INFO - 06/22/21 00:04:55 - 0:05:25 - Found better model!! INFO - 06/22/21 00:04:56 - 0:05:26 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:04:56 - 0:05:26 - ============== epoch 2 ============== INFO - 06/22/21 00:06:10 - 0:10:30 - Finish training epoch 3. loss: 0.0439 INFO - 06/22/21 00:06:10 - 0:10:30 - ============== Evaluate epoch 3 on Dev Set ============== INFO - 06/22/21 00:06:20 - 0:10:40 - Evaluate on Dev Set. F1: 0.0000. INFO - 06/22/21 00:06:20 - 0:10:40 - No better model found (1/3) INFO - 06/22/21 00:06:20 - 0:10:40 - ============== epoch 4 ============== INFO - 06/22/21 00:06:47 - 0:07:50 - Finish training epoch 2. loss: 0.0156 INFO - 06/22/21 00:06:47 - 0:07:50 - ============== Evaluate epoch 2 on Dev Set ============== INFO - 06/22/21 00:06:57 - 0:07:59 - Evaluate on Dev Set. F1: 97.5384. INFO - 06/22/21 00:06:57 - 0:07:59 - No better model found (1/3) INFO - 06/22/21 00:06:57 - 0:07:59 - ============== epoch 3 ============== INFO - 06/22/21 00:07:02 - 0:12:59 - Finish training epoch 4. loss: 0.0127 INFO - 06/22/21 00:07:02 - 0:12:59 - ============== Evaluate epoch 4 on Dev Set ============== INFO - 06/22/21 00:07:12 - 0:13:08 - Evaluate on Dev Set. F1: 97.4583. INFO - 06/22/21 00:07:12 - 0:13:08 - No better model found (1/3) INFO - 06/22/21 00:07:12 - 0:13:08 - ============== epoch 5 ============== INFO - 06/22/21 00:07:17 - 0:07:47 - Finish training epoch 2. loss: 0.0115 INFO - 06/22/21 00:07:17 - 0:07:47 - ============== Evaluate epoch 2 on Dev Set ============== INFO - 06/22/21 00:07:26 - 0:07:56 - Evaluate on Dev Set. F1: 97.2615. INFO - 06/22/21 00:07:26 - 0:07:56 - No better model found (1/3) INFO - 06/22/21 00:07:26 - 0:07:56 - ============== epoch 3 ============== INFO - 06/22/21 00:08:43 - 0:13:03 - Finish training epoch 4. loss: 0.5637 INFO - 06/22/21 00:08:43 - 0:13:03 - ============== Evaluate epoch 4 on Dev Set ============== INFO - 06/22/21 00:08:53 - 0:13:12 - Evaluate on Dev Set. F1: 0.0000. INFO - 06/22/21 00:08:53 - 0:13:12 - No better model found (2/3) INFO - 06/22/21 00:08:53 - 0:13:12 - ============== epoch 5 ============== INFO - 06/22/21 00:09:18 - 0:10:21 - Finish training epoch 3. loss: 0.0110 INFO - 06/22/21 00:09:18 - 0:10:21 - ============== Evaluate epoch 3 on Dev Set ============== INFO - 06/22/21 00:09:28 - 0:10:31 - Evaluate on Dev Set. F1: 97.2738. INFO - 06/22/21 00:09:28 - 0:10:31 - No better model found (2/3) INFO - 06/22/21 00:09:28 - 0:10:31 - ============== epoch 4 ============== INFO - 06/22/21 00:09:35 - 0:15:31 - Finish training epoch 5. loss: 0.0132 INFO - 06/22/21 00:09:35 - 0:15:31 - ============== Evaluate epoch 5 on Dev Set ============== INFO - 06/22/21 00:09:45 - 0:15:41 - Evaluate on Dev Set. F1: 97.4630. INFO - 06/22/21 00:09:45 - 0:15:41 - No better model found (2/3) INFO - 06/22/21 00:09:45 - 0:15:41 - ============== epoch 6 ============== INFO - 06/22/21 00:09:47 - 0:10:17 - Finish training epoch 3. loss: 0.0101 INFO - 06/22/21 00:09:47 - 0:10:17 - ============== Evaluate epoch 3 on Dev Set ============== INFO - 06/22/21 00:09:57 - 0:10:27 - Evaluate on Dev Set. F1: 97.5034. INFO - 06/22/21 00:09:57 - 0:10:27 - No better model found (2/3) INFO - 06/22/21 00:09:57 - 0:10:27 - ============== epoch 4 ============== INFO - 06/22/21 00:11:16 - 0:15:36 - Finish training epoch 5. loss: 0.5620 INFO - 06/22/21 00:11:16 - 0:15:36 - ============== Evaluate epoch 5 on Dev Set ============== INFO - 06/22/21 00:11:26 - 0:15:45 - Evaluate on Dev Set. F1: 0.0000. INFO - 06/22/21 00:11:26 - 0:15:45 - No better model found (3/3) INFO - 06/22/21 00:11:26 - 0:15:45 - ============== Evaluate on Test Set ============== INFO - 06/22/21 00:11:35 - 0:15:54 - Evaluate on Test Set. F1: 0.0000. INFO - 06/22/21 00:11:50 - 0:12:53 - Finish training epoch 4. loss: 0.0137 INFO - 06/22/21 00:11:50 - 0:12:53 - ============== Evaluate epoch 4 on Dev Set ============== INFO - 06/22/21 00:12:00 - 0:13:02 - Evaluate on Dev Set. F1: 97.4501. INFO - 06/22/21 00:12:00 - 0:13:02 - No better model found (3/3) INFO - 06/22/21 00:12:00 - 0:13:02 - ============== Evaluate on Test Set ============== INFO - 06/22/21 00:12:08 - 0:18:04 - Finish training epoch 6. loss: 0.0129 INFO - 06/22/21 00:12:08 - 0:18:04 - ============== Evaluate epoch 6 on Dev Set ============== INFO - 06/22/21 00:12:09 - 0:13:11 - Evaluate on Test Set. F1: 95.4761. INFO - 06/22/21 00:12:17 - 0:18:14 - Evaluate on Dev Set. F1: 97.2311. INFO - 06/22/21 00:12:17 - 0:18:14 - No better model found (3/3) INFO - 06/22/21 00:12:17 - 0:18:14 - ============== Evaluate on Test Set ============== INFO - 06/22/21 00:12:19 - 0:12:48 - Finish training epoch 4. loss: 0.0074 INFO - 06/22/21 00:12:19 - 0:12:48 - ============== Evaluate epoch 4 on Dev Set ============== INFO - 06/22/21 00:12:26 - 0:18:23 - Evaluate on Test Set. F1: 95.2934. INFO - 06/22/21 00:12:28 - 0:12:58 - Evaluate on Dev Set. F1: 97.0406. INFO - 06/22/21 00:12:28 - 0:12:58 - No better model found (3/3) INFO - 06/22/21 00:12:28 - 0:12:58 - ============== Evaluate on Test Set ============== INFO - 06/22/21 00:12:37 - 0:13:07 - Evaluate on Test Set. F1: 95.3264. INFO - 06/22/21 00:16:11 - 0:00:00 - ============ Initialized logger ============ INFO - 06/22/21 00:16:11 - 0:00:00 - batch_size: 32 data_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/conll2003 dropout: 0.1 dump_path: logs/conll2003/1 early_stop: 3 epoch: 300 exp_id: 1 exp_name: conll2003 hidden_dim: 1024 logger_filename: train.log lr: 3e-05 model_name: roberta-large num_tag: 3 saved_folder: /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model seed: 111 INFO - 06/22/21 00:16:11 - 0:00:00 - The experiment will be stored in logs/conll2003/1 DEBUG - 06/22/21 00:16:11 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/22/21 00:16:12 - 0:00:00 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/22/21 00:16:12 - 0:00:00 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/22/21 00:16:12 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/vocab.json HTTP/1.1" 200 0 DEBUG - 06/22/21 00:16:12 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/22/21 00:16:12 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/merges.txt HTTP/1.1" 200 0 DEBUG - 06/22/21 00:16:12 - 0:00:01 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/22/21 00:16:13 - 0:00:01 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/tokenizer.json HTTP/1.1" 200 0 INFO - 06/22/21 00:16:24 - 0:00:12 - conll2003 dataset: train size: 14040; dev size 3249; test size: 3452 DEBUG - 06/22/21 00:16:24 - 0:00:12 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/22/21 00:16:24 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/config.json HTTP/1.1" 200 0 DEBUG - 06/22/21 00:16:24 - 0:00:13 - Starting new HTTPS connection (1): huggingface.co:443 DEBUG - 06/22/21 00:16:24 - 0:00:13 - https://huggingface.co:443 "HEAD /roberta-large/resolve/main/pytorch_model.bin HTTP/1.1" 302 0 INFO - 06/22/21 00:16:31 - 0:00:20 - Start NER training ... INFO - 06/22/21 00:16:31 - 0:00:20 - ============== epoch 0 ============== INFO - 06/22/21 00:18:53 - 0:02:42 - Finish training epoch 0. loss: 0.0544 INFO - 06/22/21 00:18:53 - 0:02:42 - ============== Evaluate epoch 0 on Dev Set ============== INFO - 06/22/21 00:19:03 - 0:02:51 - Evaluate on Dev Set. F1: 96.2938. INFO - 06/22/21 00:19:03 - 0:02:51 - Found better model!! INFO - 06/22/21 00:19:05 - 0:02:53 - Best model has been saved to /gpfs/fs1/projects/gpu_adlr/datasets/zihanl/checkpoints/ner_model/roberta-large.pt INFO - 06/22/21 00:19:05 - 0:02:53 - ============== epoch 1 ==============