Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangkx1
text-generation-inference-dcu
Commits
81a882ad
Commit
81a882ad
authored
Nov 21, 2024
by
jixx
Browse files
add tgi2.4.0
parent
9822d7f6
Changes
267
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
2560 additions
and
627 deletions
+2560
-627
integration-tests/models/__snapshots__/test_flash_mixtral/test_flash_mixtral_load.json
...apshots__/test_flash_mixtral/test_flash_mixtral_load.json
+458
-0
integration-tests/models/__snapshots__/test_flash_mixtral_awq/test_flash_mixtral_awq.json
...hots__/test_flash_mixtral_awq/test_flash_mixtral_awq.json
+104
-0
integration-tests/models/__snapshots__/test_flash_mixtral_awq/test_flash_mixtral_awq_all_params.json
..._flash_mixtral_awq/test_flash_mixtral_awq_all_params.json
+99
-0
integration-tests/models/__snapshots__/test_flash_mixtral_awq/test_flash_mixtral_awq_load.json
..._/test_flash_mixtral_awq/test_flash_mixtral_awq_load.json
+418
-0
integration-tests/models/__snapshots__/test_flash_mixtral_gptq/test_flash_mixtral_gptq.json
...ts__/test_flash_mixtral_gptq/test_flash_mixtral_gptq.json
+104
-0
integration-tests/models/__snapshots__/test_flash_mixtral_gptq/test_flash_mixtral_gptq_all_params.json
...lash_mixtral_gptq/test_flash_mixtral_gptq_all_params.json
+99
-0
integration-tests/models/__snapshots__/test_flash_mixtral_gptq/test_flash_mixtral_gptq_load.json
...test_flash_mixtral_gptq/test_flash_mixtral_gptq_load.json
+418
-0
integration-tests/models/__snapshots__/test_flash_pali_gemma/test_flash_pali_gemma.json
...pshots__/test_flash_pali_gemma/test_flash_pali_gemma.json
+2
-2
integration-tests/models/__snapshots__/test_flash_pali_gemma/test_flash_pali_gemma_two_images.json
...st_flash_pali_gemma/test_flash_pali_gemma_two_images.json
+8
-8
integration-tests/models/__snapshots__/test_flash_phi/test_flash_phi_all_params.json
...snapshots__/test_flash_phi/test_flash_phi_all_params.json
+5
-5
integration-tests/models/__snapshots__/test_flash_phi35_moe/test_flash_phi35_moe.json
...napshots__/test_flash_phi35_moe/test_flash_phi35_moe.json
+109
-0
integration-tests/models/__snapshots__/test_flash_phi35_moe/test_flash_phi35_moe_all_params.json
...test_flash_phi35_moe/test_flash_phi35_moe_all_params.json
+99
-0
integration-tests/models/__snapshots__/test_flash_phi35_moe/test_flash_phi35_moe_load.json
...ots__/test_flash_phi35_moe/test_flash_phi35_moe_load.json
+438
-0
integration-tests/models/__snapshots__/test_flash_starcoder/test_flash_starcoder_default_params.json
..._flash_starcoder/test_flash_starcoder_default_params.json
+9
-8
integration-tests/models/__snapshots__/test_flash_starcoder2/test_flash_starcoder2_default_params.json
...lash_starcoder2/test_flash_starcoder2_default_params.json
+52
-52
integration-tests/models/__snapshots__/test_flash_starcoder_gptq/test_flash_starcoder_gptq.json
.../test_flash_starcoder_gptq/test_flash_starcoder_gptq.json
+19
-127
integration-tests/models/__snapshots__/test_flash_starcoder_gptq/test_flash_starcoder_gptq_default_params.json
...rcoder_gptq/test_flash_starcoder_gptq_default_params.json
+19
-127
integration-tests/models/__snapshots__/test_flash_starcoder_gptq/test_flash_starcoder_gptq_load.json
..._flash_starcoder_gptq/test_flash_starcoder_gptq_load.json
+76
-268
integration-tests/models/__snapshots__/test_idefics2/test_flash_idefics2_next_all_params.json
...__/test_idefics2/test_flash_idefics2_next_all_params.json
+3
-3
integration-tests/models/__snapshots__/test_idefics2/test_flash_idefics2_two_images.json
...shots__/test_idefics2/test_flash_idefics2_two_images.json
+21
-27
No files found.
Too many changes to show.
To preserve performance only
267 of 267+
files are displayed.
Plain diff
Email patch
integration-tests/models/__snapshots__/test_flash_mixtral/test_flash_mixtral_load.json
0 → 100644
View file @
81a882ad
[
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-6.1445312
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.4648438
,
"text"
:
"is"
},
{
"id"
:
21135
,
"logprob"
:
-13.6875
,
"text"
:
"gradient"
},
{
"id"
:
24871
,
"logprob"
:
-1.6005859
,
"text"
:
"descent"
},
{
"id"
:
28804
,
"logprob"
:
-0.39526367
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-0.640625
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.18774414
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
20910
,
"logprob"
:
-0.96484375
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
722
,
"logprob"
:
-0.003168106
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
24871
,
"logprob"
:
-0.16369629
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
349
,
"logprob"
:
-0.0881958
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
396
,
"logprob"
:
-0.76708984
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
18586
,
"logprob"
:
-0.57373047
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
9464
,
"logprob"
:
-0.11291504
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
1307
,
"logprob"
:
-0.79589844
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
298
,
"logprob"
:
-0.1694336
,
"special"
:
false
,
"text"
:
" to"
},
{
"id"
:
26518
,
"logprob"
:
-0.34350586
,
"special"
:
false
,
"text"
:
" minimize"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm used to minimize"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-6.1445312
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.4677734
,
"text"
:
"is"
},
{
"id"
:
21135
,
"logprob"
:
-13.6875
,
"text"
:
"gradient"
},
{
"id"
:
24871
,
"logprob"
:
-1.6015625
,
"text"
:
"descent"
},
{
"id"
:
28804
,
"logprob"
:
-0.39453125
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-0.6435547
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.18713379
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
20910
,
"logprob"
:
-0.9628906
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
722
,
"logprob"
:
-0.0032176971
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
24871
,
"logprob"
:
-0.16540527
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
349
,
"logprob"
:
-0.08898926
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
396
,
"logprob"
:
-0.765625
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
18586
,
"logprob"
:
-0.5708008
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
9464
,
"logprob"
:
-0.11401367
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
1307
,
"logprob"
:
-0.7963867
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
298
,
"logprob"
:
-0.17028809
,
"special"
:
false
,
"text"
:
" to"
},
{
"id"
:
26518
,
"logprob"
:
-0.34326172
,
"special"
:
false
,
"text"
:
" minimize"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm used to minimize"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-6.140625
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.4658203
,
"text"
:
"is"
},
{
"id"
:
21135
,
"logprob"
:
-13.6796875
,
"text"
:
"gradient"
},
{
"id"
:
24871
,
"logprob"
:
-1.5898438
,
"text"
:
"descent"
},
{
"id"
:
28804
,
"logprob"
:
-0.3955078
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-0.64501953
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.18493652
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
20910
,
"logprob"
:
-0.9580078
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
722
,
"logprob"
:
-0.0032176971
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
24871
,
"logprob"
:
-0.16552734
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
349
,
"logprob"
:
-0.08874512
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
396
,
"logprob"
:
-0.75878906
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
18586
,
"logprob"
:
-0.5703125
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
9464
,
"logprob"
:
-0.11236572
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
1307
,
"logprob"
:
-0.79541016
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
298
,
"logprob"
:
-0.17102051
,
"special"
:
false
,
"text"
:
" to"
},
{
"id"
:
26518
,
"logprob"
:
-0.34326172
,
"special"
:
false
,
"text"
:
" minimize"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm used to minimize"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-6.1328125
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.4658203
,
"text"
:
"is"
},
{
"id"
:
21135
,
"logprob"
:
-13.6796875
,
"text"
:
"gradient"
},
{
"id"
:
24871
,
"logprob"
:
-1.5947266
,
"text"
:
"descent"
},
{
"id"
:
28804
,
"logprob"
:
-0.39648438
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-0.6464844
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.18688965
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
20910
,
"logprob"
:
-0.9609375
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
722
,
"logprob"
:
-0.003168106
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
24871
,
"logprob"
:
-0.16601562
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
349
,
"logprob"
:
-0.088134766
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
396
,
"logprob"
:
-0.7597656
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
18586
,
"logprob"
:
-0.5708008
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
9464
,
"logprob"
:
-0.11291504
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
1307
,
"logprob"
:
-0.7944336
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
298
,
"logprob"
:
-0.17102051
,
"special"
:
false
,
"text"
:
" to"
},
{
"id"
:
26518
,
"logprob"
:
-0.34399414
,
"special"
:
false
,
"text"
:
" minimize"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm used to minimize"
}
]
integration-tests/models/__snapshots__/test_flash_mixtral_awq/test_flash_mixtral_awq.json
0 → 100644
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-12.296875
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-0.97216797
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-10.1796875
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.9658203
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.44384766
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.50878906
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.8876953
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.15124512
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.030288696
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.16687012
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.17858887
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.8046875
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.007205963
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.090026855
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.0030670166
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
}
integration-tests/models/__snapshots__/test_flash_mixtral_awq/test_flash_mixtral_awq_all_params.json
0 → 100644
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
349
,
"logprob"
:
-13.921875
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-11.2265625
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-2.3886719
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-4.7109375
,
"text"
:
"?"
}
],
"seed"
:
0
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.5229492
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
17504
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" Learning"
},
{
"id"
:
349
,
"logprob"
:
-0.5151367
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
13253
,
"logprob"
:
-1.3359375
,
"special"
:
false
,
"text"
:
" Machine"
},
{
"id"
:
17504
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" Learning"
},
{
"id"
:
28725
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
","
}
],
"top_tokens"
:
null
},
"generated_text"
:
"What is deep learning?
\n
Deep Learning is a subset of Machine Learning,"
}
integration-tests/models/__snapshots__/test_flash_mixtral_awq/test_flash_mixtral_awq_load.json
0 → 100644
View file @
81a882ad
[
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-12.296875
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-0.97216797
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-10.1796875
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.9658203
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.44384766
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.50878906
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.8876953
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.15136719
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.030273438
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.1665039
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.1776123
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.8076172
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.007183075
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.090148926
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.0030670166
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-12.34375
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-0.96728516
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-10.1796875
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.97265625
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.44189453
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.51220703
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.87402344
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.15039062
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.030288696
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.1652832
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.17858887
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.81103516
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.007183075
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.08880615
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.0030612946
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-12.34375
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-0.96728516
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-10.1796875
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.97265625
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.44189453
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.51220703
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.87402344
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.15039062
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.030288696
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.1652832
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.17858887
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.81103516
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.007183075
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.08880615
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.0030612946
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-12.34375
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-0.96728516
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-10.1796875
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.97265625
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.44189453
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.51220703
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.87402344
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.15039062
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.030288696
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.1652832
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.17858887
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.81103516
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.007183075
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.08880615
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.0030612946
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
}
]
integration-tests/models/__snapshots__/test_flash_mixtral_gptq/test_flash_mixtral_gptq.json
0 → 100644
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-9.2890625
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.1503906
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-9.5859375
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-1.3945312
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.4555664
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.6953125
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.4777832
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.13256836
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.023849487
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.13977051
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.14489746
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.63183594
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.010314941
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.0635376
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.0028572083
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
}
integration-tests/models/__snapshots__/test_flash_mixtral_gptq/test_flash_mixtral_gptq_all_params.json
0 → 100644
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
349
,
"logprob"
:
-12.0546875
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-10.53125
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-2.71875
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-5.0078125
,
"text"
:
"?"
}
],
"seed"
:
0
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.18237305
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
17504
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" Learning"
},
{
"id"
:
349
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
13253
,
"logprob"
:
-0.6040039
,
"special"
:
false
,
"text"
:
" Machine"
},
{
"id"
:
17504
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" Learning"
},
{
"id"
:
28725
,
"logprob"
:
-0.11621094
,
"special"
:
false
,
"text"
:
","
}
],
"top_tokens"
:
null
},
"generated_text"
:
"What is deep learning?
\n
Deep Learning is a subset of Machine Learning,"
}
integration-tests/models/__snapshots__/test_flash_mixtral_gptq/test_flash_mixtral_gptq_load.json
0 → 100644
View file @
81a882ad
[
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-9.2890625
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.1503906
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-9.5859375
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-1.3945312
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.4555664
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.6953125
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.4777832
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.13232422
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.023834229
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.13977051
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.14416504
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.63183594
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.010223389
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.064208984
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.0028266907
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-9.2890625
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.1425781
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-9.59375
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-1.390625
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.45532227
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.6953125
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.48339844
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.13256836
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.02420044
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.13977051
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.14501953
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.63134766
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.010223389
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.06427002
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.002817154
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-9.2890625
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.1425781
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-9.59375
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-1.390625
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.45532227
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.6953125
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.48339844
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.13256836
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.02420044
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.13977051
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.14501953
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.63134766
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.010223389
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.06427002
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.002817154
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1
,
"logprob"
:
null
,
"text"
:
"<s>"
},
{
"id"
:
1824
,
"logprob"
:
-9.2890625
,
"text"
:
"What"
},
{
"id"
:
349
,
"logprob"
:
-1.1425781
,
"text"
:
"is"
},
{
"id"
:
3534
,
"logprob"
:
-9.59375
,
"text"
:
"deep"
},
{
"id"
:
5168
,
"logprob"
:
-1.390625
,
"text"
:
"learning"
},
{
"id"
:
28804
,
"logprob"
:
-0.45532227
,
"text"
:
"?"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
13
,
"logprob"
:
-0.6953125
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-0.48339844
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
23229
,
"logprob"
:
-0.13256836
,
"special"
:
false
,
"text"
:
"Deep"
},
{
"id"
:
5168
,
"logprob"
:
-0.02420044
,
"special"
:
false
,
"text"
:
" learning"
},
{
"id"
:
349
,
"logprob"
:
-0.13977051
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
264
,
"logprob"
:
-0.14501953
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
19804
,
"logprob"
:
-0.63134766
,
"special"
:
false
,
"text"
:
" subset"
},
{
"id"
:
302
,
"logprob"
:
-0.010223389
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
5599
,
"logprob"
:
-0.06427002
,
"special"
:
false
,
"text"
:
" machine"
},
{
"id"
:
5168
,
"logprob"
:
-0.002817154
,
"special"
:
false
,
"text"
:
" learning"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n\n
Deep learning is a subset of machine learning"
}
]
integration-tests/models/__snapshots__/test_flash_pali_gemma/test_flash_pali_gemma.json
View file @
81a882ad
...
...
@@ -8,13 +8,13 @@
"tokens"
:
[
{
"id"
:
54901
,
"logprob"
:
-0.
72753906
,
"logprob"
:
-0.
84765625
,
"special"
:
false
,
"text"
:
"beach"
},
{
"id"
:
1
,
"logprob"
:
-0.0
11009216
,
"logprob"
:
-0.0
08666992
,
"special"
:
true
,
"text"
:
"<eos>"
}
...
...
integration-tests/models/__snapshots__/test_flash_pali_gemma/test_flash_pali_gemma_two_images.json
View file @
81a882ad
...
...
@@ -8,49 +8,49 @@
"tokens"
:
[
{
"id"
:
2502
,
"logprob"
:
-1.7
3437
5
,
"logprob"
:
-1.7
89062
5
,
"special"
:
false
,
"text"
:
"image"
},
{
"id"
:
2196
,
"logprob"
:
-0.5
756836
,
"logprob"
:
-0.5
3125
,
"special"
:
false
,
"text"
:
" result"
},
{
"id"
:
604
,
"logprob"
:
-0.007
843018
,
"logprob"
:
-0.007
7209473
,
"special"
:
false
,
"text"
:
" for"
},
{
"id"
:
12254
,
"logprob"
:
-1.7
167969
,
"logprob"
:
-1.7
03125
,
"special"
:
false
,
"text"
:
" chicken"
},
{
"id"
:
611
,
"logprob"
:
-0.
17053223
,
"logprob"
:
-0.
21582031
,
"special"
:
false
,
"text"
:
" on"
},
{
"id"
:
573
,
"logprob"
:
-0.7
626953
,
"logprob"
:
-0.7
34375
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
8318
,
"logprob"
:
-0.02
709961
,
"logprob"
:
-0.02
6000977
,
"special"
:
false
,
"text"
:
" beach"
},
{
"id"
:
1
,
"logprob"
:
-0.2
0739746
,
"logprob"
:
-0.2
109375
,
"special"
:
true
,
"text"
:
"<eos>"
}
...
...
integration-tests/models/__snapshots__/test_flash_phi/test_flash_phi_all_params.json
View file @
81a882ad
...
...
@@ -19,25 +19,25 @@
"tokens"
:
[
{
"id"
:
284
,
"logprob"
:
-0.
19421387
,
"logprob"
:
-0.
28955078
,
"special"
:
false
,
"text"
:
" to"
},
{
"id"
:
3758
,
"logprob"
:
-0.
62597656
,
"logprob"
:
-0.
7739258
,
"special"
:
false
,
"text"
:
" send"
},
{
"id"
:
1366
,
"logprob"
:
-0.8
7060547
,
"logprob"
:
-0.8
5253906
,
"special"
:
false
,
"text"
:
" data"
},
{
"id"
:
625
,
"logprob"
:
-0.884
27734
,
"logprob"
:
-0.8
9
84
375
,
"special"
:
false
,
"text"
:
" over"
},
...
...
@@ -49,7 +49,7 @@
},
{
"id"
:
3127
,
"logprob"
:
-1.94
62891
,
"logprob"
:
-1.94
04297
,
"special"
:
false
,
"text"
:
" network"
}
...
...
integration-tests/models/__snapshots__/test_flash_phi35_moe/test_flash_phi35_moe.json
0 → 100644
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1724
,
"logprob"
:
null
,
"text"
:
"What"
},
{
"id"
:
338
,
"logprob"
:
-0.6201172
,
"text"
:
"is"
},
{
"id"
:
16030
,
"logprob"
:
-13.6484375
,
"text"
:
"gradient"
},
{
"id"
:
26815
,
"logprob"
:
-0.003894806
,
"text"
:
"descent"
},
{
"id"
:
29973
,
"logprob"
:
-2.6386719
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-6.46875
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-6.6875
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
25584
,
"logprob"
:
-0.008979797
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
993
,
"logprob"
:
-8.34465e-07
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
26815
,
"logprob"
:
-0.0009407997
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
338
,
"logprob"
:
-0.0003838539
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
385
,
"logprob"
:
-0.24499512
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
13883
,
"logprob"
:
-0.010406494
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
5687
,
"logprob"
:
-0.00024354458
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
15574
,
"logprob"
:
-0.6582031
,
"special"
:
false
,
"text"
:
" commonly"
},
{
"id"
:
1304
,
"logprob"
:
-0.00092840195
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
297
,
"logprob"
:
-0.19470215
,
"special"
:
false
,
"text"
:
" in"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm commonly used in"
}
integration-tests/models/__snapshots__/test_flash_phi35_moe/test_flash_phi35_moe_all_params.json
0 → 100644
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
338
,
"logprob"
:
null
,
"text"
:
"is"
},
{
"id"
:
16030
,
"logprob"
:
-13.328125
,
"text"
:
"gradient"
},
{
"id"
:
26815
,
"logprob"
:
-0.24023438
,
"text"
:
"descent"
},
{
"id"
:
29973
,
"logprob"
:
-3.1386719
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-3.0878906
,
"text"
:
"
\n
"
}
],
"seed"
:
0
,
"tokens"
:
[
{
"id"
:
25584
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
993
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
2726
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" Des"
},
{
"id"
:
1760
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"cent"
},
{
"id"
:
313
,
"logprob"
:
-0.12322998
,
"special"
:
false
,
"text"
:
" ("
},
{
"id"
:
29954
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"G"
},
{
"id"
:
29928
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"D"
},
{
"id"
:
29897
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
")"
},
{
"id"
:
338
,
"logprob"
:
-0.6040039
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
385
,
"logprob"
:
-0.1796875
,
"special"
:
false
,
"text"
:
" an"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"What is gradient descent?
\n
Gradient Descent (GD) is an"
}
integration-tests/models/__snapshots__/test_flash_phi35_moe/test_flash_phi35_moe_load.json
0 → 100644
View file @
81a882ad
[
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1724
,
"logprob"
:
null
,
"text"
:
"What"
},
{
"id"
:
338
,
"logprob"
:
-0.6201172
,
"text"
:
"is"
},
{
"id"
:
16030
,
"logprob"
:
-13.6484375
,
"text"
:
"gradient"
},
{
"id"
:
26815
,
"logprob"
:
-0.003894806
,
"text"
:
"descent"
},
{
"id"
:
29973
,
"logprob"
:
-2.6386719
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-6.46875
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-6.6875
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
25584
,
"logprob"
:
-0.008979797
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
993
,
"logprob"
:
-8.34465e-07
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
26815
,
"logprob"
:
-0.00097084045
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
338
,
"logprob"
:
-0.0003838539
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
385
,
"logprob"
:
-0.23840332
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
13883
,
"logprob"
:
-0.010406494
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
5687
,
"logprob"
:
-0.0002501011
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
15574
,
"logprob"
:
-0.6582031
,
"special"
:
false
,
"text"
:
" commonly"
},
{
"id"
:
1304
,
"logprob"
:
-0.00092840195
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
297
,
"logprob"
:
-0.18933105
,
"special"
:
false
,
"text"
:
" in"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm commonly used in"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1724
,
"logprob"
:
null
,
"text"
:
"What"
},
{
"id"
:
338
,
"logprob"
:
-0.6113281
,
"text"
:
"is"
},
{
"id"
:
16030
,
"logprob"
:
-13.6640625
,
"text"
:
"gradient"
},
{
"id"
:
26815
,
"logprob"
:
-0.003929138
,
"text"
:
"descent"
},
{
"id"
:
29973
,
"logprob"
:
-2.625
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-6.484375
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-6.6875
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
25584
,
"logprob"
:
-0.009017944
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
993
,
"logprob"
:
-9.536743e-07
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
26815
,
"logprob"
:
-0.00097084045
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
338
,
"logprob"
:
-0.0003838539
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
385
,
"logprob"
:
-0.24499512
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
13883
,
"logprob"
:
-0.010406494
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
5687
,
"logprob"
:
-0.0002501011
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
15574
,
"logprob"
:
-0.6435547
,
"special"
:
false
,
"text"
:
" commonly"
},
{
"id"
:
1304
,
"logprob"
:
-0.0009279251
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
297
,
"logprob"
:
-0.18933105
,
"special"
:
false
,
"text"
:
" in"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm commonly used in"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1724
,
"logprob"
:
null
,
"text"
:
"What"
},
{
"id"
:
338
,
"logprob"
:
-0.609375
,
"text"
:
"is"
},
{
"id"
:
16030
,
"logprob"
:
-13.671875
,
"text"
:
"gradient"
},
{
"id"
:
26815
,
"logprob"
:
-0.0040016174
,
"text"
:
"descent"
},
{
"id"
:
29973
,
"logprob"
:
-2.6230469
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-6.453125
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-6.6875
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
25584
,
"logprob"
:
-0.008956909
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
993
,
"logprob"
:
-8.34465e-07
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
26815
,
"logprob"
:
-0.0009407997
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
338
,
"logprob"
:
-0.0003721714
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
385
,
"logprob"
:
-0.24499512
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
13883
,
"logprob"
:
-0.010406494
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
5687
,
"logprob"
:
-0.0002501011
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
15574
,
"logprob"
:
-0.6435547
,
"special"
:
false
,
"text"
:
" commonly"
},
{
"id"
:
1304
,
"logprob"
:
-0.00092601776
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
297
,
"logprob"
:
-0.19177246
,
"special"
:
false
,
"text"
:
" in"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm commonly used in"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"length"
,
"generated_tokens"
:
10
,
"prefill"
:
[
{
"id"
:
1724
,
"logprob"
:
null
,
"text"
:
"What"
},
{
"id"
:
338
,
"logprob"
:
-0.609375
,
"text"
:
"is"
},
{
"id"
:
16030
,
"logprob"
:
-13.6640625
,
"text"
:
"gradient"
},
{
"id"
:
26815
,
"logprob"
:
-0.0038967133
,
"text"
:
"descent"
},
{
"id"
:
29973
,
"logprob"
:
-2.6347656
,
"text"
:
"?"
},
{
"id"
:
13
,
"logprob"
:
-6.453125
,
"text"
:
"
\n
"
},
{
"id"
:
13
,
"logprob"
:
-6.6875
,
"text"
:
"
\n
"
}
],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
25584
,
"logprob"
:
-0.008979797
,
"special"
:
false
,
"text"
:
"Grad"
},
{
"id"
:
993
,
"logprob"
:
-9.536743e-07
,
"special"
:
false
,
"text"
:
"ient"
},
{
"id"
:
26815
,
"logprob"
:
-0.0009407997
,
"special"
:
false
,
"text"
:
" descent"
},
{
"id"
:
338
,
"logprob"
:
-0.00038409233
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
385
,
"logprob"
:
-0.24499512
,
"special"
:
false
,
"text"
:
" an"
},
{
"id"
:
13883
,
"logprob"
:
-0.010414124
,
"special"
:
false
,
"text"
:
" optimization"
},
{
"id"
:
5687
,
"logprob"
:
-0.00024354458
,
"special"
:
false
,
"text"
:
" algorithm"
},
{
"id"
:
15574
,
"logprob"
:
-0.6435547
,
"special"
:
false
,
"text"
:
" commonly"
},
{
"id"
:
1304
,
"logprob"
:
-0.0009279251
,
"special"
:
false
,
"text"
:
" used"
},
{
"id"
:
297
,
"logprob"
:
-0.19470215
,
"special"
:
false
,
"text"
:
" in"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"Gradient descent is an optimization algorithm commonly used in"
}
]
integration-tests/models/__snapshots__/test_flash_starcoder/test_flash_starcoder_default_params.json
View file @
81a882ad
...
...
@@ -11,17 +11,17 @@
},
{
"id"
:
1459
,
"logprob"
:
-5.6
3281
25
,
"logprob"
:
-5.625
,
"text"
:
" print"
},
{
"id"
:
81
,
"logprob"
:
-1.60
35156
,
"logprob"
:
-1.60
64453
,
"text"
:
"_"
},
{
"id"
:
7656
,
"logprob"
:
-5.9
882812
,
"logprob"
:
-5.9
921875
,
"text"
:
"hello"
}
],
...
...
@@ -29,7 +29,7 @@
"tokens"
:
[
{
"id"
:
2262
,
"logprob"
:
-0.04
2999268
,
"logprob"
:
-0.04
5715332
,
"special"
:
false
,
"text"
:
"():"
},
...
...
@@ -59,7 +59,7 @@
},
{
"id"
:
10896
,
"logprob"
:
-0.3
8549805
,
"logprob"
:
-0.3
659668
,
"special"
:
false
,
"text"
:
" World"
},
...
...
@@ -113,7 +113,7 @@
},
{
"id"
:
426
,
"logprob"
:
0.0
,
"logprob"
:
-
0.0
51635742
,
"special"
:
false
,
"text"
:
"name"
},
...
...
@@ -323,7 +323,7 @@
},
{
"id"
:
313
,
"logprob"
:
-0.6
328125
,
"logprob"
:
-0.6
933594
,
"special"
:
false
,
"text"
:
"
\"
"
},
...
...
@@ -387,7 +387,8 @@
"special"
:
false
,
"text"
:
" print"
}
]
],
"top_tokens"
:
null
},
"generated_text"
:
"():
\n
print(
\"
Hello World
\"
)
\n\n
def print_hello_name(name):
\n
print(
\"
Hello
\"
+ name)
\n\n
def print_hello_name_age(name, age):
\n
print(
\"
Hello
\"
+ name +
\"
\"
+ str(age))
\n\n
def print"
}
integration-tests/models/__snapshots__/test_flash_starcoder2/test_flash_starcoder2_default_params.json
View file @
81a882ad
...
...
@@ -11,12 +11,12 @@
},
{
"id"
:
1489
,
"logprob"
:
-5.26
17188
,
"logprob"
:
-5.26
5625
,
"text"
:
" print"
},
{
"id"
:
100
,
"logprob"
:
-0.38
476562
,
"logprob"
:
-0.38
305664
,
"text"
:
"_"
},
{
...
...
@@ -53,13 +53,13 @@
},
{
"id"
:
8302
,
"logprob"
:
-0.2
8125
,
"logprob"
:
-0.2
6611328
,
"special"
:
false
,
"text"
:
"Hello"
},
{
"id"
:
10914
,
"logprob"
:
-0.7
9248047
,
"logprob"
:
-0.7
734375
,
"special"
:
false
,
"text"
:
" World"
},
...
...
@@ -71,7 +71,7 @@
},
{
"id"
:
222
,
"logprob"
:
-0.0
619812
,
"logprob"
:
-0.0
54870605
,
"special"
:
false
,
"text"
:
"
\n
"
},
...
...
@@ -83,7 +83,7 @@
},
{
"id"
:
610
,
"logprob"
:
-0.4
091797
,
"logprob"
:
-0.4
152832
,
"special"
:
false
,
"text"
:
"def"
},
...
...
@@ -113,7 +113,7 @@
},
{
"id"
:
444
,
"logprob"
:
-0.216
55273
,
"logprob"
:
-0.216
18652
,
"special"
:
false
,
"text"
:
"name"
},
...
...
@@ -160,40 +160,40 @@
"text"
:
"Hello"
},
{
"id"
:
332
,
"logprob"
:
-
0.034698486
,
"id"
:
925
,
"logprob"
:
-
3.3476562
,
"special"
:
false
,
"text"
:
"
\"
"
"text"
:
"
%
"
},
{
"id"
:
494
,
"id"
:
120
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
+
"
"text"
:
"
s
"
},
{
"id"
:
655
,
"logprob"
:
0.0
,
"id"
:
11571
,
"logprob"
:
-
0.0
8892822
,
"special"
:
false
,
"text"
:
"
name
"
"text"
:
"
!
\"
"
},
{
"id"
:
494
,
"logprob"
:
-
0.
20141602
,
"id"
:
925
,
"logprob"
:
0.
0
,
"special"
:
false
,
"text"
:
"
+
"
"text"
:
"
%
"
},
{
"id"
:
332
,
"id"
:
655
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
\"
"
"text"
:
"
name
"
},
{
"id"
:
16013
,
"id"
:
46
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"!
\
"
)"
"text"
:
")"
},
{
"id"
:
222
,
...
...
@@ -251,7 +251,7 @@
},
{
"id"
:
400
,
"logprob"
:
0.0
,
"logprob"
:
-
0.0
74279785
,
"special"
:
false
,
"text"
:
"age"
},
...
...
@@ -310,85 +310,85 @@
"text"
:
"Hello"
},
{
"id"
:
332
,
"id"
:
925
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
\"
"
"text"
:
"
%
"
},
{
"id"
:
494
,
"id"
:
120
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
+
"
"text"
:
"
s
"
},
{
"id"
:
655
,
"logprob"
:
0.0
,
"id"
:
49
,
"logprob"
:
-
0.0
7891846
,
"special"
:
false
,
"text"
:
"
name
"
"text"
:
"
,
"
},
{
"id"
:
494
,
"id"
:
863
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
+
"
"text"
:
"
you
"
},
{
"id"
:
3021
,
"logprob"
:
-
0.
5761719
,
"id"
:
904
,
"logprob"
:
0.
0
,
"special"
:
false
,
"text"
:
"
\"
,
"
"text"
:
"
are
"
},
{
"id"
:
863
,
"id"
:
925
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
you
"
"text"
:
"
%
"
},
{
"id"
:
904
,
"id"
:
105
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
are
"
"text"
:
"
d
"
},
{
"id"
:
33
2
,
"id"
:
11
33
9
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
\"
"
"text"
:
"
years
"
},
{
"id"
:
494
,
"id"
:
3627
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
+
"
"text"
:
"
old
"
},
{
"id"
:
6
15
,
"id"
:
1
15
71
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
str
"
"text"
:
"
!
\"
"
},
{
"id"
:
4
5
,
"id"
:
92
5
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
(
"
"text"
:
"
%
"
},
{
"id"
:
400
,
"id"
:
327
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
age
"
"text"
:
"
(
"
},
{
"id"
:
4
6
,
"id"
:
4
44
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"
)
"
"text"
:
"
name
"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"():
\n
print(
\"
Hello World!
\"
)
\n\n
def print_hello_name(name):
\n
print(
\"
Hello
\"
+
name
+
\"
!
\"
)
\n\n
def print_hello_name_age(name, age):
\n
print(
\"
Hello
\"
+ name +
\"
, you are
\"
+ str(age)
"
"generated_text"
:
"():
\n
print(
\"
Hello World!
\"
)
\n\n
def print_hello_name(name):
\n
print(
\"
Hello
%s!
\"
%
name)
\n\n
def print_hello_name_age(name, age):
\n
print(
\"
Hello
%s, you are %d years old!
\"
% (name
"
}
integration-tests/models/__snapshots__/test_flash_starcoder_gptq/test_flash_starcoder_gptq.json
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"
length
"
,
"generated_tokens"
:
2
0
,
"finish_reason"
:
"
eos_token
"
,
"generated_tokens"
:
2
,
"prefill"
:
[
{
"id"
:
589
,
...
...
@@ -11,57 +11,57 @@
},
{
"id"
:
3226
,
"logprob"
:
-
8.5859
375
,
"logprob"
:
-
9.0234
375
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-
7.5
859375
,
"logprob"
:
-
9.0
859375
,
"text"
:
"ometric"
},
{
"id"
:
81
,
"logprob"
:
-0.2
668457
,
"logprob"
:
-0.2
5585938
,
"text"
:
"_"
},
{
"id"
:
6009
,
"logprob"
:
-
1.641601
6
,
"logprob"
:
-
2.197265
6
,
"text"
:
"mean"
},
{
"id"
:
26
,
"logprob"
:
-0.2
2705078
,
"logprob"
:
-0.2
998047
,
"text"
:
"("
},
{
"id"
:
62
,
"logprob"
:
-5.
2304688
,
"logprob"
:
-5.
6445312
,
"text"
:
"L"
},
{
"id"
:
44
,
"logprob"
:
-3.0
976562
,
"logprob"
:
-3.0
839844
,
"text"
:
":"
},
{
"id"
:
1682
,
"logprob"
:
-
1.1044922
,
"logprob"
:
-
0.6748047
,
"text"
:
" List"
},
{
"id"
:
77
,
"logprob"
:
-0.
14294434
,
"logprob"
:
-0.
3864746
,
"text"
:
"["
},
{
"id"
:
1808
,
"logprob"
:
-0.
32299805
,
"logprob"
:
-0.
9355469
,
"text"
:
"float"
},
{
"id"
:
10794
,
"logprob"
:
-2.
8164062
,
"logprob"
:
-2.
5371094
,
"text"
:
"]):"
}
],
...
...
@@ -69,126 +69,18 @@
"tokens"
:
[
{
"id"
:
284
,
"logprob"
:
-
0
.1
282959
,
"logprob"
:
-
1
.1
679688
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
1524
,
"logprob"
:
-0.97998047
,
"special"
:
false
,
"text"
:
"
\"\"\"
"
},
{
"id"
:
284
,
"logprob"
:
-0.7006836
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
14883
,
"logprob"
:
-2.1933594
,
"special"
:
false
,
"text"
:
" Calculate"
},
{
"id"
:
322
,
"logprob"
:
-0.2697754
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
3226
,
"logprob"
:
-0.0836792
,
"special"
:
false
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-0.018737793
,
"special"
:
false
,
"text"
:
"ometric"
},
{
"id"
:
5651
,
"logprob"
:
-0.028640747
,
"special"
:
false
,
"text"
:
" mean"
},
{
"id"
:
432
,
"logprob"
:
-0.29467773
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
312
,
"logprob"
:
-0.31518555
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
1149
,
"logprob"
:
-0.20605469
,
"special"
:
false
,
"text"
:
" list"
},
{
"id"
:
432
,
"logprob"
:
-0.23254395
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
7515
,
"logprob"
:
-0.4489746
,
"special"
:
false
,
"text"
:
" numbers"
},
{
"id"
:
32
,
"logprob"
:
-0.6044922
,
"special"
:
false
,
"text"
:
"."
},
{
"id"
:
446
,
"logprob"
:
-0.63964844
,
"special"
:
false
,
"text"
:
"
\n\n
"
},
{
"id"
:
499
,
"logprob"
:
-1.1953125
,
"special"
:
false
,
"text"
:
" :"
},
{
"id"
:
753
,
"logprob"
:
-0.03515625
,
"special"
:
false
,
"text"
:
"param"
},
{
"id"
:
498
,
"logprob"
:
-0.06311035
,
"special"
:
false
,
"text"
:
" L"
},
{
"id"
:
44
,
"logprob"
:
-0.003414154
,
"special"
:
false
,
"text"
:
":"
},
{
"id"
:
1682
,
"logprob"
:
-1.3310547
,
"special"
:
false
,
"text"
:
" List"
"id"
:
0
,
"logprob"
:
null
,
"special"
:
true
,
"text"
:
"<|endoftext|>"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n
\"\"\"\n
Calculate the geometric mean of a list of numbers.
\n\n
:param L: List
"
"generated_text"
:
"
\n
"
}
integration-tests/models/__snapshots__/test_flash_starcoder_gptq/test_flash_starcoder_gptq_default_params.json
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"
length
"
,
"generated_tokens"
:
2
0
,
"finish_reason"
:
"
eos_token
"
,
"generated_tokens"
:
2
,
"prefill"
:
[
{
"id"
:
589
,
...
...
@@ -11,57 +11,57 @@
},
{
"id"
:
3226
,
"logprob"
:
-
8.585937
5
,
"logprob"
:
-
9.01562
5
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-
7.5898438
,
"logprob"
:
-
9.0859375
,
"text"
:
"ometric"
},
{
"id"
:
81
,
"logprob"
:
-0.2
6
58
6914
,
"logprob"
:
-0.2
5
58
5938
,
"text"
:
"_"
},
{
"id"
:
6009
,
"logprob"
:
-
1.6347656
,
"logprob"
:
-
2.2304688
,
"text"
:
"mean"
},
{
"id"
:
26
,
"logprob"
:
-0.2
2705078
,
"logprob"
:
-0.2
9760742
,
"text"
:
"("
},
{
"id"
:
62
,
"logprob"
:
-5.
2382812
,
"logprob"
:
-5.
6796875
,
"text"
:
"L"
},
{
"id"
:
44
,
"logprob"
:
-3.0
996094
,
"logprob"
:
-3.0
742188
,
"text"
:
":"
},
{
"id"
:
1682
,
"logprob"
:
-
1.1025391
,
"logprob"
:
-
0.67626953
,
"text"
:
" List"
},
{
"id"
:
77
,
"logprob"
:
-0.
14294434
,
"logprob"
:
-0.
38842773
,
"text"
:
"["
},
{
"id"
:
1808
,
"logprob"
:
-0.
32226562
,
"logprob"
:
-0.
9165039
,
"text"
:
"float"
},
{
"id"
:
10794
,
"logprob"
:
-2.
8164062
,
"logprob"
:
-2.
5527344
,
"text"
:
"]):"
}
],
...
...
@@ -69,126 +69,18 @@
"tokens"
:
[
{
"id"
:
284
,
"logprob"
:
0.0
,
"logprob"
:
-
0.0
48583984
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
442
,
"logprob"
:
-1.3134766
,
"special"
:
false
,
"text"
:
" return"
},
{
"id"
:
11665
,
"logprob"
:
-0.10021973
,
"special"
:
false
,
"text"
:
" reduce"
},
{
"id"
:
26
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"("
},
{
"id"
:
5962
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"lambda"
},
{
"id"
:
816
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" x"
},
{
"id"
:
30
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
","
},
{
"id"
:
533
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" y"
},
{
"id"
:
44
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
":"
},
{
"id"
:
816
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" x"
},
{
"id"
:
319
,
"logprob"
:
-0.42871094
,
"special"
:
false
,
"text"
:
" *"
},
{
"id"
:
533
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" y"
},
{
"id"
:
30
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
","
},
{
"id"
:
498
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" L"
},
{
"id"
:
27
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
")"
},
{
"id"
:
1115
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" **"
},
{
"id"
:
308
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
" ("
},
{
"id"
:
35
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"1"
},
{
"id"
:
32
,
"logprob"
:
-0.31323242
,
"special"
:
false
,
"text"
:
"."
},
{
"id"
:
34
,
"logprob"
:
0.0
,
"special"
:
false
,
"text"
:
"0"
"id"
:
0
,
"logprob"
:
null
,
"special"
:
true
,
"text"
:
"<|endoftext|>"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n
return reduce(lambda x, y: x * y, L) ** (1.0
"
"generated_text"
:
"
\n
"
}
integration-tests/models/__snapshots__/test_flash_starcoder_gptq/test_flash_starcoder_gptq_load.json
View file @
81a882ad
...
...
@@ -2,8 +2,8 @@
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"
length
"
,
"generated_tokens"
:
10
,
"finish_reason"
:
"
eos_token
"
,
"generated_tokens"
:
2
,
"prefill"
:
[
{
"id"
:
589
,
...
...
@@ -12,57 +12,57 @@
},
{
"id"
:
3226
,
"logprob"
:
-8.
585937
5
,
"logprob"
:
-8.
945312
5
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-
7.5820312
,
"logprob"
:
-
8.8515625
,
"text"
:
"ometric"
},
{
"id"
:
81
,
"logprob"
:
-0.2
6708984
,
"logprob"
:
-0.2
2033691
,
"text"
:
"_"
},
{
"id"
:
6009
,
"logprob"
:
-1.
6386719
,
"logprob"
:
-1.
2939453
,
"text"
:
"mean"
},
{
"id"
:
26
,
"logprob"
:
-0.2
271728
5
,
"logprob"
:
-0.2
526855
5
,
"text"
:
"("
},
{
"id"
:
62
,
"logprob"
:
-
5.2343
75
,
"logprob"
:
-
4.7968
75
,
"text"
:
"L"
},
{
"id"
:
44
,
"logprob"
:
-3.
101562
5
,
"logprob"
:
-3.
79687
5
,
"text"
:
":"
},
{
"id"
:
1682
,
"logprob"
:
-
1.1083984
,
"logprob"
:
-
0.8066406
,
"text"
:
" List"
},
{
"id"
:
77
,
"logprob"
:
-0.
14294434
,
"logprob"
:
-0.
22644043
,
"text"
:
"["
},
{
"id"
:
1808
,
"logprob"
:
-0.
32592773
,
"logprob"
:
-0.
46166992
,
"text"
:
"float"
},
{
"id"
:
10794
,
"logprob"
:
-
2.8164062
,
"logprob"
:
-
3.0253906
,
"text"
:
"]):"
}
],
...
...
@@ -70,74 +70,26 @@
"tokens"
:
[
{
"id"
:
284
,
"logprob"
:
-0.
12817383
,
"logprob"
:
-0.
046844482
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
1524
,
"logprob"
:
-0.9863281
,
"special"
:
false
,
"text"
:
"
\"\"\"
"
},
{
"id"
:
284
,
"logprob"
:
-0.7011719
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
14883
,
"logprob"
:
-2.2050781
,
"special"
:
false
,
"text"
:
" Calculate"
},
{
"id"
:
322
,
"logprob"
:
-0.2668457
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
3226
,
"logprob"
:
-0.08465576
,
"special"
:
false
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-0.019012451
,
"special"
:
false
,
"text"
:
"ometric"
},
{
"id"
:
5651
,
"logprob"
:
-0.028625488
,
"special"
:
false
,
"text"
:
" mean"
},
{
"id"
:
432
,
"logprob"
:
-0.29418945
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
312
,
"logprob"
:
-0.3161621
,
"special"
:
false
,
"text"
:
" a"
"id"
:
0
,
"logprob"
:
null
,
"special"
:
true
,
"text"
:
"<|endoftext|>"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n
\"\"\"\n
Calculate the geometric mean of a
"
"generated_text"
:
"
\n
"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"
length
"
,
"generated_tokens"
:
10
,
"finish_reason"
:
"
eos_token
"
,
"generated_tokens"
:
2
,
"prefill"
:
[
{
"id"
:
589
,
...
...
@@ -146,57 +98,57 @@
},
{
"id"
:
3226
,
"logprob"
:
-8.
585
9375
,
"logprob"
:
-8.9375
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-
7.5937
5
,
"logprob"
:
-
8.851562
5
,
"text"
:
"ometric"
},
{
"id"
:
81
,
"logprob"
:
-0.2
6953125
,
"logprob"
:
-0.2
1826172
,
"text"
:
"_"
},
{
"id"
:
6009
,
"logprob"
:
-1.
640625
,
"logprob"
:
-1.
2871094
,
"text"
:
"mean"
},
{
"id"
:
26
,
"logprob"
:
-0.2
2705078
,
"logprob"
:
-0.2
5390625
,
"text"
:
"("
},
{
"id"
:
62
,
"logprob"
:
-
5.234375
,
"logprob"
:
-
4.8085938
,
"text"
:
"L"
},
{
"id"
:
44
,
"logprob"
:
-3.
1132812
,
"logprob"
:
-3.
7890625
,
"text"
:
":"
},
{
"id"
:
1682
,
"logprob"
:
-
1.1123047
,
"logprob"
:
-
0.8076172
,
"text"
:
" List"
},
{
"id"
:
77
,
"logprob"
:
-0.
14294434
,
"logprob"
:
-0.
22302246
,
"text"
:
"["
},
{
"id"
:
1808
,
"logprob"
:
-0.
32299805
,
"logprob"
:
-0.
46435547
,
"text"
:
"float"
},
{
"id"
:
10794
,
"logprob"
:
-
2.8164062
,
"logprob"
:
-
3.0234375
,
"text"
:
"]):"
}
],
...
...
@@ -204,74 +156,26 @@
"tokens"
:
[
{
"id"
:
284
,
"logprob"
:
-0.
12854004
,
"logprob"
:
-0.
046722412
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
1524
,
"logprob"
:
-0.9897461
,
"special"
:
false
,
"text"
:
"
\"\"\"
"
},
{
"id"
:
284
,
"logprob"
:
-0.69970703
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
14883
,
"logprob"
:
-2.2050781
,
"special"
:
false
,
"text"
:
" Calculate"
},
{
"id"
:
322
,
"logprob"
:
-0.2668457
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
3226
,
"logprob"
:
-0.08496094
,
"special"
:
false
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-0.019012451
,
"special"
:
false
,
"text"
:
"ometric"
},
{
"id"
:
5651
,
"logprob"
:
-0.029037476
,
"special"
:
false
,
"text"
:
" mean"
},
{
"id"
:
432
,
"logprob"
:
-0.2939453
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
312
,
"logprob"
:
-0.31591797
,
"special"
:
false
,
"text"
:
" a"
"id"
:
0
,
"logprob"
:
null
,
"special"
:
true
,
"text"
:
"<|endoftext|>"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n
\"\"\"\n
Calculate the geometric mean of a
"
"generated_text"
:
"
\n
"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"
length
"
,
"generated_tokens"
:
10
,
"finish_reason"
:
"
eos_token
"
,
"generated_tokens"
:
2
,
"prefill"
:
[
{
"id"
:
589
,
...
...
@@ -280,57 +184,57 @@
},
{
"id"
:
3226
,
"logprob"
:
-8.
585937
5
,
"logprob"
:
-8.
945312
5
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-
7.585937
5
,
"logprob"
:
-
8.851562
5
,
"text"
:
"ometric"
},
{
"id"
:
81
,
"logprob"
:
-0.2
6586914
,
"logprob"
:
-0.2
1813965
,
"text"
:
"_"
},
{
"id"
:
6009
,
"logprob"
:
-1.
6347656
,
"logprob"
:
-1.
2744141
,
"text"
:
"mean"
},
{
"id"
:
26
,
"logprob"
:
-0.2
2766113
,
"logprob"
:
-0.2
512207
,
"text"
:
"("
},
{
"id"
:
62
,
"logprob"
:
-
5.226562
5
,
"logprob"
:
-
4.804687
5
,
"text"
:
"L"
},
{
"id"
:
44
,
"logprob"
:
-3.
0976
562
,
"logprob"
:
-3.
7851
562
,
"text"
:
":"
},
{
"id"
:
1682
,
"logprob"
:
-
1.1025391
,
"logprob"
:
-
0.81396484
,
"text"
:
" List"
},
{
"id"
:
77
,
"logprob"
:
-0.
1427002
,
"logprob"
:
-0.
22570801
,
"text"
:
"["
},
{
"id"
:
1808
,
"logprob"
:
-0.
32592773
,
"logprob"
:
-0.
46044922
,
"text"
:
"float"
},
{
"id"
:
10794
,
"logprob"
:
-
2.8164062
,
"logprob"
:
-
3.0234375
,
"text"
:
"]):"
}
],
...
...
@@ -338,74 +242,26 @@
"tokens"
:
[
{
"id"
:
284
,
"logprob"
:
-0.
13012695
,
"logprob"
:
-0.
04650879
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
1524
,
"logprob"
:
-0.98046875
,
"special"
:
false
,
"text"
:
"
\"\"\"
"
},
{
"id"
:
284
,
"logprob"
:
-0.69921875
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
14883
,
"logprob"
:
-2.1992188
,
"special"
:
false
,
"text"
:
" Calculate"
},
{
"id"
:
322
,
"logprob"
:
-0.2668457
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
3226
,
"logprob"
:
-0.083496094
,
"special"
:
false
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-0.01902771
,
"special"
:
false
,
"text"
:
"ometric"
},
{
"id"
:
5651
,
"logprob"
:
-0.029006958
,
"special"
:
false
,
"text"
:
" mean"
},
{
"id"
:
432
,
"logprob"
:
-0.29248047
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
312
,
"logprob"
:
-0.3161621
,
"special"
:
false
,
"text"
:
" a"
"id"
:
0
,
"logprob"
:
null
,
"special"
:
true
,
"text"
:
"<|endoftext|>"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n
\"\"\"\n
Calculate the geometric mean of a
"
"generated_text"
:
"
\n
"
},
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"
length
"
,
"generated_tokens"
:
10
,
"finish_reason"
:
"
eos_token
"
,
"generated_tokens"
:
2
,
"prefill"
:
[
{
"id"
:
589
,
...
...
@@ -414,57 +270,57 @@
},
{
"id"
:
3226
,
"logprob"
:
-8.
585937
5
,
"logprob"
:
-8.
945312
5
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-
7.585937
5
,
"logprob"
:
-
8.851562
5
,
"text"
:
"ometric"
},
{
"id"
:
81
,
"logprob"
:
-0.2
6904297
,
"logprob"
:
-0.2
1960449
,
"text"
:
"_"
},
{
"id"
:
6009
,
"logprob"
:
-1.
6386719
,
"logprob"
:
-1.
2890625
,
"text"
:
"mean"
},
{
"id"
:
26
,
"logprob"
:
-0.2
2705078
,
"logprob"
:
-0.2
5073242
,
"text"
:
"("
},
{
"id"
:
62
,
"logprob"
:
-
5.234375
,
"logprob"
:
-
4.8085938
,
"text"
:
"L"
},
{
"id"
:
44
,
"logprob"
:
-3.
1132812
,
"logprob"
:
-3.
8046875
,
"text"
:
":"
},
{
"id"
:
1682
,
"logprob"
:
-
1.107421
9
,
"logprob"
:
-
0.807128
9
,
"text"
:
" List"
},
{
"id"
:
77
,
"logprob"
:
-0.
14477539
,
"logprob"
:
-0.
22570801
,
"text"
:
"["
},
{
"id"
:
1808
,
"logprob"
:
-0.
3256836
,
"logprob"
:
-0.
46118164
,
"text"
:
"float"
},
{
"id"
:
10794
,
"logprob"
:
-
2.8027344
,
"logprob"
:
-
3.0097656
,
"text"
:
"]):"
}
],
...
...
@@ -472,67 +328,19 @@
"tokens"
:
[
{
"id"
:
284
,
"logprob"
:
-0.
12915039
,
"logprob"
:
-0.
046539307
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
1524
,
"logprob"
:
-0.98535156
,
"special"
:
false
,
"text"
:
"
\"\"\"
"
},
{
"id"
:
284
,
"logprob"
:
-0.69921875
,
"special"
:
false
,
"text"
:
"
\n
"
},
{
"id"
:
14883
,
"logprob"
:
-2.2011719
,
"special"
:
false
,
"text"
:
" Calculate"
},
{
"id"
:
322
,
"logprob"
:
-0.26708984
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
3226
,
"logprob"
:
-0.08502197
,
"special"
:
false
,
"text"
:
" ge"
},
{
"id"
:
21017
,
"logprob"
:
-0.019012451
,
"special"
:
false
,
"text"
:
"ometric"
},
{
"id"
:
5651
,
"logprob"
:
-0.028625488
,
"special"
:
false
,
"text"
:
" mean"
},
{
"id"
:
432
,
"logprob"
:
-0.29589844
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
312
,
"logprob"
:
-0.31591797
,
"special"
:
false
,
"text"
:
" a"
"id"
:
0
,
"logprob"
:
null
,
"special"
:
true
,
"text"
:
"<|endoftext|>"
}
],
"top_tokens"
:
null
},
"generated_text"
:
"
\n
\"\"\"\n
Calculate the geometric mean of a
"
"generated_text"
:
"
\n
"
}
]
integration-tests/models/__snapshots__/test_idefics2/test_flash_idefics2_next_all_params.json
View file @
81a882ad
...
...
@@ -30,7 +30,7 @@
},
{
"id"
:
264
,
"logprob"
:
-0.3
7573242
,
"logprob"
:
-0.3
8061523
,
"special"
:
false
,
"text"
:
" a"
},
...
...
@@ -42,7 +42,7 @@
},
{
"id"
:
4480
,
"logprob"
:
-0.
33
227
54
,
"logprob"
:
-0.
26782
227
,
"special"
:
false
,
"text"
:
" feature"
},
...
...
@@ -78,7 +78,7 @@
},
{
"id"
:
13
,
"logprob"
:
0.
0
,
"logprob"
:
-
0.
10632324
,
"special"
:
false
,
"text"
:
"
\n
"
}
...
...
integration-tests/models/__snapshots__/test_idefics2/test_flash_idefics2_two_images.json
View file @
81a882ad
{
"details"
:
{
"best_of_sequences"
:
null
,
"finish_reason"
:
"
length
"
,
"generated_tokens"
:
20
,
"finish_reason"
:
"
eos_token
"
,
"generated_tokens"
:
19
,
"prefill"
:
[],
"seed"
:
null
,
"tokens"
:
[
{
"id"
:
415
,
"logprob"
:
-0.03
9886475
,
"logprob"
:
-0.03
665161
,
"special"
:
false
,
"text"
:
" The"
},
{
"id"
:
12072
,
"logprob"
:
-0.1
430664
,
"logprob"
:
-0.1
3549805
,
"special"
:
false
,
"text"
:
" cow"
},
{
"id"
:
349
,
"logprob"
:
-0.05
6488037
,
"logprob"
:
-0.05
819702
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
6328
,
"logprob"
:
-0.68
55469
,
"logprob"
:
-0.68
26172
,
"special"
:
false
,
"text"
:
" standing"
},
{
"id"
:
356
,
"logprob"
:
-0.16
85791
,
"logprob"
:
-0.16
07666
,
"special"
:
false
,
"text"
:
" on"
},
{
"id"
:
272
,
"logprob"
:
-0.50
097656
,
"logprob"
:
-0.50
73242
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
10305
,
"logprob"
:
-0.01
730346
7
,
"logprob"
:
-0.01
641845
7
,
"special"
:
false
,
"text"
:
" beach"
},
{
"id"
:
304
,
"logprob"
:
-1.3
564453
,
"logprob"
:
-1.3
916016
,
"special"
:
false
,
"text"
:
" and"
},
{
"id"
:
272
,
"logprob"
:
-0.0
17868042
,
"logprob"
:
-0.0
20217896
,
"special"
:
false
,
"text"
:
" the"
},
{
"id"
:
13088
,
"logprob"
:
-0.002
7103424
,
"logprob"
:
-0.002
8133392
,
"special"
:
false
,
"text"
:
" chicken"
},
{
"id"
:
349
,
"logprob"
:
-0.0031
56662
,
"logprob"
:
-0.0031
45218
,
"special"
:
false
,
"text"
:
" is"
},
{
"id"
:
6398
,
"logprob"
:
-0.37
304688
,
"logprob"
:
-0.37
060547
,
"special"
:
false
,
"text"
:
" sitting"
},
{
"id"
:
356
,
"logprob"
:
-0.034
576416
,
"logprob"
:
-0.034
851074
,
"special"
:
false
,
"text"
:
" on"
},
{
"id"
:
264
,
"logprob"
:
-0.2
9418945
,
"logprob"
:
-0.2
878418
,
"special"
:
false
,
"text"
:
" a"
},
{
"id"
:
17972
,
"logprob"
:
-0.04
2877197
,
"logprob"
:
-0.04
6051025
,
"special"
:
false
,
"text"
:
" pile"
},
{
"id"
:
302
,
"logprob"
:
-0.00028
443336
,
"logprob"
:
-0.00028
848648
,
"special"
:
false
,
"text"
:
" of"
},
{
"id"
:
2445
,
"logprob"
:
-0.02
3223877
,
"logprob"
:
-0.02
5772095
,
"special"
:
false
,
"text"
:
" money"
},
{
"id"
:
28723
,
"logprob"
:
-0.0181
57959
,
"logprob"
:
-0.0181
27441
,
"special"
:
false
,
"text"
:
"."
},
{
"id"
:
32002
,
"logprob"
:
-0.0001
8393993
,
"logprob"
:
-0.0001
9824505
,
"special"
:
true
,
"text"
:
"<end_of_utterance>"
},
{
"id"
:
2
,
"logprob"
:
-1.1920929e-07
,
"special"
:
true
,
"text"
:
"</s>"
}
],
"top_tokens"
:
null
...
...
Prev
1
…
4
5
6
7
8
9
10
11
12
…
14
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment