Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
text-generation-inference
Commits
cbd36aa4
Unverified
Commit
cbd36aa4
authored
Mar 13, 2023
by
OlivierDehaene
Committed by
GitHub
Mar 13, 2023
Browse files
fix(server): revert gpt-neox optims (#123)
parent
6860ce9c
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
6 additions
and
2 deletions
+6
-2
router/src/infer.rs
router/src/infer.rs
+5
-1
server/Makefile
server/Makefile
+1
-1
No files found.
router/src/infer.rs
View file @
cbd36aa4
...
@@ -229,7 +229,11 @@ async fn batching_task(
...
@@ -229,7 +229,11 @@ async fn batching_task(
shared
:
Arc
<
Shared
>
,
shared
:
Arc
<
Shared
>
,
)
{
)
{
// Minimum batch size after which we try to add more requests
// Minimum batch size after which we try to add more requests
let
limit_min_batch_size
=
(
max_batch_size
/
2
)
as
u32
;
let
limit_min_batch_size
=
if
max_batch_size
>
1
{
(
max_batch_size
/
2
)
as
u32
}
else
{
0
};
// Infinite loop
// Infinite loop
loop
{
loop
{
...
...
server/Makefile
View file @
cbd36aa4
transformers_commit
:=
2f87dca1ca3e5663d0637da9bb037a6956e57a5e
transformers_commit
:=
517563354a3226ecfc3dca6e7a38012668d7156a
gen-server
:
gen-server
:
# Compile protos
# Compile protos
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment