@@ -50,7 +50,7 @@ Generate a response for a given prompt with a provided model. This is a streamin
...
@@ -50,7 +50,7 @@ Generate a response for a given prompt with a provided model. This is a streamin
Advanced parameters (optional):
Advanced parameters (optional):
-`format`: the format to return a response in. Format can be `json` or a JSON schema
-`format`: the format to return a response in. Format can be `json` or a JSON schema
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.md#valid-parameters-and-values) such as `temperature`
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.mdx#valid-parameters-and-values) such as `temperature`
-`system`: system message to (overrides what is defined in the `Modelfile`)
-`system`: system message to (overrides what is defined in the `Modelfile`)
-`template`: the prompt template to use (overrides what is defined in the `Modelfile`)
-`template`: the prompt template to use (overrides what is defined in the `Modelfile`)
-`stream`: if `false` the response will be returned as a single response object, rather than a stream of objects
-`stream`: if `false` the response will be returned as a single response object, rather than a stream of objects
...
@@ -507,7 +507,7 @@ The `message` object has the following fields:
...
@@ -507,7 +507,7 @@ The `message` object has the following fields:
Advanced parameters (optional):
Advanced parameters (optional):
-`format`: the format to return a response in. Format can be `json` or a JSON schema.
-`format`: the format to return a response in. Format can be `json` or a JSON schema.
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.md#valid-parameters-and-values) such as `temperature`
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.mdx#valid-parameters-and-values) such as `temperature`
-`stream`: if `false` the response will be returned as a single response object, rather than a stream of objects
-`stream`: if `false` the response will be returned as a single response object, rather than a stream of objects
-`keep_alive`: controls how long the model will stay loaded into memory following the request (default: `5m`)
-`keep_alive`: controls how long the model will stay loaded into memory following the request (default: `5m`)
...
@@ -1189,7 +1189,7 @@ If you are creating a model from a safetensors directory or from a GGUF file, yo
...
@@ -1189,7 +1189,7 @@ If you are creating a model from a safetensors directory or from a GGUF file, yo
-`template`: (optional) the prompt template for the model
-`template`: (optional) the prompt template for the model
-`license`: (optional) a string or list of strings containing the license or licenses for the model
-`license`: (optional) a string or list of strings containing the license or licenses for the model
-`system`: (optional) a string containing the system prompt for the model
-`system`: (optional) a string containing the system prompt for the model
-`parameters`: (optional) a dictionary of parameters for the model (see [Modelfile](./modelfile.md#valid-parameters-and-values) for a list of parameters)
-`parameters`: (optional) a dictionary of parameters for the model (see [Modelfile](./modelfile.mdx#valid-parameters-and-values) for a list of parameters)
-`messages`: (optional) a list of message objects used to create a conversation
-`messages`: (optional) a list of message objects used to create a conversation
-`stream`: (optional) if `false` the response will be returned as a single response object, rather than a stream of objects
-`stream`: (optional) if `false` the response will be returned as a single response object, rather than a stream of objects
-`quantize` (optional): quantize a non-quantized (e.g. float16) model
-`quantize` (optional): quantize a non-quantized (e.g. float16) model
...
@@ -1698,7 +1698,7 @@ Generate embeddings from a model
...
@@ -1698,7 +1698,7 @@ Generate embeddings from a model
Advanced parameters:
Advanced parameters:
-`truncate`: truncates the end of each input to fit within context length. Returns error if `false` and context length is exceeded. Defaults to `true`
-`truncate`: truncates the end of each input to fit within context length. Returns error if `false` and context length is exceeded. Defaults to `true`
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.md#valid-parameters-and-values) such as `temperature`
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.mdx#valid-parameters-and-values) such as `temperature`
-`keep_alive`: controls how long the model will stay loaded into memory following the request (default: `5m`)
-`keep_alive`: controls how long the model will stay loaded into memory following the request (default: `5m`)
-`dimensions`: number of dimensions for the embedding
-`dimensions`: number of dimensions for the embedding
...
@@ -1817,7 +1817,7 @@ Generate embeddings from a model
...
@@ -1817,7 +1817,7 @@ Generate embeddings from a model
Advanced parameters:
Advanced parameters:
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.md#valid-parameters-and-values) such as `temperature`
-`options`: additional model parameters listed in the documentation for the [Modelfile](./modelfile.mdx#valid-parameters-and-values) such as `temperature`
-`keep_alive`: controls how long the model will stay loaded into memory following the request (default: `5m`)
-`keep_alive`: controls how long the model will stay loaded into memory following the request (default: `5m`)