AI-Ollama-Client
view release on metacpan or search on metacpan
lib/AI/Ollama/RequestOptions.pm view on Meta::CPAN
Maximum number of tokens to predict when generating text. (Default: 128, -1 = infinite generation, -2 = fill context)
=cut
has 'num_predict' => (
is => 'ro',
isa => Int,
);
=head2 C<< num_thread >>
Sets the number of threads to use during computation. By default, Ollama will detect this for optimal performance. It is recommended to set this value to the number of physical CPU cores your system has (as opposed to the logical number of cores).
=cut
has 'num_thread' => (
is => 'ro',
isa => Int,
);
=head2 C<< numa >>
Enable NUMA support. (Default: false)
=cut
ollama/ollama-curated.json view on Meta::CPAN
{"openapi":"3.0.3","components":{"schemas":{"PushModelResponse":{"properties":{"total":{"type":"integer","description":"total size of the model","example":"2142590208"},"status":{"$ref":"#/components/schemas/PushModelStatus"},"digest":{"example":"sha...
ollama/ollama-curated.yaml view on Meta::CPAN
rope_frequency_base:
type: number
format: float
description: |
The base of the rope frequency scale. (Default: 1.0)
rope_frequency_scale:
type: number
format: float
description: |
The scale of the rope frequency. (Default: 1.0)
num_thread:
type: integer
description: |
Sets the number of threads to use during computation. By default, Ollama will detect this for optimal performance. It is recommended to set this value to the number of physical CPU cores your system has (as opposed to the logical number o...
ResponseFormat:
type: string
description: |
The format to return a response in. Currently the only accepted value is json.
Enable JSON mode by setting the format parameter to json. This will structure the response as valid JSON.
Note: it's important to instruct the model to use JSON in the prompt. Otherwise, the model may generate large amounts whitespace.
enum:
- json
t/generate.request view on Meta::CPAN
"main_gpu": 0,
"low_vram": true,
"f16_kv": true,
"logits_all": true,
"vocab_only": true,
"use_mmap": true,
"use_mlock": true,
"embedding_only": true,
"rope_frequency_base": 0,
"rope_frequency_scale": 0,
"num_thread": 0
},
"format": "json",
"raw": true,
"stream": false,
"keep_alive": 0
}
( run in 0.318 second using v1.01-cache-2.11-cpan-87723dcf8b7 )