REQUIRED |
KEY |
TYPE |
DESCRIPTION |
No |
prompt |
str |
Prompt to use for generation.
|
No |
systemMessage |
str |
System prompt for models that support it.
|
No |
llmName |
LLMName |
Name of the underlying LLM to be used for generation. Default is auto selection.
|
No |
maxTokens |
int |
Maximum number of tokens to generate. If set, the model will just stop generating after this token limit is reached.
|
No |
temperature |
float |
Temperature to use for generation. Higher temperature makes more non-deterministic responses, a value of zero makes mostly deterministic reponses. Default is 0.0. A range of 0.0 - 2.0 is allowed.
|
No |
messages |
list |
A list of messages to use as conversation history. For completion models like OPENAI_GPT3_5_TEXT and PALM_TEXT this should not be set.
A message is a dict with attributes:
is_user (bool): Whether the message is from the user.
text (str): The message's text.
attachments (list): The files attached to the message represented as a list of dictionaries [{"doc_id": }, {"doc_id": }]
|
No |
responseType |
str |
Specifies the type of response to request from the LLM. One of 'text' and 'json'. If set to 'json', the LLM will respond with a json formatted string whose schema can be specified `json_response_schema`. Defaults to 'text'
|
No |
jsonResponseSchema |
dict |
A dictionary specifying the keys/schema/parameters which LLM should adhere to in its response when `response_type` is 'json'. Each parameter is mapped to a dict with the following info -
type (str) (required): Data type of the parameter.
description (str) (required): Description of the parameter.
is_required (bool) (optional): Whether the parameter is required or not.
Example: json_response_schema = {'title': {'type': 'string', 'description': 'Article title', 'is_required': true}, 'body': {'type': 'string', 'description': 'Article body'}}
|
No |
stopSequences |
List[str] |
Specifies the strings on which the LLM will stop generation.
|
No |
topP |
float |
The nucleus sampling value used for this run. If set, the model will sample from the smallest set of tokens whose cumulative probability exceeds the probability `top_p`. Default is 1.0. A range of 0.0 - 1.0 is allowed. It is generally recommended to use either temperature sampling or nucleus sampling, but not both.
|
Note: The arguments for the API methods follow camelCase but for Python SDK underscore_case is followed.