Arguments:

REQUIRED	KEY	TYPE	DESCRIPTION
No	prompt	str	Prompt to use for generation.
No	systemMessage	str	System prompt for models that support it.
No	llmName	LLMName	Name of the underlying LLM to be used for generation. Default is auto selection.
No	maxTokens	int	Maximum number of tokens to generate. If set, the model will just stop generating after this token limit is reached.
No	temperature	float	Temperature to use for generation. Higher temperature makes more non-deterministic responses, a value of zero makes mostly deterministic reponses. Default is 0.0. A range of 0.0 - 2.0 is allowed.
No	messages	list	A list of messages to use as conversation history. A message is a dict with attributes: is_user (bool): Whether the message is from the user. text (str): The message's text. attachments (list): The files attached to the message represented as a list of dictionaries [{"doc_id": }, {"doc_id": }]
No	responseType	str	Specifies the type of response to request from the LLM. One of 'text' and 'json'. If set to 'json', the LLM will respond with a json formatted string whose schema can be specified `json_response_schema`. Defaults to 'text'
No	jsonResponseSchema	dict	A dictionary specifying the keys/schema/parameters which LLM should adhere to in its response when `response_type` is 'json'. Each parameter is mapped to a dict with the following info - type (str) (required): Data type of the parameter. description (str) (required): Description of the parameter. is_required (bool) (optional): Whether the parameter is required or not. Example: json_response_schema = {'title': {'type': 'string', 'description': 'Article title', 'is_required': true}, 'body': {'type': 'string', 'description': 'Article body'}}
No	stopSequences	List[str]	Specifies the strings on which the LLM will stop generation.
No	topP	float	The nucleus sampling value used for this run. If set, the model will sample from the smallest set of tokens whose cumulative probability exceeds the probability `top_p`. Default is 1.0. A range of 0.0 - 1.0 is allowed. It is generally recommended to use either temperature sampling or nucleus sampling, but not both.

Note: The arguments for the API methods follow camelCase but for Python SDK underscore_case is followed.

Response:

KEY

TYPE

DESCRIPTION

success

Boolean

true if the call succeeded, false if there was an error

result

LlmResponse

KEY

TYPE

Description

content

str

Full response from LLM.

tokens

int

The number of tokens in the response.

stopReason

str

The reason due to which the response generation stopped.

llmName

str

The name of the LLM model used to generate the response.

inputTokens

int

The number of input tokens used in the LLM call.

outputTokens

int

The number of output tokens generated in the LLM response.

totalTokens

int

The total number of tokens (input + output) used in the LLM interaction.

codeBlocks

LlmCodeBlock

A list of parsed code blocks from raw LLM Response

KEY	TYPE	Description
language	str	The language of the code block. Eg - python/sql/etc.
code	str	source code string
start	int	index of the starting character of the code block in the original response
end	int	index of the last character of the code block in the original response
valid	bool	flag denoting whether the soruce code string is syntactically valid

Exceptions:

TYPE	WHEN
InvalidEnumParameterError	An invalid value is passed for `llmName`.

Language: