The configuration, including chunk_size and chunk_overlap_fraction, for document retrieval.
KEY
TYPE
Description
chunkOverlapFraction
float
The fraction of overlap between chunks.
scoreMultiplierColumn
str
If provided, will use the values in this metadata column to modify the relevance score of returned chunks for all queries.
textEncoder
VectorStoreTextEncoder
Encoder used to index texts from the documents.
indexMetadataColumns
bool
If True, metadata columns of the FG will also be used for indexing and querying.
pruneVectors
bool
Transform vectors using SVD so that the average component of vectors in the corpus are removed.
useDocumentSummary
bool
If True, uses the summary of the document in addition to chunks of the document for indexing and querying.
chunkSizeFactors
list
Chunking data with multiple sizes. The specified list of factors are used to calculate more sizes, in addition to `chunk_size`.
summaryInstructions
str
Instructions for the LLM to generate the document summary.
chunkSize
int
The size of text chunks in the vector store.
standaloneDeployment
bool
If True, the document retriever will be deployed as a standalone deployment.
Note: The arguments for the API methods follow camelCase but for Python SDK underscore_case is followed.
Response:
KEY
TYPE
DESCRIPTION
success
Boolean
true if the call succeeded, false if there was an error
result
DocumentRetrieverVersion
KEY
TYPE
Description
documentRetrieverId
str
The unique identifier of the Document Retriever.
documentRetrieverVersion
str
The unique identifier of the Document Retriever version.
createdAt
str
When the Document Retriever was created.
status
str
The status of Document Retriever version. It represents indexing status until indexing isn't complete, and deployment status after indexing is complete.
deploymentStatus
str
The status of deploying the Document Retriever version.
featureGroupId
str
The feature group id associated with the document retriever.
featureGroupVersion
str
The unique identifier of the feature group version at which the Document Retriever version is created.
error
str
The error message when it failed to create the document retriever version.
numberOfChunks
int
The number of chunks for the document retriever.
embeddingFileSize
int
The size of embedding file for the document retriever.
warnings
list
The warning messages when creating the document retriever.
resolvedConfig
VectorStoreConfig
The resolved configurations, such as default settings, for indexing documents.
KEY
TYPE
Description
chunkOverlapFraction
float
The fraction of overlap between chunks.
scoreMultiplierColumn
str
If provided, will use the values in this metadata column to modify the relevance score of returned chunks for all queries.
textEncoder
VectorStoreTextEncoder
Encoder used to index texts from the documents.
indexMetadataColumns
bool
If True, metadata columns of the FG will also be used for indexing and querying.
pruneVectors
bool
Transform vectors using SVD so that the average component of vectors in the corpus are removed.
useDocumentSummary
bool
If True, uses the summary of the document in addition to chunks of the document for indexing and querying.
chunkSizeFactors
list
Chunking data with multiple sizes. The specified list of factors are used to calculate more sizes, in addition to `chunk_size`.
summaryInstructions
str
Instructions for the LLM to generate the document summary.
chunkSize
int
The size of text chunks in the vector store.
standaloneDeployment
bool
If True, the document retriever will be deployed as a standalone deployment.
documentRetrieverConfig
VectorStoreConfig
The config used to create the document retriever version.
KEY
TYPE
Description
chunkOverlapFraction
float
The fraction of overlap between chunks.
scoreMultiplierColumn
str
If provided, will use the values in this metadata column to modify the relevance score of returned chunks for all queries.
textEncoder
VectorStoreTextEncoder
Encoder used to index texts from the documents.
indexMetadataColumns
bool
If True, metadata columns of the FG will also be used for indexing and querying.
pruneVectors
bool
Transform vectors using SVD so that the average component of vectors in the corpus are removed.
useDocumentSummary
bool
If True, uses the summary of the document in addition to chunks of the document for indexing and querying.
chunkSizeFactors
list
Chunking data with multiple sizes. The specified list of factors are used to calculate more sizes, in addition to `chunk_size`.
summaryInstructions
str
Instructions for the LLM to generate the document summary.
chunkSize
int
The size of text chunks in the vector store.
standaloneDeployment
bool
If True, the document retriever will be deployed as a standalone deployment.