DocumentRetrieverConfig

Config for indexing options of a document retriever. Default values of optional arguments are heuristically selected by the Abacus.AI platform based on the underlying data.

KEY TYPE Description
scoreMultiplierColumn str If provided, will use the values in this metadata column to modify the relevance score of returned chunks for all queries.
chunkSize int The size of text chunks in the vector store.
summaryInstructions str Instructions for the LLM to generate the document summary.
useDocumentSummary bool If True, uses the summary of the document in addition to chunks of the document for indexing and querying.
standaloneDeployment bool If True, the document retriever will be deployed as a standalone deployment.
chunkSizeFactors list Chunking data with multiple sizes. The specified list of factors are used to calculate more sizes, in addition to `chunk_size`.
chunkOverlapFraction float The fraction of overlap between chunks.
indexMetadataColumns bool If True, metadata columns of the FG will also be used for indexing and querying.
textEncoder VectorStoreTextEncoder Encoder used to index texts from the documents.
pruneVectors bool Transform vectors using SVD so that the average component of vectors in the corpus are removed.