DocumentRetrieverConfig

Config for indexing options of a document retriever. Default values of optional arguments are heuristically selected by the Abacus.AI platform based on the underlying data.

KEY TYPE Description
textEncoder VectorStoreTextEncoder Encoder used to index texts from the documents.
chunkOverlapFraction float The fraction of overlap between chunks.
chunkSizeFactors list Chunking data with multiple sizes. The specified list of factors are used to calculate more sizes, in addition to `chunk_size`.
summaryInstructions str Instructions for the LLM to generate the document summary.
scoreMultiplierColumn str If provided, will use the values in this metadata column to modify the relevance score of returned chunks for all queries.
pruneVectors bool Transform vectors using SVD so that the average component of vectors in the corpus are removed.
useDocumentSummary bool If True, uses the summary of the document in addition to chunks of the document for indexing and querying.
indexMetadataColumns bool If True, metadata columns of the FG will also be used for indexing and querying.
chunkSize int The size of text chunks in the vector store.