Config for indexing options of a document retriever. Default values of optional arguments are heuristically selected by the Abacus.AI platform based on the underlying data.
KEY | TYPE | Description |
---|---|---|
textEncoder | VectorStoreTextEncoder | Encoder used to index texts from the documents. |
chunkOverlapFraction | float | The fraction of overlap between chunks. |
chunkSizeFactors | list | Chunking data with multiple sizes. The specified list of factors are used to calculate more sizes, in addition to `chunk_size`. |
summaryInstructions | str | Instructions for the LLM to generate the document summary. |
scoreMultiplierColumn | str | If provided, will use the values in this metadata column to modify the relevance score of returned chunks for all queries. |
pruneVectors | bool | Transform vectors using SVD so that the average component of vectors in the corpus are removed. |
useDocumentSummary | bool | If True, uses the summary of the document in addition to chunks of the document for indexing and querying. |
indexMetadataColumns | bool | If True, metadata columns of the FG will also be used for indexing and querying. |
chunkSize | int | The size of text chunks in the vector store. |