Required Feature Group Types

To train a model under this use case, you will need to create feature groups of the following type(s):

Feature Group Type API Configuration Name Required Description
List of documents DOCUMENTS True This dataset corresponds to the list of documents that you want to use as a knowledge base for your LLM.
Custom Table CUSTOM_TABLE False This dataset corresponds to structured data used for querying and analysis in DataLLM. Either a custom table OR a list of documents is required. Both may be used.
Evaluation EVALUATION False The Evaluation dataset is used to evaluate the model's performance. It contains a list of questions and their expected answers.

Note: Once you upload the datasets under each Feature Group Type that comply with their respective required schemas, you will need to create Machine learning (ML) features that would be used to train your ML model(s). We use the term, "Feature Group" for a group of ML features (dataset columns) under a specific Feature Group Type. Our system support extensible schemas that enables you to provide any number of additional columns/features that you think are relevant to that Feature Group Type.


Feature Group: List of documents

This dataset corresponds to the list of documents that you want to use as a knowledge base for your LLM.

Feature Mapping Feature Type Required Description
DOCUMENT Y The document text
DOCUMENT_ID Y The unique document identifier
DOCUMENT_SOURCE N The source URL of the document

Feature Group: Custom Table

This dataset corresponds to structured data used for querying and analysis in DataLLM. Either a custom table OR a list of documents is required. Both may be used.

Feature Mapping Feature Type Required Description
[COLUMN NAME] Y Any column in the table that can be used for querying or analysis. All columns are available for DataLLM queries.

Feature Group: Evaluation

The Evaluation dataset is used to evaluate the model's performance. It contains a list of questions and their expected answers.

Feature Mapping Feature Type Required Description
QUESTION Y Question used to evaluate the model
ANSWER N The question's expected answer