Overview Use Cases Customer Churn Prediction Time Series Anomaly Detection Event Anomaly Detection Cloud Spend Alerts Personalized Promotions Predictive Modeling Real-Time Forecasting Financial Metrics Forecasting Demand Forecasting Cumulative Forecasting Text extraction and classification NLP Powered Search Sentiment Analysis Finetuned LLM ChatLLM Feature Group Requirements Training Models Evaluating Predictions Predictions Language Detection Image Classification & Detection Object Detection Clustering Timeseries Clustering Sales and Revenue Forecasting Predictive Lead Scoring Personalized Search Personalized Recommendations Related Items Model Drift and Monitoring Tensorflow with Vector Matching Custom Python Model Data Ingestion Streaming Feature Store Vector Store AI Workflows Named Entity Recognition Guidelines Optimization Connectors Authentication Getting Started with the Python SDK API Documentation Chat Bot API Search How to

Required Feature Group Types

To train a model under this use case, you will need to create feature groups of the following type(s):

Feature Group Type	API Configuration Name	Required	Description
List of documents	DOCUMENTS	True	This dataset corresponds to the list of documents that you want to use as a knowledge base for your LLM.
Custom Table	CUSTOM_TABLE	False	This dataset corresponds to structured data used for querying and analysis in DataLLM. Either a custom table OR a list of documents is required. Both may be used.
Evaluation	EVALUATION	False	The Evaluation dataset is used to evaluate the model's performance. It contains a list of questions and their expected answers.

Note: Once you upload the datasets under each Feature Group Type that comply with their respective required schemas, you will need to create Machine learning (ML) features that would be used to train your ML model(s). We use the term, "Feature Group" for a group of ML features (dataset columns) under a specific Feature Group Type. Our system support extensible schemas that enables you to provide any number of additional columns/features that you think are relevant to that Feature Group Type.

Feature Group: List of documents

This dataset corresponds to the list of documents that you want to use as a knowledge base for your LLM.

Feature Mapping	Required	Description
DOCUMENT	Y	The document text
DOCUMENT_ID	Y	The unique document identifier
DOCUMENT_SOURCE	N	The source URL of the document

Feature Group: Custom Table

This dataset corresponds to structured data used for querying and analysis in DataLLM. Either a custom table OR a list of documents is required. Both may be used.

Feature Mapping	Feature Type	Required	Description
[COLUMN NAME]		Y	Any column in the table that can be used for querying or analysis. All columns are available for DataLLM queries.

Feature Group: Evaluation

The Evaluation dataset is used to evaluate the model's performance. It contains a list of questions and their expected answers.

Feature Mapping	Feature Type	Required	Description
QUESTION		Y	Question used to evaluate the model
ANSWER		N	The question's expected answer