Required Feature Group Types

To train a model under this use case, you will need to create feature groups of the following type(s):

Feature Group Type API Configuration Name Required Description
Forecasting Timeseries Data TIMESERIES True This dataset corresponds to the historical time series data for the items you are trying to forecast.
Item Attributes ITEM_ATTRIBUTES This dataset corresponds to all the attributes or meta-data that you have about the items you are forecasting

Note: Once you upload the datasets under each Feature Group Type that comply with their respective required schemas, you will need to create Machine learning (ML) features that would be used to train your ML model(s). We use the term, "Feature Group" for a group of ML features (dataset columns) under a specific Feature Group Type. Our system support extensible schemas that enables you to provide any number of additional columns/features that you think are relevant to that Feature Group Type.


Feature Group: Forecasting Timeseries Data

This dataset corresponds to the historical time series data for the items you are trying to forecast.

Feature Mapping Feature Type Required Description
ITEM_ID categorical Y The unique identifier of the item whose target value you are forecasting.
TARGET numerical Y The target value you are forecasting. (e.g. energy, electricity usage).
DATE timestamp Y Date (day, year or month) that corresponds to the target value.
FUTURE N Known values ahead of time (e.g., State Holidays, National Holidays etc.) that can be easily included in the training dataset.
[ATTRIBUTE RELEVANT TO FORECASTING] Y Any relevant attribute about the target item whose value needs to be forecasted. For e.g., power consumption, intensity, voltage, etc., for real-time electricty requirement forecasting.

Feature Group: Item Attributes

This dataset corresponds to all the attributes or meta-data that you have about the items you are forecasting

Feature Mapping Feature Type Required Description
ITEM_ID categorical Y The unique identifier of the item whose target value you are forecasting (e.g energy pod id or compute cluster id)
[ITEM ATTRIBUTE] Y Any relevant attribute about the item. E.g., for compute clusters, it may be memory and CPU of the cluster, and for energy load forecasting, it may be intensity, voltage, etc., attributes pertaining to the energy consumer. We suggest providing at least 5-6 attributes per lead and up to a maximum of 1000 attributes.