Skip to main content

Making Predictions with Machine Learning Models

Overview​

Once you've deployed a model in Abacus.AI, you're ready to start making predictions.

What You'll Learn​

In this guide, you will learn:

  1. What Batch Predictions are
  2. How to create a Batch Prediction and build a production pipeline

Batch Predictions​

There are two ways to use a deployed model:

  1. Call the API directly
  2. Use Batch Predictions

A Batch Prediction uses a deployed model (offline or online) to generate predictions on a batch of input data. This approach is ideal for use cases where real-time predictions aren't necessary—such as when you need predictions daily, weekly, or hourly.

Key principle: Your input data must match the structure of your training data.

Example Scenario​

Consider a model trained to predict house prices using this training data:

bedroomsbathroomssq_ftyear_builtlocationprice
3218502005Suburban425000
43.526502018Urban785000
219501978Rural215000
5434002021Suburban920000
32.521002012Urban615000

Your prediction input data should contain the same features (excluding the target variable):

bedroomsbathroomssq_ftyear_builtlocation
3316002005Suburban
26.526502018Urban
229001978Rural
3330002021Suburban
31.520002012Urban

Creating a Batch Prediction​

Batch Predictions are how you generate predictions from all Machine Learning Models. The process applies to all project types:

  1. Upload your data in the same format as your model training data
  2. Navigate to Batch Predictions → Create New Batch Prediction from the left panel
NER doc upload
  1. Follow the wizard:
    • Select the deployment (the model that will generate predictions)
    • Select the input Feature Group for the batch prediction
    • Select the output Feature Group where predictions will be stored
NER doc upload

Two prediction modes:

  • Predict on Test: For models trained with ground truth data, this generates out-of-sample predictions using the training dataset. No input Feature Group is needed—the training Feature Group is used automatically.

  • Predict on Future: Requires a new input Feature Group for making predictions on unseen data.

Important considerations:

  • Changing your deployment version and re-running the batch prediction will generate new predictions in a new Feature Group version
  • You can re-run batch predictions indefinitely to get fresh predictions
  • If you refresh the underlying data of the input feature group, you can get another set of predictions based on the updated data.

So in essence, a Batch Prediction can be re-used multiple times.

Creating a Production Pipeline with a Batch Prediction​

Batch Predictions automatically use the latest Feature Group version, enabling you to refresh input data on a schedule and automatically generate updated predictions.

Example Production Workflow:​

  1. Set up a Connector to your database
  2. Load data on a scheduled cron job (e.g., weekly)
  3. Batch Prediction re-runs with new data based on a schedule cron job
  4. Export predictions back to your database on a schedule cron job. See this guide for details

Instead of managing individual cron jobs for each component (Dataset, Batch Prediction, Feature Group export), you can use a pipeline to orchestrate the entire workflow.

Abacus.AI offers many built-in pipeline templates to get you started quickly.