View on GitHub

forecasting

Time Series Forecasting Best Practices & Examples

Forecasting library

Building forecasting models can involve tedious tasks ranging from data loading, dataset understanding, model development, model evaluation to deployment of trained models. To assist with these tasks, we developed a forecasting library - fclib. You’ll see this library used widely in sample notebooks in examples. The following provides a short description of the sub-modules. For more details about what functions/classes/utitilies are available and how to use them, please review the doc-strings provided with the code and see the sample notebooks in examples directory.

Submodules

AzureML

The AzureML submodule contains utilities to connect to an Azure Machine Learning workspace, train, tune and operationalize forecasting models at scale using AzureML.

Common

This submodule contains high-level utilities that are commonly used in multiple algorithms as well as helper functions for visualizing forecasting predictions.

Dataset

This submodule includes helper functions for interacting with datasets used in the example notebooks, utility functions to process datasets for different models tasks, as well as utilities for splitting data for training/testing. For example, the ojdata submodule will allow you to download and process Orange Juice data set, as well as split it into training and testing rounds.

from fclib.dataset.ojdata import download_ojdata, split_train_test

download_ojdata(DATA_DIR)
train_df_list, test_df_list, _ = split_train_test(
    DATA_DIR,
    n_splits=N_SPLITS,
    horizon=HORIZON,
    gap=GAP,
    first_week=FIRST_WEEK,
    last_week=LAST_WEEK
)

Evaluation

Evaluation module includes functionalities for computing common forecasting evaluation metrics, more specifically MAPE, sMAPE, and pinball loss.

Feature Engineering

Feature engineering module contains utilities to create various time series features, for example, week or day of month, lagged features, and moving average features. This module is used widely in machine-learning based approaches to forecasting, in which time series data is transformed into a tabular featurized dataset, that becomes input to a machine learning method.

Models

The models module contains implementations of various algorithms that can be used in addition to external packages to evaluate and develop new forecasting solutions. Some submodules found here are: lightgbm, dilated cnn, etc. A more detailed description of which algorithms are used in our examples can be found in this README.