Creates an Amazon Forecast predictor.
In the request, you provide a dataset group and either specify an algorithm or let Amazon Forecast choose the algorithm for you using AutoML. If you specify an algorithm, you also can override algorithm-specific hyperparameters.
Amazon Forecast uses the chosen algorithm to train a model using the latest version of the datasets in the specified dataset group. The result is called a predictor. You then generate a forecast using the CreateForecast operation.
After training a model, the CreatePredictor
operation also evaluates it. To see the evaluation metrics, use the GetAccuracyMetrics operation. Always review the evaluation metrics before deciding to use the predictor to generate a forecast.
Optionally, you can specify a featurization configuration to fill and aggregate the data fields in the TARGET_TIME_SERIES
dataset to improve model training. For more information, see FeaturizationConfig .
For RELATED_TIME_SERIES datasets, CreatePredictor
verifies that the DataFrequency
specified when the dataset was created matches the ForecastFrequency
. TARGET_TIME_SERIES datasets don’t have this restriction. Amazon Forecast also verifies the delimiter and timestamp format. For more information, see howitworks-datasets-groups .
AutoML
If you want Amazon Forecast to evaluate each algorithm and choose the one that minimizes the objective function
, set PerformAutoML
to true
. The objective function
is defined as the mean of the weighted p10, p50, and p90 quantile losses. For more information, see EvaluationResult .
When AutoML is enabled, the following properties are disallowed:
AlgorithmArn
HPOConfig
PerformHPO
TrainingParameters
To get a list of all of your predictors, use the ListPredictors operation.
Note
Before you can use the predictor to create a forecast, the Status
of the predictor must be ACTIVE
, signifying that training has completed. To get the status, use the DescribePredictor operation.
See also: AWS API Documentation
See ‘aws help’ for descriptions of global parameters.
create-predictor
--predictor-name <value>
[--algorithm-arn <value>]
--forecast-horizon <value>
[--perform-auto-ml | --no-perform-auto-ml]
[--perform-hpo | --no-perform-hpo]
[--training-parameters <value>]
[--evaluation-parameters <value>]
[--hpo-config <value>]
--input-data-config <value>
--featurization-config <value>
[--encryption-config <value>]
[--tags <value>]
[--cli-input-json | --cli-input-yaml]
[--generate-cli-skeleton <value>]
[--cli-auto-prompt <value>]
--predictor-name
(string)
A name for the predictor.
--algorithm-arn
(string)
The Amazon Resource Name (ARN) of the algorithm to use for model training. Required if
PerformAutoML
is not set totrue
.Supported algorithms:
arn:aws:forecast:::algorithm/ARIMA
arn:aws:forecast:::algorithm/Deep_AR_Plus
Supports hyperparameter optimization (HPO)
arn:aws:forecast:::algorithm/ETS
arn:aws:forecast:::algorithm/NPTS
arn:aws:forecast:::algorithm/Prophet
--forecast-horizon
(integer)
Specifies the number of time-steps that the model is trained to predict. The forecast horizon is also called the prediction length.
For example, if you configure a dataset for daily data collection (using the
DataFrequency
parameter of the CreateDataset operation) and set the forecast horizon to 10, the model returns predictions for 10 days.The maximum forecast horizon is the lesser of 500 time-steps or 1/3 of the TARGET_TIME_SERIES dataset length.
--perform-auto-ml
| --no-perform-auto-ml
(boolean)
Whether to perform AutoML. When Amazon Forecast performs AutoML, it evaluates the algorithms it provides and chooses the best algorithm and configuration for your training dataset.
The default value is
false
. In this case, you are required to specify an algorithm.Set
PerformAutoML
totrue
to have Amazon Forecast perform AutoML. This is a good option if you aren’t sure which algorithm is suitable for your training data. In this case,PerformHPO
must be false.
--perform-hpo
| --no-perform-hpo
(boolean)
Whether to perform hyperparameter optimization (HPO). HPO finds optimal hyperparameter values for your training data. The process of performing HPO is known as running a hyperparameter tuning job.
The default value is
false
. In this case, Amazon Forecast uses default hyperparameter values from the chosen algorithm.To override the default values, set
PerformHPO
totrue
and, optionally, supply the HyperParameterTuningJobConfig object. The tuning job specifies a metric to optimize, which hyperparameters participate in tuning, and the valid range for each tunable hyperparameter. In this case, you are required to specify an algorithm andPerformAutoML
must be false.The following algorithm supports HPO:
DeepAR+
--training-parameters
(map)
The hyperparameters to override for model training. The hyperparameters that you can override are listed in the individual algorithms. For the list of supported algorithms, see aws-forecast-choosing-recipes .
key -> (string)
value -> (string)
Shorthand Syntax:
KeyName1=string,KeyName2=string
JSON Syntax:
{"string": "string"
...}
--evaluation-parameters
(structure)
Used to override the default evaluation parameters of the specified algorithm. Amazon Forecast evaluates a predictor by splitting a dataset into training data and testing data. The evaluation parameters define how to perform the split and the number of iterations.
NumberOfBacktestWindows -> (integer)
The number of times to split the input data. The default is 1. Valid values are 1 through 5.
BackTestWindowOffset -> (integer)
The point from the end of the dataset where you want to split the data for model training and testing (evaluation). Specify the value as the number of data points. The default is the value of the forecast horizon.
BackTestWindowOffset
can be used to mimic a past virtual forecast start date. This value must be greater than or equal to the forecast horizon and less than half of the TARGET_TIME_SERIES dataset length.
ForecastHorizon
<=BackTestWindowOffset
< 1/2 * TARGET_TIME_SERIES dataset length
Shorthand Syntax:
NumberOfBacktestWindows=integer,BackTestWindowOffset=integer
JSON Syntax:
{
"NumberOfBacktestWindows": integer,
"BackTestWindowOffset": integer
}
--hpo-config
(structure)
Provides hyperparameter override values for the algorithm. If you don’t provide this parameter, Amazon Forecast uses default values. The individual algorithms specify which hyperparameters support hyperparameter optimization (HPO). For more information, see aws-forecast-choosing-recipes .
If you included the
HPOConfig
object, you must setPerformHPO
to true.ParameterRanges -> (structure)
Specifies the ranges of valid values for the hyperparameters.
CategoricalParameterRanges -> (list)
Specifies the tunable range for each categorical hyperparameter.
(structure)
Specifies a categorical hyperparameter and it’s range of tunable values. This object is part of the ParameterRanges object.
Name -> (string)
The name of the categorical hyperparameter to tune.
Values -> (list)
A list of the tunable categories for the hyperparameter.
(string)
ContinuousParameterRanges -> (list)
Specifies the tunable range for each continuous hyperparameter.
(structure)
Specifies a continuous hyperparameter and it’s range of tunable values. This object is part of the ParameterRanges object.
Name -> (string)
The name of the hyperparameter to tune.
MaxValue -> (double)
The maximum tunable value of the hyperparameter.
MinValue -> (double)
The minimum tunable value of the hyperparameter.
ScalingType -> (string)
The scale that hyperparameter tuning uses to search the hyperparameter range. Valid values:
Auto
Amazon Forecast hyperparameter tuning chooses the best scale for the hyperparameter.
Linear
Hyperparameter tuning searches the values in the hyperparameter range by using a linear scale.
Logarithmic
Hyperparameter tuning searches the values in the hyperparameter range by using a logarithmic scale.
Logarithmic scaling works only for ranges that have values greater than 0.
ReverseLogarithmic
hyperparameter tuning searches the values in the hyperparameter range by using a reverse logarithmic scale.
Reverse logarithmic scaling works only for ranges that are entirely within the range 0 <= x < 1.0.
For information about choosing a hyperparameter scale, see Hyperparameter Scaling . One of the following values:
IntegerParameterRanges -> (list)
Specifies the tunable range for each integer hyperparameter.
(structure)
Specifies an integer hyperparameter and it’s range of tunable values. This object is part of the ParameterRanges object.
Name -> (string)
The name of the hyperparameter to tune.
MaxValue -> (integer)
The maximum tunable value of the hyperparameter.
MinValue -> (integer)
The minimum tunable value of the hyperparameter.
ScalingType -> (string)
The scale that hyperparameter tuning uses to search the hyperparameter range. Valid values:
Auto
Amazon Forecast hyperparameter tuning chooses the best scale for the hyperparameter.
Linear
Hyperparameter tuning searches the values in the hyperparameter range by using a linear scale.
Logarithmic
Hyperparameter tuning searches the values in the hyperparameter range by using a logarithmic scale.
Logarithmic scaling works only for ranges that have values greater than 0.
ReverseLogarithmic
Not supported for
IntegerParameterRange
.Reverse logarithmic scaling works only for ranges that are entirely within the range 0 <= x < 1.0.
For information about choosing a hyperparameter scale, see Hyperparameter Scaling . One of the following values:
JSON Syntax:
{
"ParameterRanges": {
"CategoricalParameterRanges": [
{
"Name": "string",
"Values": ["string", ...]
}
...
],
"ContinuousParameterRanges": [
{
"Name": "string",
"MaxValue": double,
"MinValue": double,
"ScalingType": "Auto"|"Linear"|"Logarithmic"|"ReverseLogarithmic"
}
...
],
"IntegerParameterRanges": [
{
"Name": "string",
"MaxValue": integer,
"MinValue": integer,
"ScalingType": "Auto"|"Linear"|"Logarithmic"|"ReverseLogarithmic"
}
...
]
}
}
--input-data-config
(structure)
Describes the dataset group that contains the data to use to train the predictor.
DatasetGroupArn -> (string)
The Amazon Resource Name (ARN) of the dataset group.
SupplementaryFeatures -> (list)
An array of supplementary features. The only supported feature is a holiday calendar.
(structure)
Describes a supplementary feature of a dataset group. This object is part of the InputDataConfig object.
The only supported feature is a holiday calendar. If you use the calendar, all data in the datasets should belong to the same country as the calendar. For the holiday calendar data, see the Jollyday web site.
India and Korea’s holidays are not included in the Jollyday library, but both are supported by Amazon Forecast. Their holidays are:
“IN” - INDIA
JANUARY 26 - REPUBLIC DAY
AUGUST 15 - INDEPENDENCE DAY
OCTOBER 2 GANDHI'S BIRTHDAY
“KR” - KOREA
JANUARY 1 - NEW YEAR
MARCH 1 - INDEPENDENCE MOVEMENT DAY
MAY 5 - CHILDREN'S DAY
JUNE 6 - MEMORIAL DAY
AUGUST 15 - LIBERATION DAY
OCTOBER 3 - NATIONAL FOUNDATION DAY
OCTOBER 9 - HANGEUL DAY
DECEMBER 25 - CHRISTMAS DAY
Name -> (string)
The name of the feature. This must be “holiday”.
Value -> (string)
One of the following 2 letter country codes:
“AR” - ARGENTINA
“AT” - AUSTRIA
“AU” - AUSTRALIA
“BE” - BELGIUM
“BR” - BRAZIL
“CA” - CANADA
“CN” - CHINA
“CZ” - CZECH REPUBLIC
“DK” - DENMARK
“EC” - ECUADOR
“FI” - FINLAND
“FR” - FRANCE
“DE” - GERMANY
“HU” - HUNGARY
“IE” - IRELAND
“IN” - INDIA
“IT” - ITALY
“JP” - JAPAN
“KR” - KOREA
“LU” - LUXEMBOURG
“MX” - MEXICO
“NL” - NETHERLANDS
“NO” - NORWAY
“PL” - POLAND
“PT” - PORTUGAL
“RU” - RUSSIA
“ZA” - SOUTH AFRICA
“ES” - SPAIN
“SE” - SWEDEN
“CH” - SWITZERLAND
“US” - UNITED STATES
“UK” - UNITED KINGDOM
Shorthand Syntax:
DatasetGroupArn=string,SupplementaryFeatures=[{Name=string,Value=string},{Name=string,Value=string}]
JSON Syntax:
{
"DatasetGroupArn": "string",
"SupplementaryFeatures": [
{
"Name": "string",
"Value": "string"
}
...
]
}
--featurization-config
(structure)
The featurization configuration.
ForecastFrequency -> (string)
The frequency of predictions in a forecast.
Valid intervals are Y (Year), M (Month), W (Week), D (Day), H (Hour), 30min (30 minutes), 15min (15 minutes), 10min (10 minutes), 5min (5 minutes), and 1min (1 minute). For example, “Y” indicates every year and “5min” indicates every five minutes.
The frequency must be greater than or equal to the TARGET_TIME_SERIES dataset frequency.
When a RELATED_TIME_SERIES dataset is provided, the frequency must be equal to the RELATED_TIME_SERIES dataset frequency.
ForecastDimensions -> (list)
An array of dimension (field) names that specify how to group the generated forecast.
For example, suppose that you are generating a forecast for item sales across all of your stores, and your dataset contains a
store_id
field. If you want the sales forecast for each item by store, you would specifystore_id
as the dimension.All forecast dimensions specified in the
TARGET_TIME_SERIES
dataset don’t need to be specified in theCreatePredictor
request. All forecast dimensions specified in theRELATED_TIME_SERIES
dataset must be specified in theCreatePredictor
request.(string)
Featurizations -> (list)
An array of featurization (transformation) information for the fields of a dataset.
(structure)
Provides featurization (transformation) information for a dataset field. This object is part of the FeaturizationConfig object.
For example:
{
"AttributeName": "demand",
FeaturizationPipeline [ {
"FeaturizationMethodName": "filling",
"FeaturizationMethodParameters": {"aggregation": "avg", "backfill": "nan"}
} ]
}
AttributeName -> (string)
The name of the schema attribute that specifies the data field to be featurized. Amazon Forecast supports the target field of the
TARGET_TIME_SERIES
and theRELATED_TIME_SERIES
datasets. For example, for theRETAIL
domain, the target isdemand
, and for theCUSTOM
domain, the target istarget_value
. For more information, see howitworks-missing-values .FeaturizationPipeline -> (list)
An array of one
FeaturizationMethod
object that specifies the feature transformation method.(structure)
Provides information about the method that featurizes (transforms) a dataset field. The method is part of the
FeaturizationPipeline
of the Featurization object.The following is an example of how you specify a
FeaturizationMethod
object.
{
"FeaturizationMethodName": "filling",
"FeaturizationMethodParameters": {"aggregation": "sum", "middlefill": "zero", "backfill": "zero"}
}
FeaturizationMethodName -> (string)
The name of the method. The “filling” method is the only supported method.
FeaturizationMethodParameters -> (map)
The method parameters (key-value pairs), which are a map of override parameters. Specify these parameters to override the default values. Related Time Series attributes do not accept aggregation parameters.
The following list shows the parameters and their valid values for the “filling” featurization method for a Target Time Series dataset. Bold signifies the default value.
aggregation
: sum ,avg
,first
,min
,max
frontfill
: none
middlefill
: zero ,nan
(not a number),value
,median
,mean
,min
,max
backfill
: zero ,nan
,value
,median
,mean
,min
,max
The following list shows the parameters and their valid values for a Related Time Series featurization method (there are no defaults):
middlefill
:zero
,value
,median
,mean
,min
,max
backfill
:zero
,value
,median
,mean
,min
,max
futurefill
:zero
,value
,median
,mean
,min
,max
key -> (string)
value -> (string)
JSON Syntax:
{
"ForecastFrequency": "string",
"ForecastDimensions": ["string", ...],
"Featurizations": [
{
"AttributeName": "string",
"FeaturizationPipeline": [
{
"FeaturizationMethodName": "filling",
"FeaturizationMethodParameters": {"string": "string"
...}
}
...
]
}
...
]
}
--encryption-config
(structure)
An AWS Key Management Service (KMS) key and the AWS Identity and Access Management (IAM) role that Amazon Forecast can assume to access the key.
RoleArn -> (string)
The ARN of the IAM role that Amazon Forecast can assume to access the AWS KMS key.
Passing a role across AWS accounts is not allowed. If you pass a role that isn’t in your account, you get an
InvalidInputException
error.KMSKeyArn -> (string)
The Amazon Resource Name (ARN) of the KMS key.
Shorthand Syntax:
RoleArn=string,KMSKeyArn=string
JSON Syntax:
{
"RoleArn": "string",
"KMSKeyArn": "string"
}
--tags
(list)
The optional metadata that you apply to the predictor to help you categorize and organize them. Each tag consists of a key and an optional value, both of which you define.
The following basic restrictions apply to tags:
Maximum number of tags per resource - 50.
For each resource, each tag key must be unique, and each tag key can have only one value.
Maximum key length - 128 Unicode characters in UTF-8.
Maximum value length - 256 Unicode characters in UTF-8.
If your tagging schema is used across multiple services and resources, remember that other services may have restrictions on allowed characters. Generally allowed characters are: letters, numbers, and spaces representable in UTF-8, and the following characters: + - = . _ : / @.
Tag keys and values are case sensitive.
Do not use
aws:
,AWS:
, or any upper or lowercase combination of such as a prefix for keys as it is reserved for AWS use. You cannot edit or delete tag keys with this prefix. Values can have this prefix. If a tag value hasaws
as its prefix but the key does not, then Forecast considers it to be a user tag and will count against the limit of 50 tags. Tags with only the key prefix ofaws
do not count against your tags per resource limit.(structure)
The optional metadata that you apply to a resource to help you categorize and organize them. Each tag consists of a key and an optional value, both of which you define.
The following basic restrictions apply to tags:
Maximum number of tags per resource - 50.
For each resource, each tag key must be unique, and each tag key can have only one value.
Maximum key length - 128 Unicode characters in UTF-8.
Maximum value length - 256 Unicode characters in UTF-8.
If your tagging schema is used across multiple services and resources, remember that other services may have restrictions on allowed characters. Generally allowed characters are: letters, numbers, and spaces representable in UTF-8, and the following characters: + - = . _ : / @.
Tag keys and values are case sensitive.
Do not use
aws:
,AWS:
, or any upper or lowercase combination of such as a prefix for keys as it is reserved for AWS use. You cannot edit or delete tag keys with this prefix. Values can have this prefix. If a tag value hasaws
as its prefix but the key does not, then Forecast considers it to be a user tag and will count against the limit of 50 tags. Tags with only the key prefix ofaws
do not count against your tags per resource limit.Key -> (string)
One part of a key-value pair that makes up a tag. A
key
is a general label that acts like a category for more specific tag values.Value -> (string)
The optional part of a key-value pair that makes up a tag. A
value
acts as a descriptor within a tag category (key).
Shorthand Syntax:
Key=string,Value=string ...
JSON Syntax:
[
{
"Key": "string",
"Value": "string"
}
...
]
--cli-input-json
| --cli-input-yaml
(string)
Reads arguments from the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton
. If other arguments are provided on the command line, those values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally. This may not be specified along with --cli-input-yaml
.
--generate-cli-skeleton
(string)
Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input
, prints a sample input JSON that can be used as an argument for --cli-input-json
. Similarly, if provided yaml-input
it will print a sample input YAML that can be used with --cli-input-yaml
. If provided with the value output
, it validates the command inputs and returns a sample output JSON for that command.
--cli-auto-prompt
(boolean)
Automatically prompt for CLI input parameters.
See ‘aws help’ for descriptions of global parameters.