[ aws . forecast ]

describe-dataset-import-job

Description

Describes a dataset import job created using the CreateDatasetImportJob operation.

In addition to listing the parameters provided in the CreateDatasetImportJob request, this operation includes the following properties:

  • CreationTime

  • LastModificationTime

  • DataSize

  • FieldStatistics

  • Status

  • Message - If an error occurred, information about the error.

See also: AWS API Documentation

See ‘aws help’ for descriptions of global parameters.

Synopsis

  describe-dataset-import-job
--dataset-import-job-arn <value>
[--cli-input-json | --cli-input-yaml]
[--generate-cli-skeleton <value>]

Options

--dataset-import-job-arn (string)

The Amazon Resource Name (ARN) of the dataset import job.

--cli-input-json | --cli-input-yaml (string) Reads arguments from the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, those values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally. This may not be specified along with --cli-input-yaml.

--generate-cli-skeleton (string) Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input, prints a sample input JSON that can be used as an argument for --cli-input-json. Similarly, if provided yaml-input it will print a sample input YAML that can be used with --cli-input-yaml. If provided with the value output, it validates the command inputs and returns a sample output JSON for that command. The generated JSON skeleton is not stable between versions of the AWS CLI and there are no backwards compatibility guarantees in the JSON skeleton generated.

See ‘aws help’ for descriptions of global parameters.

Output

DatasetImportJobName -> (string)

The name of the dataset import job.

DatasetImportJobArn -> (string)

The ARN of the dataset import job.

DatasetArn -> (string)

The Amazon Resource Name (ARN) of the dataset that the training data was imported to.

TimestampFormat -> (string)

The format of timestamps in the dataset. The format that you specify depends on the DataFrequency specified when the dataset was created. The following formats are supported

  • “yyyy-MM-dd” For the following data frequencies: Y, M, W, and D

  • “yyyy-MM-dd HH:mm:ss” For the following data frequencies: H, 30min, 15min, and 1min; and optionally, for: Y, M, W, and D

TimeZone -> (string)

The single time zone applied to every item in the dataset

UseGeolocationForTimeZone -> (boolean)

Whether TimeZone is automatically derived from the geolocation attribute.

GeolocationFormat -> (string)

The format of the geolocation attribute. Valid Values:"LAT_LONG" and "CC_POSTALCODE" .

DataSource -> (structure)

The location of the training data to import and an AWS Identity and Access Management (IAM) role that Amazon Forecast can assume to access the data.

If encryption is used, DataSource includes an AWS Key Management Service (KMS) key.

S3Config -> (structure)

The path to the data stored in an Amazon Simple Storage Service (Amazon S3) bucket along with the credentials to access the data.

Path -> (string)

The path to an Amazon Simple Storage Service (Amazon S3) bucket or file(s) in an Amazon S3 bucket.

RoleArn -> (string)

The ARN of the AWS Identity and Access Management (IAM) role that Amazon Forecast can assume to access the Amazon S3 bucket or files. If you provide a value for the KMSKeyArn key, the role must allow access to the key.

Passing a role across AWS accounts is not allowed. If you pass a role that isn’t in your account, you get an InvalidInputException error.

KMSKeyArn -> (string)

The Amazon Resource Name (ARN) of an AWS Key Management Service (KMS) key.

EstimatedTimeRemainingInMinutes -> (long)

The estimated time remaining in minutes for the dataset import job to complete.

FieldStatistics -> (map)

Statistical information about each field in the input data.

key -> (string)

value -> (structure)

Provides statistics for each data field imported into to an Amazon Forecast dataset with the CreateDatasetImportJob operation.

Count -> (integer)

The number of values in the field. If the response value is -1, refer to CountLong .

CountDistinct -> (integer)

The number of distinct values in the field. If the response value is -1, refer to CountDistinctLong .

CountNull -> (integer)

The number of null values in the field. If the response value is -1, refer to CountNullLong .

CountNan -> (integer)

The number of NAN (not a number) values in the field. If the response value is -1, refer to CountNanLong .

Min -> (string)

For a numeric field, the minimum value in the field.

Max -> (string)

For a numeric field, the maximum value in the field.

Avg -> (double)

For a numeric field, the average value in the field.

Stddev -> (double)

For a numeric field, the standard deviation.

CountLong -> (long)

The number of values in the field. CountLong is used instead of Count if the value is greater than 2,147,483,647.

CountDistinctLong -> (long)

The number of distinct values in the field. CountDistinctLong is used instead of CountDistinct if the value is greater than 2,147,483,647.

CountNullLong -> (long)

The number of null values in the field. CountNullLong is used instead of CountNull if the value is greater than 2,147,483,647.

CountNanLong -> (long)

The number of NAN (not a number) values in the field. CountNanLong is used instead of CountNan if the value is greater than 2,147,483,647.

DataSize -> (double)

The size of the dataset in gigabytes (GB) after the import job has finished.

Status -> (string)

The status of the dataset import job. States include:

  • ACTIVE

  • CREATE_PENDING , CREATE_IN_PROGRESS , CREATE_FAILED

  • DELETE_PENDING , DELETE_IN_PROGRESS , DELETE_FAILED

  • CREATE_STOPPING , CREATE_STOPPED

Message -> (string)

If an error occurred, an informational message about the error.

CreationTime -> (timestamp)

When the dataset import job was created.

LastModificationTime -> (timestamp)

The last time the resource was modified. The timestamp depends on the status of the job:

  • CREATE_PENDING - The CreationTime .

  • CREATE_IN_PROGRESS - The current timestamp.

  • CREATE_STOPPING - The current timestamp.

  • CREATE_STOPPED - When the job stopped.

  • ACTIVE or CREATE_FAILED - When the job finished or failed.

Format -> (string)

The format of the imported data, CSV or PARQUET.