Creates a new job to profile an AWS Glue DataBrew dataset that exists in the current AWS account.
See also: AWS API Documentation
See ‘aws help’ for descriptions of global parameters.
create-profile-job
--dataset-name <value>
[--encryption-key-arn <value>]
[--encryption-mode <value>]
--name <value>
[--log-subscription <value>]
[--max-capacity <value>]
[--max-retries <value>]
--output-location <value>
--role-arn <value>
[--tags <value>]
[--timeout <value>]
[--cli-input-json | --cli-input-yaml]
[--generate-cli-skeleton <value>]
--dataset-name
(string)
The name of the dataset that this job is to act upon.
--encryption-key-arn
(string)
The Amazon Resource Name (ARN) of an encryption key that is used to protect the job.
--encryption-mode
(string)
The encryption mode for the job, which can be one of the following:
SSE-KMS
- para>``SSE-KMS`` - server-side encryption with AWS KMS-managed keys.
SSE-S3
- Server-side encryption with keys managed by Amazon S3.Possible values:
SSE-KMS
SSE-S3
--name
(string)
The name of the job to be created.
--log-subscription
(string)
A value that enables or disables Amazon CloudWatch logging for the current AWS account. If logging is enabled, CloudWatch writes one log stream for each job run.
Possible values:
ENABLE
DISABLE
--max-capacity
(integer)
The maximum number of nodes that DataBrew can use when the job processes data.
--max-retries
(integer)
The maximum number of times to retry the job after a job run fails.
--output-location
(structure)
An Amazon S3 location (bucket name an object key) where DataBrew can read input data, or write output from a job.
Bucket -> (string)
The S3 bucket name.
Key -> (string)
The unique name of the object in the bucket.
Shorthand Syntax:
Bucket=string,Key=string
JSON Syntax:
{
"Bucket": "string",
"Key": "string"
}
--role-arn
(string)
The Amazon Resource Name (ARN) of the AWS Identity and Access Management (IAM) role to be assumed for this request.
--tags
(map)
Metadata tags to apply to this job.
key -> (string)
value -> (string)
Shorthand Syntax:
KeyName1=string,KeyName2=string
JSON Syntax:
{"string": "string"
...}
--timeout
(integer)
The job’s timeout in minutes. A job that attempts to run longer than this timeout period ends with a status of
TIMEOUT
.
--cli-input-json
| --cli-input-yaml
(string)
Reads arguments from the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton
. If other arguments are provided on the command line, those values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally. This may not be specified along with --cli-input-yaml
.
--generate-cli-skeleton
(string)
Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input
, prints a sample input JSON that can be used as an argument for --cli-input-json
. Similarly, if provided yaml-input
it will print a sample input YAML that can be used with --cli-input-yaml
. If provided with the value output
, it validates the command inputs and returns a sample output JSON for that command.
See ‘aws help’ for descriptions of global parameters.