[ aws . glue ]

update-column-statistics-for-table

Description

Creates or updates table statistics of columns.

See also: AWS API Documentation

See ‘aws help’ for descriptions of global parameters.

Synopsis

  update-column-statistics-for-table
[--catalog-id <value>]
--database-name <value>
--table-name <value>
--column-statistics-list <value>
[--cli-input-json | --cli-input-yaml]
[--generate-cli-skeleton <value>]
[--cli-auto-prompt <value>]

Options

--catalog-id (string)

The ID of the Data Catalog where the partitions in question reside. If none is supplied, the AWS account ID is used by default.

--database-name (string)

The name of the catalog database where the partitions reside.

--table-name (string)

The name of the partitions’ table.

--column-statistics-list (list)

A list of the column statistics.

(structure)

Defines a column statistics.

ColumnName -> (string)

The name of the column.

ColumnType -> (string)

The type of the column.

AnalyzedTime -> (timestamp)

The analyzed time of the column statistics.

StatisticsData -> (structure)

The statistics of the column.

Type -> (string)

The name of the column.

BooleanColumnStatisticsData -> (structure)

Boolean Column Statistics Data.

NumberOfTrues -> (long)

Number of true value.

NumberOfFalses -> (long)

Number of false value.

NumberOfNulls -> (long)

Number of nulls.

DateColumnStatisticsData -> (structure)

Date Column Statistics Data.

MinimumValue -> (timestamp)

Minimum value of the column.

MaximumValue -> (timestamp)

Maximum value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

DecimalColumnStatisticsData -> (structure)

Decimal Column Statistics Data.

MinimumValue -> (structure)

Minimum value of the column.

UnscaledValue -> (blob)

The unscaled numeric value.

Scale -> (integer)

The scale that determines where the decimal point falls in the unscaled value.

MaximumValue -> (structure)

Maximum value of the column.

UnscaledValue -> (blob)

The unscaled numeric value.

Scale -> (integer)

The scale that determines where the decimal point falls in the unscaled value.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

DoubleColumnStatisticsData -> (structure)

Double Column Statistics Data.

MinimumValue -> (double)

Minimum value of the column.

MaximumValue -> (double)

Maximum value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

LongColumnStatisticsData -> (structure)

Long Column Statistics Data.

MinimumValue -> (long)

Minimum value of the column.

MaximumValue -> (long)

Maximum value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

StringColumnStatisticsData -> (structure)

String Column Statistics Data.

MaximumLength -> (long)

Maximum value of the column.

AverageLength -> (double)

Average value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

BinaryColumnStatisticsData -> (structure)

Binary Column Statistics Data.

MaximumLength -> (long)

Maximum length of the column.

AverageLength -> (double)

Average length of the column.

NumberOfNulls -> (long)

Number of nulls.

JSON Syntax:

[
  {
    "ColumnName": "string",
    "ColumnType": "string",
    "AnalyzedTime": timestamp,
    "StatisticsData": {
      "Type": "BOOLEAN"|"DATE"|"DECIMAL"|"DOUBLE"|"LONG"|"STRING"|"BINARY",
      "BooleanColumnStatisticsData": {
        "NumberOfTrues": long,
        "NumberOfFalses": long,
        "NumberOfNulls": long
      },
      "DateColumnStatisticsData": {
        "MinimumValue": timestamp,
        "MaximumValue": timestamp,
        "NumberOfNulls": long,
        "NumberOfDistinctValues": long
      },
      "DecimalColumnStatisticsData": {
        "MinimumValue": {
          "UnscaledValue": blob,
          "Scale": integer
        },
        "MaximumValue": {
          "UnscaledValue": blob,
          "Scale": integer
        },
        "NumberOfNulls": long,
        "NumberOfDistinctValues": long
      },
      "DoubleColumnStatisticsData": {
        "MinimumValue": double,
        "MaximumValue": double,
        "NumberOfNulls": long,
        "NumberOfDistinctValues": long
      },
      "LongColumnStatisticsData": {
        "MinimumValue": long,
        "MaximumValue": long,
        "NumberOfNulls": long,
        "NumberOfDistinctValues": long
      },
      "StringColumnStatisticsData": {
        "MaximumLength": long,
        "AverageLength": double,
        "NumberOfNulls": long,
        "NumberOfDistinctValues": long
      },
      "BinaryColumnStatisticsData": {
        "MaximumLength": long,
        "AverageLength": double,
        "NumberOfNulls": long
      }
    }
  }
  ...
]

--cli-input-json | --cli-input-yaml (string) Reads arguments from the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, those values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally. This may not be specified along with --cli-input-yaml.

--generate-cli-skeleton (string) Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input, prints a sample input JSON that can be used as an argument for --cli-input-json. Similarly, if provided yaml-input it will print a sample input YAML that can be used with --cli-input-yaml. If provided with the value output, it validates the command inputs and returns a sample output JSON for that command.

--cli-auto-prompt (boolean) Automatically prompt for CLI input parameters.

See ‘aws help’ for descriptions of global parameters.

Output

Errors -> (list)

List of ColumnStatisticsErrors.

(structure)

Defines a column containing error.

ColumnStatistics -> (structure)

The ColumnStatistics of the column.

ColumnName -> (string)

The name of the column.

ColumnType -> (string)

The type of the column.

AnalyzedTime -> (timestamp)

The analyzed time of the column statistics.

StatisticsData -> (structure)

The statistics of the column.

Type -> (string)

The name of the column.

BooleanColumnStatisticsData -> (structure)

Boolean Column Statistics Data.

NumberOfTrues -> (long)

Number of true value.

NumberOfFalses -> (long)

Number of false value.

NumberOfNulls -> (long)

Number of nulls.

DateColumnStatisticsData -> (structure)

Date Column Statistics Data.

MinimumValue -> (timestamp)

Minimum value of the column.

MaximumValue -> (timestamp)

Maximum value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

DecimalColumnStatisticsData -> (structure)

Decimal Column Statistics Data.

MinimumValue -> (structure)

Minimum value of the column.

UnscaledValue -> (blob)

The unscaled numeric value.

Scale -> (integer)

The scale that determines where the decimal point falls in the unscaled value.

MaximumValue -> (structure)

Maximum value of the column.

UnscaledValue -> (blob)

The unscaled numeric value.

Scale -> (integer)

The scale that determines where the decimal point falls in the unscaled value.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

DoubleColumnStatisticsData -> (structure)

Double Column Statistics Data.

MinimumValue -> (double)

Minimum value of the column.

MaximumValue -> (double)

Maximum value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

LongColumnStatisticsData -> (structure)

Long Column Statistics Data.

MinimumValue -> (long)

Minimum value of the column.

MaximumValue -> (long)

Maximum value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

StringColumnStatisticsData -> (structure)

String Column Statistics Data.

MaximumLength -> (long)

Maximum value of the column.

AverageLength -> (double)

Average value of the column.

NumberOfNulls -> (long)

Number of nulls.

NumberOfDistinctValues -> (long)

Number of distinct values.

BinaryColumnStatisticsData -> (structure)

Binary Column Statistics Data.

MaximumLength -> (long)

Maximum length of the column.

AverageLength -> (double)

Average length of the column.

NumberOfNulls -> (long)

Number of nulls.

Error -> (structure)

The error message occurred during operation.

ErrorCode -> (string)

The code associated with this error.

ErrorMessage -> (string)

A message describing the error.