Restores an archived copy of an object back into Amazon S3
This operation performs the following types of requests:
select
- Perform a select query on an archived object
restore an archive
- Restore an archived object
To use this operation, you must have permissions to perform the s3:RestoreObject
action. The bucket owner has this permission by default and can grant this permission to others. For more information about permissions, see Permissions Related to Bucket Subresource Operations and Managing Access Permissions to Your Amazon S3 Resources in the Amazon Simple Storage Service Developer Guide .
Querying Archives with Select Requests
You use a select type of request to perform SQL queries on archived objects. The archived objects that are being queried by the select request must be formatted as uncompressed comma-separated values (CSV) files. You can run queries and custom analytics on your archived data without having to restore your data to a hotter Amazon S3 tier. For an overview about select requests, see Querying Archived Objects in the Amazon Simple Storage Service Developer Guide .
When making a select request, do the following:
Define an output location for the select query’s output. This must be an Amazon S3 bucket in the same AWS Region as the bucket that contains the archive object that is being queried. The AWS account that initiates the job must have permissions to write to the S3 bucket. You can specify the storage class and encryption for the output objects stored in the bucket. For more information about output, see Querying Archived Objects in the Amazon Simple Storage Service Developer Guide . For more information about the S3
structure in the request body, see the following:
PutObject
Managing Access with ACLs in the Amazon Simple Storage Service Developer Guide
Protecting Data Using Server-Side Encryption in the Amazon Simple Storage Service Developer Guide
Define the SQL expression for the SELECT
type of restoration for your query in the request body’s SelectParameters
structure. You can use expressions like the following examples.
The following expression returns all records from the specified object. SELECT * FROM Object
Assuming that you are not using any headers for data stored in the object, you can specify columns with positional headers. SELECT s._1, s._2 FROM Object s WHERE s._3 > 100
If you have headers and you set the fileHeaderInfo
in the CSV
structure in the request body to USE
, you can specify headers in the query. (If you set the fileHeaderInfo
field to IGNORE
, the first row is skipped for the query.) You cannot mix ordinal positions with header column names. SELECT s.Id, s.FirstName, s.SSN FROM S3Object s
For more information about using SQL with S3 Glacier Select restore, see SQL Reference for Amazon S3 Select and S3 Glacier Select in the Amazon Simple Storage Service Developer Guide .
When making a select request, you can also do the following:
To expedite your queries, specify the Expedited
tier. For more information about tiers, see “Restoring Archives,” later in this topic.
Specify details about the data serialization format of both the input object that is being queried and the serialization of the CSV-encoded query results.
The following are additional important facts about the select feature:
The output results are new Amazon S3 objects. Unlike archive retrievals, they are stored until explicitly deleted-manually or through a lifecycle policy.
You can issue more than one select request on the same Amazon S3 object. Amazon S3 doesn’t deduplicate requests, so avoid issuing duplicate requests.
Amazon S3 accepts a select request even if the object has already been restored. A select request doesn’t return error response 409
.
Restoring Archives
Objects in the GLACIER and DEEP_ARCHIVE storage classes are archived. To access an archived object, you must first initiate a restore request. This restores a temporary copy of the archived object. In a restore request, you specify the number of days that you want the restored copy to exist. After the specified period, Amazon S3 deletes the temporary copy but the object remains archived in the GLACIER or DEEP_ARCHIVE storage class that object was restored from.
To restore a specific object version, you can provide a version ID. If you don’t provide a version ID, Amazon S3 restores the current version.
The time it takes restore jobs to finish depends on which storage class the object is being restored from and which data access tier you specify.
When restoring an archived object (or using a select request), you can specify one of the following data access tier options in the Tier
element of the request body:
**Expedited
** - Expedited retrievals allow you to quickly access your data stored in the GLACIER storage class when occasional urgent requests for a subset of archives are required. For all but the largest archived objects (250 MB+), data accessed using Expedited retrievals are typically made available within 1–5 minutes. Provisioned capacity ensures that retrieval capacity for Expedited retrievals is available when you need it. Expedited retrievals and provisioned capacity are not available for the DEEP_ARCHIVE storage class.
**Standard
** - S3 Standard retrievals allow you to access any of your archived objects within several hours. This is the default option for the GLACIER and DEEP_ARCHIVE retrieval requests that do not specify the retrieval option. S3 Standard retrievals typically complete within 3-5 hours from the GLACIER storage class and typically complete within 12 hours from the DEEP_ARCHIVE storage class.
**Bulk
** - Bulk retrievals are Amazon S3 Glacier’s lowest-cost retrieval option, enabling you to retrieve large amounts, even petabytes, of data inexpensively in a day. Bulk retrievals typically complete within 5-12 hours from the GLACIER storage class and typically complete within 48 hours from the DEEP_ARCHIVE storage class.
For more information about archive retrieval options and provisioned capacity for Expedited
data access, see Restoring Archived Objects in the Amazon Simple Storage Service Developer Guide .
You can use Amazon S3 restore speed upgrade to change the restore speed to a faster speed while it is in progress. You upgrade the speed of an in-progress restoration by issuing another restore request to the same object, setting a new Tier
request element. When issuing a request to upgrade the restore tier, you must choose a tier that is faster than the tier that the in-progress restore is using. You must not change any other parameters, such as the Days
request element. For more information, see Upgrading the Speed of an In-Progress Restore in the Amazon Simple Storage Service Developer Guide .
To get the status of object restoration, you can send a HEAD
request. Operations return the x-amz-restore
header, which provides information about the restoration status, in the response. You can use Amazon S3 event notifications to notify you when a restore is initiated or completed. For more information, see Configuring Amazon S3 Event Notifications in the Amazon Simple Storage Service Developer Guide .
After restoring an archived object, you can update the restoration period by reissuing the request with a new period. Amazon S3 updates the restoration period relative to the current time and charges only for the request-there are no data transfer charges. You cannot update the restoration period when Amazon S3 is actively processing your current restore request for the object.
If your bucket has a lifecycle configuration with a rule that includes an expiration action, the object expiration overrides the life span that you specify in a restore request. For example, if you restore an object copy for 10 days, but the object is scheduled to expire in 3 days, Amazon S3 deletes the object in 3 days. For more information about lifecycle configuration, see PutBucketLifecycleConfiguration and Object Lifecycle Management in Amazon Simple Storage Service Developer Guide .
Responses
A successful operation returns either the 200 OK
or 202 Accepted
status code.
If the object copy is not previously restored, then Amazon S3 returns 202 Accepted
in the response.
If the object copy is previously restored, Amazon S3 returns 200 OK
in the response.
Special Errors
Code: RestoreAlreadyInProgress
Cause: Object restore is already in progress. (This error does not apply to SELECT type requests.)
HTTP Status Code: 409 Conflict
SOAP Fault Code Prefix: Client
Code: GlacierExpeditedRetrievalNotAvailable
Cause: S3 Glacier expedited retrievals are currently not available. Try again later. (Returned if there is insufficient capacity to process the Expedited request. This error applies only to Expedited retrievals and not to S3 Standard or Bulk retrievals.)
HTTP Status Code: 503
SOAP Fault Code Prefix: N/A
Related Resources
PutBucketLifecycleConfiguration
GetBucketNotificationConfiguration
SQL Reference for Amazon S3 Select and S3 Glacier Select in the Amazon Simple Storage Service Developer Guide
See also: AWS API Documentation
See ‘aws help’ for descriptions of global parameters.
restore-object
--bucket <value>
--key <value>
[--version-id <value>]
[--restore-request <value>]
[--request-payer <value>]
[--cli-input-json | --cli-input-yaml]
[--generate-cli-skeleton <value>]
[--cli-auto-prompt <value>]
--bucket
(string)
The bucket name or containing the object to restore.
When using this API with an access point, you must direct requests to the access point hostname. The access point hostname takes the form AccessPointName -AccountId .s3-accesspoint.*Region* .amazonaws.com. When using this operation using an access point through the AWS SDKs, you provide the access point ARN in place of the bucket name. For more information about access point ARNs, see Using Access Points in the Amazon Simple Storage Service Developer Guide .
--key
(string)
Object key for which the operation was initiated.
--version-id
(string)
VersionId used to reference a specific version of the object.
--restore-request
(structure)
Container for restore job parameters.
Days -> (integer)
Lifetime of the active copy in days. Do not use with restores that specify
OutputLocation
.GlacierJobParameters -> (structure)
S3 Glacier related parameters pertaining to this job. Do not use with restores that specify
OutputLocation
.Tier -> (string)
S3 Glacier retrieval tier at which the restore will be processed.
Type -> (string)
Type of restore request.
Tier -> (string)
S3 Glacier retrieval tier at which the restore will be processed.
Description -> (string)
The optional description for the job.
SelectParameters -> (structure)
Describes the parameters for Select job types.
InputSerialization -> (structure)
Describes the serialization format of the object.
CSV -> (structure)
Describes the serialization of a CSV-encoded object.
FileHeaderInfo -> (string)
Describes the first line of input. Valid values are:
NONE
: First line is not a header.
IGNORE
: First line is a header, but you can’t use the header values to indicate the column in an expression. You can use column position (such as _1, _2, …) to indicate the column (SELECT s._1 FROM OBJECT s
).
Use
: First line is a header, and you can use the header value to identify a column in an expression (SELECT "name" FROM OBJECT
).Comments -> (string)
A single character used to indicate that a row should be ignored when the character is present at the start of that row. You can specify any character to indicate a comment line.
QuoteEscapeCharacter -> (string)
A single character used for escaping the quotation mark character inside an already escaped value. For example, the value “”” a , b “”” is parsed as ” a , b “.
RecordDelimiter -> (string)
A single character used to separate individual records in the input. Instead of the default value, you can specify an arbitrary delimiter.
FieldDelimiter -> (string)
A single character used to separate individual fields in a record. You can specify an arbitrary delimiter.
QuoteCharacter -> (string)
A single character used for escaping when the field delimiter is part of the value. For example, if the value is
a, b
, Amazon S3 wraps this field value in quotation marks, as follows:" a , b "
.Type: String
Default:
"
Ancestors:
CSV
AllowQuotedRecordDelimiter -> (boolean)
Specifies that CSV field values may contain quoted record delimiters and such records should be allowed. Default value is FALSE. Setting this value to TRUE may lower performance.
CompressionType -> (string)
Specifies object’s compression format. Valid values: NONE, GZIP, BZIP2. Default Value: NONE.
JSON -> (structure)
Specifies JSON as object’s input serialization format.
Type -> (string)
The type of JSON. Valid values: Document, Lines.
Parquet -> (structure)
Specifies Parquet as object’s input serialization format.
ExpressionType -> (string)
The type of the provided expression (for example, SQL).
Expression -> (string)
The expression that is used to query the object.
OutputSerialization -> (structure)
Describes how the results of the Select job are serialized.
CSV -> (structure)
Describes the serialization of CSV-encoded Select results.
QuoteFields -> (string)
Indicates whether to use quotation marks around output fields.
ALWAYS
: Always use quotation marks for output fields.
ASNEEDED
: Use quotation marks for output fields when needed.QuoteEscapeCharacter -> (string)
The single character used for escaping the quote character inside an already escaped value.
RecordDelimiter -> (string)
A single character used to separate individual records in the output. Instead of the default value, you can specify an arbitrary delimiter.
FieldDelimiter -> (string)
The value used to separate individual fields in a record. You can specify an arbitrary delimiter.
QuoteCharacter -> (string)
A single character used for escaping when the field delimiter is part of the value. For example, if the value is
a, b
, Amazon S3 wraps this field value in quotation marks, as follows:" a , b "
.JSON -> (structure)
Specifies JSON as request’s output serialization format.
RecordDelimiter -> (string)
The value used to separate individual records in the output. If no value is specified, Amazon S3 uses a newline character (‘n’).
OutputLocation -> (structure)
Describes the location where the restore job’s output is stored.
S3 -> (structure)
Describes an S3 location that will receive the results of the restore request.
BucketName -> (string)
The name of the bucket where the restore results will be placed.
Prefix -> (string)
The prefix that is prepended to the restore results for this request.
Encryption -> (structure)
Contains the type of server-side encryption used.
EncryptionType -> (string)
The server-side encryption algorithm used when storing job results in Amazon S3 (for example, AES256, aws:kms).
KMSKeyId -> (string)
If the encryption type is
aws:kms
, this optional value specifies the ID of the symmetric customer managed AWS KMS CMK to use for encryption of job results. Amazon S3 only supports symmetric CMKs. For more information, see Using Symmetric and Asymmetric Keys in the AWS Key Management Service Developer Guide .KMSContext -> (string)
If the encryption type is
aws:kms
, this optional value can be used to specify the encryption context for the restore results.CannedACL -> (string)
The canned ACL to apply to the restore results.
AccessControlList -> (list)
A list of grants that control access to the staged results.
(structure)
Container for grant information.
Grantee -> (structure)
The person being granted permissions.
DisplayName -> (string)
Screen name of the grantee.
EmailAddress -> (string)
Email address of the grantee.
Note
Using email addresses to specify a grantee is only supported in the following AWS Regions:
US East (N. Virginia)
US West (N. California)
US West (Oregon)
Asia Pacific (Singapore)
Asia Pacific (Sydney)
Asia Pacific (Tokyo)
Europe (Ireland)
South America (São Paulo)
For a list of all the Amazon S3 supported Regions and endpoints, see Regions and Endpoints in the AWS General Reference.
ID -> (string)
The canonical user ID of the grantee.
Type -> (string)
Type of grantee
URI -> (string)
URI of the grantee group.
Permission -> (string)
Specifies the permission given to the grantee.
Tagging -> (structure)
The tag-set that is applied to the restore results.
TagSet -> (list)
A collection for a set of tags
(structure)
A container of a key value name pair.
Key -> (string)
Name of the tag.
Value -> (string)
Value of the tag.
UserMetadata -> (list)
A list of metadata to store with the restore results in S3.
(structure)
A metadata key-value pair to store with an object.
Name -> (string)
Name of the Object.
Value -> (string)
Value of the Object.
StorageClass -> (string)
The class of storage used to store the restore results.
JSON Syntax:
{
"Days": integer,
"GlacierJobParameters": {
"Tier": "Standard"|"Bulk"|"Expedited"
},
"Type": "SELECT",
"Tier": "Standard"|"Bulk"|"Expedited",
"Description": "string",
"SelectParameters": {
"InputSerialization": {
"CSV": {
"FileHeaderInfo": "USE"|"IGNORE"|"NONE",
"Comments": "string",
"QuoteEscapeCharacter": "string",
"RecordDelimiter": "string",
"FieldDelimiter": "string",
"QuoteCharacter": "string",
"AllowQuotedRecordDelimiter": true|false
},
"CompressionType": "NONE"|"GZIP"|"BZIP2",
"JSON": {
"Type": "DOCUMENT"|"LINES"
},
"Parquet": {
}
},
"ExpressionType": "SQL",
"Expression": "string",
"OutputSerialization": {
"CSV": {
"QuoteFields": "ALWAYS"|"ASNEEDED",
"QuoteEscapeCharacter": "string",
"RecordDelimiter": "string",
"FieldDelimiter": "string",
"QuoteCharacter": "string"
},
"JSON": {
"RecordDelimiter": "string"
}
}
},
"OutputLocation": {
"S3": {
"BucketName": "string",
"Prefix": "string",
"Encryption": {
"EncryptionType": "AES256"|"aws:kms",
"KMSKeyId": "string",
"KMSContext": "string"
},
"CannedACL": "private"|"public-read"|"public-read-write"|"authenticated-read"|"aws-exec-read"|"bucket-owner-read"|"bucket-owner-full-control",
"AccessControlList": [
{
"Grantee": {
"DisplayName": "string",
"EmailAddress": "string",
"ID": "string",
"Type": "CanonicalUser"|"AmazonCustomerByEmail"|"Group",
"URI": "string"
},
"Permission": "FULL_CONTROL"|"WRITE"|"WRITE_ACP"|"READ"|"READ_ACP"
}
...
],
"Tagging": {
"TagSet": [
{
"Key": "string",
"Value": "string"
}
...
]
},
"UserMetadata": [
{
"Name": "string",
"Value": "string"
}
...
],
"StorageClass": "STANDARD"|"REDUCED_REDUNDANCY"|"STANDARD_IA"|"ONEZONE_IA"|"INTELLIGENT_TIERING"|"GLACIER"|"DEEP_ARCHIVE"
}
}
}
--request-payer
(string)
Confirms that the requester knows that they will be charged for the request. Bucket owners need not specify this parameter in their requests. For information about downloading objects from requester pays buckets, see Downloading Objects in Requestor Pays Buckets in the Amazon S3 Developer Guide .
Possible values:
requester
--cli-input-json
| --cli-input-yaml
(string)
Reads arguments from the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton
. If other arguments are provided on the command line, those values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally. This may not be specified along with --cli-input-yaml
.
--generate-cli-skeleton
(string)
Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input
, prints a sample input JSON that can be used as an argument for --cli-input-json
. Similarly, if provided yaml-input
it will print a sample input YAML that can be used with --cli-input-yaml
. If provided with the value output
, it validates the command inputs and returns a sample output JSON for that command.
--cli-auto-prompt
(boolean)
Automatically prompt for CLI input parameters.
See ‘aws help’ for descriptions of global parameters.
To create a restore request for an object
The following restore-object
example restores the specified Amazon S3 Glacier object for the bucket my-glacier-bucket
for 10 days.
aws s3api restore-object \
--bucket my-glacier-bucket \
--key doc1.rtf \
--restore-request Days=10
This command produces no output.
RequestCharged -> (string)
If present, indicates that the requester was successfully charged for the request.
RestoreOutputPath -> (string)
Indicates the path in the provided S3 output location where Select results will be restored to.