2017-10-10 94 views
0

是否有命令行方式來恢復Glacier的數據? 到目前爲止,我曾嘗試:從Amazon Glacier恢復數據

s3cmd restore --recursive s3://mybucketname/folder/ 

aws s3 ls s3://<bucket_name> | awk '{print $4}' | xargs -L 1 aws s3api restore-object --restore-request Days=<days> --bucket <bucket_name> --key 

但是,沒有幫助那裏。 PS:我知道我們可以通過控制檯做到這一點。

+0

當你說 「這裏沒有幫助」,你是什麼意思?命令失敗了嗎?它成功了,但是你找不到S3對象?您應該從文檔中瞭解到標準冰川檢索時間通常爲3-5小時,除非您要求加快(在這種情況下您需要支付更多費用)。 – jarmod

回答

0

不幸的是,這是不可能的。您只能使用Amazon S3訪問已歸檔到Amazon Glacier的對象。

參見:http://docs.aws.amazon.com/AmazonS3/latest/user-guide/restore-archived-objects.html

+0

是的,我知道這種特殊的方式。 但是,即使通過控制檯的限制,你不能恢復文件夾。恢復是在對象層面上,而不是在桶級別上。 我認爲這不是一個好辦法。 –

+0

S3沒有文件夾。一個文件夾是一個邏輯概念,並不是S3中實際存在的東西。 S3中的所有對象都只是平面名稱空間中的鍵。文件夾出現的事實只是一個邏輯概念,使用文件名字符「/」作爲客戶端軟件中的分隔符。 –

-1

您可以使用下面的Python代碼冰川數據恢復到S3。

import boto3, botocore 
import subprocess, os, shutil, tempfile, argparse, sys, time, codecs 
from pprint import pprint 

sys.stdout = codecs.getwriter('utf8')(sys.stdout) 

parser = argparse.ArgumentParser() 
parser.add_argument('--max-rate-mb', action='store', type=int, default=10000, help='The maximum rate in MB/h to restore files at. Files larger than this will not be restored.') 
parser.add_argument('--restore-days', action='store', type=int, default=30, help='How many days restored objects will remain in S3.') 
parser.add_argument('--restore-path', action='store', help='The bucket/prefix to restore from') 
parser.add_argument('--pretend', action='store_true', help='Do not execute restores') 
parser.add_argument('--estimate', action='store_true', help='When pretending, do not check for already-restored files') 
args = parser.parse_args() 

if not args.restore_path: 
    print 'No restore path specified.' 
    sys.exit(1) 

BUCKET = None 
PREFIX = None 
if '/' in args.restore_path: 
    BUCKET, PREFIX = args.restore_path.split('/',1) 
else: 
    BUCKET = args.restore_path 
    PREFIX = '' 

RATE_LIMIT_BYTES = args.max_rate_mb * 1024 * 1024 

s3 = boto3.Session(aws_access_key_id='<ACCESS_KEY>', aws_secret_access_key='<SECRET_KEY>').resource('s3') 
bucket = s3.Bucket(BUCKET) 

totalsize = 0 
objects = [] 

objcount = 0 
for objpage in bucket.objects.filter(Prefix=PREFIX).page_size(100).pages(): 
    for obj in objpage: 
     objcount += 1 
     print obj 
     objects.append(obj) 
    print u'Found {} objects.'.format(objcount) 
print 

objects.sort(key=lambda x: x.size, reverse=True) 

objects = filter(lambda x: x.storage_class == 'GLACIER', objects) 

if objects: 
    obj = objects[0] 
    print u'The largest object found is of {} size: {:14,d} {:1s} {}'.format(('a restorable' if obj.size <= RATE_LIMIT_BYTES else 'an UNRESTORABLE'), obj.size, obj.storage_class[0], obj.key) 
    print 

while objects: 
    current_set = [] 
    current_set_total = 0 
    unreported_unrestoreable_objects = [] 
    i = 0 
    while i < len(objects): 
     obj = objects[i] 

     if obj.size > RATE_LIMIT_BYTES: 
      unreported_unrestoreable_objects.append(obj) 
     elif unreported_unrestoreable_objects: 
      # No longer accumulating these. Print the ones we found. 
      print u'Some objects could not be restored due to exceeding the hourly rate limit:' 
      for obj in unreported_unrestoreable_objects: 
       print u'- {:14,d} {:1s} {}'.format(obj.size, obj.storage_class[0], obj.key) 
      print 

     if current_set_total + obj.size <= RATE_LIMIT_BYTES: 
      if not args.pretend or not args.estimate: 
       if obj.Object().restore is not None: 
        objects.pop(i) 
        continue 
      current_set.append(obj) 
      current_set_total += obj.size 
      objects.pop(i) 
      continue 
     i += 1 

    for obj in current_set: 
     print u'{:14,d} {:1s} {}'.format(obj.size, obj.storage_class[0], obj.key) 
     #pprint(obj.Object().restore) 
     if not args.pretend: 
      obj.restore_object(RestoreRequest={'Days': args.restore_days}) 
     #sys.exit(0) 

    print u'{:s} Requested restore of {:d} objects consisting of {:,d} bytes. {:d} objects remaining. {:,d} bytes of hourly restore rate wasted'.format(time.strftime('%Y-%m-%d %H:%M:%S'), len(current_set), current_set_total, len(objects), RATE_LIMIT_BYTES - current_set_total) 
    print 
    if not objects: 
     break 
    if not args.pretend: 
     time.sleep(3690) 

命令來運行腳本:

python restore_glacier_data_to_s3.py --restore-path s3-bucket-name/folder-name/ 
+0

儘管這個鏈接可能回答這個問題,但最好在這裏包含答案的重要部分,並提供供參考的鏈接。如果鏈接頁面更改,則僅鏈接答案可能會失效。 - [來自評論](/ review/low-quality-posts/17930206) – Maraboc