This looks like it would be perfect for Tarsnap -- the data Tarsnap stores is al...

jeffbarr · on Sept 17, 2015

This is online storage.

If a GET fails, just retry as usual (most higher-level libraries do this automatically, sometimes with a backoff mechanism).

cperciva · on Sept 17, 2015

If a GET fails, just retry as usual

Thanks! This is a very important detail which isn't documented anywhere: Retries are likely to succeed. A service where 1% of requests fail but failures are completely uncorrelated is far more usable than a service where 0.01% of requests fail but they keep on failing no matter how many times you retry them.

zeckalpha · on Sept 17, 2015

Additionally, assuming your block data is being hash-addressed, i.e. not changing the S3 objects once they are in S3, adding CloudFront in front of your buckets may go a long way to increasing that percentage.

However, its SLA is a little more involved (for better or worse): http://aws.amazon.com/cloudfront/sla/

I'm not accessing S3 from EC2, though, so another benefit for me was it brought my S3 network costs way down.

seiji · on Sept 17, 2015

(constructive review)

I had to read these sentences a few times to understand what you were trying to say: "You now have the choice of three S3 storage classes (Standard, Standard – IA, and Glacier) that are designed to offer 99.999999999% (eleven nines) of durability.‎ Standard – IA has an availability SLA of 99%."

No availability is mentioned for the others, but I assume it's 100%? Perhaps a simple table could help readers to scan and visually compare the values of two properties across three service classes?

dbarlett · on Sept 17, 2015

S3 Standard: "Designed for 99.99% availability over a given year" [1]

Standard - IA: "Designed for 99.9% availability over a given year" [2]

Glacier: "Retrieval jobs typically complete within 3-5 hours." [3]

[1] https://aws.amazon.com/s3/storage-classes/#Amazon_S3_Standar...

[2] https://aws.amazon.com/s3/storage-classes/#Infrequent_Access

[3] http://aws.amazon.com/glacier/faqs/#How_can_I_retrieve_data

seiji · on Sept 17, 2015

Thanks for rounding those up. There's a nice table just below this URL too with all relevant comparisons too: https://aws.amazon.com/s3/storage-classes/#Archive

It does contradict the introductory blog post article here, but I'm assuming the actual documentation is more accurate.

sandstrom · on Sept 17, 2015

Jeff, could you elaborate on the trade-offs?

When choosing between standard and this it would be helpful to understand the pros and cons. With the current description (below) it's as if the difference is only in pricing. But I assume there is a technical difference as well.

Also, the availability number could be explained better -- why is it different.

    Standard - IA offers the high durability, throughput, and low latency
    of Amazon S3 Standard, with a low per GB storage price and per GB retrieval fee.

TheDong · on Sept 17, 2015

One possibility is the reduced SLA is to account for code bugs / youth, not for any expected difference.

They might expect to be just as good as regular in the happy path, but are under-promising out of fear of some code-bug or other issue.

Either way, I'd definitely not migrate too soon for exactly the above reason.

toomuchtodo · on Sept 17, 2015

Based on the post, it seems availability drops but durability remains the same. You might need to rest to get an object, and you'll be successful eventually.

cperciva · on Sept 17, 2015

Right. But it matters how much availability drops, and also what the correlation is between failures -- if they're completely uncorrelated but there's a 1% failure rate, you just retry, but if 1% of objects are going to be unavailable for the next four hours, that's a problem.

derefr · on Sept 17, 2015

From one of the customer quotes on the page:

> it’s vital that customers have immediate, instant access to any of [our photos] at a moment’s notice – even if they haven’t been viewed in years. [IA] offers the same high durability and performance ... so we can continue to deliver the same amazing experience for our customers.

The way this was phrased implies that this customer's use-case had a hard requirement that all of their data be in "online" storage at all times, and their satisfaction implies that IA does, in fact, hit this requirement.

I'm not sure what the 99% SLA means given that.

cperciva · on Sept 17, 2015

Huh, I hadn't noticed that quote. Now I'm really confused.

seiji · on Sept 17, 2015

To me it sounds like:

S3 Classic: Your file gets replicated on 3 live HDs (and/or HDs backed by RAID arrays—not sure about the internal S3 storage topology).

S3 Infrequent: Your file gets stored on 1 live HD (or single hardware redundancy component) and a copy in Glacier. If your live HD dies, your file will be automatically restored from Glacier to a new HD (but your data may be inaccessible during the automatic re-deploy).

Glacier: offline bluray combined with error correction accessed by robot arms, temporarily restored to live HDs on demand.

breakingcups · on Sept 17, 2015

I thought Glacier used "green" consumer drives clocked down even further to save power?

acveilleux · on Sept 17, 2015

Whichever they use, most disks are likely unpowered most of the time. Like the Facebook equivalent where they can only power 1 out of 12 disks at any time.

6t6t6 · on Sept 17, 2015

I understand that 99% doesn't mean that 1% of the objects will be always unaccesible. Instead, I guess that they mean is that they allow themselves to have up to 80 hours a year of downtime for any bucket.