AWS Simple Storage Service
- Durability: This defines the percentage which tells what is chance of file not getting corrupted or lost
- Availability: This defines the percentage which tells what is chance of file being available for access
-
Amazon’s response to How durable is s3
- Amazon S3 Standard, S3 Standard–IA, S3 Intelligent-Tiering, S3 One Zone-IA, S3 Glacier Instant Retrieval, S3 Glacier Flexible Retrieval, and S3 Glacier Deep Archive are all designed to provide 99.999999999% (11 9’s) of data durability of objects over a given year.
- This durability level corresponds to an average annual expected loss of 0.000000001% of objects.
- For example, if you store 10,000,000 objects with Amazon S3, you can on average expect to incur a loss of a single object once every 10,000 years.
- S3 on Outposts is designed to durably and redundantly store data across multiple devices and servers on your Outpost.
- In addition, Amazon S3 Standard, S3 Standard-IA, S3 Glacier Instant Retrieval, S3 Glacier Flexible Retrieval, and S3 Glacier Deep Archive are all designed to sustain data in the event of an entire S3 Availability Zone loss.
- As with any environment, the best practice is to have a backup and to put in place safeguards against malicious or accidental deletion.
- For S3 data, that best practice includes secure access permissions, Cross-Region Replication, versioning, and a functioning, regularly tested backup.
- Refer Here for storage classes
- Versioning:
- Objects in s3 buckets can be versioned, so that the history of changes are retained.
- Versioning can be enabled in AWS s3 but not disabled we can only suspend versioning
- Static Website: A Website developed in html, css, javascript can be directly hosted on s3 buckets as s3 is http aware.
- Cross-Region Replication: Any object uploaded to s3 bucket can be backed up in other region. This is an asynchronous operation.
- Access Control:
- Anonymous (Public-Read access) => ACL
- Access to specific aws users => S3 bucket policy
- The policies are written in JSON
- AWS S3 from Command line
- aws s3
- aws s3api
- Glacier: Started as a storage class and now almost considered as a different storage service.
- AWS Athena: S3 is used as a Datastore in Big data / data engineering/ML, Athena can query the s3 objects Refer Here
- All the AWS logs are internally stored in S3 and when we enable custom loggin s3 buckets get created.
- AWS audit information from services like cloud trail gets stored in s3.
- AWS has lifecycle management to automatically move data from storage class to other.