/Storage Services (S3, EBS, EFS, Glacier)
Concept
Easy

Storage Services (S3, EBS, EFS, Glacier)

7 min read·s3ebsefsglacierstorage-classesinstance-storefsxsnow-familystorage-gatewayclf-c02

AWS offers three main storage paradigms: object storage (S3 — unlimited scale, 11-nines durability), block storage (EBS — persistent volumes attached to EC2 within a single AZ), and file storage (EFS — shared NFS mountable by thousands of Linux instances across multiple AZs). Understanding S3 storage classes, EBS snapshot behavior, and when to use each storage type is essential for CLF-C02.


1. AWS Storage Services Overview

Rendering diagram…

2. Amazon S3 — Object Storage

Core Concepts

  • Buckets — Containers for objects; name must be globally unique across all AWS accounts; created in a specific Region
  • Objects — Files plus metadata stored in S3; maximum object size is 5 TB
  • Durability: 99.999999999% (11 nines) — designed so data is not lost
  • Availability: 99.99% for Standard class — the data is accessible when you need it
  • By default, all S3 buckets and objects are private — access is granted via bucket policies or IAM

S3 Storage Classes

Rendering diagram…
Storage ClassRetrievalMin DurationAZsBest For
StandardMillisecondsNone3Frequently accessed data, websites, analytics
Intelligent-TieringMillisecondsNone3Unknown or changing access patterns
Standard-IAMilliseconds30 days3Backups, disaster recovery, infrequently accessed
One Zone-IAMilliseconds30 days1Non-critical reproducible infrequent data
Glacier InstantMilliseconds90 days3Medical images, news archives needing fast access
Glacier Flexible1 min–12 hrs90 days3Long-term backups, compliance archives
Glacier Deep Archive12 hours180 days3Regulatory data accessed once per year or less

S3 Key Features

FeatureDescription
VersioningKeep multiple versions of each object; protect against accidental deletion or overwrite
Lifecycle PoliciesAutomatically transition objects between storage classes or delete after a period
Cross-Region ReplicationAutomatically copy objects to another Region; requires versioning enabled
Pre-signed URLsTemporary URL granting access to a private object without AWS credentials
Server-Side EncryptionEncrypt data at rest using SSE-S3 (AWS-managed), SSE-KMS, or SSE-C (customer key)
S3 Block Public AccessAccount-level or bucket-level setting to prevent any public access
Object LockWORM protection — write once, read many; prevents deletion for a set period
Static Website HostingHost static HTML, CSS, JavaScript sites directly from an S3 bucket
Transfer AccelerationSpeed up uploads via CloudFront edge locations

3. Amazon EBS — Block Storage

Elastic Block Store provides persistent, network-attached block storage for EC2 instances — like a virtual hard drive.

Rendering diagram…

Key EBS properties:

  • Same AZ required — EBS volume must be in the same AZ as the EC2 instance it attaches to
  • Persists independently — data survives instance stop, start, and reboot (not termination unless configured)
  • One attachment — typically one EC2 instance at a time (except io1/io2 Multi-Attach)
  • Snapshots — point-in-time backups stored in S3; incremental; can copy across Regions

EBS Volume Types:

TypePerformanceUse Case
gp3 / gp2 (General Purpose SSD)Balanced IOPS and throughputBoot volumes, dev/test, most workloads
io1 / io2 (Provisioned IOPS SSD)Highest consistent IOPSHigh-performance databases
st1 (Throughput Optimized HDD)High sequential throughputBig data, log processing, data warehouses
sc1 (Cold HDD)Lowest costInfrequently accessed cold data

4. Instance Store — Ephemeral Storage

  • Block storage that is physically attached to the EC2 host server (not network-attached)
  • Extremely fast — higher throughput and lower latency than EBS because there is no network hop
  • Ephemeral — data is permanently lost when the instance stops, terminates, or fails
  • Cannot be detached or moved to another instance
  • Use case: Temporary buffers, caches, scratch data — never for data that must persist

5. Amazon EFS — Elastic File System

EFS provides a fully managed, elastic NFS (Network File System) for Linux workloads:

Rendering diagram…
FeatureEFSEBS
Storage typeFile (NFS)Block
Multi-instanceThousands simultaneouslyOne at a time
AZ scopeRegional (spans multiple AZs)Single AZ
ScalingElastic — grows and shrinks automaticallyFixed provisioned size
OS supportLinux onlyLinux and Windows
Use casesShared content repos, CMS, home directoriesBoot volumes, databases

6. Amazon FSx — Managed Third-Party File Systems

ServiceProtocolUse Case
FSx for Windows File ServerSMBWindows workloads; Active Directory integration
FSx for LustreLustreHigh-performance computing, ML training, media processing
FSx for NetApp ONTAPNFS, SMB, iSCSIEnterprise file storage with advanced features
FSx for OpenZFSNFSZFS-based file system with snapshots

7. AWS Snow Family — Offline Data Transfer

When migrating large datasets where network transfer is too slow or expensive:

DeviceCapacityForm FactorUse Case
Snowcone8 TB HDD or 14 TB SSDBackpack-portableRemote and edge locations; small migrations
Snowball Edge Storage80 TB usableRugged briefcase applianceLarge datacenter migrations
Snowball Edge Compute40 TB + GPU/EC2Rugged applianceEdge ML processing before shipping data
Snowmobile100 PB45-foot shipping container truckEntire datacenter migration

Rule: If transferring data over your network would take weeks → consider Snow Family.


8. AWS Storage Gateway — Hybrid Storage

Connects on-premises applications to AWS cloud storage:

Gateway TypeProtocolWhat It Does
File GatewayNFS / SMBOn-premises files stored as S3 objects; local cache for low-latency access
Volume GatewayiSCSIOn-premises block volumes backed by S3; cached or stored modes
Tape GatewayiSCSI VTLReplaces physical tape library with S3 and Glacier archives

9. CLF-C02 Quick Reference

TopicKey Fact
S3 durability99.999999999% (11 nines) across 3 AZs
S3 max object size5 TB
S3 bucket namingGlobally unique across all AWS accounts
S3 default accessPrivate — must explicitly grant access
S3 StandardFrequent access; highest cost; millisecond retrieval
S3 Intelligent-TieringAuto-moves tiers; monitoring fee; no retrieval fee
S3 Glacier Deep ArchiveLowest cost; 12-hr retrieval; 180-day minimum
S3 VersioningKeeps all versions; protects against accidental deletion
EBSBlock storage; same AZ as EC2; persists on stop; one instance
EBS snapshotsPoint-in-time; incremental; stored in S3; can copy cross-Region
Instance StoreEphemeral; lost on stop/terminate; very fast; no network hop
EFSManaged NFS; multi-instance mount; regional; Linux only; elastic
EFS vs EBSEFS: multi-instance, regional; EBS: single instance, single AZ
FSx for WindowsSMB; Windows and Active Directory integration
Snowball Edge80 TB physical device for large migrations
Snowmobile100 PB container truck for datacenter migrations
Storage GatewayHybrid bridge from on-premises to S3 and Glacier

Practice Questions10

easy

Q1. What type of storage does Amazon S3 provide?


Select one answer before revealing.

easy

Q2. Which S3 storage class provides the LOWEST cost for data that is accessed at most once or twice per year and can tolerate retrieval times of up to 12 hours?


Select one answer before revealing.

easy

Q3. What is the durability guarantee of Amazon S3?


Select one answer before revealing.

easy

Q4. A company needs block storage that persists independently after an EC2 instance is stopped or terminated and can be re-attached to a different instance. Which AWS service should they use?


Select one answer before revealing.

medium

Q5. What is the key difference between Amazon EBS and EC2 Instance Store?


Select one answer before revealing.

medium

Q6. A company has 200 EC2 instances spread across multiple AZs that all need to read and write to the same shared file system simultaneously. Which storage service supports this?


Select one answer before revealing.

medium

Q7. A company needs to transfer 80 TB of on-premises data to AWS. Their internet connection is 100 Mbps and they calculate the transfer would take approximately 73 days. What is the MOST practical solution?


Select one answer before revealing.

medium

Q8. Which AWS service connects on-premises storage environments to AWS cloud storage, supporting use cases like backup to the cloud, file sharing, and tape replacement?


Select one answer before revealing.

medium

Q9. Which TWO Amazon S3 storage classes are specifically designed for infrequently accessed data but provide millisecond retrieval (unlike Glacier)? (Select TWO — more than one answer may be correct)


Select one answer before revealing.

easy

Q10. Which S3 feature keeps multiple versions of the same object to protect against accidental deletion and overwrites?


Select one answer before revealing.