AWS Block, File, and Hybrid Storage

AWS-STORAGE-HYBRID

EBS, EFS, FSx, Storage Gateway, and Snow Family — how AWS handles block, file, and hybrid on-premises storage scenarios.

awsebsefsfsxstorage-gatewaysnow-familyhybrid-storage

Overview

AWS offers three distinct storage paradigms — block, file, and object — each optimised for different workload characteristics. Where object storage (S3) is addressed separately, this article covers EBS (block), EFS and FSx (file), and the services that bridge AWS storage into on-premises environments: Storage Gateway and the Snow Family.


EBS — Elastic Block Store

EBS provides persistent block storage volumes for EC2 instances. A block volume behaves like a physical disk attached to a server: the operating system formats it with a filesystem (ext4, NTFS, XFS) and treats it as a device. EBS volumes persist independently of the EC2 instance lifecycle — stopping or terminating an instance does not delete its EBS volume unless the “Delete on Termination” flag is set.

Key characteristics:

Volume Types

TypeCategoryMax IOPSMax ThroughputBest For
gp3General Purpose SSD16,0001,000 MB/sDefault choice; IOPS/throughput set independently
gp2General Purpose SSD (legacy)16,000250 MB/sLegacy; IOPS tied to volume size (3 IOPS/GB)
io2Provisioned IOPS SSD64,0001,000 MB/sLatency-sensitive OLTP, databases
io2 Block ExpressProvisioned IOPS SSD256,0004,000 MB/sSAP HANA, mission-critical databases
st1Throughput HDD500500 MB/sSequential large I/O: log streaming, Kafka, data warehouse ingest
sc1Cold HDD250250 MB/sInfrequently accessed archives, lowest cost HDD

gp3 is now the default and recommended general-purpose volume. Unlike gp2, gp3 allows IOPS and throughput to be configured independently — you can provision 10,000 IOPS on a 100 GB volume without paying for a larger volume.

io2 Multi-Attach allows a single io2 volume to be attached to up to 16 EC2 instances simultaneously within the same AZ. The application must manage concurrent writes — typically a clustered database engine or distributed file system that handles its own locking.

EBS Snapshots

Snapshots are point-in-time backups stored in S3 (managed by AWS; you never see the underlying bucket). They are incremental: only changed blocks since the last snapshot are stored. Despite incremental storage, any snapshot can be used independently to restore a full volume.

Snapshots can be:

EBS Encryption

EBS encryption uses AES-256 with a KMS key. When enabled:

Encryption is set at volume creation and cannot be changed after creation. To encrypt an existing unencrypted volume: take a snapshot → copy the snapshot with encryption enabled → restore the encrypted snapshot copy as a new volume.


EFS — Elastic File System

EFS is a managed NFS v4.1/v4.2 file system. It is serverless, fully elastic (capacity automatically grows and shrinks), and designed for Linux workloads that need shared file storage across multiple EC2 instances, containers, or Lambda functions simultaneously.

Unlike EBS, EFS is multi-AZ by default: mount targets are created in each AZ’s subnet, and data is stored redundantly across multiple AZs.

Performance Modes

Throughput Modes

Storage Classes and Lifecycle Management

Lifecycle management moves files to IA after a configurable period of inactivity (7, 14, 30, 60, or 90 days). Files are moved back to Standard/One Zone on access.

EFS Access Points

Access points are application-specific entry points into an EFS file system. Each access point can enforce:

Lambda functions benefit particularly from access points: each function can have an access point that enforces a specific home directory and user identity, even when multiple functions share the same EFS file system.


FSx — Managed High-Performance File Systems

FSx is AWS’s family of managed file systems for specialised use cases where NFS (EFS) or SMB (basic S3 file gateway) are insufficient.

FSx for Windows File Server

Fully managed Windows-native SMB file shares backed by Windows Server. Features:

Use FSx for Windows when migrating Windows workloads that depend on SMB, Windows ACLs, or DFS — scenarios where EFS (NFS, Linux permissions) is not compatible.

FSx for Lustre

A high-performance parallel file system designed for compute-intensive workloads. Lustre disaggregates metadata and data operations, allowing clients to read/write directly to storage nodes in parallel without a single bottleneck.

Capabilities:

Use FSx for Lustre for HPC, ML training (feeding GPUs from a shared high-throughput store), video processing, and genomics workflows.

FSx for NetApp ONTAP

A managed version of NetApp’s ONTAP storage operating system. Provides:

Designed for lifting and shifting NetApp on-premises environments to AWS without re-platforming applications.

FSx for OpenZFS

Managed ZFS (Zettabyte File System) with NFS access. Provides ZFS snapshots (instantaneous, space-efficient), writable clones from snapshots, compression (LZ4, ZSTD), and data integrity checking (checksumming). Access via NFS from Linux and macOS clients. Suited for migrating on-premises OpenZFS or Oracle Solaris ZFS workloads.


AWS Storage Gateway

Storage Gateway is a hybrid storage service: a virtual appliance (VMware ESXi, Hyper-V, KVM, or a physical AWS hardware appliance) deployed on-premises that presents a local storage interface to applications while storing data durably in AWS.

Storage Gateway solves the “we have on-premises workloads that can’t move to the cloud, but we want cloud economics and durability for our data” problem.

S3 File Gateway

Presents NFS and SMB mount points to on-premises clients. Files written to the mount point are stored as native S3 objects in your specified bucket. The gateway maintains a local cache of recently accessed files, so reads of recent data are served locally. S3 objects are directly accessible from AWS services — applications can write through the gateway and then have AWS services (Glue, Athena, Lambda) process the data directly from S3.

Active Directory authentication is supported for SMB shares.

FSx File Gateway

Provides a local cache for FSx for Windows File Server. On-premises SMB clients mount the gateway, which maintains a cache of frequently accessed data locally and routes all traffic to FSx. Reduces WAN latency for FSx access. Supports DFS namespaces.

Volume Gateway — Cached Mode

Presents iSCSI block devices (volumes) to on-premises servers. The primary copy of data lives in S3 (as EBS snapshots). Frequently accessed data is cached on-premises for low-latency access. Minimises on-premises storage hardware while keeping hot data locally accessible. Snapshots can be mounted in AWS as EBS volumes for restore or test/dev.

Volume Gateway — Stored Mode

The full dataset lives on-premises (the gateway presents locally-stored iSCSI volumes). Data is asynchronously backed up to S3 as EBS snapshots. Used for disaster recovery: on-premises has full performance access, AWS holds the DR copy. In a failure, mount the latest snapshot as an EBS volume on EC2.

Tape Gateway

Presents a Virtual Tape Library (VTL) via iSCSI to backup software (Veeam, Veritas NetBackup, Commvault, Arcserve). Virtual tapes write to S3 (active tapes). Archived tapes move to S3 Glacier or Glacier Deep Archive, appearing as an offline tape vault in the backup software. Replaces physical tape libraries with no application changes.

On-Prem NFS Clients
S3 File Gateway
NFS write (file share mount)
Application writes as if to local NFS
Cache recent files locally
Reads served from local cache if warm
Upload as S3 object (async)
Native S3 objects, AWS services can read directly
Evaluate transition rule
After configured inactivity period
Transition to Glacier storage class
Cost reduction for aged archive data

Snow Family

The Snow Family addresses scenarios where transferring data over the internet is impractical: the dataset is too large, the network is too slow, bandwidth is too expensive, or data residency requirements prevent cloud transit. AWS ships a physical encrypted device; you copy data locally, ship it back, and AWS imports it into S3.

All Snow devices use AWS KMS for encryption (AES-256). The encryption key is never stored on the device. Data is inaccessible without the KMS key even if the device is lost or stolen.

Snowcone

The smallest Snow device. Available in HDD (8 TB usable) and SSD (14 TB usable) variants. Ruggedized and battery-powered. Designed for edge computing and disconnected environments where carrying a larger device is impractical.

Snowcone runs EC2-compatible edge compute (via AWS IoT Greengrass or EC2 API). It also runs a DataSync agent — data can be transferred back to AWS via DataSync over the network if connectivity is available, or by shipping the device.

Snowball Edge Storage Optimized

80 TB of usable storage (128 TB raw). Designed for large-scale data migration. Includes limited compute capability (24 vCPU, 32 GB RAM) for pre-processing data locally before import. Cluster mode: 5–10 devices act as a distributed storage cluster for very large migrations.

Snowball Edge Compute Optimized

28 TB of usable storage (with optional 7.68 TB of NVMe SSD) combined with 52 vCPU, 208 GB RAM, and an optional NVIDIA Tesla V100 GPU. Designed for edge machine learning inference, local video analysis, and compute-heavy preprocessing at sites without AWS connectivity. Storage is secondary to compute.

Snowmobile

A 45-foot shipping container on a truck, capable of transferring up to 100 PB of data. Used for exabyte-scale data centre relocations where even multiple Snowball Edge clusters would take too long. AWS drives the truck to your data centre, you connect a fibre cable, copy data at up to 10 Gbps, and AWS drives it back.

Snow Device Workflow

  1. Order device through AWS Console
  2. AWS ships device (encrypted, locked to your account’s KMS key)
  3. Connect to your network, copy data using the Snowball client or S3-compatible API
  4. Ship device back using the pre-paid shipping label
  5. AWS receives device, imports data into your S3 bucket, and destroys the device securely
  6. AWS sends an import job completion notification with data transfer audit logs

Choosing the Right Storage Service

NeedService
Block storage for EC2 (boot, databases)EBS
Shared NFS for Linux workloadsEFS
Windows SMB with AD integrationFSx for Windows File Server
HPC / ML / video parallel I/OFSx for Lustre
Lift-and-shift NetApp ONTAPFSx for NetApp ONTAP
Object storage, unlimited scaleS3
Hybrid: cloud backup of on-prem file sharesS3 File Gateway
Hybrid: iSCSI block volumes with S3 backupVolume Gateway
Replace physical tape libraryTape Gateway
Offline bulk data migration (under 100 TB)Snowball Edge
Edge compute in disconnected locationsSnowcone / Snowball Compute Optimized
Exabyte-scale data centre migrationSnowmobile

References