Azure Data Lake Gen2 Storage Account Replication Options for High Availability

Choose the solution with minimum cost.

Question

You are implementing an Azure Data Lake Gen2 storage account.

You need to ensure that data will be accessible for write and read operations both even if an entire data center (zonal or non-zonal) becomes unavailable.

Which kind of replication would you use for the storage account? (Choose the solution with minimum cost)

Answers

Explanations

Click on the arrows to vote for the correct answer

A. B. C. D.

Correct Answer: B

Zone-redundant storage replicates the Azure Storage data in a synchronous manner across 3 Azure availability zones in the primary region.

With Zone-redundant storage, the data remains accessible for write and read operations both even if a zone is not available.

The following table describes the durability and availability by outage scenario:

Outage scenario

Anode within a data center becomes unavailable

‘An entire data center (zonal or non-zonal) becomes unavailable
‘A region-wide outage occurs in the primary region

Read access to the secondary region is available if the primary region
becomes unavailable

LRS

Yes

No

No

No

ZRS

Yes

Yes

No

No

GRS/RA-GRS

Yes

Yes!

Yes (with RA-
GRS)

GZRS/RA-GZRS

Yes

Yes

Yes!

Yes (with RA-
GZRS)

Account failover is needed for restoring the write availability if the primary region becomes unavailable.

Option A is incorrect.

LRS ensures availability only if a node within a data center becomes unavailable.

Option B is correct.

Zone-redundant storage replicates the Azure Storage data in a synchronous manner around 3 Azure availability zones within primary region.

Option C is incorrect.

GRS is not a cost-effective method.

ZRS will be a more suitable option in the given scenario.

Option D is incorrect.

GZRS will also ensure availability but it is not the redundant method with minimum cost.

ZRS will achieve the goal in the given scenario.

To know more about Azure Storage Redundancy, please visit the below-given link:

To ensure that data will be accessible for both write and read operations even if an entire data center becomes unavailable, we need to choose a replication option that provides redundancy across multiple data centers.

The available replication options for Azure Data Lake Gen2 storage account are:

A. Locally-redundant storage (LRS) B. Zone-redundant storage (ZRS) C. Geo-redundant storage (GRS) D. Geo-zone-redundant storage (GZRS)

Locally-redundant storage (LRS) replicates data within a single data center to provide high durability within that data center. This means that if an entire data center becomes unavailable, data stored in that data center will not be accessible, so LRS is not the appropriate option for our requirements.

Zone-redundant storage (ZRS) provides redundancy within a single region, across multiple availability zones. This means that if one availability zone becomes unavailable, data can still be accessed from the remaining availability zones. However, if the entire region becomes unavailable, data stored in that region will not be accessible. Since we need to ensure access to data even if an entire data center (zonal or non-zonal) becomes unavailable, ZRS is also not the appropriate option for our requirements.

Geo-redundant storage (GRS) provides redundancy for data across multiple data centers in different regions. This means that if an entire data center or region becomes unavailable, data can still be accessed from another data center in a different region. However, read and write operations will be slower because data needs to be retrieved from a different region. GRS can be a more expensive option than LRS or ZRS because it involves storing multiple copies of the data in multiple regions.

Geo-zone-redundant storage (GZRS) provides redundancy across multiple regions and availability zones within those regions. This means that if an entire region or availability zone becomes unavailable, data can still be accessed from another region or availability zone. GZRS is a more cost-effective option than GRS because it does not involve storing multiple copies of the data in multiple regions.

Therefore, the appropriate replication option for our requirements is Geo-zone-redundant storage (GZRS).