Blog Post

Storage at Microsoft

4 MIN READ

Cluster size recommendations for ReFS and NTFS

Microsoft

Apr 10, 2019

First published on TECHNET on Jan 13, 2017
Microsoft’s file systems organize storage devices based on cluster size. Also known as the allocation unit size, cluster size represents the smallest amount of disk space that can be allocated to hold a file. Because ReFS and NTFS don’t reference files at a byte granularity, the cluster size is the smallest unit of size that each file system can reference when accessing storage. Both ReFS and NTFS support multiple cluster sizes, as different sized clusters can offer different performance benefits, depending on the deployment.

In the past couple weeks, we’ve seen some confusion regarding the recommended cluster sizes for ReFS and NTFS, so this blog will hopefully disambiguate previous recommendations while helping to provide the reasoning behind why some cluster sizes are recommended for certain scenarios.

IO amplification

Before jumping into cluster size recommendations, it’ll be important to understand what IO amplification is and why minimizing IO amplification is important when choosing cluster sizes:

IO amplification refers to the broad set of circumstances where one IO operation triggers other, unintentional IO operations. Though it may appear that only one IO operation occurred, in reality, the file system had to perform multiple IO operations to successfully service the initial IO. This phenomenon can be especially costly when considering the various optimizations that the file system can no longer make:
- When performing a write, the file system could perform this write in memory and flush this write to physical storage when appropriate. This helps dramatically accelerate write operations by avoiding accessing slow, non-volatile media before completing every write.
- Certain writes, however, could force the file system to perform additional IO operations, such as reading in data that is already written to a storage device. Reading data from a storage device significantly delays the completion of the original write, as the file system must wait until the appropriate data is retrieved from storage before making the write.

ReFS cluster sizes

ReFS offers both 4K and 64K clusters. 4K is the default cluster size for ReFS, and we recommend using 4K cluster sizes for most ReFS deployments because it helps reduce costly IO amplification:

In general, if the cluster size exceeds the size of the IO, certain workflows can trigger unintended IOs to occur. Consider the following scenarios where a ReFS volume is formatted with 64K clusters:
- Consider a tiered volume . If a 4K write is made to a range currently in the capacity tier, ReFS must read the entire cluster from the capacity tier into the performance tier before making the write . Because the cluster size is the smallest granularity that the file system can use, ReFS must read the entire cluster, which includes an unmodified 60K region, to be able to complete the 4K write.
- If a cluster is shared by multiple regions after a block cloning operation occurs, ReFS must copy the entire cluster to maintain isolation between the two regions. So if a 4K write is made to this shared cluster, ReFS must copy the unmodified 60K cluster before making the write.
- Consider a deployment that enables integrity streams . A sub-cluster granularity write will cause the entire cluster to be re-allocated and re-written, and the new checksum must be computed. This represents additional IO that ReFS must perform before completing the new write, which introduces a larger latency factor to the IO operation.

By choosing 4K clusters instead of 64K clusters, one can reduce the number of IOs that occur that are smaller than the cluster size, preventing costly IO amplifications from occurring as frequently.

Additionally, 4K cluster sizes offer greater compatibility with Hyper-V IO granularity, so we strongly recommend using 4K cluster sizes with Hyper-V on ReFS. 64K clusters are applicable when working with large, sequential IO, but otherwise, 4K should be the default cluster size.

NTFS cluster sizes

NTFS offers cluster sizes from 512 to 64K, but in general, we recommend a 4K cluster size on NTFS, as 4K clusters help minimize wasted space when storing small files. We also strongly discourage the usage of cluster sizes smaller than 4K. There are two cases, however, where 64K clusters could be appropriate:

4K clusters limit the maximum volume and file size to be 16TB
- 64K cluster sizes can offer increased volume and file capacity, which is relevant if you’re are hosting a large deployment on your NTFS volume, such as hosting VHDs or a SQL deployment.

NTFS has a fragmentation limit, and larger cluster sizes can help reduce the likelihood of reaching this limit
- Because NTFS is backward compatible, it must use internal structures that weren’t optimized for modern storage demands. Thus, the metadata in NTFS prevents any file from having more than ~1.5 million extents.
  - One can, however, use the “format /L” option to increase the fragmentation limit to ~6 million. Read more here .
- 64K cluster deployments are less susceptible to this fragmentation limit, so 64K clusters are a better option if the NTFS fragmentation limit is an issue. (Data deduplication, sparse files, and SQL deployments can cause a high degree of fragmentation.)
  - Unfortunately, NTFS compression only works with 4K clusters, so using 64K clusters isn’t suitable when using NTFS compression. Consider increasing the fragmentation limit instead, as described in the previous bullets.

While a 4K cluster size is the default setting for NTFS, there are many scenarios where 64K cluster sizes make sense, such as: Hyper-V, SQL, deduplication, or when most of the files on a volume are large.

Updated Apr 10, 2019

Version 2.0

Microsoft

Joined June 13, 2017

View Profile

Storage at Microsoft

Follow this blog board to get notified when there's new activity

dwalling
Copper Contributor
Jul 18, 2022
Does using the 4k cluster size on ReFS limit the maximum volume size like it does on NTFS? The wording of this article implies that the max ReFS volume size is the same whether 4k or 64k clusters are used. It would be helpful if this article confirmed it explicitly.
Steve_Reczek
Copper Contributor
Feb 25, 2023
4K clusters limit the maximum volume and file size to be 16TB
This is because the maximum number of clusters that NTFS supports is (2^32 – 1). Thus a 64 KB AUS allows a largest volume (or file) size of 256 TB.

Since Windows Server 2019 and Windows 10, version 1709, a 2048 KB AUS is supported. Whereas 64 KB was the earlier max cluster size for older OS versions, a 2048 KB AUS now allows a largest volume (or file) size of 8 PB (8 * 2^50 Bytes).

For more information, see Support for large volumes.

Clarifying Terminology:
A cluster (MS-specific terminology) consists of one or more consecutive physical sectors and represents the smallest allocatable unit, the Allocation Unit Size (AUS). The file system considers the AUS the smallest addressable unit. Thus, file sizes are quantized to the ceiling function of the AUS, even though I/O still takes place quantized to the ceiling function physical sector size.
A physical sector is the smallest physical storage unit on the disk, i.e. the minimum quanta of data that the HDD can read or write. The hard drive considers the physical sector the smallest addressable unit.
A physical sector is always equal to or larger than the cluster size.

If someone wants to clarify the relationship between "cluster" and "Logical Sector Size," I'd appreciate it. It seems to me that a logical sector is the more OS-independent term for what Microsoft refers to as a cluster, but I cannot be sure.
Игорь Петрович Лейко
Copper Contributor
Mar 02, 2024
Cluster size of host drive has no value to VHDX operation. Note, cluster is used only for disk space allocation and is not involved in I/O operation.
MFT zone reservation is made by ntfs.sys driver not by NTFS itself. Since Vista and WS2008 maximum reserved zone is 800 MB.

As far as I know, there is no any special recommendations for very large drives with a huge number of files.
Игорь Петрович Лейко
Copper Contributor
Mar 31, 2020
NTFS in modern 4KN drives uses large file record by default so /l option is not needed if one have 4KN drive.
Игорь Петрович Лейко
Copper Contributor
Feb 16, 2024
NTFS itself supports 2^64-1 cluster. It is current realisation of ntfs.sys driver who supports 2^32-1 cluster.

A physical sector may be larger or lesser than the cluster size, but logical sector is equal or lesser than cluster size. E.g. If one has Advanced Format 512e drive, the physical sector is 4 KB but the logical one is 512 B.
vovannovig
Copper Contributor
Mar 02, 2024
Please tell me.
Based on their articles for the volume where the VHDX file is stored, i.e. Is it recommended to format a virtual hard disk file (ssize 16-18TB and this is a single disk without RAID) with a cluster size of 64k?
But if millions of small files are stored inside the virtual hard disk, then the cluster size should be 4k.
Will this cause I/O waste?

I have more than 50 million files stored on my virtual disk with a disk size of 16-18TB.
The file system is very heavily fragmented, the MFT is very large.
What are the recommendations for this application model?
Is it possible to immediately reserve 200GB at the beginning of the disk for MFT so that it does not become fragmented?