(cache)What is the maximum number of inodes in Linux filesystems? I found suggestion that for Ext4 it is 4 billion files (32-bit number). Is it true for XFS and/or BtrFS?

Assistant

Bot

·

1y

In Linux filesystems, the maximum number of inodes varies depending on the filesystem type and its configuration. Here's a breakdown for Ext4, XFS, and Btrfs:

Ext4

Maximum Inodes: Ext4 supports a maximum of 4 billion inodes (2^32) due to its use of a 32-bit inode number.
Configuration: The number of inodes can be specified at the time of filesystem creation, and the default is usually set to 1 inode per 16 KB of disk space.

XFS

Maximum Inodes: XFS does not have a fixed maximum number of inodes like Ext4. Instead, it allows for a much larger number of objects since it uses a 64-bit inode number. The

In Linux filesystems, the maximum number of inodes varies depending on the filesystem type and its configuration. Here's a breakdown for Ext4, XFS, and Btrfs:

Ext4

Maximum Inodes: Ext4 supports a maximum of 4 billion inodes (2^32) due to its use of a 32-bit inode number.
Configuration: The number of inodes can be specified at the time of filesystem creation, and the default is usually set to 1 inode per 16 KB of disk space.

XFS

Maximum Inodes: XFS does not have a fixed maximum number of inodes like Ext4. Instead, it allows for a much larger number of objects since it uses a 64-bit inode number. The practical limit on the number of inodes is determined by the available disk space and the filesystem's configuration.
Configuration: XFS dynamically allocates inodes as needed, so the limit is generally very high, theoretically allowing for billions of inodes as long as there is sufficient storage.

Btrfs

Maximum Inodes: Btrfs also uses a 64-bit inode number, allowing it to support a very large number of inodes, theoretically in the billions, similar to XFS.
Configuration: Btrfs dynamically manages inodes and can allocate them based on usage patterns, meaning the practical limit can be very high, akin to XFS.

Summary

Ext4: Up to 4 billion inodes (32-bit limit).
XFS: Very high limit, practically determined by disk space (64-bit inode number).
Btrfs: Very high limit, practically determined by disk space (64-bit inode number).

In summary, both XFS and Btrfs can support a much larger number of inodes compared to the fixed limit of Ext4, making them more scalable for environments with a massive number of files.

Eric Sandeen

ext4 filesystem developer for Red Hat

· 11y

ext2/3/4 hold inode numbers in a 32-bit on-disk structure, so yes, the maximum possible number of inodes on an ext2/3/4 filesystem is 2^32, or about 4 billion.

Most modern filesystems like XFS use 64-bit structures for this sort of thing, so XFS can in theory go to 2^64 inodes. But with a minimum inode size of 256 bytes, that would mean 4096 Exabytes of disk space for the inodes alone. In reality, theoretical/design limits such as this are not usually attainable (or advisable) in practice.

Sponsored by SMRTR Tech

How can I stay up to date with the latest technology and AI news?

Stay ahead of the curve with short summaries of the day's technology news in just 5 minutes.

Brian Bi

Amateur Linux server administrator · Upvoted by

Siddharth Kawar

, M.S. Computer Science, Stony Brook University (2017) · Author has 4.8K answers and 65M answer views

· 12y

inode numbers are like pointers within the file system, and are stored within the file system itself. (In particular, a directory entry is the combination of a name and an inode, and you can see this using ls -i.) File descriptor numbers, on the other hand, are not stored anywhere within the file system, and are dynamically generated by the kernel when you call open() (or certain other system calls). They are pointers into the kernel's file descriptor table for a particular process.

An inode number always refers to something on a device somewhere. A file descriptor may also refer to an anonymou

inode numbers are like pointers within the file system, and are stored within the file system itself. (In particular, a directory entry is the combination of a name and an inode, and you can see this using ls -i.) File descriptor numbers, on the other hand, are not stored anywhere within the file system, and are dynamically generated by the kernel when you call open() (or certain other system calls). They are pointers into the kernel's file descriptor table for a particular process.

An inode number always refers to something on a device somewhere. A file descriptor may also refer to an anonymous pipe, a socket, or some other kind of resource, such as the result of a call to epoll_create() on Linux.

An inode number unambiguously identifies a file or directory on a given device, but two files on different mounts may have the same inode. A file descriptor does not unambiguously identify anything by itself; in combination with a process ID it unambiguously identifies some resource on the system, even if you don't know which device it's on.

Every file or directory on a given device has a unique inode number. (If two files on the same device have the same inode number, then they are really the same file with two different names.) On the other hand, a file or directory may be opened several times by the same process or by different processes, and thus have multiple different file descriptors. Additionally, files or directories that are not currently open by any process do not have any file descriptors referring to them.

A valid file descriptor is associated with file mode flags and offset. That is, it grants the possessing process the right to read or write the file (or both), based on how the file descriptor was obtained; it also remembers some position within the file. However, it does not contain any metadata associated with the file itself, such as timestamps or Unix permission bits. An inode contains timestamps and Unix permission bits, but no file mode flags or offset.

Ryan Wells

8y

About the topic of stability, BTRFS developers bury their head in the sand, officially (see the FAQ) giving a non-answer (“stability? what is it? does it really exist?”).

Assuming you mean stabler than EXT4, the answer is certainly not. There is no comparison when it comes to user base and code maturity. That of course, doesn’t imply that a home user can’t use it in relatively safety (I experienced three bugs, two of which caused my system to be unbootable).

EXT FSs are very different from BTRFS/ZFS, as the latters have many features that the formers haven’t (a quick google search will tell you

About the topic of stability, BTRFS developers bury their head in the sand, officially (see the FAQ) giving a non-answer (“stability? what is it? does it really exist?”).

Assuming you mean stabler than EXT4, the answer is certainly not. There is no comparison when it comes to user base and code maturity. That of course, doesn’t imply that a home user can’t use it in relatively safety (I experienced three bugs, two of which caused my system to be unbootable).

EXT FSs are very different from BTRFS/ZFS, as the latters have many features that the formers haven’t (a quick google search will tell you a lot).

If you want a “new-generation” system, go for ZFS. ZFS is stable.

Marko Poutiainen

First installation mid-90's. Working on Linux for years. · Author has 675 answers and 2.3M answer views

· 7y

Is the Btrfs filesystem on Linux worth using? Wasn't it supposed to be a decent alternative to ZFS, which never became officially available for Linux? In what ways does Btrfs shine or fall short compared to ZFS? Are any better file systems available?

Erez explained what btrfs is but since by coincidence I’ve been using it for a few weeks now I’ll take this opportunity to explain why.

I currently work on Android. Each Android development source tree takes a lot of disk space, something like 110 gigabytes when “clean” and after a build about 150 gigabytes. For efficiency I have these on an SSD (two, actually) so space is at a premium (I have a terabyte of space in total). As it happens I need several work areas (different Android versions and products) so something like ext4 would fill up quickly. It also takes about two hours to do a full bu

Erez explained what btrfs is but since by coincidence I’ve been using it for a few weeks now I’ll take this opportunity to explain why.

I currently work on Android. Each Android development source tree takes a lot of disk space, something like 110 gigabytes when “clean” and after a build about 150 gigabytes. For efficiency I have these on an SSD (two, actually) so space is at a premium (I have a terabyte of space in total). As it happens I need several work areas (different Android versions and products) so something like ext4 would fill up quickly. It also takes about two hours to do a full build and sometimes this is required.

Btrfs solves these problem: I have a source-only tree that is automatically updated (repo sync) every night. The same script creates a snapshot of this source and does a full build. This way I have a fresh, full build available every morning. If I need the very latest code or if my current build directory gets messed up and needs a clean/build all combo to work again, I can just instead make a snapshot of the build directory which takes a second (two if I need to type my sudo password)! Then I just copy whatever I did on the old directory to the new one either with Beyond Compare or using Git tools (merge or cherry-pick) and I’m good to go. Creating snapshots is extremely quick, just copying the whole source tree would probably take ten minutes nevermind using repotool to create it.

So not only do I get to use disk space more efficiently, I’m actually working much more efficiently! Bye-bye the two hours compilation breaks because the damn thing won’t compile anymore without running a full build. And I also have the very latest version every day ready to be taken into use in a clean environment rather than the environment I’ve been smashing for the past several weeks.

If you are interested in the nit and grit, I explained how to do this here: Using btrfs to speed up Android development. My setup is such that I save the old daily build directories and manually delete them just in case something urgent pops up from yesterday.

Marko Petrovic

Using Linux Fedora for more than 6 years.

· 2y

Over ZFS the advantage of btrfs is that btrfs is native Linux filesystem that is merged to the mainline Linux kernel so it is usually compiled into the distribution's kernel and available out of the box.

In contrast with that, ZFS isn't in the Linux kernel tree so you either need to use ZFS in fuse which provides lower performance or to use out-of-tree module for your kernel version to enable ZFS support.

Btrfs also uses Linux's page cache while ZFS kernel module implements it's own ARC cache which again can affect performance.

Over ext3/ext4 advantages of btrfs are:

The Copy-On-Write design which

Over ZFS the advantage of btrfs is that btrfs is native Linux filesystem that is merged to the mainline Linux kernel so it is usually compiled into the distribution's kernel and available out of the box.

In contrast with that, ZFS isn't in the Linux kernel tree so you either need to use ZFS in fuse which provides lower performance or to use out-of-tree module for your kernel version to enable ZFS support.

Btrfs also uses Linux's page cache while ZFS kernel module implements it's own ARC cache which again can affect performance.

Over ext3/ext4 advantages of btrfs are:

The Copy-On-Write design which allows having 2 different files sharing the same physical storage but acting as separate files. Internally only changes to one of the files are saved instead of having 2 distinct copies of files.
Snapshot support — copies of subvolumes (specially marked folders) that initially don't consume any disk space, and only future modifications are additionally saved. Snapshot creation is also very fast, usually less than 1 second.
Disk quotas support — limiting disk space usage per subvolume
Number of files not practically limited, unlike ext3/4 filesystems which have fixed number of inodes
Transparent compression — files are compressed on disk, but system views them as uncompressed when reading data from them. This saves disk space.
Volume management built into the filesystem — one filesystem can utilize multiple physical drives in an arrangement called RAID array. There are several types of RAID. For details on status of btrfs-raid look at the documentation.

WARNING: As of writing this answer RAID 5/6 aren't stable on btrfs. RAID 0 and 1 are supposed to work with no problems. Look up the details on current state in documentation.

FAT32 is quite old filesystem. All btrfs advantages over ext3/4 apply over FAT32 plus additional:

No practical limit on file size. FAT32 supports maximum file size of 4GB
Power loss can't corrupt the filesystem (not the case on FAT32)
No practical limit on volume size. FAT32 supports only volumes up to 2TB.
Btrfs is better at handling large number of files and keeping performance

Erez Lirov

Works at Freelance Musicians

· 7y

Is the Btrfs filesystem on Linux worth using? Wasn't it supposed to be a decent alternative to ZFS, which never became officially available for Linux? In what ways does Btrfs shine or fall short compared to ZFS? Are any better file systems available?

Btrfs is the only COW (copy on write) filesystem that you can easily install via the graphical install program of fedora (and probably other distributions). I'll explain the significance of COW below, but for now, the important thing is that if you want COW and you don't want to fiddle around with extra repositories, kernel driver installs and other somewhat complicated installation steps, then btrfs is really your only choice.

Now, why would you want COW?

Copy on write means that instead of modifying blocks in place every time you update your data, your filesystem will transparently create a wh

Btrfs is the only COW (copy on write) filesystem that you can easily install via the graphical install program of fedora (and probably other distributions). I'll explain the significance of COW below, but for now, the important thing is that if you want COW and you don't want to fiddle around with extra repositories, kernel driver installs and other somewhat complicated installation steps, then btrfs is really your only choice.

Now, why would you want COW?

Copy on write means that instead of modifying blocks in place every time you update your data, your filesystem will transparently create a whole new block and repoint the metadata pointers to the new block, leaving the old block in place. That might seem inefficient, but actually has some pretty big benefits:

Since you don’t “repoint” until the new block is completely written out, if you lose power or crash in the middle of a write, you end up with either the old block or the new block, but not a half-written corrupted block. So you don’t need to fsck filesystems on startup and you somewhat lower your risk of data corruption.
You can “snapshot” the filesystem at any point, creating a snapshot entry in the metadata with the current set of pointers. This protects old blocks from being garbage collected later on and allows the filesystem to present a volume as it was during the snapshot. (in other words, you have instant rollback capabilities). You can even clone that volume to make it a writable volume based on the snapshot.

Btrfs also has block checksums and block mirroring capabilities (or even RAID5-like parity redundancy), which means that it can protect you from device failures (like RAID) and also do bitrot protection and self-healing (unlike RAID). If a block is read and its checksum doesn’t match, Btrfs will attempt to read the block from another device and if that works, it will write the block back to the “bad” device, correcting the error in place automatically.

I’ve used Btrfs on Fedora for a couple of years, and it seems very stable so far. However, on our servers, we use ZFS on Linux because a few years ago, when Btrfs was much less mature, we had some serious stability issues with it, switched to ZFS on Linux, and never tried Btrfs on the server-side again.

TL;DR

Btrfs is an “easy install”, copy-on-write, snapshot-enabled, self-healing filesystem for Linux that seems pretty stable. Your other choice is ZFS on Linux, which may be more stable, but requires a few more steps to install on typical Linux distributions.

Gaurav Jain

Kernel Newbie · Author has 59 answers and 659.2K answer views

· 13y

The number of inodes varies from filesystem to filesystem and is set up at the time of filesystem creation. You can view the number of inodes (free/used) through the GNU df command

Example output:
# df -i
Filesystem Inodes IUsed IFree IUse% Mounted on
/dev/hda3 18364896 240571 18124325 2% /
/dev/hda1 130560 36 130524 1% /boot
tmpfs 240945 1 240944 1% /dev/shm

Anand Babu Periasamy

Founded Gluster · Author has 82 answers and 341.6K answer views

· Updated 13y

For USB flash drives, Ext3 is still the best choice. As much as I hate VFAT, I love my friends more.

*** recommendation below is very specific to flash drives ***

Ext2 - Yes - too old and no journaling support. It will work fine, but there is no compelling reason to stick to it.

Ext3 - No - very stable, journaling support, capacity up to 16TB, supported by all GNU/Linux distros. Ext3's journaling is not flash friendly. It helps with fast recovery though.

Ext4 - No - stable in newer kernel versions, faster file system check, improved large file performance (extent support), journaled checksums.

For USB flash drives, Ext3 is still the best choice. As much as I hate VFAT, I love my friends more.

*** recommendation below is very specific to flash drives ***

Ext2 - Yes - too old and no journaling support. It will work fine, but there is no compelling reason to stick to it.

Ext3 - No - very stable, journaling support, capacity up to 16TB, supported by all GNU/Linux distros. Ext3's journaling is not flash friendly. It helps with fast recovery though.

Ext4 - No - stable in newer kernel versions, faster file system check, improved large file performance (extent support), journaled checksums. RHEL announced stable Ext4 support only in 5.6 release. e2fsprogs is still limited to 16TB capacity. I use Ext4 with Debian distro for my laptop backup drives. It is my personal #1 choice.

XFS - No - over kill, not much portability, journaling support, up to 100TB, designed for storage servers. Red Hat revived XFS in RHEL 5.2. Most of ex-XFS developers from SGI are now with RH. It is officially recommended filesystem for servers > 16TB capacity. Fast mkfs and fsck. Excellent large file support. Linux 2.6.37/39 got some really nice xfs meta-data performance improvements.

ReiserFS - No - dead project.

VFAT - Yes - most portable across operating systems, no other good reason.

Btrfs - No - Linux's answer to ZFS. Not ready yet, but most promising.

Kartik Ayyar

Former Machine Learning Engineer at Google (company) · Author has 1.2K answers and 9.9M answer views

· 14y

I've never used or contributed to btrfs, though looking at the literature, ZFS was very WAFL inspired [2, 3, 4], and btrfs feels very ZFS inspired[1], so a lot of the benefits of using both filesystems from a user and administrator perspective are very similar.

At the highest level, I'd split the benefits of comparing brtfs versus any other filesystem into 3 categories.

1. Benefits arising out of using a copy on write tree with no in place updates as storage primitive.

2. Benefits arising out out a separation of logical and physical units of data management.

3. Everything else

The first two buckets

I've never used or contributed to btrfs, though looking at the literature, ZFS was very WAFL inspired [2, 3, 4], and btrfs feels very ZFS inspired[1], so a lot of the benefits of using both filesystems from a user and administrator perspective are very similar.

At the highest level, I'd split the benefits of comparing brtfs versus any other filesystem into 3 categories.

1. Benefits arising out of using a copy on write tree with no in place updates as storage primitive.

2. Benefits arising out out a separation of logical and physical units of data management.

3. Everything else

The first two buckets are the ones to pay attention to. If you take a look at literature describing brtfs, WAFL and ZFS, the first two are the real game changers for performance, reliability and data management - most of the other features can be found or ported to other filesystems.

Tree structure based benefits

1. Creating snapshots of your data is extremely efficient

This is the single most important benefit - snapshots are incredibly incredibly efficient to create, and have very little performance impact. This is because creating a snapshot is as simple as creating a new root pointer to your data set and incrementing reference counts on metadata. You can create snapshots in the midst of other activity on your system without adding any significant load to it.

2. Calculating incremental deltas for backups is extremely efficient.

Calculating deltas between two different versions of trees, be it snapshots or writeable clones is highly efficient, since it relies on a pure metadata comparison of reference counts in different snapshots.

3. Creating writeable clones is extremely efficient

Creating a clone is very similar to creating a snapshot and is thus similarly efficient as it simply involves creating a new root pointer to a tree and adding book keeping for reference counting.

This is a killer feature in environments where you have many copies of almost the same data, such as large test databases or virtual machine images.

4. Rolling back to a given snapshot is super efficient

Rolling back to a given older version of the filesystem is highly efficient as it primarily involves swapping a pointer to an older version of a tree.

5. The filesystem has transactional semantics

Non transactional semantics for a filesystem are bad. They can expose you to minor corruptions for operations that update different blocks in different places non atomically.

So for example, for creating a new directory entry, you need to allocate and inode and also make a directory entry point to it. If this is not done atomically, depending on your implementation, you could have a leaked inode or a directory entry that points to an unallocated inode.

With btrfs and similar filesystems, you never write in place and an update to a new version of the filesystem is only complete when you update the root pointer of thew new tree, this means that all your filesystem operations consistently move you from one consistent state to another.

If you crash in the middle of an update, since you never wrote in place and the final tree root pointer update never hit the disks, there is nothing required to get your filesystem back to a consistent state.

This class of benefits is the most important - NetApp shipped a product based on this idea in 1994, and it was a huge competitive differentiator with respect to its success. It was also one of the primary similarities cited when NetApp sued Sun for ZFS [7].

Benefits arising from have a separation of volumes and sub volumes

One key fundamentals of building a high performing file systems is trying to make full use of the IO bandwidth of a large collection of disks. For performance reasons, you want to have as many spindles as possible to parallelize IO.

The initial solution was to simply create one giant filesystem that had a one to one map to a large pool of disks to increase disk bandwidth and also amortize the cost of your RAID parity disk(s).

However, doing so creates a data management nightmare - you can't for example snapshot or backup just your important data at a high frequency without also doing the same for your low priority data.

The solution that addressed both performance needs and data management needs was to make the tree structures live inside what was "traditional" filesystem that directly mapped to a a physical RAID volumes ( known as aggregates, pools and volumes in WAFL, ZFS and btrfs respectively ), while user data lived inside logical volumes ( known as flexible volumes, volumes and subvolumes in btrfs repectively) .

The benefits of this are as below:

1. High performance IO for even small filesystems by sharing spindles with larger filesystems

The idea here is your volume can be a direct mapping to your RAID layout, but your subvolume can be much smaller and still benefit from the raw IO bandwidth.

2. Instant, easy on the fly partition resizing

Partitions in the scheme of subvolumes are just quotas.

Specifically, different subvolumes actually do share free space, so if you find your volumes filling up in a way different form what you planned, you can just adjust the partitions on the fly.

Thus, it makes it easier to aggregate your free space and improve storage utilization and adjust your space allocation on the fly if your partitions are filling up in a manner different to how you had planned.

3. Decoupling the unit of data management from the unit of physical IO bandwidth / redundancy

By having subvolumes being distinct form physical volumes, you can do things such as set up different snapshot and backup schedules for part of your data, without all of it.

Miscellany:

brtfs also has the use of extents to prevent fragmentation (unlike ZFS and WAFL btrfs seems to reference count extents rather than blocks [6]), block checksums, built in compression and various other features, but these are typically the class of features that can be adapted to other filesystems without changing the fundamental architecture of it.

[1]: http://lwn.net/Articles/342892/
[2]: http://www.usenix.org/event/usenix08/tech/full_papers/edwards/edwards.pdf
[3]: http://www.cs.fsu.edu/~awang/courses/cop5611_s2007/wafl.pdf
[4]: http://solaris-training.com/classp/200_HTML/docs/zfs_wp.pdf
[5]: http://hub.opensolaris.org/bin/download/Community+Group+zfs/docs/zfslast.pdf
[6]: http://video.linux.com/video/1787
[7]: http://blogs.netapp.com/dave/2007/09/netapp-sues-sun.html

Sachin Kaushik

Sr. Linux Admin at Various Inc (2012–present)

· 8y

Btrfs is a improved version. And good thing about it is, it’s rebuild from Scratch. Although it offer greater volumes (up to 16 EB) and improved fault tolerance with Deduplication and mirroring. But it didn’t offer the faster speed.

For enterprises who use very large programs with very large databases, having a seemingly continuous file system across multiple hard drives would be very beneficial because it can make consolidation of data much easier. Data deduplication would reduce the amount of actual space data would occupy, and data mirroring would become easier with btrfs as well when there

Btrfs is a improved version. And good thing about it is, it’s rebuild from Scratch. Although it offer greater volumes (up to 16 EB) and improved fault tolerance with Deduplication and mirroring. But it didn’t offer the faster speed.

For enterprises who use very large programs with very large databases, having a seemingly continuous file system across multiple hard drives would be very beneficial because it can make consolidation of data much easier. Data deduplication would reduce the amount of actual space data would occupy, and data mirroring would become easier with btrfs as well when there is a single, broad file system that needs to be mirrored.

So the answer to your question is it’s not faster and not yet officially declared as stable.

Walter Parker

Runs DNS Servers for an ISP · Upvoted by

Carfield Yim

, MSc Software Technology, The Hong Kong Polytechnic University (2004) · Author has 609 answers and 653.8K answer views

· Updated 8y

Yes.

I expect most of the answers to this question will like other great debates (vi vs. emacs).

From my camp, ZFS is battle tested file system that be around for more than 10 years. Compared to most of the other UNIX file systems, it solves the problem as a whole rather than a a large series of pieces (it unifies the volume manager and the file system). Few file systems implement the level of checksum usage that that ZFS does (it checks on all reads as well as writes, can transparent fix latent bit loss errors on the fly). It has background process called “scrub” that reads the data on the driv

Yes.

I expect most of the answers to this question will like other great debates (vi vs. emacs).

From my camp, ZFS is battle tested file system that be around for more than 10 years. Compared to most of the other UNIX file systems, it solves the problem as a whole rather than a a large series of pieces (it unifies the volume manager and the file system). Few file systems implement the level of checksum usage that that ZFS does (it checks on all reads as well as writes, can transparent fix latent bit loss errors on the fly). It has background process called “scrub” that reads the data on the drive and fixes any blocks that have “gone bad” [failed the checksum] before the fail happens during a read request made by the applications.

it has snapshots, replication and snapshot cloning. Combined with a boot environment manager, this can allow for risk free updates to production systems (you can have hourly COW snapshots of your running, live production file systems without blowing your disk budget).

Because ZFS unifies the volume manager with the file systems, you can have all the “partitions” you want without having to worry about over and under allocating space. Disk space in ZFS is allocated like virtual memory in RAM. This make life so much nicer. It also has a inline compression algorithm name lz4 that can provide good compression with minimal CPU overhead.

Btrfs is still a bit new and not as well tested and battle hardened. I have the impression that Btrfs was not written with the level of paranoia about data safety/loss prevention that ZFS’s authors had. We you read about Btrfs, it is not uncomon for someone to say “it has ZFS features” or “it will have that ZFS feature in the future.”

Right now, I’d say ZFS is the gold standard and Btrfs is a up and comer that hopes to compete/is competing against ZFS. Actually for Btrfs, the proper comparison would be with a recent version of NTFS.

Anthony Markus Rasat

Senior Linux Citizen (2008–present)

· 4y

Once upon a time, I was a Slackware user.

The thing that Slackware user have is itches to go beyond what we had. In this case stock Slackware 8.1 ext3 filesystem I felt so terribly slow. So I went manual kernel compile and started to try other filesystem. At that time I chose XFS which I felt way faster than ext3 in the same harddisk drive.

Fast forward few years, ext4 become new standard. Not Slackware user anymore and not using the same hardware, I felt differences of ext4 and ext3. Ext4 is even faster than XFS in my experience and I have not pulling my hair out using ext4.

So my respect to ext

Once upon a time, I was a Slackware user.

The thing that Slackware user have is itches to go beyond what we had. In this case stock Slackware 8.1 ext3 filesystem I felt so terribly slow. So I went manual kernel compile and started to try other filesystem. At that time I chose XFS which I felt way faster than ext3 in the same harddisk drive.

Fast forward few years, ext4 become new standard. Not Slackware user anymore and not using the same hardware, I felt differences of ext4 and ext3. Ext4 is even faster than XFS in my experience and I have not pulling my hair out using ext4.

So my respect to ext4 developers.

HRH George J Rickle IV

Studied at The Evergreen State College · Author has 7.8K answers and 24.5M answer views

· 6y

Neither really.

Ext4 is the 4th generation of Ext and backwards compatible. NTFS doesn't advertise the generation in the name, but it's also at its 4th (maybe 5th, depending on who's deciding) generation and backwards compatible. Neither of them are forward compatible from old versions. Both originated at the same time periods (early 1990s) and both have kept up with adding features. If you compare the two feature by feature, it's pretty much a wash to say one is more advanced. Each has a few features unique, as well as drawbacks.

It's not really a useful comparison either. A Windows system is g

Neither really.

Ext4 is the 4th generation of Ext and backwards compatible. NTFS doesn't advertise the generation in the name, but it's also at its 4th (maybe 5th, depending on who's deciding) generation and backwards compatible. Neither of them are forward compatible from old versions. Both originated at the same time periods (early 1990s) and both have kept up with adding features. If you compare the two feature by feature, it's pretty much a wash to say one is more advanced. Each has a few features unique, as well as drawbacks.

It's not really a useful comparison either. A Windows system is going to use NTFS. A Lunix system is going to run Ext. Interoperability is abstracted, meh.

Your response is private

Was this worth your time?

This helps us sort answers on the page.

Absolutely not

Definitely yes

Jaime Rosal

Unemployable (1985–present) · Author has 416 answers and 525.8K answer views

· Updated Apr 10

In my limited experience, no.

I’ve had no less than three power outages completely destroy an entire BTRFS partition, and despite being able to recover most of the data no repair tools -official or otherwise- were able to fix the filesystem, even when the corruption only affected a handful of files and restricted to the same subvolume.

Another time I had a problem with a bent SATA cable that caused one drive to randomly stall when anything would move it slightly out of place. One such time happened just in the middle of backing the drive up while debugging the issue, and the partition became un-

In my limited experience, no.

I’ve had no less than three power outages completely destroy an entire BTRFS partition, and despite being able to recover most of the data no repair tools -official or otherwise- were able to fix the filesystem, even when the corruption only affected a handful of files and restricted to the same subvolume.

Another time I had a problem with a bent SATA cable that caused one drive to randomly stall when anything would move it slightly out of place. One such time happened just in the middle of backing the drive up while debugging the issue, and the partition became un-unmountable. In the end had no choice but to do a hard reboot, resulting in the same type of filesystem corruption as mentioned before.

Bad cables aside, power outages are not a rare thing, especially on laptops. Having to plug in another storage device of the same or greater capacity, recovering the files by hand, reformatting the damaged filesystem, and copying the data back is, needless to say, extremely inconvenient, possibly expensive, definitely time-consuming, and a major drawback of BTRFS.

If you value your time and data, stay away from BTRFS, at least on production machines or until these issues have been fully solved. It promises lots of features, but they are too often half-baked or not performing well in practice.

Nazmul Ahmed Noyon

Computer Engineer · Author has 80 answers and 313.8K answer views

· 3y

XFS being the default filesystem in RHEL based distros and Ext4 being the standard filesystem in Debian and Ubuntu distributions.

Maximum supported size for Ext4 is 16TB in both Red Hat Enterprise Linux 5 and Red Hat Enterprise Linux 6 and 50TB in Red Hat Enterprise Linux 7.

Ext4 perfectly manages many small files and ensures metadata is correctly written even write cache loses power.

XFS is indeed better than ext4 in many aspects. XFS support file system size upto 1PB.

Advantages:

XFS scales to exabytes, Red Hat’s maximum supported XFS file system image is 100TB for Red Hat Enterprise Linux 5, 300

XFS being the default filesystem in RHEL based distros and Ext4 being the standard filesystem in Debian and Ubuntu distributions.

Maximum supported size for Ext4 is 16TB in both Red Hat Enterprise Linux 5 and Red Hat Enterprise Linux 6 and 50TB in Red Hat Enterprise Linux 7.

Ext4 perfectly manages many small files and ensures metadata is correctly written even write cache loses power.

XFS is indeed better than ext4 in many aspects. XFS support file system size upto 1PB.

Advantages:

XFS scales to exabytes, Red Hat’s maximum supported XFS file system image is 100TB for Red Hat Enterprise Linux 5, 300TB for Red Hat Enterprise Linux 6, and 500TB for Red Hat Enterprise Linux 7. XFS is routinely measured as one of the highest performing file systems on large systems with enterprise workloads. For instance, a large system would be one with a relatively high number of CPUs, multiple HBAs, and connections to external disk arrays. XFS also performs well on smaller systems that have a multi-threaded, parallel I/O workload. The XFS backup and recovery process can be interrupted and resumed without freezing the file system.

Disadvantages:

XFS has a relatively poor performance for single threaded, metadata-intensive workloads, for example, a workload that creates or deletes large numbers of small files in a single thread. Finally, you cannot shrink (reduce) XFS file systems in size, so you should take extra care to not over-allocate storage to an existing file system.

If you want to know more, see here

How to Choose Your Red Hat Enterprise Linux File System - Red Hat Customer Portal

Choosing the Red Hat Enterprise Linux file system that is appropriate for your application is an important decision due to the large number of options available and the trade-offs involved. This paper describes some of the file systems that ship with Red Hat Enterprise Linux and provides historical background and recommendations on the right file system to suit your application.

https://access.redhat.com/articles/3129891

Eric Sandeen

ext4 filesystem developer for Red Hat

· 5y

This may come down to semantics to some degree.

When you do a directory listing on ext4, you will get a list of filenames. Each of these filenames (ignoring symlinks for now) is associated with an inode on disk, which is a piece of metadata which describes the attributes of that file - its size, its timestamps, the location of its data blocks, etc.

As an example let’s get a directory listing for a directory with a single file in it, named “a”:

# strace -v -e getdents ls testdir 
getdents(3, {{d_ino=2756063, d_off=3966251527637366025, d_reclen=24, d_name="."} {d_ino=2755343, d_off=484355910627326653

This may come down to semantics to some degree.

When you do a directory listing on ext4, you will get a list of filenames. Each of these filenames (ignoring symlinks for now) is associated with an inode on disk, which is a piece of metadata which describes the attributes of that file - its size, its timestamps, the location of its data blocks, etc.

As an example let’s get a directory listing for a directory with a single file in it, named “a”:

# strace -v -e getdents ls testdir 
getdents(3, {{d_ino=2756063, d_off=3966251527637366025, d_reclen=24, d_name="."} {d_ino=2755343, d_off=4843559106273266539, d_reclen=24, d_name="a"} {d_ino=2752513, d_off=9223372036854775807, d_reclen=24, d_name=".."}}, 32768) = 72

We can see that the file “a” has inode number 2755343. If we stat that file, we get its inode number and some of its other metadata, all of which is stored in the inode:

# stat testdir/a 
  File: `testdir/a' 
  Size: 4         	Blocks: 8          IO Block: 4096   regular file 
Device: fd00h/64768d	Inode: 2755343     Links: 1 
Access: (0644/-rw-r--r--)  Uid: (    0/    root)   Gid: (    0/    root) 
Access: 2020-03-10 09:50:44.388028747 -0500 
Modify: 2020-03-10 09:52:59.659817195 -0500 
Change: 2020-03-10 09:52:59.659817195 -0500

If we want to know where that inode’s actual data lives, we can ask for that information, which is also found via the inode’s metadata:

[root@bp-05 ~]# filefrag -v testdir/a 
Filesystem type is: ef53 
File size of testdir/a is 4 (1 block, blocksize 4096) 
 ext logical physical expected length flags 
   0       0 11436385               1 eof 
testdir/a: 1 extent found

In any case, for every file a user sees in a directory listing, there is ultimately one inode on disk which contains all of the information about that file, and pointers to that file’s data on disk. I’ll leave it to you to decide whether or not that means they are “the same.

Lakshmipathi

Linux Desktop sandbox in browser. webminal.org · Author has 257 answers and 941.7K answer views

· 13y

Inode are allocated when admin/user does mkfs to format the file system. With ext3 for example,You can also specify the inode count using mkfs.ext3 -N option.You can view the current options via tune2fs command.

Mark McCorkle

Developer, Sysop, and professional Devops for 20+ years

· 6y

Can you please provide a comment or reference as to how they are inaccurate? Please update with a comment on what specific distribution (and version if you know that as well) and I’ll provide a more detailed response.

It has been a while since I set quotas per user, so I did some generic digging and I found this in the redhat documentation for their quota management:

The hard block limit is the absolute maximum amount of disk space that a user or group can use. Once this limit is reached, no further disk space can be used.

The soft block limit defines the maximum amount of disk space that can be

Can you please provide a comment or reference as to how they are inaccurate? Please update with a comment on what specific distribution (and version if you know that as well) and I’ll provide a more detailed response.

It has been a while since I set quotas per user, so I did some generic digging and I found this in the redhat documentation for their quota management:

The hard block limit is the absolute maximum amount of disk space that a user or group can use. Once this limit is reached, no further disk space can be used.

The soft block limit defines the maximum amount of disk space that can be used. However, unlike the hard limit, the soft limit can be exceeded for a certain amount of time. That time is known as the grace period. The grace period can be expressed in seconds, minutes, hours, days, weeks, or months.

Another thing to note is that their quota management is done PER BLOCK not per bytes.

So if you are talking about quotas on a redhat based system (Fedora, CentOS, and others), this is likely where the confusion comes from. If you are talking another distro or another filesystem type, again, just provide me some context and I’ll help if I can.

Daniel Nebdal

Works at Oslo Universitetssykehus · Upvoted by

Horst H. von Brand

, PhD Computer Science & Mathematics, Louisiana State University (1987) · Author has 1.1K answers and 2.8M answer views

· 5y

A file system hierarchy is a set of directories (and perhaps a set of rules for what goes where in it). A file system is how you store it.

Storage devices, like hard drives, are just a long long series of bytes you can read and write; they have absolutely no structure on their own. A file system is a set of rules for where you start reading, how you store “a directory” (byte for byte), how you say that a file is in a directory, how you specify where on the disk the contents of that file are, and how you specify what kind of a file it is. (A directory is typically just a special kind of file, wh

A file system hierarchy is a set of directories (and perhaps a set of rules for what goes where in it). A file system is how you store it.

Storage devices, like hard drives, are just a long long series of bytes you can read and write; they have absolutely no structure on their own. A file system is a set of rules for where you start reading, how you store “a directory” (byte for byte), how you say that a file is in a directory, how you specify where on the disk the contents of that file are, and how you specify what kind of a file it is. (A directory is typically just a special kind of file, which makes it a bit easier to implement.)

This means that the two are independent: You can implement a linux file system hierarchy on any file system (or even on paper, if you’re just deciding how you lay things out in your new distro) — and you can put whatever directory structure you want on a file system.

Michael Smith

Lives in Florida (1956–present)

· 2y

This ext4 feature allows the mapping of logical block numbers for a particular inode to physical blocks on the storage device to be stored using an extent tree, which is a more efficient data structure than the traditional indirect block scheme used by the ext2 and ext3 file systems.

Michael Rutledge

20+ years of systems and network engineering · Author has 1.2K answers and 1.7M answer views

· 6y

Q: What is the maximum file size in EXT4? Why does it have that limit?

16 Terabytes in a single file, 1 Exabyte in a single volume - ext4 - Wikipedia

Realistically though there are a lot of limitations you’ll hit in the operating system before you can create a file that large. The EXT4 filesystem and its integration into linux in a vanilla installation and kernel aren’t sized and tuned to work with files of this size out of the box. Not saying it can’t be done, but you’ll have to do a lot of tuning before you can actually save a file that large.

As for why it has this limit, the way the filesyste

Q: What is the maximum file size in EXT4? Why does it have that limit?

16 Terabytes in a single file, 1 Exabyte in a single volume - ext4 - Wikipedia

Realistically though there are a lot of limitations you’ll hit in the operating system before you can create a file that large. The EXT4 filesystem and its integration into linux in a vanilla installation and kernel aren’t sized and tuned to work with files of this size out of the box. Not saying it can’t be done, but you’ll have to do a lot of tuning before you can actually save a file that large.

As for why it has this limit, the way the filesystem is constructed there is a basically a map of where all of the parts on your storage that constitutes the file, and this map has a finite number of entries. The same is true of directories, there is a finite number of entries that you can put in a directory, which translates to a maximum number of files you can put into a directory.

Regarding the system limitations I mentioned, there’s a lot of additional components that go into storing your data under a modern linux based operating system, and filesystem, for example device mapper, logical volume manager, the quota system, and so on. These also have to be configured in order to properly work with large files and large directories.

Wayne Lewis

Linux operating system specialist for over 20 years · Author has 802 answers and 1.1M answer views

· 8y

If you want snapshots, on the fly compression, data integrity checks and subvolumes then go with btrfs. If you are running a large database or virtual machines you will need to make some settings changes for the directories homing them or use xfs instead. Ext4 is basically obsolete, replaced by btrfs and xfs in the main Linux distributions as the default file system.

ZFS and BTRFS are both b-tree file systems. Ext is just an old style fs that has been superceded by modern alternatives.

Jerry Coffin

Senior Software Engineer (2019–present) · Author has 5.2K answers and 19.7M answer views

· 5y

Here’s an extremely simplified diagram of a file system.

So, you have a directory entry that contains a name and an inode number.

That inode number is used as an index into the inode table, to find a particular inode. The inode contains at least a reference count, and some sort of pointer to storage blocks that hold the data for the file.

The reference count in the inode tracks the number of directory entries pointing to that inode. When you unlink a file, that doesn’t directly delete the file. Rather, it decrements the reference count in the inode, and removes the directory entry.

When/if the ino

Here’s an extremely simplified diagram of a file system.

So, you have a directory entry that contains a name and an inode number.

That inode number is used as an index into the inode table, to find a particular inode. The inode contains at least a reference count, and some sort of pointer to storage blocks that hold the data for the file.

The reference count in the inode tracks the number of directory entries pointing to that inode. When you unlink a file, that doesn’t directly delete the file. Rather, it decrements the reference count in the inode, and removes the directory entry.

When/if the inode’s reference count reaches zero, that means there’s no longer any directory entry pointing to the inode, so the storage consumed by that file can be released for reuse.

This means you can have a (nearly) arbitrary number of directory entries pointing to the same inode, so the same data can appear under different names, and/or in different directories, at the same time, while consuming essentially no extra storage space (only the space for a directory entry).

I’ll repeat one point for emphasis though: this is an extremely simplified view of things.

Irné Barnard

Using and programming for computers since the mid 80s · Author has 12.5K answers and 42.4M answer views

· 1y

“a hard drive that stores files only”

As opposed to what other things? Peanuts?

Here’s a tip, all things on a computer, especially anything in a Unix/Linux computer (since you specify EXT4), are “files”. Even on other systems, “files” are what you store on a hard drive. You’d have to go way out of your way to do anything else with a hard drive, and arguably end up with just a different form of “file” (e.g. use a raw disk as a database engine’s storage location).

I’m assuming you mean, just data files. No programs, no system-, settings, cache-, temporary-, etc. files, nothing else like that. Just

“a hard drive that stores files only”

As opposed to what other things? Peanuts?

Here’s a tip, all things on a computer, especially anything in a Unix/Linux computer (since you specify EXT4), are “files”. Even on other systems, “files” are what you store on a hard drive. You’d have to go way out of your way to do anything else with a hard drive, and arguably end up with just a different form of “file” (e.g. use a raw disk as a database engine’s storage location).

I’m assuming you mean, just data files. No programs, no system-, settings, cache-, temporary-, etc. files, nothing else like that. Just things like pictures, documents, audio, video, etc.

In that case: Could I ask a follow-up question?

How are you using these files?

I mean, are they just sitting there? Perhaps one year later, you may want to open and view one. But, other than that, they never change?

Do you add new files periodically? Do you change existing ones? How often?

What about the organization of the files? All in one folder? Or do you have multiple folders and subfolders to group files into some form of “category” for your own ease of finding them later?

All that can combine to modify an answer here between some 50% of the total drive space needs to be free, else you start seeing performance degradation. Up to fill it to the brim, no difference. And every level in between.

William Bechtold

Linux user for 18 years, retired EE. · Author has 2K answers and 538.5K answer views

· 2y

Per

What do "extents" feature do in ext4 filesystem in linux?

I was reading about the ext4 filesystem that it supports compression, encryption and a feature called extents which is used to save disk space. What are extents and how are they are effective for s...

https://unix.stackexchange.com/questions/223310/what-do-extents-feature-do-in-ext4-filesystem-in-linux

“Extents reduce the amount of metadata needed to keep track of the data blocks for large files. Instead of storing a list of every individual block which makes up the file, the idea is to store just the address of the first and last block of each continuous range of blocks. These continuous ranges of data blocks (and the pairs of numbers which represent them) are called extents.“