Here are few important features of HDFS:
- HDFS doesn't need highly expensive storage devices
- Uses off the shelf hardware.
- Rapid Elasticity
- Need more capacity, just assign some more nodes
- Scalable
- Can add or remove nodes with little effort or reconfiguration.
- Runs entirely in userspace
- The file system is dynamically distributed across multiple computers
- Allows for nodes to be added or removed easily
- Highly scalable in a horizontal fashion
- Highly Fault Tolerant: Automatic Failure Recovery System
- High aggregate throughput for streaming large files
- Supports replication and locality features
- Designed to work with systems with vary large file (files with size in TB) and few in number.
- Provides streaming access to file system data. It is specifically good for write once read many kind of files (for example Log files).
Tags