In general Hadoop tools do not take replication into account unless you are talking about disk space…
So hdfs dfsadmin -report
Will tell you the amount of HDFS disk space used, including replication, but something like
Hadoop fs -du -s -h “someingorother”
Will tell you the unreplicated size of that file or dir
↧
Reply To: Data size on HDFS
↧