Using HDP 2.2 Sandbox on the Ambari Dashboard I notice that HDFS Disk Usage circle was red and the usage was 95%. I don’t have all that much data. So I set out to find out what was using it all and I found:
$ sudo -u hdfs hadoop fs -du / 7497031 /app-logs 229265468 /apps 881591 /demo 413489640 /hdp 0 /mapred 0 /mr-history 20616255563 /ranger 0 /system 4599988 /tmp 239285045 /user
Ranger? It looks like it’s all audit files.
$ sudo -u hdfs hadoop fs -du /ranger/audit/ 274172 /ranger/audit/hbaseMaster 3027170361 /ranger/audit/hbaseRegional 17578061921 /ranger/audit/hdfs 0 /ranger/audit/knox 0 /ranger/audit/storm
Why so many?