I am experiencing (what I think is) the same basic issue, with different details.
I’m running the 2.1 sandbox using the virtualbox image from Hortonworks.
I have Pentaho 5.1 setup with the HDP21 ‘shim’ to integrate with hadoop 2.1 and running on the (Windows 8) host machine.
I am able to transfer the file to HDFS on the sandbox VM, but the new file always ends up empty.
The error in the log is:
Caused by: File /user/pdi/weblogs/raw/weblogs_rebuild.txt could only be replicated to 0 nodes instead of minReplication (=1). There are 1 datanode(s) running and 1 node(s) are excluded in this operation.
I am able to upload the same weblogs_rebuild.txt file via the File Uploader tool in the ‘Hue’ interface, so I don’t think this is a storage issue.
Some of the forums I have found suggest that I need to change the network config of the VM image from “NAT” to “Bridged Adapter” (along the lines of the IP Address mapping issues mentioned in this thread).
However, when I do this, I am unable to access the machine at all (via Hue or Pentaho, etc).
I’m guessing (hoping) that there’s a set of changes I can apply to the network settings to overcome this issue.
Is there any documentation or help on the suggested way to interface with the sandbox from the host machine as far as network settings?
Thanks!