What is going OOM? The data node process? Are you also running YARN apps on the data node boxes? Are they using up all the memory when the OOM occurs? What does Gangllia tell you is going on?
How did you calculate YARN memory settings? Did you use Hortonworks’ magic formula?