Hi,
Im a student in Master of medical information systems management in Brasill…
Im a php web developer in my country.
I ‘m doing a thesis on drift information and machine learning.
I have installed Hortonworks in a virtual machine in linux mint .
But i have a problem, Because I do not know What is the Best way to do the following architecture:
1- Get many tweets (I’m using twitter4j and insert in MongoDB) –> is it good?
2- use hadoop — mahout and user deep learning /boosting algorithms
3 – a new BD with filter data.
So I’m a little lost if this is right way for that architecture. I have also read about the apache storm .
Someone could give me a help?
I