Hi everyone,
I’m currently trying to implement batch puts into HBase (ver 1.0.0 but it can be changed if needed) from Spark Streaming.
I’ve read in the HBase: The Definitive Guide about batch operations. Basically it shows that you can put a bunch of different actions in a List and then perform table.batch on that row.
I’ve got several questions regarding this approach:
1. It throws a checked InterruptedException from the batch, there’s no indication of why it is throwing it. After digging a little bit through the source code I see that it is executed asynchronously and so my question is: Who/What can interrupt the batch execution?
2. Could you explain (or share a link) about how this batch process is actually executed?
Also is it the best way to perform batch insertions to HBase (from Spark Streaming) or is there another preferred way to perform the batch?
Thanks in advance