Quantcast
Channel: Hortonworks » All Replies
Viewing all articles
Browse latest Browse all 3435

Query on bucketed table stored as ORCfile

$
0
0

This is a cross-post from the Sandbox forum – Please let me know if this is against the rules, I apologize in advance for doing this – only trying to make sure my question gets to the appropriate audience.

Hello – I want to test out the new ACID transaction capabilities of Hive 0.14 so I’ve got the HDP 2.2 Preview Sandbox up and running, I was able to import my data into HDFS and create an external table on my delimited data.

Next I created a bucketed ORCfile table:
create table diskavailable_orc_bucketed (location string,gbtotal string,gbfree string,servertype string,updated string)
partitioned by (ds string)
clustered by (updated) into 256 buckets
stored as orc;

and now I’m having issues copying my data into it with this query:
CREATE TEMPORARY FUNCTION rowSequence AS ‘org.apache.hadoop.hive.contrib.udf.UDFRowSequence';
insert overwrite table diskavailable_orc_bucketed PARTITION (ds) select rowSequence() as ds, * from diskavailable_orc;

I am using the hive-contrib-0.14.0.jar file as it has the rowSequence capability I need for the partitioning. When I run the two lines above using Hue interface it just keeps saying “Waiting for query…” (screenshot here: http://postimg.org/image/wp0k5qygd/ ) and never seems to execute.

Does it look like I’ve taken some wrong steps or is something happening with my Sandbox? I’m not getting the expected – data copied into the diskavailable_orc_bucketed table.

Any help would be greatly appreciated. Thank you.


Viewing all articles
Browse latest Browse all 3435

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>