I have a json file that I loaded into HDFS and loaded into table as below. But get error while doing select. I have tried various json serdes but no luck so far. The issue with how the json file is formatted. Each row in the json file is a json object. When I do select on the table it says it expects a “,” to separate name and value.
json file sample
{“user”:{“userlocation”:”BANDUNG-INDONESIA”,”id”:236827129,”name”:”Powerpuff Girls”,”screenname”:”TasyootReyoot”,”geoenabled”:true},”tweetmessage”:”RT @viddyVR: Kok gitu sih RT @TasyootReyoot: #HelloKitty itu aneh pala gede bekumis pula badan kecil kayak org kena polio\””,”createddate”:”2013-06-20T12:08:45″,”geolocation”:null}
{“user”:{“userlocation”:”Bolton, UK”,”id”:14141159,”name”:”Chris Beckett”,”screenname”:”ChrisBeckett”,”geoenabled”:true},”tweetmessage”:”vCOps people – Does Advanced Edition == 5 VMs? I know Std has UI and analytics VM, but what does rest use? Hyperic etc? #vmware #vcops”,”createddate”:”2013-06-20T12:08:46″,”geolocation”:null}
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2907″></div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2738″ class=”yiv7605459760″>CREATE EXTERNAL TABLE sample_twitter_data (</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2740″ class=”yiv7605459760″>user STRUCT<</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2742″ class=”yiv7605459760″>userlocation:STRING,</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2744″ class=”yiv7605459760″>id:STRING,</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2746″ class=”yiv7605459760″>name:STRING,</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2748″ class=”yiv7605459760″>screenname:STRING,</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2750″ class=”yiv7605459760″>geoenabled:STRING>,</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2752″ class=”yiv7605459760″>tweetmessage STRING,</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2754″ class=”yiv7605459760″>createddate STRING</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2756″ class=”yiv7605459760″>)</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2758″ class=”yiv7605459760″>ROW FORMAT SERDE ‘org.apache.hive.hcatalog.data.JsonSerDe'</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2760″ class=”yiv7605459760″>LOCATION ‘/user/hdfs/sample_twitter_data’;</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2772″ class=”yiv7605459760″>LOAD DATA INPATH ‘/user/hdfs/sample_twitter_data.txt’ OVERWRITE INTO TABLE sample_twitter_data;</div>
<div id=”yiv7605459760yui_3_16_0_1_1443586689289_2780″ class=”yiv7605459760″>select * from sample_twitter_data where user.screenname=’ChrisBeckett’;</div>
<div class=”yiv7605459760″></div>
<div class=”yiv7605459760″>Wondering if anybody else had it working. Any help is highly appreciated.</div>