Cheat Code
import pyodbc
import pandas as pd
pyodbc.autocommit = True
con = pyodbc.connect('DRIVER={Hortonworks Hive ODBC Driver};Host=MySuperCluster.azurehdinsight.net;Port=443;AuthMech=6;SSL=1;UID=SuperAdmin;PWD=[Can'tTellYou];HTTPPath=/hive2;', autocommit=True)
sql = "SELECT * FROM mydb.mytable limit 10"
data = pd.read_sql(sql,con)
print(data)
Another day another Python learning.
Today I tried to read the table from Hive which connected using PyODBC into Pandas Data Frame.
The idea is, once data available on Pandas Data Frame, I can run my queries and run my exploratory.
Python FTW!
