renovate

Question

I have installed Hortonworks hdp3.0 and configured Zeppelin as well.

When I running spark or sql Zeppelin only showing me default database(This is the default database from Spark which has location as '/apps/spark/warehouse', not the default database of Hive). This is probably because hive.metastore.warehouse.dir property is not set from hive-site.xml and zeppelin is picking this from Spark config (spark.sql.warehouse.dir).

I had similar issue with spark as well and it was due to hive-site.xml file on spark-conf dir, I was able to resolve this by copying hive-site.xml from hive-conf dir to spark-conf dir.

I did the same for Zeppelin as well, copied hive-site.xml in zeppelin dir(where it has zeppelin-site.xml and also copied in zeppelin-external-dependency-conf dir.

But this did not resolve the issue

*** Edit#1 - adding some additional information ***

I have create spark session by enabling hive support through enableHiveSupport(), and even tried setting spark.sql.warehouse.dir config property. but this did not help.

import org.apache.spark.sql.SparkSession
 
val spark =SparkSession.builder.appName("Test Zeppelin").config("spark.sql.warehouse.dir","/apps/hive/db").enableHiveSupport().getOrCreate()

Through some online help, I am learnt that Zeppelin uses only Spark's hive-site.xml file, but I can view all hive databases through spark it's only in Zeppelin (through spark2) I am not able to access Hive databases.

Additionaly Zeppelin is not letting me choose programming language, it by default creates session with scala. I would prefer a Zeppeling session with pyspark.

Any help on this will be highly appreciated

Answer 1

After copying hive-site.xml from hive-conf dir to spark-conf dir, I restarted the spark services that reverted those changes, I copied hive-site.xml again and it's working now.

apache phoenix org.apache.phoenix.exception.PhoenixIOException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions for user jdbc (0)	2018.12.12
Pyspark로 Spark on Yarn Code --1(개발환경구성) (0)	2018.11.29
HDP3 에서 Spark 로 Hive Table 를 조회했는데 빈값이 나온경우 (0)	2018.10.03
HDP3 제플린(Zepplin) 스케쥴(Cron) 활성화 (0)	2018.09.04
HDP3 클러스터에 HDF(nifi)설치 (0)	2018.08.22
Spark(Yarn) + Intellj 원격 디버깅 하기 (0)	2018.08.21

renovate

HDP3 spark, pyspark, zepplin에서 database가 안보일때,

Zeppelin : Not able to connect Hive Databases (through spark2) HDP3.0

'Study > Bigdata' 카테고리의 다른 글

티스토리툴바

HDP3 spark, pyspark, zepplin에서 database가 안보일때,

Zeppelin : Not able to connect Hive Databases (through spark2) HDP3.0

'Study > Bigdata' 카테고리의 다른 글

관련글

티스토리툴바