Cassandra 와 Spark

Cassandra 테이블을 스파크 데이터와 join 하고 싶을때

https://github.com/datastax/spark-cassandra-connector/blob/master/doc/14_data_frames.md

datastax , 쉽게 생각하면 카산드라 벤더에서 만든 스파크 드라이버를 이용

val createDDL = """CREATE TEMPORARY VIEW/TABLE words USING org.apache.spark.sql.cassandra OPTIONS ( table "words", keyspace "test", cluster "Test Cluster", pushdown "true")""" spark.sql(createDDL) // Creates Catalog Entry registering an existing Cassandra Table

View 또는 TABLE로 맵핑해서 SparkSQL로 사용

저작자표시 비영리 변경금지

'Study > Bigdata' 카테고리의 다른 글

Spark(Yarn) + Intellj 원격 디버깅 하기 (0)	2018.08.21
intellj, Spark Assembly (0)	2018.08.17
Hive Metastore not working - Syntax error 'OPTION SQL_SELECT_LIMIT=DEFAULT' at line 1 (0)	2018.08.02
NodeManager 추가 및 삭제 (0)	2017.07.11
HDFS 노드 추가 및 삭제 (0)	2017.07.11
아파치 재플린 코드 자동완성(apache zeppelin code assistant) (0)	2017.06.06

renovate

Cassandra 와 Spark

'Study > Bigdata' 카테고리의 다른 글

티스토리툴바

Cassandra 와 Spark

'Study > Bigdata' 카테고리의 다른 글

관련글

티스토리툴바