Add support for Spark (reusing Hive code) #14

karth295 · 2020-09-10T21:31:30Z

Spark has a fork of HiveServer2 it uses to support JDBC: https://jaceklaskowski.gitbooks.io/mastering-spark-sql/content/spark-sql-thrift-server.html. And you use Hive's JDBC client to interact with it.

This means that the bulk of what we need is already done. Here are remaining TODOs afaik:

Create and document an init action to start a Spark thrift server. Part of that init action will be to configure Knox to expose the Spark thrift server.
Change the JDBC connector to accept jdbc:dataproc://spark and translate it into using the component gateway path for Spark.
Update the README to reflect this.

The text was updated successfully, but these errors were encountered:

karth295 · 2020-09-23T08:30:09Z

Init action: gs://hive-http-mode-init-action/spark-http-config.sh. Note that it disables the regular hive-server2 and runs Spark's hive-server2 in its place.

Now it's just a matter of documenting this init action in the README -- I'll leave this issue open for that.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Spark (reusing Hive code) #14

Add support for Spark (reusing Hive code) #14

karth295 commented Sep 10, 2020

karth295 commented Sep 23, 2020

Add support for Spark (reusing Hive code) #14

Add support for Spark (reusing Hive code) #14

Comments

karth295 commented Sep 10, 2020

karth295 commented Sep 23, 2020