Question 1

How do I submit a Python job to Spark Job Server?

Accepted Answer

Package your Python job as a .egg or .whl file, upload it via the /binaries API endpoint with the appropriate content-type, then submit the job using the class path and app name in a POST request to /jobs. The README provides details under 'Multi-language Support' and dependency handling.

Question 2

What's the difference between Spark Job Server and Apache Livy?

Accepted Answer

Spark Job Server is a RESTful service focused on job and context management with features like named objects and persistent contexts, while Livy is more oriented towards interactive sessions and notebook integration. Job Server offers more fine-grained control over job lifecycle, but Livy might be better for interactive use cases.

Question 3

Can Spark Job Server run on Kubernetes?

Accepted Answer

The README does not explicitly mention Kubernetes support; it primarily covers Standalone, Mesos, YARN, and EMR deployments. Users may need to adapt deployment scripts or use community contributions, as native Kubernetes integration isn't documented.

Question 4

How to handle job dependencies in Spark Job Server?

Accepted Answer

Use the 'dependent-jar-uris' parameter in context configuration or job submission to specify URIs for dependency jars, or package dependencies into a fat jar. The README details this under 'Dependency jars' with examples for file and binary names.

Question 5

Is Spark Job Server suitable for streaming jobs?

Accepted Answer

Yes, it supports Streaming Contexts as noted in the features, but streaming jobs require careful context management and monitoring. The README mentions it under context types, but users should test for latency and reliability in their specific setup.

Question 6

How to secure Spark Job Server with SSL?

Accepted Answer

Configure SSL by setting 'ssl-encryption = on' in application.conf, specifying a keystore path and password. The README includes a section on HTTPS/SSL configuration with examples for self-signed certificates and client authentication.

spark-jobserver

What is spark-jobserver?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions