A REST interface for interacting with Apache Spark from anywhere, enabling remote code execution and job submissions.
Apache Livy is an open-source REST service that provides a web interface for interacting with Apache Spark clusters remotely. It allows users to execute Spark code snippets, submit batch jobs, and manage Spark sessions via HTTP calls, eliminating the need for direct cluster access. Livy acts as a gateway that simplifies Spark integration into applications, scripts, or tools that can communicate over REST.
Data engineers, data scientists, and developers who need to programmatically interact with Apache Spark clusters from external systems, web applications, or automated pipelines.
Livy offers a standardized, language-agnostic REST API for Spark, enabling remote job submissions and interactive sessions without modifying existing Spark code. Its multi-user support and impersonation features make it suitable for shared, production environments where secure, scalable Spark access is required.
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Provides a standardized REST API, enabling Spark interactions from any environment that can make HTTP calls, as highlighted in the description for remote job submissions and sessions.
Supports impersonation for shared clusters, allowing multiple users to securely access the same server, which is essential for production environments with team-based workflows.
Works with existing Spark programs without modifications, reducing integration effort and making it easy to adopt in legacy systems, as stated in the features.
Offers interactive shells and batch submissions in Scala, Python, R, and Java, catering to diverse data science and engineering teams with varied language preferences.
Requires pre-installed Spark, JDK, Python, R, and multiple dependencies like cloudpickle and requests-kerberos, making initial setup and maintenance non-trivial.
As an Apache incubator project, it may experience more breaking changes, less stable documentation, and slower community support compared to established Spark components.
Adds a REST layer between clients and Spark, which can introduce latency and resource usage compared to direct Spark submissions, especially for high-frequency or large-scale jobs.