How to migrate from KCL 1.x to 3.x?

Follow the official migration guide linked in the README, which involves updating interfaces and security credential providers. AWS provides a detailed blog post and documentation outlining steps, but it requires code changes and testing due to breaking changes.

KCL vs Apache Kafka Consumer for AWS streaming?

KCL is optimized for Amazon Kinesis Data Streams and abstracts AWS-specific complexities like shard management, while Kafka Consumer is for self-managed or Confluent Kafka clusters. Choose KCL for tight AWS integration and managed features, but it locks you into AWS.

Does KCL support exactly-once processing?

No, KCL provides at-least-once delivery semantics, as stated in the Fault Tolerance feature. For exactly-once, you need to implement idempotency or deduplication logic in your application, which adds complexity.

How to set up KCL with Java for a new project?

Add the Maven dependency from the README, configure AWS credentials via profile or IAM roles, create a Kinesis stream in the AWS console, and implement the IRecordProcessor interface. The README links to a developer guide for step-by-step tutorials.

Can I use KCL with Python without Java?

Yes, via the MultiLangDaemon which allows writing processors in Python. AWS provides a Python wrapper library that abstracts the Java daemon details, but it still relies on Java for the underlying daemon process.

What monitoring tools does KCL integrate with?

KCL integrates with Amazon CloudWatch for consumer-level monitoring, providing metrics on checkpointing, record processing, and error rates automatically. This is built-in and requires minimal setup, as mentioned in the Monitoring feature.

amazon-kinesis-client — Java Client for Kinesis Data Streams

What is amazon-kinesis-client?

Amazon Kinesis Client Library (KCL) is a Java library for building applications that consume and process data from Amazon Kinesis Data Streams. It solves the problem of handling the operational complexities of distributed stream processing, such as load balancing, fault tolerance, and checkpointing, allowing developers to focus on their data processing logic.

Target Audience

Java developers building real-time data processing applications on AWS, particularly those who need to consume high-volume, streaming data from Kinesis Data Streams with reliability and scalability.

Value Proposition

Developers choose KCL because it provides a managed, production-ready framework for stream consumption that handles scaling, fault tolerance, and checkpointing automatically, reducing the need to build and maintain custom distributed systems infrastructure.

Overview

Client library for Amazon Kinesis

Use Cases

Best For

Building real-time analytics pipelines that consume data from Kinesis Data Streams
Creating fault-tolerant event processing applications on AWS
Developing scalable data consumers that need automatic load balancing
Implementing stream processing applications that require checkpointing for stateful processing
Writing multi-language stream processors using the MultiLangDaemon
Applications that need to handle Kinesis shard splits and merges automatically

Not Ideal For

Projects using non-AWS streaming platforms like Apache Kafka or Google Pub/Sub
Simple event processing with low data volume where lightweight consumers suffice
Teams seeking a fully serverless solution without managing worker instances
Applications requiring exactly-once message delivery semantics

Pros & Cons

Pros

Managed Scalability

Dynamically scales processing across workers, supporting manual or auto-scaling without load redistribution, as highlighted in the Scalability feature section of the README.

Robust Fault Tolerance

Provides at-least-once delivery and continuous processing during worker failures, with built-in mechanisms ensuring reliability even in distributed failures.

Automatic Stream Adaptation

Handles shard splits and merges seamlessly, maintaining ordering by processing child shards only after parent completion, as described in the Stream-Level Change Handling feature.

Multi-Language Flexibility

Supports Java natively and enables other languages through MultiLangDaemon, allowing diverse development teams to write processors in languages like Python without Java expertise.

Cons

Complex Version Migration

KCL 1.x is nearing end-of-support, requiring non-trivial interface and credential provider updates to migrate to 3.x, as warned in the IMPORTANT notice, adding maintenance overhead.

AWS SDK Dependency Risks

Specific AWS SDK for Java versions (2.27.19-2.27.23) cause DynamoDB exceptions with KCL 3.x, necessitating careful version management and updates, as highlighted in the warning box.

Vendor Lock-in

Tightly integrated with Amazon Kinesis and AWS ecosystem, making it unsuitable for multi-cloud or on-premises deployments without significant rework.

amazon-kinesis-client

What is amazon-kinesis-client?

Overview

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?

amazon-kinesis-client

What is amazon-kinesis-client?

Overview

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?