How do I set up automatic scaling for Kinesis with this utility?

Deploy the WAR file to a Java app server like Tomcat on Elastic Beanstalk, create a JSON configuration file with your scaling policies, and set the config-file-url to point to it. The app will then monitor CloudWatch metrics and adjust shards based on your thresholds.

Can this tool manage multiple Kinesis streams at once?

Yes, the autoscaling configuration supports an array of streamMonitor objects in the JSON file, allowing a single deployment to monitor and scale multiple streams with individual policies for each.

Kinesis scaling utility vs AWS on-demand capacity: which is better?

This utility offers more customization with configurable thresholds and cool-off periods, while AWS on-demand capacity is fully managed but with less granular control. Choose based on whether you need policy flexibility or hands-off management.

What happens if the autoscaling service fails or has errors?

From version .9.5.9, the service exits on fatal errors by default, but you can suppress this with the suppress-abort-on-fatal configuration. Check the application logs for specific error details and troubleshooting.

How to configure scaling based on both PUT and GET metrics?

In the JSON configuration, set scaleOnOperation to include both PUT and GET, and the utility will use a logic table to decide scaling actions based on combined utilization, as described in the README.

Is there a way to test my scaling policies before deploying?

You can use the manual ScalingClient for dry runs or reports, and review the TestScalingUtils.java file for examples of scaling behavior, but full testing requires deployment and monitoring of logs.

amazon-kinesis-scaling-utils — Amazon Kinesis Scaling Utility

What is amazon-kinesis-scaling-utils?

Amazon Kinesis Scaling Utility is an open-source tool that enables dynamic scaling of Amazon Kinesis Streams. It allows users to manually adjust shard counts or deploy an auto-scaling service that monitors stream metrics and automatically scales capacity up or down based on demand. It solves the problem of manually managing Kinesis stream sharding to match fluctuating data ingestion or consumption rates.

Target Audience

AWS developers and data engineers who manage Kinesis Streams with variable throughput and need to optimize cost and performance through automated scaling.

Value Proposition

Developers choose this utility because it provides a programmatic, auto-scaling approach to Kinesis stream management—similar to EC2 Auto Scaling—reducing operational overhead and ensuring streams have adequate capacity without manual intervention.

Overview

The Kinesis Scaling Utility is designed to give you the ability to scale Amazon Kinesis Streams in the same way that you scale EC2 Auto Scaling groups – up or down by a count or as a percentage of the total fleet. You can also simply scale to an exact number of Shards. There is no requirement for you to manage the allocation of the keyspace to Shards when using this API, as it is done automatically.

Use Cases

Best For

Automatically scaling Kinesis Streams based on PUT or GET rate thresholds
Reducing Kinesis costs by scaling down underutilized streams
Handling sudden spikes in data ingestion to Kinesis
Managing multiple Kinesis Streams with centralized auto-scaling policies
Simplifying shard management without manual keyspace allocation
Integrating Kinesis scaling with SNS for operational alerts

Not Ideal For

Streams with static, predictable throughput where manual scaling via AWS Console is sufficient
Organizations mandating AWS-native managed services for compliance or support reasons
Teams using infrastructure-as-code tools like Terraform that prefer integrated, declarative scaling modules
Applications requiring sub-minute scaling latency, as checks are interval-based (e.g., every 300 seconds)

Pros & Cons

Pros

Flexible Scaling Policies

Supports manual scaling by shard count or percentage and automatic scaling with configurable thresholds, cool-off periods, and SNS notifications via JSON, similar to EC2 Auto Scaling patterns.

Automatic Keyspace Management

Handles shard splitting and merging automatically during scaling operations, eliminating the need for manual partitioning and reducing operational complexity.

Dual Metric Monitoring

Scales based on both PUT and GET utilization rates, with configurable logic for combined metrics, ensuring comprehensive capacity adjustment for varied workloads.

Easy Deployment Options

Provides pre-built WAR files deployable on Elastic Beanstalk or any Java app server, with region-specific S3 URLs for quick setup and reduced manual effort.

Cons

Complex Initial Configuration

Requires deploying a Java web application and managing detailed JSON configuration files, which is more involved than using AWS native tools or simple scripts.

Potential Scaling Latency

Scaling decisions rely on periodic metric checks (e.g., configurable intervals like 300 seconds), so it may not instantly respond to rapid traffic spikes.

Version-Specific Behaviors

The README highlights specific behaviors in version .9.8.8, such as scalePct logic, which could lead to confusion and require careful testing and validation.

amazon-kinesis-scaling-utils

What is amazon-kinesis-scaling-utils?

Overview

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?

amazon-kinesis-scaling-utils

What is amazon-kinesis-scaling-utils?

Overview

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?