Rate Limiting

Explore our guide on API rate limiting to prevent misuse and ensure efficient operation. Dive into classifications, enforcement, and management across our product suite's API categories.

To safeguard our APIs from potential misuse due to coding errors, suboptimal integrations, and malicious activities, we implement proactive rate limiting. This document outlines the classifications, enforcement, and management of rate limits across various API categories critical to our product suite.

API operation categories

We classify API actions based on their relevance to the purchase process and the resources they consume. This classification helps minimize interference between processes, ensuring efficient operation. For example, we strive to prevent extensive settlement report processes from impacting new payment transaction capabilities.

The rate limit category for an API call is documented in the description of the operation.

API rate limit action categories are as follows:

Routing Ratelimit Categories	Description
Payment Transaction Capture	Includes crucial actions for capturing payment transactions.
Payment Transaction Management	Operations needed to manage a payment, and some optional operations for a payment capture. Covers actions related to managing payment transactions that are not essential for the capture process.
Dispute	Specifically addresses operations related to handling disputes.
Partner Management	Operations used to manage Partner accounts and additional Partner operations that are not related to onboarding.
Account Onboarding	Operations necessary to onboard a new Partner account. Includes resource-intensive tasks such as creating accounts, requiring synchronous calls to other APIs.
Settlement	Operations needed for processing of settlements. Includes actions that could affect rate limits in other areas, identified through access log analysis.
On Site Messaging	Operations needed for using the on-site-messaging functionality.
Customer Post Purchase	Operations focused on functionality for post-purchase for end-users.

Rate limit enforcement

Requests to the global API endpoint are processed in the closest data center relative to the caller's location. The rate limit configuration for a given Partner and Acquiring Partner is the same across all locations within a specific environment. However, the rate limit quota is enforced by each data center. Therefore, requests for the same Partner from different parts of the world may experience different rate limit statuses.

The rate limit mechanism tracks rate limits for API operations at two distinct levels:

Acquiring Partner level: This encompasses all operations, including those on sub-Partners.
Partner level: These limits apply uniquely to each Partner.

If either limit is reached, a request will be subject to rate limiting:

Rate Limit by API Category in Production	Partner	Acquiring Partner
account-onboarding	0/s	10/s
payment-transaction-capture	50/s	200/s
payment-transaction-management	200/s	500/s
dispute	20/s	50/s
settlement	0/s	150/s
Partner-management	20/s	50/s
on-site-messaging	200/s	500/s

Rate Limit by API Category in Test	Partner	Acquiring Partner
account-onboarding	0/s	5/s
payment-transaction-capture	12/s	50/s
payment-transaction-management	50/s	125/s
dispute	5/s	12/s
settlement	0/s	37/s
Partner-management	5/s	12/s
on-site-messaging	50/s	125/s

Handling rate limits

A rate-limited request returns an HTTP-status code 429 and headers indicating the remaining quota:

X-Ratelimit-Limit: information on the rate limits and the metering interval. The first item is the quota that is closest to being exceeded. This is followed by one or more rate limit quota policy descriptions.
X-Ratelimit-Reset: time in seconds until the current metering interval resets. For per-second rate limiting, this will always be “1”.
X-Ratelimit-Remaining: the approximate remaining quota of the rate limit.

When a rate limit is reached, we require implementing a retry mechanism with exponential back-off to prevent retry attempts from retriggering rate limiting; add jitter to the back-off as retrying all attempts together is likely to draw out the issue. In addition, Klarna suggests that a token bucket algorithm is used to control the global flow of requests from the calling system to the API. This will help to reduce the likelihood of rate limiting in future.

Rate limit quota policy

A rate limit quota policy element is describing a rate limit that has been evaluated for the request. More than one can be active at the same time. In the current API there will be two, one for the Acquiring Partner-level rate limit and one for the Partner rate limit.

Example of a single quota policy:

JAVASCRIPT

40;w=1;name="ratelimit-name"

The “40” is how many units are available within a refresh-interval.
The “w=1” describes the length in seconds of interval after which the rate limit quota is reset.
The “name” component is optional and should be seen as informational and can change without notice. Typical values for the name will include the level where the rate limit applies and the rate limit operation category.

Example (values approximate, provided on a best-effort basis):

JSON

X-Ratelimit-Limit: 40, 40;w=1;name="account_payment-request-capture",100;w=1;name="psp_payment-request-capture"
X-Ratelimit-Remaining: 15
X-Ratelimit-Reset: 1

The rate limit quota closest to being reached for the interval is 40 requests per second with the descriptive name “account_payment-request-capture”.
The quota window interval is 1 second.
There are 15 requests remaining within the 1-second time window before further requests are rejected.
There is 1 second until this quota resets.

Common causes for rate limiting

We aim to set rate limiting quotas to ensure that the majority of Partners experience no rate limitations for legitimate traffic. However, you may encounter rate limits in the following scenarios:

High request rate: Exceeding the defined rate limits due to a rapid influx of requests within a short timeframe may trigger rate limiting. To address this, it is advised to distribute requests evenly over the interval specified by informational headers and employ retries with exponential backoff.
Traffic surges: A sudden and substantial increase in traffic, such as during a flash sale, can deplete the rate limit quota. If you anticipate an upcoming event that may surpass your request quota, kindly reach out to Partner Support or contact your Account Manager for assistance.
Bot-generated requests: The presence of bots on a website may result in an increased frequency of requests, potentially leading to rate limiting as a preventive measure against web scraping or data harvesting. It is recommended to implement bot-detection methods and, if suspicious bot activity is detected, initiate authentication before making API calls.

Rate limiting change management

Adjustments to rate limiting protocols are managed per the Klarna change management process as outlined in Versioning and deprecation. Changes may occur without prior notice in response to abuse or consistent deviation from Klarna’s requirements.

It is crucial to monitor the rate limiting information returned in response headers to adapt to any adjustments effectively.

Testing

Live Environment

Running any performance tests in the live environment is not allowed. This prohibition covers scenarios such as load tests, stress tests, and other evaluations that might negatively impact live operations.

Test Environment

In the test environment, you may run limited performance tests to confirm that your merchant integration behaves as expected when a rate limit is reached. Note that the deliberately lower rate limits in this environment are designed to trigger such events more easily. These tests are only intended for verifying integration behavior and should not be used for capacity testing.