Fast Inventory Processing At Scale With Inventory Accelerator

Contributor:

Published: Invalid Date

5 min read

#React

At Fluent Commerce, we strive to provide our customers with the best solutions to stay ahead of their game. As a distributed order management platform, inventory management is crucial for all of our customers.

Failure to process inventory quickly and accurately can result in frequent occurrences of underselling or overselling, which in turn can lead to a cascade of negative consequences. These repercussions include cancelled orders, lost sales opportunities, and ultimately, dissatisfied customers.

We have introduced Global Inventory as a solution to meet our customers' diverse needs. With Global Inventory, customers have the flexibility to customize the logic and perform various actions, such as reserving, pre-selling, or updating inventory, based on their specific requirements.

Executing this logic can pose significant computational challenges, particularly when dealing with large datasets such as 20 million records. The processing speed may be compromised, and valuable resources could be wasted due to the repetitive nature of the data, where a small percentage (e.g., 5%) represents new information. This computational burden not only affects the efficiency of inventory processing for individual customers but can also have a negative impact on other customers operating within the same multi-tenant environment. When extensive resources are dedicated to processing redundant or unchanging data, it can disrupt the overall order and inventory management processes, impeding the smooth operation of other customers' businesses.

Introducing The Inventory Accelerator

To enhance the speed of inventory processing while optimizing computing resources, We recently introduced a solution called Inventory Accelerator, specifically designed to address these challenges. This innovative feature employs data filtering techniques to examine incoming data before it undergoes further processing. By identifying and ignoring redundant or unchanged records, Inventory Accelerator significantly reduces the computational overhead associated with inventory processing. This also means that customers can process hundreds of millions of inventory updates efficiently, ensuring a seamless and expedited inventory processing workflow.

Before going into the details of the Inventory Accelerator, let us take a 30,000-foot view of the Fluent Commerce platform that is responsible for processing inventory batches.

The following diagram depicts the high-level execution flow of inventory batch processing with the Inventory Accelerator, followed by a short description of each step.

A customer sends inventory batch request/s via the batch APIs in JSON format
The Batch API streams the files to an AWS S3 bucket and also records metadata in the customer’s database. Each customer’s inventory data is stored separately.
Whenever new inventory batch files arrive in the Inventory batch AWS S3 bucket it sends notifications via AWS EventBridge. This event-driven approach allows us to decouple downstream applications and provide better scalability. The Inventory Accelerator subscribes and listens to the event notifications emitted by the Inventory batch S3 bucket. The types of events the Inventory Accelerator listens to are configured with an Amazon EventBridge rule.
The primary task of the Accelerator is to identify the inventory items that have actually changed based on certain criteria. For this, it uses the latest snapshot data from the customer’s inventory data and the uploaded inventory batch items to find the items that need to be processed.
Once the inventory batch items to be processed are identified, an event is generated for each inventory batch item and is fed into an SQS queue.
The Fluent Commerce Orchestration engine picks up these events and executes the customer-specific Global Inventory workflows. The workflow is a blueprint of what needs to happen and includes customer-specific logic to process anything. In this case, the workflow for processing inventory is executed.

How Does The Inventory Accelerator Work

The Inventory Accelerator (aka Accelerator) is built on top of the AWS Glue service. AWS Glue is a fully managed extract, transform, and load (ETL) service provided by Amazon Web Services (AWS). It is designed to simplify and automate the process of preparing and loading data for analytics, machine learning, and other data-driven tasks.

The Accelerator consists of two main components that work together, each implemented using Apache Spark.

The data syncing module - This module is responsible for constantly syncing inventory data (such as inventory catalogue, inventory position, and inventory quantity records) from the customer’s database and storing it in an efficient storage format called Apache Hudi to be used in the next steps.
Inventory Deduplication module - This module compares the inventory batch items the customer uploaded with the up-to-date inventory data snapshot maintained by the above module. The output will include inventory batch items that require further processing.

Here is how the components are tied together in the Accelerator. The flow starts from left to right.

The Accelerator execution is triggered either by the arrival of new inventory batch items or by a fixed scheduler, whichever happens first. Then, the data syncing module runs multiple parallel jobs to capture the latest snapshot of inventory data in the relational database. Once the three entity snapshots are loaded, the deduplication module starts filtering out the "changed inventory data" and produces events for all such batch items.

How Does An Inventory Item Meet Change Inventory Data Definition

Before discussing the definition of a changed inventory data item, let us briefly introduce some terms used in the inventory management domain.

The inventory catalog serves as a structure that facilitates the grouping of inventory position data. Each inventory position calculates the "on-hand" inventory available based on its associated quantities and location. To represent specific actions that occurred during a particular period, various types of inventory quantities are utilized, including SALE, CORRECTION, LAST_ON_HAND, or DELTA. For example, the SALE inventory quantity type accurately records a sale event associated with the respective inventory position. In contrast, the CORRECTION type indicates a faulty item or any other issue that prevents the item from being sold. Lastly, the LAST_ON_HAND value represents the most recent on-hand value provided by the customer.

The Accelerator determines that the inventory batch item should be filtered if the batch item meets the two conditions below.

the incoming batch item quantity value is the same as the last known value of LAST_ON_HAND inventory quantity type
the associated inventory position does not have SALE, CORRECTION, or DELTA inventory quantity types in ACTIVE or CREATED status

Does This Approach Suit All Scenarios

The current implementation is primarily designed to be compatible with the out-of-the-box Global Inventory workflows, offering limited customization options. It supports certain customizations, such as customizing transient and active statuses of inventory quantities.

Performance improvement

To evaluate the improvements, a comprehensive set of performance tests was meticulously designed, implemented, and executed using 2000 batches, each containing a total of 1 million batch items. Let's now examine the insightful results obtained from these tests.

Scenario	E2E execution time w/o the accelerator	E2E execution time with the accelerator	improvement
99% Redundant Inventory Data	54 minutes	34 minutes	🔻37.00%
92% Redundant Inventory Data	58 minutes	40 minutes	🔻31.00%
80% Redundant Inventory Data	80 minutes	57 minutes	🔻28.75%
50% Redundant Inventory Data	65 minutes	57 minutes	🔻12.3%

Based on the provided data, it is evident that an increased amount of redundant data leads to greater performance enhancements by the accelerator.

Steve Zhang

Senior Software Engineer II

More from this author

Fraud Check Logic composed of Core Rules and New Rules

Martina Ricci

Blog Post 2 - Martina Test

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

9 Dec 2024

5 min read

Martina Ricci

Blog Post - Martina Test

This is an overview

16 Dec 2024

5 min read