Advanced Queue Patterns

This example demonstrates how to build several advanced queue patterns with DBOS. For the full queues documentation, check out the queues tutorial.

All source code is available on GitHub.

Fair Queueing

Often, you have a queue with limited capacity and need to fairly divide that capacity among multiple tenants. For example, suppose your application can only process 5 workflows at a time and you don't want a single tenant to monopolize all that capacity. With fair queueing, you can limit each tenant to 1 workflow at a time while still allowing up to 5 workflows total across all tenants.

You can implement fair queueing in DBOS by combining a partitioned queue with a regular (non-partitioned) queue. You enforce per-tenant limits on the partitioned queue and global limits on the non-partitioned queue. To do that, first let's register the two queues and define a workflow:

DBOS.register_queue("concurrency-queue", concurrency=5)
DBOS.register_queue("partitioned-queue", partition_queue=True, concurrency=1)

# This workflow is fairly queued: at most five workflows can run concurrently,
# but no more than one per tenant.
@DBOS.workflow()
def fair_queue_workflow():
    time.sleep(5)

Next, let's create an endpoint to enqueue the workflow. It does not enqueue the workflow directly, but instead enqueues a "concurrency manager" workflow to the partitioned queue to enforce per-tenant limits:

@api.post("/workflows/fair_queue")
def submit_fair_queue(tenant_id: str):
    # Enqueue a "concurrency manager" workflow to the partitioned
    # queue to enforce per-partition limits.
    with SetEnqueueOptions(queue_partition_key=tenant_id):
        DBOS.enqueue_workflow("partitioned-queue", fair_queue_concurrency_manager)

The "concurrency manager" bridges the two queues, enqueueing the workflow on the non-partitioned queue and waiting for it to complete:

@DBOS.workflow()
def fair_queue_concurrency_manager():
    # The "concurrency manager" workflow enqueues the
    # workflow on the non-partitioned queue and
    # awaits its results to enforce global flow control limits.
    return DBOS.enqueue_workflow("concurrency-queue", fair_queue_workflow).get_result()

Because the "concurrency manager" has the same lifetime as the actual workflow, this pattern ensures both the partitioned queue's per-tenant limits and the non-partitioned queue's global concurrency limits are respected. You can adapt this pattern to combine any per-tenant limits with any global limits.

Rate Limiting

Sometimes, you need to rate limit a workflow, limiting the number of workflows that can start in a given period. This is especially useful when using a rate-limited API, like many LLM APIs. You can do this by applying a rate limit to a queue. For example, here's a rate-limited queue and workflow:

DBOS.register_queue("rate-limited-queue", limiter={"limit": 2, "period": 10})

# This workflow is rate-limited: No more than two workflows can start per 10 seconds
@DBOS.workflow()
def rate_limited_queue_workflow():
    time.sleep(5)

If a rate-limit is defined with limit X and period Y, no more than X workflows can start per Y seconds. Rate limits are global across all DBOS processes using a queue.

You can enqueue a workflow on a rate-limited queue like with any other queue:

@api.post("/workflows/rate_limited_queue")
def submit_rate_limited_queue():
    DBOS.enqueue_workflow("rate-limited-queue", rate_limited_queue_workflow)

Debouncing

Sometimes, you may receive many requests to start a workflow in quick succession, but you only want to start it once. For example, if a user is editing an input field, you may want to start a processing workflow only after some time has passed since the last edit.

Debouncing delays a workflow's execution until some time has passed since it was last called. To debounce a workflow, we define the workflow and queue and create a debouncer for the workflow:

DBOS.register_queue("debouncer-queue")

@DBOS.workflow()
def debouncer_workflow(tenant_id: str, input: str):
    print(f"Executing debounced workflow for tenant {tenant_id} with input {input}")
    time.sleep(5)

debouncer = Debouncer.create(debouncer_workflow, queue="debouncer-queue")

Then, we submit the workflow with the debouncer. This delays the workflow until a set time has passed since the last input is submitted for a tenant. When the workflow starts, it uses the last input receieved by the debouncer.

# Each time a new input is submitted for a tenant, debounce debouncer_workflow.
# The debouncer waits until 5 seconds after input stops being submitted for the tenant,
# then enqueues the workflow with the last input submitted.
@api.post("/workflows/debouncer")
def submit_debounced_workflow(tenant_id: str, input: str):
    debounce_key = tenant_id
    debounce_period_sec = 5
    debouncer.debounce(debounce_key, debounce_period_sec, tenant_id, input)

Learn more about debouncing in the reference.

Try it Yourself!

Clone and enter the dbos-demo-apps repository:

git clone https://github.com/dbos-inc/dbos-demo-apps.git
cd python/queue-patterns

Then follow the instructions in the README to run the example.

Fair Queueing​

Rate Limiting​

Debouncing​

Try it Yourself!​

Fair Queueing

Rate Limiting

Debouncing

Try it Yourself!