Limiting incoming requests queue #212

kinnalru · 2023-10-12T13:01:26Z

Is it possible to limit incoming request queue by queue length or timeout?
Something like max_request_queue_size and max_request_queue_time from Phusion Passenger?

The text was updated successfully, but these errors were encountered:

ioquatix · 2023-10-12T23:50:30Z

At this time, there is no such feature and I agree it's a deficiency. I am planning to address this (configurable load balancing, rate limiting, etc), but it may be a paid feature. How do you feel about that?

kinnalru · 2023-10-16T13:25:06Z

@ioquatix

it will be great if falcon starts bringing you money. But maintaining both versions (enterprise and free) is difficult.

slewsys · 2024-01-07T03:16:46Z

At this time, there is no such feature and I agree it's a deficiency. I am planning to address this (configurable load balancing, rate limiting, etc), but it may be a paid feature. How do you feel about that?

Nginx offers rate limiting and load balancing. You might have more success offering a hosting service, rather than charging for tooling.

platbr · 2024-10-09T13:38:41Z

Is it possible to limit incoming request queue by queue length or timeout? Something like max_request_queue_size and max_request_queue_time from Phusion Passenger?
HAProxy offers a queue feature using maxconn.
NGINX open source doesn't have a queue feature, only Plus version.

platbr · 2024-10-11T15:41:30Z

Thoughts on Queueing Requests/Scaling

We moved from Puma to Falcon last week because it is much easier to handle scaling without needing to worry about the number of necessary threads or how CPU resources are allocated. We have five applications working together to provide all the functionality of our website (we sell tickets).

Before Falcon, we scaled based on Puma's backlog and CPU usage, but now we scale solely based on CPU. This change allows us to ensure that CPU resources are fully utilized. However, we had to drastically increase the amount of memory, which I assume is because Falcon handles more requests simultaneously, leading to this behavior.

Since Falcon doesn't have a request queue, I’ve been considering placing it behind HAProxy. However, this might reintroduce the same issue we faced with Puma, where we couldn’t fully utilize available CPU. On the other hand, it could help limit the number of simultaneous requests.

I believe a good approach for queueing requests could be to base it on available CPU. Is it possible to control the queue this way? Can available CPU be monitored effectively in a shared cloud environment , Kubernetes, inside a container, etc?

@ioquatix , Do you think this would be a good approach?

korbin mentioned this issue Oct 1, 2024

Allow IO::Endpoint::Wrapper#accept to accept task: socketry/io-endpoint#14

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limiting incoming requests queue #212

Limiting incoming requests queue #212

kinnalru commented Oct 12, 2023

ioquatix commented Oct 12, 2023

kinnalru commented Oct 16, 2023

slewsys commented Jan 7, 2024

platbr commented Oct 9, 2024

platbr commented Oct 11, 2024 •

edited

Loading

Limiting incoming requests queue #212

Limiting incoming requests queue #212

Comments

kinnalru commented Oct 12, 2023

ioquatix commented Oct 12, 2023

kinnalru commented Oct 16, 2023

slewsys commented Jan 7, 2024

platbr commented Oct 9, 2024

platbr commented Oct 11, 2024 • edited Loading

platbr commented Oct 11, 2024 •

edited

Loading