Replies: 11 comments 17 replies
-
Hey @rickfish , Interesting question.
Yes, we run Conductor on AWS. And No, we use Cassandra for persistence and Queuing layer built on top of Redis.
I guess running benchmarks are the only way to find the answer. But, some of the theoretical limitations we could think of while scaling Conductor are:
Please let us know your findings if you pursue this; We're very curious to learn more and see if we can apply these learning where possible. |
Beta Was this translation helpful? Give feedback.
-
@rickfish : any news on this? I'm interested in the numbers. We also run conductor (with many things added on top) on openshift, but with an Oracle db so far. |
Beta Was this translation helpful? Give feedback.
-
@rickfish 100M per day is certainly possible with Conductor. The key things to consider here are going to:
Curious to know if you ran some benchmarks and would love to see the numbers if possible. cc: @Jiehong |
Beta Was this translation helpful? Give feedback.
-
+1 with the right setup, 100M is easy on this platform. |
Beta Was this translation helpful? Give feedback.
-
How to define the num of workflow? If 3 workflowId use one workflow name, is the num 1 or 3 ? |
Beta Was this translation helpful? Give feedback.
-
@rickfish Can you please also detail out your configuration using which you are running 300,000 workflows per day. An insight into the following will be very helpful
Thanks |
Beta Was this translation helpful? Give feedback.
-
@coderrr22 by HTTP service requests I don't mean HTTP tasks, I mean any of the Conductor REST service requests. We have two Conductor instances deployed on Kubernetes (OpenShift). Each is configured differently. One only runs the background threads and is allocated 2 pods (40 sweeper threads each) and one handles the REST service requests and has 4 pods. We use the following config (using 2.31 version of conductor) to disable the background threads on the REST instance: The background thread instance doesn't get forwarded any REST requests. |
Beta Was this translation helpful? Give feedback.
-
@mariomartucci 2 pods is a configuration setting in the kubernetes deployment config file that we use when deploying to RedHat OpenShift (the spec.replicas attribute I think) |
Beta Was this translation helpful? Give feedback.
-
@mark91m12 I got those properties while going through the code, not sure if they are all documented somewhere. Keep in mind this is the 2.31 version of Conductor not the 3.x so these property names probably changed. |
Beta Was this translation helpful? Give feedback.
-
Hi Marco. We can meet if you like. Not sure what software to use for that.
You decide. Keep in mind that the work I did in Conductor was for my former
client so I will have to remember some things...
…On Mon, Jul 18, 2022 at 4:50 AM Marco Amato ***@***.***> wrote:
hi @rickfish <https://github.com/rickfish>, first of all thank you for
your reply, yes i saw all these configuration variables in the CHANGELOG
file (with new names relative to the version 3.x), i really would
appriciate if you give me more information about your installation through
a direct channel, can we schedule a meeting or something like that?
—
Reply to this email directly, view it on GitHub
<#2299 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AJMH6WBNGRA4J6MC77JMUFTVUULE5ANCNFSM46MP2RYA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
How many of you will be interested in having a conductor as a k8s operator? I can spend some time to start off the ground if there is good interest in it. |
Beta Was this translation helpful? Give feedback.
-
Currently we are using Conductor with a Postgres backend and Conductor is deployed on OpenShift. We are processing about 300,000 workflows per day that create around 1.5 million tasks in total. Performance is good and there is room for additional growth.
We have recently been approached by a team in our organization that wants to use Conductor to start 100 million workflows per day consistently. I am pretty sure that our current setup will not handle that volume as far as Postgres and our http infrastructure goes.
My question is: I believe Netflix has Conductor deployed on AWS and is using DynamoDB as the data store. Is this correct? If so, do you think it is possible to construct an environment that can handle this kind of volume and, if so, what do we need to consider?
Beta Was this translation helpful? Give feedback.
All reactions