[GLUTEN-8060][CORE] GlutenShuffleManager as a registry of shuffle managers #8084

zhztheplayer · 2024-11-28T09:23:41Z

Prepare GlutenShuffleManager as the new shuffle manager implementation of Gluten.

The shuffle manager is simply a router for all shuffle managers registered through API ShuffleManagerRegistry.get().register.

About backward compatibility:

Use of ColumnarShuffleManager / CelebornShuffleManager / UniffleShuffleManager will be deprecated in future (perhaps, in next few of PRs). Then once Spark configuration like spark.shuffle.manager=....ColumnarShuffleManager is specified explicitly, Gluten could still run but raise a warning indicating that the option is deprecated.
Instead, use of GlutenShuffleManager will be recommended. GlutenShuffleManager will maintain a mechanism (either automatically or by configuration) to route shuffle manager calls to the underlying registered shuffle managers. Hence, it will become recommended to use spark.shuffle.manager=....GlutenShuffleManager instead, no matter RSS is required or not.

This PR is only an API preparation with essential UTs. No existing production code is affected.

github-actions · 2024-11-28T09:23:59Z

#8060

github-actions · 2024-11-28T09:24:13Z

Run Gluten Clickhouse CI on x86

github-actions · 2024-11-28T11:53:49Z

Run Gluten Clickhouse CI on x86

zhztheplayer · 2024-11-29T01:04:37Z

cc @kerwin-zk @summaryzb @marin-ma @wForget

gluten-core/src/main/scala/org/apache/spark/shuffle/ShuffleManagerRegistry.scala

wForget · 2024-11-29T02:15:40Z

gluten-core/src/main/scala/org/apache/spark/shuffle/ShuffleManagerRouter.scala

+/** The internal shuffle manager instance used by GlutenShuffleManager. */
+private class ShuffleManagerRouter(lookup: ShuffleManagerLookup) extends ShuffleManager {
+  import ShuffleManagerRouter._
+  private val cache = new Cache()


Will this status be sensed by spark executor?

I think so. The cache key is shuffle ID which will be shared among driver and executors.

Will the mapping saved in cache.store also be synchronized to the executor? I am worried that calling cache.get(shuffleId) on the executor will not be able to find the ShuffleManager

I see. It's likely a problem. Let me test, thanks.

Perhaps a simple solution could be registering ShuffleHandle -> ShuffleManager mappings as well which can be used for lookup on executor side. Would you think that is feasible?

@wForget I assume the current code is fine, given that usually shuffle dependency builder (namely #getDependencies) is carried by shuffle RDD and called from task code. Will run some tests later to prove.

https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/rdd/ShuffledRDD.scala

Perhaps a simple solution could be registering ShuffleHandle -> ShuffleManager mappings as well which can be used for lookup on executor side. Would you think that is feasible?

Sounds feasible

kerwin-zk · 2024-11-29T02:29:31Z

@zhztheplayer Although spark.shuffle.manager can be uniformly configured as GlutenShuffleManager, additional configurations are still required to choose between ColumnarShuffleManager, CelebornShuffleManager, or UniffleShuffleManager. Essentially, this is no different from the current setup where spark.shuffle.manager is configured to different managers. Moreover, from a certain perspective, the introduction of GlutenShuffleManager couples the various ShuffleManagers together, whereas the current fully decoupled setup seems clearer.

wForget · 2024-11-29T02:31:09Z

gluten-core/src/main/scala/org/apache/spark/shuffle/ShuffleManagerRouter.scala

+      mapId: Long,
+      context: TaskContext,
+      metrics: ShuffleWriteMetricsReporter): ShuffleWriter[K, V] = {
+    cache.get(handle.shuffleId).getWriter(handle, mapId, context, metrics)


It may be better to determine the appropriate ShuffleManager based on ShuffleHandle

ShuffleHandle is not passing to ShuffleBlockResolver so it could be tricky if using it as key. Would you expect any potential issues if we use shuffle ID as the key? Maybe we need another solution to avoid them.

wForget · 2024-11-29T02:35:06Z

Do you originally want to implement a mixture of ess and rss like https://github.com/apache/incubator-uniffle/blob/master/client-spark/spark3/src/main/java/org/apache/spark/shuffle/DelegationRssShuffleManager.java and https://github.com/apache/celeborn/blob/3bf91929b6bd02974b5d15b4d6804c9b2b01cfc0/client-spark/spark-3/src/main/java/org/apache/spark/shuffle/celeborn/SparkShuffleManager.java#L165?

github-actions · 2024-11-29T02:42:45Z

Run Gluten Clickhouse CI on x86

zhztheplayer · 2024-11-29T03:05:52Z

@kerwin-zk

Moreover, from a certain perspective, the introduction of GlutenShuffleManager couples the various ShuffleManagers together, whereas the current fully decoupled setup seems clearer.

I haven't clearly gotten what's the new code coupling come from. Say ideally we can remove the proxied calls from RSS shuffle managers to default shuffle manager because of this work (each shuffle manager can be registered for certain shuffle dependency type it can handle). So I see this a decoupling of code rather than coupling of code.

Essentially, this is no different from the current setup where spark.shuffle.manager is configured to different managers.

But I may understand your concern totally especially this one. The reason I am proposing this is not simply because of code refactor. The solution's kind of a trade-off for letting user enable different backends at the same time. See the umbrella #6920. For that purpose, we have to develop a way to make the shuffle managers be registered together so Gluten could choose the right one to use. For example, if we introduce GPU backend, we should give back the flexibility to developers when they decide not to let GPU backend rely on Velox backend, but they may still want query planner use Velox shuffle when the query plan is not offloaded to GPU.

May be we can put off deprecating the current use of CelebornShuffleManager / UniffleShuffleManager? We can keep the support for the legacy configurations during a considerable long period. I agree this is very normal that a user wants to specify a shuffle manager via Spark config explicitly. Does that work for you?

zhztheplayer · 2024-11-29T03:17:41Z

Do you originally want to implement a mixture of ess and rss like https://github.com/apache/incubator-uniffle/blob/master/client-spark/spark3/src/main/java/org/apache/spark/shuffle/DelegationRssShuffleManager.java and https://github.com/apache/celeborn/blob/3bf91929b6bd02974b5d15b4d6804c9b2b01cfc0/client-spark/spark-3/src/main/java/org/apache/spark/shuffle/celeborn/SparkShuffleManager.java#L165?

No, it's not intentionally a refactor against current shuffles. It's for preparing the base support of mixed backend(s): #6920 .

kerwin-zk · 2024-11-29T03:37:51Z

@kerwin-zk

Moreover, from a certain perspective, the introduction of GlutenShuffleManager couples the various ShuffleManagers together, whereas the current fully decoupled setup seems clearer.

I haven't clearly gotten what's the new code coupling come from. Say ideally we can remove the proxied calls from RSS shuffle managers to default shuffle manager because of this work (each shuffle manager can be registered for certain shuffle dependency type it can handle). So I see this a decoupling of code rather than coupling of code.

Essentially, this is no different from the current setup where spark.shuffle.manager is configured to different managers.

But I may understand your concern totally especially this one. The reason I am proposing this is not simply because of code refactor. The solution's kind of a trade-off for letting user enable different backends at the same time. See the umbrella #6920. For that purpose, we have to develop a way to make the shuffle managers be registered together so Gluten could choose the right one to use. For example, if we introduce GPU backend, we should give back the flexibility to developers when they decide not to let GPU backend rely on Velox backend, but they may still want query planner use Velox shuffle when the query plan is not offloaded to GPU.

May be we can put off deprecating the current use of CelebornShuffleManager / UniffleShuffleManager? We can keep the support for the legacy configurations during a considerable long period. I agree this is very normal that a user wants to specify a shuffle manager via Spark config explicitly. Does that work for you?

@zhztheplayer Thank you very much for your detailed explanation. I believe that #6920 indeed requires this feature. At the same time, retaining the current option that allows users to specify the ShuffleManager through configuration is also necessary. I think this is acceptable.

marin-ma · 2024-11-29T04:16:14Z

gluten-core/src/main/scala/org/apache/spark/shuffle/ShuffleManagerRegistry.scala

+class ShuffleManagerRegistry private[ShuffleManagerRegistry] {
+  import ShuffleManagerRegistry._
+  private val all: mutable.Buffer[(LookupKey, String)] = mutable.Buffer()
+  private val routerBuilders: mutable.Buffer[RouterBuilder] = mutable.Buffer()


Could you explain in what cases there would be multiple RouterBuilders? Thanks!

I think we usually have one single router builder at the same time. So this practice can be simplified in future with some kind of assertions (something like if (router created) throw error). Though it's benign at the moment.

The reason we don't simply adopt a lazy initialization here is because the instantiation of RouterBuilder is parameterized: It takes SparkConf and isDriver in ctor. So we'd guard our code against the twice invocation with different parameters, no matter it's practically possible or not.

zhztheplayer · 2024-11-29T05:37:07Z

cc @zzcclp @baibaichen

zhztheplayer added 6 commits November 28, 2024 14:21

init

5393711

fixup

fbfa021

fixup

9e44fe7

fixup

7b76f37

fixup

1a7d9e0

fixup

c308148

github-actions bot added the CORE works for Gluten Core label Nov 28, 2024

zhztheplayer added 2 commits November 28, 2024 19:49

fixup

7376e2f

fixup

041ae0a

zhztheplayer marked this pull request as ready for review November 29, 2024 00:51

wForget reviewed Nov 29, 2024

View reviewed changes

gluten-core/src/main/scala/org/apache/spark/shuffle/ShuffleManagerRegistry.scala Show resolved Hide resolved

wForget reviewed Nov 29, 2024

View reviewed changes

fixup

3ba64cf

marin-ma reviewed Nov 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GLUTEN-8060][CORE] GlutenShuffleManager as a registry of shuffle managers #8084

[GLUTEN-8060][CORE] GlutenShuffleManager as a registry of shuffle managers #8084

zhztheplayer commented Nov 28, 2024 •

edited

Loading

github-actions bot commented Nov 28, 2024

github-actions bot commented Nov 28, 2024

github-actions bot commented Nov 28, 2024

zhztheplayer commented Nov 29, 2024

wForget Nov 29, 2024

zhztheplayer Nov 29, 2024

wForget Nov 29, 2024

zhztheplayer Nov 29, 2024

zhztheplayer Nov 29, 2024 •

edited

Loading

zhztheplayer Nov 29, 2024 •

edited

Loading

wForget Nov 29, 2024

kerwin-zk commented Nov 29, 2024

wForget Nov 29, 2024

zhztheplayer Nov 29, 2024

wForget commented Nov 29, 2024 •

edited

Loading

github-actions bot commented Nov 29, 2024

zhztheplayer commented Nov 29, 2024 •

edited

Loading

zhztheplayer commented Nov 29, 2024

kerwin-zk commented Nov 29, 2024

marin-ma Nov 29, 2024

zhztheplayer Nov 29, 2024

zhztheplayer Nov 29, 2024 •

edited

Loading

zhztheplayer commented Nov 29, 2024

[GLUTEN-8060][CORE] GlutenShuffleManager as a registry of shuffle managers #8084

Are you sure you want to change the base?

[GLUTEN-8060][CORE] GlutenShuffleManager as a registry of shuffle managers #8084

Conversation

zhztheplayer commented Nov 28, 2024 • edited Loading

github-actions bot commented Nov 28, 2024

github-actions bot commented Nov 28, 2024

github-actions bot commented Nov 28, 2024

zhztheplayer commented Nov 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhztheplayer Nov 29, 2024 • edited Loading

Choose a reason for hiding this comment

zhztheplayer Nov 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kerwin-zk commented Nov 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wForget commented Nov 29, 2024 • edited Loading

github-actions bot commented Nov 29, 2024

zhztheplayer commented Nov 29, 2024 • edited Loading

zhztheplayer commented Nov 29, 2024

kerwin-zk commented Nov 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhztheplayer Nov 29, 2024 • edited Loading

Choose a reason for hiding this comment

zhztheplayer commented Nov 29, 2024

zhztheplayer commented Nov 28, 2024 •

edited

Loading

zhztheplayer Nov 29, 2024 •

edited

Loading

zhztheplayer Nov 29, 2024 •

edited

Loading

wForget commented Nov 29, 2024 •

edited

Loading

zhztheplayer commented Nov 29, 2024 •

edited

Loading

zhztheplayer Nov 29, 2024 •

edited

Loading