Improve transport.Post Do method #3373

lkeix · 2024-11-13T12:40:23Z

Description

Improve: optimize graphql runtime #3372

This PR optimizes the Do method in transport.Post by addressing memory allocation inefficiencies. Specifically:

Removed unnecessary []byte to string conversions in the Do method, which were causing additional memory allocations.
Introduced sync.Pool for graphql.RawParams to enable memory reuse, reducing overall allocations.

These changes aim to improve the performance of the gqlgen server, especially in high-throughput scenarios.

Related Issue

Resolves #3372 (replace with actual issue link)

Benchmark

I took benchmark simple code 5 times.
schema is generated project initizalization.

resolver implementation

// CreateTodo is the resolver for the createTodo field.
func (r *mutationResolver) CreateTodo(ctx context.Context, input model.NewTodo) (*model.Todo, error) {
	return &model.Todo{}, nil
}

// Todos is the resolver for the todos field.
func (r *queryResolver) Todos(ctx context.Context) ([]*model.Todo, error) {
	return []*model.Todo{}, nil
}

Result

Before

BenchmarkReponse-16    	    4557	    260146 ns/op	   21297 B/op	     157 allocs/op
BenchmarkReponse-16    	    3516	    308304 ns/op	   21351 B/op	     157 allocs/op
BenchmarkReponse-16    	    3307	    353268 ns/op	   21485 B/op	     157 allocs/op
BenchmarkReponse-16    	    3379	    343946 ns/op	   21434 B/op	     157 allocs/op
BenchmarkReponse-16    	    3438	    400341 ns/op	   21276 B/op	     157 allocs/op

After

BenchmarkReponse-16    	    4507	    253248 ns/op	   21395 B/op	     155 allocs/op
BenchmarkReponse-16    	    3798	    311202 ns/op	   21364 B/op	     155 allocs/op
BenchmarkReponse-16    	    3756	    312710 ns/op	   21268 B/op	     155 allocs/op
BenchmarkReponse-16    	    3168	    362442 ns/op	   21387 B/op	     155 allocs/op
BenchmarkReponse-16    	    2730	    523350 ns/op	   21286 B/op	     155 allocs/op

Checklist

I have:

Added tests covering the optimization (see testing)
Updated any relevant documentation (see docs)

coveralls · 2024-11-14T13:23:27Z

coverage: 59.578% (+0.03%) from 59.549%
when pulling c4ba63b on lkeix:master
into aaf44f5 on 99designs:master.

StevenACoffman · 2024-11-14T13:24:36Z

graphql/handler/transport/http_post.go

 		resp := exec.DispatchError(ctx, gqlerror.List{gqlErr})
 		writeJson(w, resp)
 		return
 	}

-	bodyReader := io.NopCloser(strings.NewReader(bodyString))
-	if err = jsonDecode(bodyReader, &params); err != nil {
+	bodyReader := bytes.NewReader(bodyBytes)


I'm a little nervous about removing the io.NopCloser here. We used to have quite a few issues with accidentally closing io.Reader more than once, and it was pretty hard to track down.

Ok, nevermind. It looks like that doesn't apply at all here (or at least it doesn't anymore). That was a different part of the codebase.

StevenACoffman · 2024-11-14T13:53:34Z

I greatly appreciate you looking into performance improvements, but I want to ask how you have picked which places to optimize here and in your other PR.

It would be extremely valuable to the entire project to identify where the biggest source of allocations or CPU usage are coming from during a realistic load scenario. It may be that you have done that, but even if this is the most important place, I would like to hear what you found were second or third and why, as others in the community may be able to help on those.

Also, I'm not sure how far to take performance improvements in this area at the expense of readability and maintainability.

For instance, io.ReadAll is much more readable than using io.Copy, but it does produce more allocations. Make up a 10 MB text file:

func BenchmarkReadAll(b *testing.B) {
    for i := 0; i < b.N; i++ {
        in, _ := os.Open("sample.txt")
        io.ReadAll(in)
        in.Close()
    }
}

func BenchmarkCopy(b *testing.B) {
    for i := 0; i < b.N; i++ {
        buf := &bytes.Buffer{}
        in, _ := os.Open("sample.txt")
        io.Copy(buf, in)
        in.Close()
    }
}

Output:

goos: linux
goarch: amd64
pkg: example.org
cpu: Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
BenchmarkReadAll-32           26      45445764 ns/op    102635996 B/op        43 allocs/op
BenchmarkXxx-32               48      23985133 ns/op    67108544 B/op         21 allocs/op

Or take for example this article: https://klotzandrew.com/blog/speeding-up-json-processing-in-go/
This is very easily read and maintained:

  bodyBytes, _ := ioutil.ReadAll(body)
  box := BoxType{}
  _ = json.Unmarshal(bodyBytes, &box)

But it can definitely be made faster and allocate much less at the expense of complicating it as in the article.

It is worth it if the gains are worth it in comparison to everything else, but if it's a small part of the overall, then it's not worth it.

With this PR, I can't tell how much your performance improvement in this area relates to the overall CPU/memory performance. If you have already worked that out, I would love it if you could share that.

StevenACoffman · 2024-11-15T20:52:22Z

I'm merging the PR here, but I would still appreciate someone sharing an analysis of the most performance critical areas of gqlgen runtime.

lkeix · 2024-11-20T13:04:50Z

@StevenACoffman
Apologies for the delayed response.
Thank you for merging this PR.

I will continue to share insights on gqlgen runtime bottlenecks in this issue.

lkeix added 3 commits November 13, 2024 20:46

optimize: type convertions

2f55c0f

optimize: graphql raw params

9ba41dc

fmt

c4ba63b

lkeix mentioned this pull request Nov 13, 2024

Improve: optimize graphql runtime #3372

Open

StevenACoffman reviewed Nov 14, 2024

View reviewed changes

StevenACoffman merged commit 288848a into 99designs:master Nov 15, 2024
16 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve transport.Post Do method #3373

Improve transport.Post Do method #3373

lkeix commented Nov 13, 2024 •

edited

Loading

coveralls commented Nov 14, 2024

StevenACoffman Nov 14, 2024

StevenACoffman Nov 14, 2024

StevenACoffman commented Nov 14, 2024

StevenACoffman commented Nov 15, 2024

lkeix commented Nov 20, 2024

Improve transport.Post Do method #3373

Improve transport.Post Do method #3373

Conversation

lkeix commented Nov 13, 2024 • edited Loading

Description

Related Issue

Benchmark

Result

Checklist

coveralls commented Nov 14, 2024

StevenACoffman Nov 14, 2024

Choose a reason for hiding this comment

StevenACoffman Nov 14, 2024

Choose a reason for hiding this comment

StevenACoffman commented Nov 14, 2024

StevenACoffman commented Nov 15, 2024

lkeix commented Nov 20, 2024

lkeix commented Nov 13, 2024 •

edited

Loading