New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

SEAB-5604: submit github delivery event to s3 #162

Merged

hyunnaye merged 18 commits into develop from feature/seab-5604

Apr 18, 2024

Contributor

hyunnaye commented Mar 27, 2024 •

edited

Loading

Description
This PR adds code to the existing github lambda to send the message to a S3 bucket. The lambda uploads the event in the path date/deliveryid

Issue
https://ucsc-cgl.atlassian.net/browse/SEAB-5604

Security
If there are any concerns that require extra attention from the security team, highlight them here.

Please make sure that you've checked the following before submitting your pull request. Thanks!

Ensure that the PR targets the correct branch. Check the milestone or fix version of the ticket.


          initial

hyunnaye self-assigned this

github-advanced-security bot found potential problems

View reviewed changes

upsertGitHubTag/deployment/index.js Fixed Show fixed Hide fixed

hyunnaye added 5 commits

March 27, 2024 15:33


          unused

03cb382


          import fix

48ced08


          import fix

ca13b79


          if case

cf42ead


          Merge branch 'develop' into feature/seab-5604

02f3af1

hyunnaye marked this pull request as ready for review

April 12, 2024 20:07

hyunnaye requested review from coverbeck, denis-yuen, kathy-t and svonworl

April 12, 2024 20:10

coverbeck reviewed

View reviewed changes

upsertGitHubTag/deployment/index.js Outdated Show resolved Hide resolved

upsertGitHubTag/deployment/index.js Outdated Show resolved Hide resolved

upsertGitHubTag/deployment/index.js Outdated Show resolved Hide resolved

upsertGitHubTag/deployment/index.js Outdated

+                  const client = new S3Client({});
+                  const command = new PutObjectCommand({
+                    Bucket: process.env.BUCKET_NAME,
+                    Key: deliveryId,

Contributor

coverbeck Apr 12, 2024

This will put all objects in the "root" of the bucket. Is that what we want? If we know the deliveryId, then it works. If we don't know it, then it's going to be hard to find.

Is it worth putting them in keys by date/org or org/date?

I don't know the answer, just raising the question. It depends on how we expect to use this.

Contributor Author

hyunnaye Apr 15, 2024

It is now changed to date/repository/deliveryid

Contributor Author

hyunnaye Apr 18, 2024

I had to change it to date/deliveryid as there can be multiple repos in a single delivery.

kathy-t reviewed

View reviewed changes

upsertGitHubTag/deployment/index.js Show resolved Hide resolved

upsertGitHubTag/deployment/index.js

@@ @@ -253,6 +254,23 @@ function processEvent(event, callback) { @@
                       " from GitHub.",
                   });
                 }
+                // If bucket name is not null (had to put this for the integration test)
+                if (process.env.BUCKET_NAME) {

Contributor

kathy-t Apr 15, 2024

This is called after we invoke callback above...should it be before? Doesn't the lambda use callback to return a result? (general question, callbacks confuse me)

Contributor

coverbeck Apr 16, 2024

It looks like it will still execute after the callback: https://stackoverflow.com/questions/49688927/how-do-i-stop-execution-of-a-aws-lambda-after-a-callback. Although maybe you want to log earlier to avoid confusion. It doesn't look like it will affect performance either way

Contributor

coverbeck Apr 16, 2024

In looking at Kathy's question, I noticed the method is getting pretty big (we don't have a linter in place), so I'd optionally suggest creating a method out of this if block, e.g., logPayloadToS3().

Contributor Author

hyunnaye Apr 16, 2024 •

edited

Loading

How strongly do we feel about moving my s3 code to be before the callback? I avoided putting my code to s3 before the callback to avoid too many if-blocks because we want to avoid submitting the event if the event type is not supported. So, I put return in the else condition (line 257) above such that the new s3 code wouldn't be ran.


          pr review

cdd4480

github-advanced-security bot found potential problems

View reviewed changes

upsertGitHubTag/deployment/index.js Fixed Show fixed Hide fixed


          forgot semicolon

7a8df45

hyunnaye requested review from kathy-t and coverbeck

April 15, 2024 20:09

denis-yuen reviewed

View reviewed changes

Member

denis-yuen left a comment

a couple comments

upsertGitHubTag/deployment/index.js Outdated Show resolved Hide resolved

upsertGitHubTag/deployment/index.js Outdated Show resolved Hide resolved

coverbeck approved these changes

View reviewed changes

Contributor

coverbeck left a comment

Agreed with Denis' comments. Since today is my last day of the sprint and proposed changes seem pretty minor, approving.

upsertGitHubTag/deployment/index.js

@@ @@ -253,6 +254,23 @@ function processEvent(event, callback) { @@
                       " from GitHub.",
                   });
                 }
+                // If bucket name is not null (had to put this for the integration test)
+                if (process.env.BUCKET_NAME) {

Contributor

coverbeck Apr 16, 2024

It looks like it will still execute after the callback: https://stackoverflow.com/questions/49688927/how-do-i-stop-execution-of-a-aws-lambda-after-a-callback. Although maybe you want to log earlier to avoid confusion. It doesn't look like it will affect performance either way

upsertGitHubTag/deployment/index.js

@@ @@ -253,6 +254,23 @@ function processEvent(event, callback) { @@
                       " from GitHub.",
                   });
                 }
+                // If bucket name is not null (had to put this for the integration test)
+                if (process.env.BUCKET_NAME) {

Contributor

coverbeck Apr 16, 2024

In looking at Kathy's question, I noticed the method is getting pretty big (we don't have a linter in place), so I'd optionally suggest creating a method out of this if block, e.g., logPayloadToS3().

hyunnaye added 6 commits

April 16, 2024 15:03


          pr review

80e9063


          pr review

adf9b6e


          pr review

aa12586


          pr review

d176c4a


          eslint

cdb6b7d


          eslint

d02b3d2

hyunnaye requested a review from denis-yuen

April 16, 2024 19:51

coverbeck reviewed

View reviewed changes

upsertGitHubTag/deployment/index.js

+                  return;
+                }
+                // If bucket name is not null (had to put this for the integration test)
+                if (process.env.BUCKET_NAME) {

Contributor

coverbeck Apr 16, 2024

I make this part of the logPayloadToS3 method, logPayloadtoS3(body, bucketPath, deliveryId).

How strongly do we feel about moving my s3 code to be before the callback? I avoided putting my code to s3 before the callback to avoid too many if-blocks because we want to avoid submitting the event if the event type is not supported. So, I put return in the else condition (line 257) above such that the new s3 code wouldn't be ran.

Then if you want to do it before the callback, or in each of the conditions, you're only adding one line.

I don't feel too strongly about going before. Maybe add a comment before the method invocation that it will execute even if the callback has been invoked, since that caused some confusion in the PR review.

denis-yuen approved these changes

View reviewed changes

Member

denis-yuen left a comment •

edited

Loading

With @coverbeck and @kathy-t comments (my feedback has been addressed)


          folder path fix

de3f3a9

svonworl approved these changes

View reviewed changes

Contributor

svonworl left a comment

Thanks for persevering, the existing code is super-duper confusing. If I remember correctly, there's actually two varieties of callbacks, both referenced as callback and invoked at various points in the code. And then, to muddy things further, there's the handleCallback function, a better name for which would be handleResponse.

kathy-t approved these changes

View reviewed changes

hyunnaye added 3 commits

April 18, 2024 10:12


          new helper function

ebbbcfc


          new helper function

8d8381d


          eslint

91f627b

hyunnaye merged commit 1385866 into develop

12 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet