Refactor the Run upload process #2172

lampajr · 2024-11-15T17:12:25Z

Refactor the process such that runs and related datasets get created/persisted synchronously whereas the datasets processing (label values calculation and so on) is deferred to the async process by making use if the dataset-even queue

Fixes Issue

Fixes #1976

Changes proposed

This change proposes to slightly update how the Run upload is managed, moving from a completely sync process to a hybrid one.

The process can be summarized as follow:

Persist the Run and any related dataset in the same transaction (if everything goes fine obv).
Once run and datasets are flushed, send dataset events to the dataset-event queue to process label values (and change point detection and so on) asynchronously.

TODOs/Open points:

What to do with the datastore uploads, should we follow the same process even if more than 10 runs are passed?
Change the API to always return 202 if the process succeeded with the runId that was created.
Adapt tests
~~[ ] Adapt the UI~~ nothing to do here

Check List (Check all the applicable boxes)

My code follows the code style of this project.
My change requires changes to the documentation.
I have updated the documentation accordingly.
All new and existing tests passed.

lampajr · 2024-11-18T11:55:49Z

The behavior will change as follows:

Uploading a new Run
- If datastore has more than 10 runs, all of them will be processed async and the api will return 202 with empty array
- If less than 10 runs, the process will persist the runs and their datasets and it will queue up the datasets recalculation such that they will be processed async . The API will return the list of run ids.

Given that the transform() method is getting called by other processes, if isRecalculation is set to true (which is not the case for new Run upload) the datasets recalculation is getting processed sync.

lampajr · 2024-11-19T11:18:45Z

Given that with the new implementation, uploading a run is not fully synchronous anymore, I had to change some tests in order to explicitly recalculate the labelValues.

Moreover I create a different in memory resource for AMQ such that we can skip processing events in the AMQ broker if not actually required, especially because they will throw errors as they are not running in the same transaction of the test

Refactor the process such that runs and related datasets get created/persisted synchronously whereas the datasets processing (label values calculation and so on) is deferred to the async process by making use if the dataset-even queue Signed-off-by: Andrea Lamparelli <a.lamparelli95@gmail.com>

Signed-off-by: Andrea Lamparelli <a.lamparelli95@gmail.com>

With this resource we can have more control over async processing and even avoid unnecessary async processing when not needed Signed-off-by: Andrea Lamparelli <a.lamparelli95@gmail.com>

johnaohara · 2024-11-21T08:42:51Z

Moreover I create a different in memory resource for AMQ

wdym by this? what happens in the event of a crash? we have an external amq instance to persist any messages that have not been processed so we can recover in the event of a crash. why do we need an in-memory resource?

lampajr · 2024-11-21T08:45:54Z

Moreover I create a different in memory resource for AMQ

wdym by this? what happens in the event of a crash? we have an external amq instance to persist any messages that have not been processed so we can recover in the event of a crash. why do we need an in-memory resource?

Sorry, I should have mentioned that is just for testing purposes, see horreum-backend/src/test/java/io/hyperfoil/tools/horreum/test/AMQPInMemoryResource.java such that all tests that do not require or that they don't check async processing can use that. This will avoid to see some exceptions during the tests that are "expected" but that can cause confusion

johnaohara · 2024-11-21T08:49:25Z

Sorry, I should have mentioned that is just for testing purposes, see horreum-backend/src/test/java/io/hyperfoil/tools/horreum/test/AMQPInMemoryResource.java such that all tests that do not require or that they don't check async processing can use that. This will avoid to see some exceptions during the tests that are "expected" but that can cause confusion

ok, thanks for clarifying

lampajr force-pushed the run_upload_refactoring branch 4 times, most recently from 40b5218 to 95d8918 Compare November 18, 2024 11:46

lampajr force-pushed the run_upload_refactoring branch 3 times, most recently from f1661fe to 5fdd97d Compare November 19, 2024 10:00

lampajr marked this pull request as ready for review November 19, 2024 11:16

lampajr requested review from johnaohara and barreiro November 19, 2024 11:18

lampajr added 3 commits November 20, 2024 12:38

Revert upload run return type to string

9e29538

Signed-off-by: Andrea Lamparelli <a.lamparelli95@gmail.com>

Add AMQ in memory test resource

6288578

With this resource we can have more control over async processing and even avoid unnecessary async processing when not needed Signed-off-by: Andrea Lamparelli <a.lamparelli95@gmail.com>

lampajr force-pushed the run_upload_refactoring branch from e692b8a to 6288578 Compare November 21, 2024 08:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the Run upload process #2172

Refactor the Run upload process #2172

lampajr commented Nov 15, 2024 •

edited

Loading

lampajr commented Nov 18, 2024 •

edited

Loading

lampajr commented Nov 19, 2024

johnaohara commented Nov 21, 2024 •

edited

Loading

lampajr commented Nov 21, 2024 •

edited

Loading

johnaohara commented Nov 21, 2024

Refactor the Run upload process #2172

Are you sure you want to change the base?

Refactor the Run upload process #2172

Conversation

lampajr commented Nov 15, 2024 • edited Loading

Fixes Issue

Changes proposed

Check List (Check all the applicable boxes)

lampajr commented Nov 18, 2024 • edited Loading

lampajr commented Nov 19, 2024

johnaohara commented Nov 21, 2024 • edited Loading

lampajr commented Nov 21, 2024 • edited Loading

johnaohara commented Nov 21, 2024

lampajr commented Nov 15, 2024 •

edited

Loading

lampajr commented Nov 18, 2024 •

edited

Loading

johnaohara commented Nov 21, 2024 •

edited

Loading

lampajr commented Nov 21, 2024 •

edited

Loading