Automate incidents creation #586

JenySadadia · 2024-10-03T09:33:50Z

Automate the creation of incidents based on issues using kcidb_match tool.

kcidb/tools/kcidb_match.py

JenySadadia · 2024-10-04T06:10:09Z

@helen-fornazier Thanks for the comments.
It would have been easier if you had committed the changes directly to this PR.
But no worries, I'll do it.

tales-aparecida

I'm so happy with how fast this is moving

kcidb/tools/kcidb_match.py

helen-fornazier · 2024-10-14T14:52:29Z

Proposal for command line tool usage:

cat issue.json | kcidb-issues --pattern-db-file=DB_NAME # this would save the patterns in `DB_NAME`

cat tests.json | kcidb-issues --pattern-db-file=DB_NAME # this would dump a json with the incidents (if it was created) matched agains `DB_NAME`

with this, we can use this tool to debug, and also to validate a issue and test example in the editor UI before submitting it to kcidb

@tales-aparecida @JenySadadia @spbnick what do you think?

so we can drop all the options I had previously added there

helen-fornazier · 2024-10-14T15:35:39Z

Proposal for command line tool usage:
cat issue.json | kcidb-issues --pattern-db-file=DB_NAME # this would save the patterns in `DB_NAME`

cat tests.json | kcidb-issues --pattern-db-file=DB_NAME # this would dump a json with the incidents (if it was created) matched agains `DB_NAME`
with this, we can use this tool to debug, and also to validate a issue and test example in the editor UI before submitting it to kcidb

@tales-aparecida @JenySadadia @spbnick what do you think?

so we can drop all the options I had previously added there

Following up this comment #586 (comment)

I think we can leave it to a second PR, no problem for me

kcidb/tools/kcidb_match.py

spbnick

Sorry to leave a possibly overwhelming review, asking to change things a lot. Let's discuss this and see what's viable at this moment. However, this would all be for nothing if I don't fix that performance problem I was working on before Plumbers, and don't enable notifications. This code would simply not be called without that. So I'll go and work on that, instead of going further into detailed review 🙈

kcidb/tools/kcidb_match.py

spbnick · 2024-10-15T12:15:18Z

kcidb/tools/kcidb_match.py

+        If snippet_lines == 0: the full log
+        If snippet_lines > 0: the first snippet_lines log lines
+        If snippet_lines < 0: the last snippet_lines log lines
+    """


Could you please document classes/methods/functions fully, including the arguments, following the conventions seen everywhere else in the code? Also, if you have a multi-line docstring, could you please start the text on the next line from the """? Thank you 🙏

spbnick · 2024-10-15T12:21:33Z

kcidb/tools/kcidb_match.py

+    """
+    try:
+        response = requests.get(url, timeout=60)
+        response.raise_for_status()


Do you think you could try to fetch the URL from the artifact cache first? Similarly to how it's done in the cache redirector:

kcidb/main.py

Lines 576 to 577 in 37961dc

cache_client = get_cache_client()

cache = cache_client.map(url_to_fetch, ttl=CACHE_REDIRECT_TTL)

This way we could save a little of inbound traffic now, and once we have cache fully working we could save a lot.

spbnick · 2024-10-15T12:38:04Z

kcidb/tools/kcidb_match.py

+        return None
+    try:
+        raw_bytes = gzip.decompress(response.content)
+        text = raw_bytes.decode('utf-8')


This should not be necessary, nor possible, as the log URL should point to a plain log file. This should be fixed in Maestro. E.g. by configuring the web server to specify Transfer-Encoding: gzip and serving the files compressed transparently. Python requests would then decompress that automatically. Otherwise this not only hurts our code complexity, but also the experience of users who download those files and then have to decompress them manually.

@nuclearcat, @pawiecz, how hard would that be to fix?

spbnick · 2024-10-15T13:01:54Z

kcidb/tools/kcidb_match.py

+    """Pattern validator class"""
+    def __init__(self):
+        self.schema = copy.deepcopy(kcidb.io.SCHEMA.json)
+        self.remove_required_fields(self.schema)


This doesn't need to be a class at all. You don't need multiple of them and it doesn't have any parameters. Please remake it into a module (this or a separate one). Also, it doesn't only validate patterns.

spbnick · 2024-10-15T14:07:07Z

kcidb/tools/kcidb_match.py

+        kcidb_io_object = {"tests": [test._data],
+                           "builds": [test.build._data],
+                           "checkouts": [test.build.checkout._data]}
+        return self.generate_incidents_from_db(kcidb_io_object)


You cannot make an I/O object from an OO object in general, even if it looks like you can. You just cannot count on that. An OO object is a processed I/O object, and there can be data loss. Instead deal with OO objects directly everywhere. Where you need to process I/O objects from stdin and command-line interface, load them into an sqlite database using the database client, get them as OO objects from there, and then process.

OO objects were specifically made to make the things you're doing here easier to do. Like walk related objects and so on. We can have a call this week and go over all the concerns and options regarding this.

BTW, there's also a schema for the raw OO data in kcidb.orm.data.

spbnick · 2024-10-15T14:08:52Z

kcidb/monitor/subscriptions/create_incidents.py

+    client = get_client()
+    if client:
+        incident_generator = kcidb_match.IncidentGenerator()
+        incidents = incident_generator.generate_incidents_from_test(test)


Considering that IncidentGenerator encapsulates a database client connection, why not create and get it similarly to get_client(), instead of creating a new one for every object matched?

Add `tools` directory and add `kcidb_match.py` script there. Co-authored-by: Jeny Sadadia <jeny.sadadia@collabora.com> Signed-off-by: Jeny Sadadia <jeny.sadadia@collabora.com>

Add a subscription module `create_incidents.py` to create incidents automatically when builds and tests objects match with issue patterns. Signed-off-by: Jeny Sadadia <jeny.sadadia@collabora.com>

JenySadadia force-pushed the auto-create-incidents branch 2 times, most recently from e41e75b to b2c5392 Compare October 3, 2024 13:31

helen-fornazier reviewed Oct 3, 2024

View reviewed changes

JenySadadia force-pushed the auto-create-incidents branch 2 times, most recently from 7f7da72 to 0fe4538 Compare October 4, 2024 07:35

tales-aparecida reviewed Oct 4, 2024

View reviewed changes

kcidb/tools/kcidb_match.py Outdated Show resolved Hide resolved

kcidb/tools/kcidb_match.py Outdated Show resolved Hide resolved

kcidb/tools/kcidb_match.py Show resolved Hide resolved

kcidb/tools/kcidb_match.py Outdated Show resolved Hide resolved

JenySadadia force-pushed the auto-create-incidents branch 2 times, most recently from 1a29241 to a5f7531 Compare October 14, 2024 13:37

helen-fornazier previously approved these changes Oct 14, 2024

View reviewed changes

JenySadadia dismissed helen-fornazier’s stale review via c01e741 October 14, 2024 18:08

JenySadadia force-pushed the auto-create-incidents branch 2 times, most recently from c01e741 to 0c1eee0 Compare October 14, 2024 18:22

tales-aparecida reviewed Oct 14, 2024

View reviewed changes

kcidb/tools/kcidb_match.py Outdated Show resolved Hide resolved

kcidb/tools/kcidb_match.py Outdated Show resolved Hide resolved

JenySadadia force-pushed the auto-create-incidents branch from 0c1eee0 to 2c9a632 Compare October 15, 2024 05:28

JenySadadia had a problem deploying to staging October 15, 2024 05:44 — with GitHub Actions Failure

JenySadadia force-pushed the auto-create-incidents branch from 2c9a632 to 0a1babc Compare October 15, 2024 05:46

JenySadadia had a problem deploying to staging October 15, 2024 06:01 — with GitHub Actions Failure

JenySadadia force-pushed the auto-create-incidents branch 2 times, most recently from 9552aec to 094c755 Compare October 15, 2024 06:57

JenySadadia had a problem deploying to staging October 15, 2024 07:12 — with GitHub Actions Failure

JenySadadia requested a review from helen-fornazier October 15, 2024 07:29

helen-fornazier previously approved these changes Oct 15, 2024

View reviewed changes

spbnick requested changes Oct 15, 2024

View reviewed changes

JenySadadia dismissed helen-fornazier’s stale review via 0228687 October 15, 2024 14:11

JenySadadia force-pushed the auto-create-incidents branch from 094c755 to 0228687 Compare October 15, 2024 14:11

helen-fornazier and others added 2 commits October 17, 2024 16:26

kcidb: add kcidb_match tool

3de95ac

Add `tools` directory and add `kcidb_match.py` script there. Co-authored-by: Jeny Sadadia <jeny.sadadia@collabora.com> Signed-off-by: Jeny Sadadia <jeny.sadadia@collabora.com>

Automate incidents creation

5064d37

Add a subscription module `create_incidents.py` to create incidents automatically when builds and tests objects match with issue patterns. Signed-off-by: Jeny Sadadia <jeny.sadadia@collabora.com>

JenySadadia force-pushed the auto-create-incidents branch from 0228687 to 5064d37 Compare October 17, 2024 10:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automate incidents creation #586

Automate incidents creation #586

JenySadadia commented Oct 3, 2024

JenySadadia commented Oct 4, 2024

tales-aparecida left a comment

helen-fornazier commented Oct 14, 2024

helen-fornazier commented Oct 14, 2024

spbnick left a comment

spbnick Oct 15, 2024

spbnick Oct 15, 2024

spbnick Oct 15, 2024

spbnick Oct 15, 2024

spbnick Oct 15, 2024

spbnick Oct 15, 2024

spbnick Oct 15, 2024

	cache_client = get_cache_client()
	cache = cache_client.map(url_to_fetch, ttl=CACHE_REDIRECT_TTL)

Automate incidents creation #586

Are you sure you want to change the base?

Automate incidents creation #586

Conversation

JenySadadia commented Oct 3, 2024

JenySadadia commented Oct 4, 2024

tales-aparecida left a comment

Choose a reason for hiding this comment

helen-fornazier commented Oct 14, 2024

helen-fornazier commented Oct 14, 2024

spbnick left a comment

Choose a reason for hiding this comment

spbnick Oct 15, 2024

Choose a reason for hiding this comment

spbnick Oct 15, 2024

Choose a reason for hiding this comment

spbnick Oct 15, 2024

Choose a reason for hiding this comment

spbnick Oct 15, 2024

Choose a reason for hiding this comment

spbnick Oct 15, 2024

Choose a reason for hiding this comment

spbnick Oct 15, 2024

Choose a reason for hiding this comment

spbnick Oct 15, 2024

Choose a reason for hiding this comment