fix: Don't encode `@` in project name #228

dominik003 · 2024-01-22T15:37:52Z

T4C uses an intersting encoding for project names described in more detail here. Since our projects shouldn't use any special characters anyways, we can avoid reproducing their encoding by simply adding characters to the safe list if encountering a project with such a character.

Resolves #142

T4C uses an intersting encoding for project names described in more detail [here](https://en.wikipedia.org/wiki/Internationalized_Resource_Identifier). Since our projects shouldn't use any special characters anyways, we can avoid reproducing their encoding by simply adding characters to the safe list if encountering a project with such a character.

MoritzWeber0 · 2024-01-22T17:00:41Z

In the references article, it is stated:

While URIs are limited to a subset of the US-ASCII character set (characters outside that set must be mapped to octets according to some unspecified character encoding, then percent-encoded), IRIs may additionally contain most characters from the Universal Character Set (Unicode/ISO 10646)

Therefore, we can just take all ISO_10646 characters and pass them as safe chars.

Since our projects shouldn't use any special characters anyways, we can avoid reproducing their encoding by simply adding characters to the safe list if encountering a project with such a character.

They SHOULDN'T in theory, but they do in practise. Therefore, I would prefer a more stable solution.

dominik003 · 2024-01-23T08:16:30Z

In the references article, it is stated:

While URIs are limited to a subset of the US-ASCII character set (characters outside that set must be mapped to octets according to some unspecified character encoding, then percent-encoded), IRIs may additionally contain most characters from the Universal Character Set (Unicode/ISO 10646)

Therefore, we can just take all ISO_10646 characters and pass them as safe chars.

Since our projects shouldn't use any special characters anyways, we can avoid reproducing their encoding by simply adding characters to the safe list if encountering a project with such a character.

They SHOULDN'T in theory, but they do in practise. Therefore, I would prefer a more stable solution.

Seems like I put the wrong link in there (not the actual link to the eclipse page which is the following: URI (EMF Documentation)). There it is stated:

This implementation uses Java's Unicode char and String representations, and makes no attempt to encode characters 0xA0 and above. Characters in the range 0x80-0x9F are still escaped. In this respect, EMF's notion of a URI is actually more like an IRI (Internationalized Resource Identifier), for which an RFC is now in draft form.

So as far as I see there is not just a clear set of characters we can ignore when quoting or how would you do that?

MoritzWeber0

Looks good for now, however we should investigate it in more detail: #229

dominik003 requested a review from MoritzWeber0 as a code owner January 22, 2024 15:37

MoritzWeber0 mentioned this pull request Feb 16, 2024

Backup image fails for projects with special characters in the project name #229

Open

MoritzWeber0 approved these changes Feb 16, 2024

View reviewed changes

MoritzWeber0 merged commit cdf5ec7 into main Feb 16, 2024
18 checks passed

MoritzWeber0 deleted the fix-import-project-name-encoding branch February 16, 2024 10:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Don't encode `@` in project name #228

fix: Don't encode `@` in project name #228

dominik003 commented Jan 22, 2024

MoritzWeber0 commented Jan 22, 2024 •

edited

Loading

dominik003 commented Jan 23, 2024

MoritzWeber0 left a comment

fix: Don't encode @ in project name #228

fix: Don't encode @ in project name #228

Conversation

dominik003 commented Jan 22, 2024

MoritzWeber0 commented Jan 22, 2024 • edited Loading

dominik003 commented Jan 23, 2024

MoritzWeber0 left a comment

Choose a reason for hiding this comment

fix: Don't encode `@` in project name #228

fix: Don't encode `@` in project name #228

MoritzWeber0 commented Jan 22, 2024 •

edited

Loading