Skip to content

Commit

Permalink
revising lists on methods
Browse files Browse the repository at this point in the history
  • Loading branch information
ebeshero committed Jul 17, 2024
1 parent 9555e2b commit da81ef4
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/pages/method.astro
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ import {Image} from "@astrojs/image/components";
<p>We had to find a way to negotiate the different paradigms of markup in these digital editions, and doing so became the foundation of our method for machine-assisted collation that, crucially, had to include the XML tags that marked chapter and paragraph boundaries, while it also crucially had to ignore the markers of lineation and surface boundaries. We also had to bundle the S-GA page XML files into clusters to align roughly with the structural divisions of the print editions. </p>

<p>We followed <a href="https://collatex.net/doc/">the Gothenburg Model</a> of computer-aided textual collation, which requires clarity on how we would:</p>
<ul>
<ul class="explan">
<li><strong>Tokenize</strong> the strings of text to be compared into base units: We would compare words and punctuation, and decide on markup to include as tokens supplying crucial data for comparison;</li>
<li><strong>Normalize</strong> certain literal differences as not meaningful: for example: we must instruct the collation algorithm that ”‡amp;” is the same as ”and”, and also that the <code>&lt;milestone/&gt;</code> markup in the Shelley-Godwin archive indicating the start of a new paragraph is the same as as start tag for a <code>`&lt;p&gt;`</code> element in the editions of the print texts. Our list of normalizing algorithims became very extensive over the course of this project.</li>
<li><strong>Align</strong> the texts by dividing them into 33 regular portions that share starting and ending points across the five editions. This was complicated by gaps in the manuscript and heavily revised passages and alterations in the chapter divisions in the 1831 text. The alignment challenges with the MS notebooks are explained and visualized later.</li>
Expand Down

0 comments on commit da81ef4

Please sign in to comment.