Extract Lexeme IDs in SPARQL Queries for Language Totals #110

mhmohona · 2024-03-19T02:37:07Z

Contributor checklist

This pull request is on a separate branch and not the main branch

Description

The SELECT statement in the query has been updated to include (REPLACE(STR(?lexeme), "http://www.wikidata.org/entity/", "") as ?lexemeID) as the first element. This modification ensures that the query returns the lexeme ID alongside the word category and its counts, aligning with the requirements outlined in the issue discussion and the specific approach suggested by Andrew.

Related issue

closes Option to grab the Wikidata lexemes for queried words #101

github-actions · 2024-03-19T02:37:26Z

Thank you for the pull request!

The Scribe team will do our best to address your contribution as soon as we can. The following is a checklist for maintainers to make sure this process goes as well as possible. Feel free to address the points below yourself in further commits if you realize that actions are needed :)

If you're not already a member of our public Matrix community, please consider joining! We'd suggest using Element as your Matrix client, and definitely join the General and Data rooms once you're in. It'd be great to have you!

Maintainer checklist

The commit messages for the remote branch should be checked to make sure the contributor's email is set up correctly so that they receive credit for their contribution
- The contributor's name and icon in remote commits should be the same as what appears in the PR
- If there's a mismatch, the contributor needs to make sure that the email they use for GitHub matches what they have for git config user.email in their local Scribe-Data repo
The CHANGELOG has been updated with a description of the changes for the upcoming release and the corresponding issue (if necessary)

andrewtavis · 2024-03-19T08:08:10Z

Thanks for this, @mhmohona! Will look to review it as soon as I can :) :)

andrewtavis · 2024-03-19T23:01:57Z

Hey @mhmohona 👋 Am realizing the directions here weren't quite what they should have been. The three queries you edited are actually the only three that don't need this change :) Specifically if you go into the extract_transform/languages directory and then find queries like extract_transform/languages/French/nouns/query_nouns.sparql, these are the files we want to add this line to 😊

Can you go through and remove the edits to the current files and send along versions of all instances of query_nouns.sparql, query_verbs.sparql and query_prepositions.sparql that have the lexemeID line included?

mhmohona · 2024-03-20T01:12:11Z

Ops! 🫠 Let me update it.

mhmohona · 2024-03-20T02:12:07Z

@andrewtavis this PR is up for review now!

mhmohona · 2024-03-20T02:14:35Z

src/scribe_data/extract_transform/languages/Arabic/nouns/query_nouns.sparql

@@ -1,7 +1,9 @@
 # All Arabic (Q13955) nouns.
 # Enter this query at https://query.wikidata.org/.

-SELECT DISTINCT ?lexeme ?noun WHERE {


@andrewtavis just one thing, here I am suppose to make change, isnt it?

Yes, that'd be great, @mhmohona :)

So the changes I made, are they sufficient?

This will be find, @mhmohona :) I'll go through and do the review and add in the line for you 😊

andrewtavis

Changes I sent along were just minor formatting, @mhmohona 😊 Thanks for all the help here! 🚀

adding lexemes

db1380c

andrewtavis self-requested a review March 19, 2024 08:07

mhmohona added 5 commits March 20, 2024 07:33

update till danish

6dc50ae

update the rest

96ce930

fix small details

c919f3d

smaller fix

4fbbd08

smallest fix

9bf03d7

mhmohona commented Mar 20, 2024

View reviewed changes

scribe-org#101 sparql query formatting and adding Greek verbs

bdf0bc2

andrewtavis approved these changes Mar 20, 2024

View reviewed changes

andrewtavis merged commit f6f593d into scribe-org:main Mar 20, 2024
2 checks passed

andrewtavis mentioned this pull request Jun 7, 2024

Simplify formatting process to lexeme based outputs rather than string based #142

Closed

2 tasks

mhmohona deleted the lexemes branch August 28, 2024 01:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract Lexeme IDs in SPARQL Queries for Language Totals #110

Extract Lexeme IDs in SPARQL Queries for Language Totals #110

mhmohona commented Mar 19, 2024

github-actions bot commented Mar 19, 2024 •

edited by andrewtavis

Loading

andrewtavis commented Mar 19, 2024

andrewtavis commented Mar 19, 2024

mhmohona commented Mar 20, 2024

mhmohona commented Mar 20, 2024

mhmohona Mar 20, 2024

andrewtavis Mar 20, 2024

mhmohona Mar 20, 2024

andrewtavis Mar 20, 2024

andrewtavis left a comment

Extract Lexeme IDs in SPARQL Queries for Language Totals #110

Extract Lexeme IDs in SPARQL Queries for Language Totals #110

Conversation

mhmohona commented Mar 19, 2024

Contributor checklist

Description

Related issue

github-actions bot commented Mar 19, 2024 • edited by andrewtavis Loading

Thank you for the pull request!

Maintainer checklist

andrewtavis commented Mar 19, 2024

andrewtavis commented Mar 19, 2024

mhmohona commented Mar 20, 2024

mhmohona commented Mar 20, 2024

mhmohona Mar 20, 2024

Choose a reason for hiding this comment

andrewtavis Mar 20, 2024

Choose a reason for hiding this comment

mhmohona Mar 20, 2024

Choose a reason for hiding this comment

andrewtavis Mar 20, 2024

Choose a reason for hiding this comment

andrewtavis left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 19, 2024 •

edited by andrewtavis

Loading