This repository has been archived by the owner on Sep 27, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 0
Issues: appledora/mwparserfromhtml
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Allow raw article html strings to be passed without all the additional metadata in the dump
low priority
#46
opened Sep 21, 2022 by
appledora
Make sure change to external links is not breaking
low priority
#45
opened Sep 20, 2022 by
appledora
Add logging to indicate mismatch between HTML spec version and html dumps version
low priority
on-going
#44
opened Sep 20, 2022 by
appledora
Ensure clear connection between HTML nodes and plaintext
low priority
#43
opened Sep 15, 2022 by
appledora
Contribution Guideline and Tutorial Notebook
low priority
review
#42
opened Sep 1, 2022 by
appledora
Handle inline transclusion differently in plaintext extraction
low priority
#41
opened Aug 29, 2022 by
appledora
write function to create a hierarchy tree of the HTML tags
low priority
#13
opened Jun 29, 2022 by
appledora
Decide and Handle Categories that appear under WikiLink relations
low priority
#9
opened Jun 24, 2022 by
appledora
Write a Link Normalization method with better coverage of different use cases
medium priority
#8
opened Jun 23, 2022 by
appledora
Determine appropriate level of processing on instantiation of Article object
low priority
#7
opened Jun 23, 2022 by
appledora
Decide whether to treat interwikilinks as internal or external links
low priority
#3
opened Jun 23, 2022 by
appledora
ProTip!
Follow long discussions with comments:>50.