You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
The following warning is issued when processing one of the .xml from the example data:
Processing: paperetl/file/data/0.xml
/usr/local/lib/python3.10/dist-packages/paperetl/file/tei.py:35: XMLParsedAsHTMLWarning: It looks like you're parsing an XML document using an HTML parser. If this really is an HTML document (maybe it's XHTML?), you can ignore or filter this warning. If it's XML, you should know that using an XML parser will be more reliable. To parse this document as XML, make sure you have the lxml package installed, and pass the keyword argument features="xml" into the BeautifulSoup constructor.
soup = BeautifulSoup(stream, "lxml")
Any clue how to avoid/correct that?
Thanks a lot.
The text was updated successfully, but these errors were encountered:
Hello,
The following warning is issued when processing one of the .xml from the example data:
Processing: paperetl/file/data/0.xml
/usr/local/lib/python3.10/dist-packages/paperetl/file/tei.py:35: XMLParsedAsHTMLWarning: It looks like you're parsing an XML document using an HTML parser. If this really is an HTML document (maybe it's XHTML?), you can ignore or filter this warning. If it's XML, you should know that using an XML parser will be more reliable. To parse this document as XML, make sure you have the lxml package installed, and pass the keyword argument
features="xml"
into the BeautifulSoup constructor.soup = BeautifulSoup(stream, "lxml")
Any clue how to avoid/correct that?
Thanks a lot.
The text was updated successfully, but these errors were encountered: