You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @OlgaGKononova can you explain how did you make the symbol table?
I assumed they are HTML special symbols. But I think some of them are not working in Chrome. Please see this fiddle: https://jsfiddle.net/vfj3hw0q/1/ Thus, I'm not sure if the replacement of symbols like &Agr; -> Α really works for our project. Maybe the table needs updating.
In addition, I found this package https://github.com/chartbeat-labs/textacy to be extremely useful. It essentially does the same thing as this repo, but with a nicer interface and standard. Here is an example:
import textacy
text= 'ℏ. Then the mixtures were placed in alumina crucibles and sintered at 1200 ° C for 4 h in air. '
print(repr(text))
text = textacy.preprocess_text(text, fix_unicode=True)
print(repr(text))
'ℏ. Then the mixtures were placed in alumina crucibles and sintered at 1200\xa0° C for 4\xa0h in air. '
'ℏ. Then the mixtures were placed in alumina crucibles and sintered at 1200 ° C for 4 h in air.'
So I suggest looking into this package.
The text was updated successfully, but these errors were encountered:
Hi @OlgaGKononova can you explain how did you make the symbol table?
I assumed they are HTML special symbols. But I think some of them are not working in Chrome. Please see this fiddle: https://jsfiddle.net/vfj3hw0q/1/ Thus, I'm not sure if the replacement of symbols like
&Agr; -> Α
really works for our project. Maybe the table needs updating.In addition, I found this package https://github.com/chartbeat-labs/textacy to be extremely useful. It essentially does the same thing as this repo, but with a nicer interface and standard. Here is an example:
So I suggest looking into this package.
The text was updated successfully, but these errors were encountered: