Some key_words have multiple different clean_names #71
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
For example:
keyword_processor = KeywordProcessor()
keyword_dict = {"news_channel": ["CNN","CCTV","BBC"],"neural_network": ["CNN", "RNN"]}
keyword_processor.add_keywords_from_dict(keyword_dict)
keyword_processor.extract_keywords('I like CNN')
we hope get result as follows:
"news_channel_|neural_network"
we can use str.split() to get real clean name as follows:
"news_channel|neural_network".split('|_') ==> ["news_channel", "neural_network"]