You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
These sites, as we could see, have more categorized domains , making it ideal to complement the categorization of domains. They use other categories, different from that used by Alexa (DMOZ) to classify domains.
Therefore it is necessary to find a relationship between the categories of Alexa and the other sites, check this paper as reference (http://arxiv.org/pdf/1411.5281v1.pdf) and more specifically Leacock-Chodorow similarity that works with a similarity coefficient between two or more words.
Alexa offers a limited coverage of websites, so we need to complement it with other sources that will help us improve and extend the coverage.
For that, we should use a combination of other 4 sources as explained in this paper: http://arxiv.org/pdf/1411.5281v1.pdf
These are: Cyren, Google Ad Words, McAfee and WebPulse.
Pointers for those sources are mentioned in the paper. If we have any doubts we may write or call paper authors. I know them
The text was updated successfully, but these errors were encountered: