Automated Spam Removal

Webometric Analyst automatically removes URLs from a few web sites as likely to be spam. In URLs removed as spam can be found in a file with name ending in .spam.txt. The "long results" file ending in ".raw.txt" is the original data with the spam.

To override the automatic spam removal or to add your own domains, create a file with a name starting with the search file name and ending in .spam.txt. E.g., if the query file is wolverhampton.txt then the spam file would be wolverhampton.spam.txt. In this file enter a list of domain names of URLs to be eliminated as spam, one per line. For example, to remove all links from facebook as spam, one line could be