Combating Web Spam with TrustRank - Zoltan Gyongyi, Hector Garcia-Molina, Stanford University, and Jan Pedersen, Yahoo. Proceedings of the 30th VLDB Conference, 2004. The authors propose techniques which allow to semi-automatically identify reputable pages and then discover more good pages - http://dbpubs.stanford.edu:8090/pub/showDoc.Fulltext?lang=en&doc=2004-52&format=pdf&compression=&name=2004-52.pdf
Topical TrustRank: Using Topicality to Combat Web Spam - Baoning Wu, Vinay Goel and Brian D. Davison propose to partition the seed set used in TrustRank by topic and calculate trust scores for each topic separately, making use of the Open Directory Project. Paper presented to the 15th International World Wide W - http://www2006.org/programme/files/xhtml/3115/fp3115-wu/fp3115-wu-xhtml.html
Web Spam Taxonomy - By Zoltán Gyöngyi and Hector Garcia-Molina, Stanford University. First International Workshop on Adversarial Information Retrieval on the Web, May 2005. Offers a definition of spam and an overview on current spamming techniques. The ODP guidelines are q - http://airweb.cse.lehigh.edu/2005/gyongyi.pdf
A Reference Collection for Web Spam - Castillo, Carlos, et al.: Documentation of the Webspam-UK2006 collection, a publicly available reference collection for Web spam research. The crawl was seeded with ODP data. - http://www.dcc.uchile.cl/~ccastill/papers/castillo_2006_reference_collection_spam.pdf