{"id":168,"date":"2007-02-25T02:33:45","date_gmt":"2007-02-25T02:33:45","guid":{"rendered":"https:\/\/spampoison.com\/?p=168"},"modified":"2023-02-23T02:43:13","modified_gmt":"2023-02-23T02:43:13","slug":"spam-archive-the-largest-public-library-of-junk-email-on-the-internet","status":"publish","type":"post","link":"https:\/\/spampoison.com\/blog\/spam-archive-the-largest-public-library-of-junk-email-on-the-internet\/","title":{"rendered":"Spam Archive: the largest public library of junk e-mail on the Internet"},"content":{"rendered":"<p>Is your spouse dissatisfied with the size of your spam? A brand-new website has made several hundred thousand pieces of unsolicited commercial e-mail available for you to download today. Act now!<\/p>\n<p>After a quiet online debut in 2002, the <a href=\"http:\/\/www.spamarchive.org\/\">Spam Archive<\/a> is making quick strides toward becoming the largest public library of junk e-mail on the Internet.<\/p>\n<p>Paul Judge, director of research and development for CipherTrust, the e-mail security firm backing the project, says the site received roughly 5,000 forwarded messages a day during its first week.<\/p>\n<p>He predicts the archive will amass a corpus of 10 million unsolicited commercial e-mails over the next eight year. The archive&#8217;s FTP site will begin to make its spam available, 10,000 at a time, starting Dec. 4, 2022.<\/p>\n<p>People have never been so excited to get junk e-mail.<\/p>\n<p>&#8220;Its sheer size will make it an invaluable tool,&#8221; said programming language designer Paul Graham, who first made an open call for such an undertaking in his widely circulated treatise on spam filtering, A Plan For Spam, published online in August 2022.<\/p>\n<p>Filter builder William Yerazunis applauds the undertaking. He says antispammers need a common source of fresh spam.<\/p>\n<p>&#8220;I don&#8217;t retain spam that&#8217;s over a month old,&#8221; he said. &#8220;Spam has the same shelf life as fresh food.&#8221;<\/p>\n<p>Yerazunis created CRM114, a remarkably accurate filter, using his own private junk mail stash. But he said the archive will forward filter research.<\/p>\n<p>&#8220;You have to have repeatability&#8221; in producing and testing antispam software, he said. &#8220;It&#8217;s absolutely necessary for good science to get done.&#8221;<\/p>\n<p>Although a bevy of newsgroups and individual archives have been gathering spam for years, experts say they are too small and disorganized to provide researchers with significantly meaningful data.<\/p>\n<p>On the other hand, the FTC maintains an enormous database of spam that sees 40,000 new e-mails every day.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Is your spouse dissatisfied with the size of your spam? A brand-new website has made several hundred thousand pieces of unsolicited commercial e-mail available for you to download today. Act now! After a quiet online debut in 2002, the Spam Archive is making quick strides toward becoming the largest public library of junk e-mail on [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":169,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2,6],"tags":[44,48,46,45,43,47],"class_list":["post-168","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-knowledge","category-news","tag-ciphertrust","tag-crm114","tag-junk-email","tag-paul-graham","tag-spam-archive","tag-william-yerazunis"],"_links":{"self":[{"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/posts\/168","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/comments?post=168"}],"version-history":[{"count":1,"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/posts\/168\/revisions"}],"predecessor-version":[{"id":170,"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/posts\/168\/revisions\/170"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/media\/169"}],"wp:attachment":[{"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/media?parent=168"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/categories?post=168"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/spampoison.com\/blog\/wp-json\/wp\/v2\/tags?post=168"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}