SlideShare ist ein Scribd-Unternehmen logo
1 von 46
Downloaden Sie, um offline zu lesen
Webspam
  Dirk Haun
www.geeklog.net
Geeklog, Spam & me
• Geeklog:
  ‣ since Jan. 2002
  ‣ as a maintainer
    since 2004

• Spam as a problem:
  ‣ since mid-2004
  ‣ End of 2004: Poker
    Spam
Agenda


• What is webspam?
• What to do about it?
• Outlook
Types of Webspam


• Comment Spam
• Trackback Spam
• Referrer Spam
• more subtle ways
Comment Spam


• Comments
• Forums
• Guest books
Very good site...

Hi all!

[url=...]100% Free Lesbian Video[/url]
[url=...]Lesbian Teen[/url]
[url=...]Asian Teen Lesbian[/url]
[url=...]Mature Lesbian[/url]
[url=...]Woman Naked Pussy Lesbian[/url]
[url=...]Shemale Lesbian Sex Vidoes[/url]
[url=...]Skinny Lesbian Girls Having Sex[/url]
[url=...]Teen Blonde Lesbian[/url]
[url=...]Twins Sisters Video Lesbian[/url]
[url=...]xxx Free Lesbian Movie[/url]




Just the usual ...
[url=.../index.html]underground sex[/url]
[url=.../page=2.html]underlolitas[/url]
[url=.../page=3.html]underpants[/url]
[url=.../page=4.html]underwater erotica[/url]
[url=.../page=5.html]underwater fucking[/url]
[url=.../page=12.html]underwear models[/url]
[url=.../page=13.html]undies[/url]
[url=.../page=14.html]uniform porn[/url]
[url=.../page=15.html]uniform sex[/url]
[url=.../page=16.html]unique baby boys names
[/url]
[url=.../page=23.html]united airlines tickets
flights[/url]
[url=.../page=490.html]wellbutrin xl[/url]
[url=.../page=491.html]wellness dog food[/url]



All-in-one spam
This Website contains sexually-oriented adult
  content which may include visual images and
  verbal descriptions of nude adults, adults
  engaging in sexual acts, and other audio and
  visual materials of a sexually-explicit nature.

  Permission to enter this Website and to view
  and download its contents is strictly limited
  only to consenting adults who affirm that the
  following conditions apply:

  1. That you are at least 18 years of age or
  older, and that you are voluntarily choosing
  to view and access such sexually-explicit (...)




Spam with disclaimer
Wiki Spam


• everbody can edit -
  including spammers

• Spam sometimes
  hidden in older
  revisions
Trackback Spam


• in blogs: cross-site
  comments

• XML-RPC, clearly
  defined protocol

• similar: Pingback
  (URL only)
Referrer Spam


• faked referrers
• Blogs used to display
  them on their
  homepage

• usually invisible in
  the webserver logfile
66.49.223.233 - - [02/Jun/2007:04:11:07 -0400] quot;GET /
forum/viewtopic.php?showtopic=73271 HTTP/1.1quot; 403 26
quot;http://www.kzcarinsurance.info/12868-71-0.htmlquot; quot;Mozilla/
4.0 (compatible; MSIE 6.0; Windows NT 5.1)quot;

216.185.128.200 - - [02/Jun/2007:04:37:01 -0400] quot;GET /
forum/viewtopic.php?showtopic=21070 HTTP/1.1quot; 200
18384 quot;http://www.kzcarinsurance.info/38645-71-0.htmlquot;
quot;Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)quot;

66.49.223.233 - - [02/Jun/2007:05:02:14 -0400] quot;GET /
forum/viewtopic.php?showtopic=68994 HTTP/1.1quot; 403 26
quot;http://www.kzcarinsurance.info/62898-71-0.htmlquot; quot;Mozilla/
4.0 (compatible; MSIE 6.0; Windows NT 5.1)quot;

216.185.128.200 - - [02/Jun/2007:09:00:23 -0400] quot;GET /
article.php/To-do_20050606 HTTP/1.1quot; 200 20169 quot;http://
www.kzcarinsurance.info/224400-71-0.htmlquot; quot;Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.1)quot;




Referrer Spam
More subtle spam

• Profile Spam
  ‣ List of members in
    forums

• almost on-topic posts
  ‣ Kudos, jokes,
    general questions
Stumbled onto geeklog.info for the
  first time today looks like
  someplace I needed to find a while
  ago.

  Just went from a slow dial up
  system to at DSL so I don't have
  to wait several minutes for a
  picture to arrive



Harmless posting ...
Stumbled onto geeklog.info for the first time today
looks li[url=http://webmeds.iespana.es/amoxicilin]
k[/url][url=http://webmeds.iespana.es/rogaine]e[/
url] [url=http://webmeds.iespana.es/seroquel]s[/
url][url=http://webmeds.iespana.es/oxycontin]o[/
url][url=http://webmeds.iespana.es/oxycodone]m[/
url][url=http://webmeds.iespana.es/viagra]e[/url]
[url=http://webmeds.iespana.es/celebrix]p[/url]
[url=http://webmeds.iespana.es/welbutrin]l[/url]
[url=http://webmeds.iespana.es/stop-smoking]a[/
url][url=http://webmeds.iespana.es/quit-smoking]c
[/url][url=http://webmeds.iespana.es/skelaxin]e[/
url] [url=http://webmeds.iespana.es/atenolol]I[/
url] [url=http://webmeds.iespana.es/fluconazole]n[/
url][url=http://webmeds.iespana.es/diflucan]e[/url]
[url=http://webmeds.iespana.es/ciales]e[/url]
[url=http://webmeds.iespana.es/xanex]d[/url]
[url=http://webmeds.iespana.es/aciclovir]e[/url]

... or maybe not
[url=http://webmeds.iespana.es/adderol]d[/url]
Motivation


• Pagerank
• Clickthroughs
• Test Spam
Pagerank


• not that much quot;mass
  spamquot; any more

• takes time to build
• Spamming older posts
Clickthroughs

• Get people onto their
  site

  ‣ Sale, Ads, Affiliate
• Throw-away domains
  ‣ Redirects
• Throw-away URLs
  ‣ old forums, etc.
Spam topics
                      24.-31. March 2007 (356 Spam posts)


     Pills                                                  137

    Porn                                         102

 Finance               23

Software         13

Ringtones        11

    misc.                              70

             0                    50               100            150
Spam topics

               misc.
               20%



Pills
38%               Ringtones
                     3%
                   Software
                      4%
                   Finance
                     6%




        Porn
        29%
Compare with email
        spam

• Keywords not
  obfuscated (V14gr4)

• No stock spam
  (time?)

• No spam in images
How they're spamming
• Spambots
  ‣ hijacked PCs or
     webservers

  ‣ Bulletproof hosting
  ‣ open proxies
• manual spam: very
  rarely

• quot;We'll spam for youquot;
I am amazed by the skills of some people here

#file=D:XRumerfreewebtown-general.txt




            Oops ...
I am amazed by the skills of some people here

  Hi..!! everyone!

  This is my first post on Yours site. Thank you in
  [url=http://www.freewebtown.com/topweb/louis-
  vuitton]a[/url](...)[url=http://
  www.freewebtown.com/topweb/credit-equity-home-
  line].[/url]
  I am From Canada
  Nice day is it today, but I have a question for all...

  In first , how i post message to PM...???

  Thank you very much!
  Mark. G..!!




... let's try that again
XRumer
I offer you the services in advertising in internet: (...)

 3. Forum spam.
 Opportunities of posting:
 - Registration at a forum with editing a profile of the user
 - Dispatch on the forums supporting a guest input
 - Notices on e-mail about answers at a forum or private messages
 - the Opportunity of registration without posting (increases PR Google)

 On the ending of dispatch you receive the report on the done work -
 direct references to your announcement.

 The prices for mass dispatch on forums:

 2)1000 forums - $35/1000
 3)4000-6000 forums - $33/1000
 4)7000-9000 forums - $31/1000
 5)10000-13000 forums - $30/1000
 5)20000 forums and more - $20/1000

 Total of Russian forums - 40.000
 Amount of English-speaking forums - 70.000




We'll spam for you
Agenda


• What is webspam?
• What to do about it?
• Outlook
IP Addresses

• Block IP
  ‣ dynamic IPs
  ‣ Bulletproof Hosting
• Speedlimit
  ‣ only helps with
    individual IPs
Word filters


• surprisingly effective      viagra
• depends on topics and       xanax
  languages                specialist
• Beware of False             phentermine
  Positives                   tramadol
Moderation


• takes up time
• full moderation queue
• Mixed approach:
  moderate first post
Registration

• only let registered
  users post

  ‣ and how many
     visitors will that
     drive away?

• OpenID
• automatic
  registration from bots
CAPTCHA

• Try to tell humans
  and bots apart

  ‣ doesn't have to be a
     picture!

• often hard to read for
  humans, too

• arms race
  ‣ PWNtcha
Blacklists: manual

• update manually:
  takes time

  ‣ MT-Blacklist (RIP)
  ‣ spam-merge
    ✴ MoinMoin,
       TWiki,
       MediaWiki
Blacklists: automatic

• dynamically
• recognize URLs
  showing up often

• centralised
  ‣ Akismet
  ‣ SLV
Detecting spambots

• Bad Behavior
  ‣ known bots
  ‣ bad HTTP requests
• Project Honeypot
  ‣ dynamic IP
    blacklist
Abuse Reports


• Takes time and work
• not a lot of success
• ISPs and hosters
  aren't aware of the
  problem
rel=quot;nofollowquot;

• Don't rank links with
  that attribute

• concerted effort of all
  big search engines

• promised to end web
  spam

• didn't change
  anything
Example: Spam-X
• Spamfilter in Geeklog
• modular, extensible
  ‣ new modules for
    the spammer's new
    tricks

  ‣ new modules for
    new services

• Downside: yes/no
  decisions only
Agenda


• What is webspam?
• What to do about it?
• Outlook
R.I.P. - Success stories


• Trackback Spam
  ‣ through technical
    measures

• Referrer Spam
  ‣ simply not effective
State of things

• a big portion can be
  filtered easily

• the rest is starting to
  become a problem

  ‣ Total amount of
     spam increases

• there will alway be
  some spam
Solutions?

• not CAPTCHA!
  ‣ at least not as
    graphics

  ‣ OCR improvements
    for email spam will
    help break
    CAPTCHAs
Solutions?

• Bayes-Filter?
  ‣ Who wants to train
    them?

• We need user-friendly
  solutions!

• centralized systems
  may be not accurate
  enough
Solutions?

• Cooperation?
  ‣ not much
  ‣ quot;Spam is not a
    problem any morequot;

• Where are the
  commercial
  solutions?
Resources

• Webspam in general
  ‣ spamhuntress.com
• Wiki-Spam
  ‣ chongqed.org
• My blog
  ‣ spam.tinyweb.net
Credits

  • Photos via flickr.com,
    thanks to: freezelight,
    Hopkinsii, striatic, chotda,
    lagiuspo, It'sGreg, lorZ, YnR,
    kevinthoule, acagamic, R80o
    (Mark Strozier), Kevin,
    loungerie, brappy!,
    ^Sandra^, longwayround,
    sheeshoo, Orgasmic kmlz,
    awinn233, teotwawki,
    Hugo*, rofanator, gyst,
    Gigglejuice, manuki




Hint: Pictures and keywords are hyperlinked!

Weitere ähnliche Inhalte

Was ist angesagt?

"The Web Is Broken" by Bipin Upadhyay
"The Web Is Broken" by Bipin Upadhyay"The Web Is Broken" by Bipin Upadhyay
"The Web Is Broken" by Bipin UpadhyayBipin Upadhyay
 
OpenID Security
OpenID SecurityOpenID Security
OpenID Securityeugenet
 
The page and the desktop
The page and the desktopThe page and the desktop
The page and the desktopGlenn Jones
 
Making Mobile Sites Faster
Making Mobile Sites FasterMaking Mobile Sites Faster
Making Mobile Sites FasterAndy Davies
 
Re-using social media data
Re-using social media dataRe-using social media data
Re-using social media dataGlenn Jones
 
Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)
Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)
Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)Nicholas Zakas
 
Real World Web Standards
Real World Web StandardsReal World Web Standards
Real World Web Standardsgleddy
 
Mobile Web Performance - Getting and Staying Fast
Mobile Web Performance -  Getting and Staying FastMobile Web Performance -  Getting and Staying Fast
Mobile Web Performance - Getting and Staying FastAndy Davies
 
How to optimise TTFB - BrightonSEO 2020
How to optimise TTFB - BrightonSEO 2020How to optimise TTFB - BrightonSEO 2020
How to optimise TTFB - BrightonSEO 2020Roxana Stingu
 
How to connect social media with open standards
How to connect social media with open standardsHow to connect social media with open standards
How to connect social media with open standardsGlenn Jones
 
BNC Tech Forum 09: Lexcycle Stanza demo
BNC Tech Forum 09: Lexcycle Stanza demoBNC Tech Forum 09: Lexcycle Stanza demo
BNC Tech Forum 09: Lexcycle Stanza demoBookNet Canada
 
Keypoints html5
Keypoints html5Keypoints html5
Keypoints html5dynamis
 
Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?
Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?
Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?Andy Davies
 
Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"Fwdays
 
GFW-official-list-cashed in Google
GFW-official-list-cashed in GoogleGFW-official-list-cashed in Google
GFW-official-list-cashed in GoogleAkkad
 
The web is too slow
The web is too slow The web is too slow
The web is too slow Andy Davies
 

Was ist angesagt? (16)

"The Web Is Broken" by Bipin Upadhyay
"The Web Is Broken" by Bipin Upadhyay"The Web Is Broken" by Bipin Upadhyay
"The Web Is Broken" by Bipin Upadhyay
 
OpenID Security
OpenID SecurityOpenID Security
OpenID Security
 
The page and the desktop
The page and the desktopThe page and the desktop
The page and the desktop
 
Making Mobile Sites Faster
Making Mobile Sites FasterMaking Mobile Sites Faster
Making Mobile Sites Faster
 
Re-using social media data
Re-using social media dataRe-using social media data
Re-using social media data
 
Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)
Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)
Progressive Enhancement 2.0 (jQuery Conference SF Bay Area 2011)
 
Real World Web Standards
Real World Web StandardsReal World Web Standards
Real World Web Standards
 
Mobile Web Performance - Getting and Staying Fast
Mobile Web Performance -  Getting and Staying FastMobile Web Performance -  Getting and Staying Fast
Mobile Web Performance - Getting and Staying Fast
 
How to optimise TTFB - BrightonSEO 2020
How to optimise TTFB - BrightonSEO 2020How to optimise TTFB - BrightonSEO 2020
How to optimise TTFB - BrightonSEO 2020
 
How to connect social media with open standards
How to connect social media with open standardsHow to connect social media with open standards
How to connect social media with open standards
 
BNC Tech Forum 09: Lexcycle Stanza demo
BNC Tech Forum 09: Lexcycle Stanza demoBNC Tech Forum 09: Lexcycle Stanza demo
BNC Tech Forum 09: Lexcycle Stanza demo
 
Keypoints html5
Keypoints html5Keypoints html5
Keypoints html5
 
Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?
Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?
Are Today’s Good Practices… Tomorrow’s Performance Anti-Patterns?
 
Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"Stefan Judis "Did we(b development) lose the right direction?"
Stefan Judis "Did we(b development) lose the right direction?"
 
GFW-official-list-cashed in Google
GFW-official-list-cashed in GoogleGFW-official-list-cashed in Google
GFW-official-list-cashed in Google
 
The web is too slow
The web is too slow The web is too slow
The web is too slow
 

Andere mochten auch

Rewriting not recommended
Rewriting not recommendedRewriting not recommended
Rewriting not recommendedDirk Haun
 
Is C going the way of the Dodo?
Is C going the way of the Dodo?Is C going the way of the Dodo?
Is C going the way of the Dodo?Dirk Haun
 
Kurzeinführung: Atom Publishing Protocol
Kurzeinführung: Atom Publishing ProtocolKurzeinführung: Atom Publishing Protocol
Kurzeinführung: Atom Publishing ProtocolDirk Haun
 
Google Summer of Code 2011 (English)
Google Summer of Code 2011 (English)Google Summer of Code 2011 (English)
Google Summer of Code 2011 (English)Dirk Haun
 
Using Geeklog as a Web Application Framework
Using Geeklog as a Web Application FrameworkUsing Geeklog as a Web Application Framework
Using Geeklog as a Web Application FrameworkDirk Haun
 
Google Summer of Code 2010 (in English)
Google Summer of Code 2010 (in English)Google Summer of Code 2010 (in English)
Google Summer of Code 2010 (in English)Dirk Haun
 
Atom Publishing Protocol
Atom Publishing ProtocolAtom Publishing Protocol
Atom Publishing ProtocolDirk Haun
 
People & Performance UK
People & Performance UKPeople & Performance UK
People & Performance UKtn
 
Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...
Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...
Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...Dirk Haun
 
Google Summer of Code™ (in English; neutral version)
Google Summer of Code™ (in English; neutral version)Google Summer of Code™ (in English; neutral version)
Google Summer of Code™ (in English; neutral version)Dirk Haun
 
Ribbit for Salesforce - General
Ribbit for Salesforce - GeneralRibbit for Salesforce - General
Ribbit for Salesforce - GeneralChris Cranis
 
Will Stacy Talks Productivity from Sales 2.0
Will Stacy Talks Productivity from Sales 2.0Will Stacy Talks Productivity from Sales 2.0
Will Stacy Talks Productivity from Sales 2.0Chris Cranis
 
Open Source is good for you
Open Source is good for youOpen Source is good for you
Open Source is good for youDirk Haun
 
Continuous Integration - Does it scale?
Continuous Integration - Does it scale?Continuous Integration - Does it scale?
Continuous Integration - Does it scale?Dirk Haun
 
What's our Status?
What's our Status?What's our Status?
What's our Status?Dirk Haun
 
Continuous Integration in der Praxis
Continuous Integration in der PraxisContinuous Integration in der Praxis
Continuous Integration in der PraxisDirk Haun
 
Braindump - How to leave your Knowledge when leaving your Job
Braindump - How to leave your Knowledge when leaving your JobBraindump - How to leave your Knowledge when leaving your Job
Braindump - How to leave your Knowledge when leaving your JobDirk Haun
 

Andere mochten auch (18)

Rewriting not recommended
Rewriting not recommendedRewriting not recommended
Rewriting not recommended
 
Is C going the way of the Dodo?
Is C going the way of the Dodo?Is C going the way of the Dodo?
Is C going the way of the Dodo?
 
Kurzeinführung: Atom Publishing Protocol
Kurzeinführung: Atom Publishing ProtocolKurzeinführung: Atom Publishing Protocol
Kurzeinführung: Atom Publishing Protocol
 
Google Summer of Code 2011 (English)
Google Summer of Code 2011 (English)Google Summer of Code 2011 (English)
Google Summer of Code 2011 (English)
 
Send Sms
Send SmsSend Sms
Send Sms
 
Using Geeklog as a Web Application Framework
Using Geeklog as a Web Application FrameworkUsing Geeklog as a Web Application Framework
Using Geeklog as a Web Application Framework
 
Google Summer of Code 2010 (in English)
Google Summer of Code 2010 (in English)Google Summer of Code 2010 (in English)
Google Summer of Code 2010 (in English)
 
Atom Publishing Protocol
Atom Publishing ProtocolAtom Publishing Protocol
Atom Publishing Protocol
 
People & Performance UK
People & Performance UKPeople & Performance UK
People & Performance UK
 
Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...
Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...
Google Summer of Code: Neue Mitstreiter mit Geld (und T-Shirts) gewinnen - kl...
 
Google Summer of Code™ (in English; neutral version)
Google Summer of Code™ (in English; neutral version)Google Summer of Code™ (in English; neutral version)
Google Summer of Code™ (in English; neutral version)
 
Ribbit for Salesforce - General
Ribbit for Salesforce - GeneralRibbit for Salesforce - General
Ribbit for Salesforce - General
 
Will Stacy Talks Productivity from Sales 2.0
Will Stacy Talks Productivity from Sales 2.0Will Stacy Talks Productivity from Sales 2.0
Will Stacy Talks Productivity from Sales 2.0
 
Open Source is good for you
Open Source is good for youOpen Source is good for you
Open Source is good for you
 
Continuous Integration - Does it scale?
Continuous Integration - Does it scale?Continuous Integration - Does it scale?
Continuous Integration - Does it scale?
 
What's our Status?
What's our Status?What's our Status?
What's our Status?
 
Continuous Integration in der Praxis
Continuous Integration in der PraxisContinuous Integration in der Praxis
Continuous Integration in der Praxis
 
Braindump - How to leave your Knowledge when leaving your Job
Braindump - How to leave your Knowledge when leaving your JobBraindump - How to leave your Knowledge when leaving your Job
Braindump - How to leave your Knowledge when leaving your Job
 

Ähnlich wie Webspam (English Version)

Be Afraid. Be Very Afraid. Javascript security, XSS & CSRF
Be Afraid. Be Very Afraid. Javascript security, XSS & CSRFBe Afraid. Be Very Afraid. Javascript security, XSS & CSRF
Be Afraid. Be Very Afraid. Javascript security, XSS & CSRFMark Stanton
 
Building Twitter in Drupal
Building Twitter in DrupalBuilding Twitter in Drupal
Building Twitter in DrupalJeff Eaton
 
Rise of the Autobots: Into the Underground of Social Network Bots
Rise of the Autobots: Into the Underground of Social Network BotsRise of the Autobots: Into the Underground of Social Network Bots
Rise of the Autobots: Into the Underground of Social Network BotsTom Eston
 
Microapps for Fun and <s>profit</s>
Microapps for Fun and <s>profit</s>Microapps for Fun and <s>profit</s>
Microapps for Fun and <s>profit</s>guesta2b753
 
Defeating Comment Spam
Defeating Comment SpamDefeating Comment Spam
Defeating Comment SpamAndrew Hedges
 
OpenID Intro @ Barcamp Brussels 3
OpenID Intro @ Barcamp Brussels 3OpenID Intro @ Barcamp Brussels 3
OpenID Intro @ Barcamp Brussels 3Frank Louwers
 
Microblogging via XMPP
Microblogging via XMPPMicroblogging via XMPP
Microblogging via XMPPStoyan Zhekov
 
White Lightning Sept 2014
White Lightning Sept 2014White Lightning Sept 2014
White Lightning Sept 2014Bryce Kunz
 
Api anti patterns
Api anti patternsApi anti patterns
Api anti patternsMike Pearce
 
6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...
6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...
6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...Oban International
 
So you want to be a red teamer
So you want to be a red teamerSo you want to be a red teamer
So you want to be a red teamerJorge Orchilles
 
Country domination - Causing chaos and wrecking havoc
Country domination - Causing chaos and wrecking havocCountry domination - Causing chaos and wrecking havoc
Country domination - Causing chaos and wrecking havocTiago Henriques
 
High Performance Kick Ass Web Apps (JavaScript edition)
High Performance Kick Ass Web Apps (JavaScript edition)High Performance Kick Ass Web Apps (JavaScript edition)
High Performance Kick Ass Web Apps (JavaScript edition)Stoyan Stefanov
 
Don't make me wait! or Building High-Performance Web Applications
Don't make me wait! or Building High-Performance Web ApplicationsDon't make me wait! or Building High-Performance Web Applications
Don't make me wait! or Building High-Performance Web ApplicationsStoyan Stefanov
 
Scaling Twitter 12758
Scaling Twitter 12758Scaling Twitter 12758
Scaling Twitter 12758davidblum
 
Special:Contributions/newbies
Special:Contributions/newbiesSpecial:Contributions/newbies
Special:Contributions/newbiesBrianna Laugher
 

Ähnlich wie Webspam (English Version) (20)

Be Afraid. Be Very Afraid. Javascript security, XSS & CSRF
Be Afraid. Be Very Afraid. Javascript security, XSS & CSRFBe Afraid. Be Very Afraid. Javascript security, XSS & CSRF
Be Afraid. Be Very Afraid. Javascript security, XSS & CSRF
 
Building Twitter in Drupal
Building Twitter in DrupalBuilding Twitter in Drupal
Building Twitter in Drupal
 
Rise of the Autobots: Into the Underground of Social Network Bots
Rise of the Autobots: Into the Underground of Social Network BotsRise of the Autobots: Into the Underground of Social Network Bots
Rise of the Autobots: Into the Underground of Social Network Bots
 
Microapps for Fun and <s>profit</s>
Microapps for Fun and <s>profit</s>Microapps for Fun and <s>profit</s>
Microapps for Fun and <s>profit</s>
 
Defeating Comment Spam
Defeating Comment SpamDefeating Comment Spam
Defeating Comment Spam
 
OpenID Intro @ Barcamp Brussels 3
OpenID Intro @ Barcamp Brussels 3OpenID Intro @ Barcamp Brussels 3
OpenID Intro @ Barcamp Brussels 3
 
Microblogging via XMPP
Microblogging via XMPPMicroblogging via XMPP
Microblogging via XMPP
 
White Lightning Sept 2014
White Lightning Sept 2014White Lightning Sept 2014
White Lightning Sept 2014
 
Spam Wars
Spam WarsSpam Wars
Spam Wars
 
Api anti patterns
Api anti patternsApi anti patterns
Api anti patterns
 
20081123-web2.0class
20081123-web2.0class20081123-web2.0class
20081123-web2.0class
 
6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...
6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...
6 site migration fails and how to avoid them - BrightonSEO September 2018 - J...
 
So you want to be a red teamer
So you want to be a red teamerSo you want to be a red teamer
So you want to be a red teamer
 
Country domination - Causing chaos and wrecking havoc
Country domination - Causing chaos and wrecking havocCountry domination - Causing chaos and wrecking havoc
Country domination - Causing chaos and wrecking havoc
 
High Performance Kick Ass Web Apps (JavaScript edition)
High Performance Kick Ass Web Apps (JavaScript edition)High Performance Kick Ass Web Apps (JavaScript edition)
High Performance Kick Ass Web Apps (JavaScript edition)
 
Don't make me wait! or Building High-Performance Web Applications
Don't make me wait! or Building High-Performance Web ApplicationsDon't make me wait! or Building High-Performance Web Applications
Don't make me wait! or Building High-Performance Web Applications
 
Scaling Twitter 12758
Scaling Twitter 12758Scaling Twitter 12758
Scaling Twitter 12758
 
Special:Contributions/newbies
Special:Contributions/newbiesSpecial:Contributions/newbies
Special:Contributions/newbies
 
Web Design
Web DesignWeb Design
Web Design
 
Reification
ReificationReification
Reification
 

Mehr von Dirk Haun

Reverse Bildersuche mit TinEye
Reverse Bildersuche mit TinEyeReverse Bildersuche mit TinEye
Reverse Bildersuche mit TinEyeDirk Haun
 
Vorsicht, Kamera!
Vorsicht, Kamera!Vorsicht, Kamera!
Vorsicht, Kamera!Dirk Haun
 
Vorsicht Kamera!
Vorsicht Kamera!Vorsicht Kamera!
Vorsicht Kamera!Dirk Haun
 
Botschaften optimieren für Erinnerung und Verbreitung
Botschaften optimieren für Erinnerung und VerbreitungBotschaften optimieren für Erinnerung und Verbreitung
Botschaften optimieren für Erinnerung und VerbreitungDirk Haun
 
Smile, you're on camera!
Smile, you're on camera!Smile, you're on camera!
Smile, you're on camera!Dirk Haun
 
What's our Status?
What's our Status?What's our Status?
What's our Status?Dirk Haun
 
Google Summer of Code 2012
Google Summer of Code 2012Google Summer of Code 2012
Google Summer of Code 2012Dirk Haun
 
Geeklog: The secure CMS.
Geeklog: The secure CMS.Geeklog: The secure CMS.
Geeklog: The secure CMS.Dirk Haun
 
Google Summer of Code 2011 (German)
Google Summer of Code 2011 (German)Google Summer of Code 2011 (German)
Google Summer of Code 2011 (German)Dirk Haun
 
Apple iPad als Reisebegleiter
Apple iPad als ReisebegleiterApple iPad als Reisebegleiter
Apple iPad als ReisebegleiterDirk Haun
 
Verteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der PraxisVerteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der PraxisDirk Haun
 
Verteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der PraxisVerteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der PraxisDirk Haun
 
Adventures in QA
Adventures in QAAdventures in QA
Adventures in QADirk Haun
 
Google Summer of Code 2010 (in German)
Google Summer of Code 2010 (in German)Google Summer of Code 2010 (in German)
Google Summer of Code 2010 (in German)Dirk Haun
 
GSoC@Webmontag (in German)
GSoC@Webmontag (in German)GSoC@Webmontag (in German)
GSoC@Webmontag (in German)Dirk Haun
 
Google Summer of Code™ (in German)
Google Summer of Code™ (in German)Google Summer of Code™ (in German)
Google Summer of Code™ (in German)Dirk Haun
 
Google Summer of Code™ (in German; neutral version)
Google Summer of Code™ (in German; neutral version)Google Summer of Code™ (in German; neutral version)
Google Summer of Code™ (in German; neutral version)Dirk Haun
 

Mehr von Dirk Haun (17)

Reverse Bildersuche mit TinEye
Reverse Bildersuche mit TinEyeReverse Bildersuche mit TinEye
Reverse Bildersuche mit TinEye
 
Vorsicht, Kamera!
Vorsicht, Kamera!Vorsicht, Kamera!
Vorsicht, Kamera!
 
Vorsicht Kamera!
Vorsicht Kamera!Vorsicht Kamera!
Vorsicht Kamera!
 
Botschaften optimieren für Erinnerung und Verbreitung
Botschaften optimieren für Erinnerung und VerbreitungBotschaften optimieren für Erinnerung und Verbreitung
Botschaften optimieren für Erinnerung und Verbreitung
 
Smile, you're on camera!
Smile, you're on camera!Smile, you're on camera!
Smile, you're on camera!
 
What's our Status?
What's our Status?What's our Status?
What's our Status?
 
Google Summer of Code 2012
Google Summer of Code 2012Google Summer of Code 2012
Google Summer of Code 2012
 
Geeklog: The secure CMS.
Geeklog: The secure CMS.Geeklog: The secure CMS.
Geeklog: The secure CMS.
 
Google Summer of Code 2011 (German)
Google Summer of Code 2011 (German)Google Summer of Code 2011 (German)
Google Summer of Code 2011 (German)
 
Apple iPad als Reisebegleiter
Apple iPad als ReisebegleiterApple iPad als Reisebegleiter
Apple iPad als Reisebegleiter
 
Verteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der PraxisVerteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der Praxis
 
Verteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der PraxisVerteilte Versionskontrolle in der Praxis
Verteilte Versionskontrolle in der Praxis
 
Adventures in QA
Adventures in QAAdventures in QA
Adventures in QA
 
Google Summer of Code 2010 (in German)
Google Summer of Code 2010 (in German)Google Summer of Code 2010 (in German)
Google Summer of Code 2010 (in German)
 
GSoC@Webmontag (in German)
GSoC@Webmontag (in German)GSoC@Webmontag (in German)
GSoC@Webmontag (in German)
 
Google Summer of Code™ (in German)
Google Summer of Code™ (in German)Google Summer of Code™ (in German)
Google Summer of Code™ (in German)
 
Google Summer of Code™ (in German; neutral version)
Google Summer of Code™ (in German; neutral version)Google Summer of Code™ (in German; neutral version)
Google Summer of Code™ (in German; neutral version)
 

Kürzlich hochgeladen

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Kürzlich hochgeladen (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Webspam (English Version)

  • 1. Webspam Dirk Haun www.geeklog.net
  • 2. Geeklog, Spam & me • Geeklog: ‣ since Jan. 2002 ‣ as a maintainer since 2004 • Spam as a problem: ‣ since mid-2004 ‣ End of 2004: Poker Spam
  • 3. Agenda • What is webspam? • What to do about it? • Outlook
  • 4. Types of Webspam • Comment Spam • Trackback Spam • Referrer Spam • more subtle ways
  • 5. Comment Spam • Comments • Forums • Guest books
  • 6. Very good site... Hi all! [url=...]100% Free Lesbian Video[/url] [url=...]Lesbian Teen[/url] [url=...]Asian Teen Lesbian[/url] [url=...]Mature Lesbian[/url] [url=...]Woman Naked Pussy Lesbian[/url] [url=...]Shemale Lesbian Sex Vidoes[/url] [url=...]Skinny Lesbian Girls Having Sex[/url] [url=...]Teen Blonde Lesbian[/url] [url=...]Twins Sisters Video Lesbian[/url] [url=...]xxx Free Lesbian Movie[/url] Just the usual ...
  • 7. [url=.../index.html]underground sex[/url] [url=.../page=2.html]underlolitas[/url] [url=.../page=3.html]underpants[/url] [url=.../page=4.html]underwater erotica[/url] [url=.../page=5.html]underwater fucking[/url] [url=.../page=12.html]underwear models[/url] [url=.../page=13.html]undies[/url] [url=.../page=14.html]uniform porn[/url] [url=.../page=15.html]uniform sex[/url] [url=.../page=16.html]unique baby boys names [/url] [url=.../page=23.html]united airlines tickets flights[/url] [url=.../page=490.html]wellbutrin xl[/url] [url=.../page=491.html]wellness dog food[/url] All-in-one spam
  • 8. This Website contains sexually-oriented adult content which may include visual images and verbal descriptions of nude adults, adults engaging in sexual acts, and other audio and visual materials of a sexually-explicit nature. Permission to enter this Website and to view and download its contents is strictly limited only to consenting adults who affirm that the following conditions apply: 1. That you are at least 18 years of age or older, and that you are voluntarily choosing to view and access such sexually-explicit (...) Spam with disclaimer
  • 9. Wiki Spam • everbody can edit - including spammers • Spam sometimes hidden in older revisions
  • 10. Trackback Spam • in blogs: cross-site comments • XML-RPC, clearly defined protocol • similar: Pingback (URL only)
  • 11. Referrer Spam • faked referrers • Blogs used to display them on their homepage • usually invisible in the webserver logfile
  • 12. 66.49.223.233 - - [02/Jun/2007:04:11:07 -0400] quot;GET / forum/viewtopic.php?showtopic=73271 HTTP/1.1quot; 403 26 quot;http://www.kzcarinsurance.info/12868-71-0.htmlquot; quot;Mozilla/ 4.0 (compatible; MSIE 6.0; Windows NT 5.1)quot; 216.185.128.200 - - [02/Jun/2007:04:37:01 -0400] quot;GET / forum/viewtopic.php?showtopic=21070 HTTP/1.1quot; 200 18384 quot;http://www.kzcarinsurance.info/38645-71-0.htmlquot; quot;Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)quot; 66.49.223.233 - - [02/Jun/2007:05:02:14 -0400] quot;GET / forum/viewtopic.php?showtopic=68994 HTTP/1.1quot; 403 26 quot;http://www.kzcarinsurance.info/62898-71-0.htmlquot; quot;Mozilla/ 4.0 (compatible; MSIE 6.0; Windows NT 5.1)quot; 216.185.128.200 - - [02/Jun/2007:09:00:23 -0400] quot;GET / article.php/To-do_20050606 HTTP/1.1quot; 200 20169 quot;http:// www.kzcarinsurance.info/224400-71-0.htmlquot; quot;Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)quot; Referrer Spam
  • 13. More subtle spam • Profile Spam ‣ List of members in forums • almost on-topic posts ‣ Kudos, jokes, general questions
  • 14. Stumbled onto geeklog.info for the first time today looks like someplace I needed to find a while ago. Just went from a slow dial up system to at DSL so I don't have to wait several minutes for a picture to arrive Harmless posting ...
  • 15. Stumbled onto geeklog.info for the first time today looks li[url=http://webmeds.iespana.es/amoxicilin] k[/url][url=http://webmeds.iespana.es/rogaine]e[/ url] [url=http://webmeds.iespana.es/seroquel]s[/ url][url=http://webmeds.iespana.es/oxycontin]o[/ url][url=http://webmeds.iespana.es/oxycodone]m[/ url][url=http://webmeds.iespana.es/viagra]e[/url] [url=http://webmeds.iespana.es/celebrix]p[/url] [url=http://webmeds.iespana.es/welbutrin]l[/url] [url=http://webmeds.iespana.es/stop-smoking]a[/ url][url=http://webmeds.iespana.es/quit-smoking]c [/url][url=http://webmeds.iespana.es/skelaxin]e[/ url] [url=http://webmeds.iespana.es/atenolol]I[/ url] [url=http://webmeds.iespana.es/fluconazole]n[/ url][url=http://webmeds.iespana.es/diflucan]e[/url] [url=http://webmeds.iespana.es/ciales]e[/url] [url=http://webmeds.iespana.es/xanex]d[/url] [url=http://webmeds.iespana.es/aciclovir]e[/url] ... or maybe not [url=http://webmeds.iespana.es/adderol]d[/url]
  • 17. Pagerank • not that much quot;mass spamquot; any more • takes time to build • Spamming older posts
  • 18. Clickthroughs • Get people onto their site ‣ Sale, Ads, Affiliate • Throw-away domains ‣ Redirects • Throw-away URLs ‣ old forums, etc.
  • 19. Spam topics 24.-31. March 2007 (356 Spam posts) Pills 137 Porn 102 Finance 23 Software 13 Ringtones 11 misc. 70 0 50 100 150
  • 20. Spam topics misc. 20% Pills 38% Ringtones 3% Software 4% Finance 6% Porn 29%
  • 21. Compare with email spam • Keywords not obfuscated (V14gr4) • No stock spam (time?) • No spam in images
  • 22. How they're spamming • Spambots ‣ hijacked PCs or webservers ‣ Bulletproof hosting ‣ open proxies • manual spam: very rarely • quot;We'll spam for youquot;
  • 23. I am amazed by the skills of some people here #file=D:XRumerfreewebtown-general.txt Oops ...
  • 24. I am amazed by the skills of some people here Hi..!! everyone! This is my first post on Yours site. Thank you in [url=http://www.freewebtown.com/topweb/louis- vuitton]a[/url](...)[url=http:// www.freewebtown.com/topweb/credit-equity-home- line].[/url] I am From Canada Nice day is it today, but I have a question for all... In first , how i post message to PM...??? Thank you very much! Mark. G..!! ... let's try that again
  • 26. I offer you the services in advertising in internet: (...) 3. Forum spam. Opportunities of posting: - Registration at a forum with editing a profile of the user - Dispatch on the forums supporting a guest input - Notices on e-mail about answers at a forum or private messages - the Opportunity of registration without posting (increases PR Google) On the ending of dispatch you receive the report on the done work - direct references to your announcement. The prices for mass dispatch on forums: 2)1000 forums - $35/1000 3)4000-6000 forums - $33/1000 4)7000-9000 forums - $31/1000 5)10000-13000 forums - $30/1000 5)20000 forums and more - $20/1000 Total of Russian forums - 40.000 Amount of English-speaking forums - 70.000 We'll spam for you
  • 27. Agenda • What is webspam? • What to do about it? • Outlook
  • 28. IP Addresses • Block IP ‣ dynamic IPs ‣ Bulletproof Hosting • Speedlimit ‣ only helps with individual IPs
  • 29. Word filters • surprisingly effective viagra • depends on topics and xanax languages specialist • Beware of False phentermine Positives tramadol
  • 30. Moderation • takes up time • full moderation queue • Mixed approach: moderate first post
  • 31. Registration • only let registered users post ‣ and how many visitors will that drive away? • OpenID • automatic registration from bots
  • 32. CAPTCHA • Try to tell humans and bots apart ‣ doesn't have to be a picture! • often hard to read for humans, too • arms race ‣ PWNtcha
  • 33. Blacklists: manual • update manually: takes time ‣ MT-Blacklist (RIP) ‣ spam-merge ✴ MoinMoin, TWiki, MediaWiki
  • 34. Blacklists: automatic • dynamically • recognize URLs showing up often • centralised ‣ Akismet ‣ SLV
  • 35. Detecting spambots • Bad Behavior ‣ known bots ‣ bad HTTP requests • Project Honeypot ‣ dynamic IP blacklist
  • 36. Abuse Reports • Takes time and work • not a lot of success • ISPs and hosters aren't aware of the problem
  • 37. rel=quot;nofollowquot; • Don't rank links with that attribute • concerted effort of all big search engines • promised to end web spam • didn't change anything
  • 38. Example: Spam-X • Spamfilter in Geeklog • modular, extensible ‣ new modules for the spammer's new tricks ‣ new modules for new services • Downside: yes/no decisions only
  • 39. Agenda • What is webspam? • What to do about it? • Outlook
  • 40. R.I.P. - Success stories • Trackback Spam ‣ through technical measures • Referrer Spam ‣ simply not effective
  • 41. State of things • a big portion can be filtered easily • the rest is starting to become a problem ‣ Total amount of spam increases • there will alway be some spam
  • 42. Solutions? • not CAPTCHA! ‣ at least not as graphics ‣ OCR improvements for email spam will help break CAPTCHAs
  • 43. Solutions? • Bayes-Filter? ‣ Who wants to train them? • We need user-friendly solutions! • centralized systems may be not accurate enough
  • 44. Solutions? • Cooperation? ‣ not much ‣ quot;Spam is not a problem any morequot; • Where are the commercial solutions?
  • 45. Resources • Webspam in general ‣ spamhuntress.com • Wiki-Spam ‣ chongqed.org • My blog ‣ spam.tinyweb.net
  • 46. Credits • Photos via flickr.com, thanks to: freezelight, Hopkinsii, striatic, chotda, lagiuspo, It'sGreg, lorZ, YnR, kevinthoule, acagamic, R80o (Mark Strozier), Kevin, loungerie, brappy!, ^Sandra^, longwayround, sheeshoo, Orgasmic kmlz, awinn233, teotwawki, Hugo*, rofanator, gyst, Gigglejuice, manuki Hint: Pictures and keywords are hyperlinked!