when bad things happen to good blogs

update: i failed to include the url for the ebiquity research group study on the splogosphere. here it is — read it: it’s really good stuff!

i am amazed at the innovative research going on with the ebiquity guys. one really phenomenal piece of research is their work on splogs. what are splogs? before i answer that question, here are some facts:

these are some fascinating statistics. there is clearly a lot of activity and opportunity in this space. but, there are also some challenges — SPAM! back to my earlier question: what are splogs? wikipedia explains it like this:

splogs are weblog sites which the author uses only for promoting affiliated websites. The purpose is to increase the PageRank of the affiliated sites, get ad impressions from visitors, and/or use the blog as a link outlet to get new sites indexed. Content is often nonsense or text stolen from other websites with an unusually high number of links to sites associated with the splog creator which are often disreputable or otherwise useless websites.

a corollary to splogs are spings, which are ping spam or spam sent from spam blogs or splogs. it’s important to emphasize the reasons or motivations behind splogs: The purpose is to increase the PageRank of the affiliated sites, get ad impressions from visitors, and/or use the blog as a link outlet to get new sites indexed. many people have chimed in on splogs including pirillo and mark cuban. others seem to think that splogs linking to them is a sign of growth. how are splogs created anyway? there is software that can automate blogs, pings, and the like. the software releases a bot that crawls the blogosphere and post-jacks content, copies it, and creates a splog of its own. interestingly, the post title is always used in the splog. this is important to remember when i discuss splog baits. the folks at ebiquity have done amazing research into the splogosphere. they have studied weblogs data and innovatively created a splog bait. this bait was picked-up by many splogs shortly after the post was published. this splog bait contains jibberish — garbage — but the text has key words that rank highly on the adwords bid tool, which can be found here and the postitle in the splog is the exact same is it is in the authentic post title. this is one way to find out who has splogged you. here’s the conclusion from the ebiquity study:

ebiquity presents a unique solution, which includes characterizing splog features versus authentic blog features and probabilistic training to automatically determine which is which. for their study, they looked at weblogs data and blogpulse data sets. they arrive at some fascinating findings about ping times over a time series of splogs versus blogs, word counts of splogs versus blogs, and word distribution in splogs versus blogs. to read more about their great work, please go to their website. i’ll be posting a splog bait here shortly. i’ll report on how quickly and by how many splog bots have hijacked my splog bait. this will be an interesting study.


Short URL: http://bit.ly/Q0Gw

Share This Post:



  • Digg
  • Twitter
  • Facebook
  • Google Bookmarks
  • del.icio.us
  • StumbleUpon
  • LinkedIn
  • MySpace
  • HackerNews
  • Reddit
  • Live
  • email

2-pizza teams (10)
3 C's (3)
37signals (1)
5S (38)
A3 Report (9)
adoption (7)
agile/software (59)
ajax (4)
amazon (53)
apple (3)
apple iphone (7)
axiom (3)
Aza Raskin (9)
backcountry.com (2)
berlin (1)
bill gates (1)
bill marriott (1)
blog tag (1)
book reviews (4)
bullwhip effect (5)
business (397)
business plans (3)
busm361 (13)
BzzAgent (12)
call center and queueing (11)
car buying (2)
Carbonite (1)
change management (5)
chicago (1)
click fraud (1)
click-to-ship (21)
clocky (2)
colin powell (2)
community (2)
company interviews (18)
company interviews (6)
complexity (32)
costs (8)
culture (7)
customer experience (10)
customer obsession (52)
customer recovery function (1)
customer segmentation (8)
customer service (17)
design thinking (14)
digg (4)
drum-buffer-rope (38)
dublin (1)
dynamic systems (24)
eBay (6)
economics (3)
efficiency (4)
ethnography (29)
family (18)
featuritis (15)
flexibility (1)
forecasting (2)
four performance dimensions (2)
Fun With The 2×2 Matrix (1)
game theory (7)
Gemba (67)
genchi genbutsu (68)
general (135)
germany (1)
google (15)
Gretchen Rubin (1)
heijunka (65)
holidays (1)
hoshin kanri (1)
how to be a human (1)
IDEO (2)
image uploading (1)
interviews (4)
iphone (5)
ishikawa (69)
IT at Toyota (67)
jason fried (1)
just-in-time (4)
kaizen (4)
kanban (46)
law of instinct (1)
Leadership (46)
lean (167)
Lean Consumption Maps (98)
learning curve (1)
licketyship (1)
mark cuban (1)
martin luther king (1)
mary poppendieck (1)
metrics (73)
microsoft (6)
milton friedman (1)
moving average (1)
muda (68)
nba fines (1)
net promoter score (nps) (1)
obeya (39)
Off-Topic (1)
onstar (1)
operations (108)
pageviews (3)
pareto principle (39)
patent (1)
peanut butter manifesto (2)
philosophy (3)
Poka-Yoke (6)
poppendieck (3)
powerpoint sucks (2)
private equity (4)
process measures (6)
product development (20)
productivity (4)
quality (41)
quasimodal design (1)
queueing theory (41)
Raffle (1)
rational choice (2)
regression analysis (18)
respect for people (6)
root cause analysis (60)
sarah+palin (2)
seth godin (1)
simplicity principle (10)
six sigma (128)
snowboarding (2)
social media (3)
spam (1)
statistical process control (46)
strategy (46)
suburban (1)
supply chain (24)
takt time (8)
teaching (2)
team size (9)
technology (104)
the beer distribution game (1)
The Happiness Project (1)
the profit tree (7)
The Visual Factory (11)
theory of constraints (41)
time (2)
timeline (3)
tony+hsieh (11)
toyota (75)
travel (1)
trump bankruptcy (1)
turnaround (5)
twitter (8)
uspto (1)
utah deal flow (2)
variation (69)
venture capital (1)
Visual Management (11)
waste (59)
website traffic (2)
Wing Chun (2)
wisdom of crowds (1)
wisdom teeth (1)
word-of-mouth marketing (18)
yahoo (2)
zappos.com (12)
zero defects (3)

WP Cumulus Flash tag cloud by Roy Tanck and Luke Morton requires Flash Player 9 or better.


If you enjoyed this post, please consider to leave a comment or subscribe to the feed and get future articles delivered to your feed reader.

Comments

Very interesting post regarding splogs. The market disruption this creates actually impacts a large amount of industries. For instance, let’s imagine that someone creates a splog bot that mirrors certain blog topics. Then, they sign up to multiple CPA Affiliate Networks. There is an amazing amount of content to plagiarize on the web, their obvious goal is to plagiarize content that ranks highly on Google and Yahoo, and most people will not be able to tell the difference between the genuine article and the splog – especially if the splog is ranked higher. Now, all they have to do is create as many splogs as possible, place well chosen advertisements from CPA networks, and reap in the cash.

On a side note, where exactly do you find these statistics:

* 57% of online us teens generate content, 40% read blogs, and 20% have are blog publishers (pew november 2005)
* 53% of companies are blogging (guideware, october 2005)
* myspace accounts of 33% of ALL web clicks (hendler, 2006)

I’m very interested in writing an article about trends in the blogosphere, fraud tactics, and overall internet strength – but I am having a hard time finding the relevant statistics to back up my claims.

+ Alex

Alex,

Thanks for reading. Splog research and Trust in the Social Web is a fascinating research area. Below is the URL where you can find some great data: the Ebiquity Research Group at The University of Maryland is doing great research in this area. Thanks for reading:

http://ebiquity.umbc.edu/paper/html/id/299

Peter Abilla

[...] Shmula was attacked by 30 Spings today. I’ve previously writtend about Splogs and Spings here, here, and the splog bait is here. Splogs and Spings put an undue burden on the blogosphere by eating up resources that should be used for genuine blogs and genuine comments. Moreover, Splogs really jeopardizes the advertising model of the internet, especially contextual ads like adsense, chitika, and others. [...]

[...] Shmula introduced me to two new terms today:  splogs and spings.  A splog is a blog only used to promote another site, and spings are the pings generated by such blogs.  The purpose is to generate ad clicks and to increase page ranking.  Shmula also pointed me to the UMBC eBiquity Research Group and their effort to detect spogs and spings.  According to their recent research, 75% of all new pings are spings.  Tags: blogosphere, spings, splogsThese icons link to social bookmarking sites where readers can share and discover new web pages. [...]

Leave a comment

(required)

(required)


Additional comments powered by BackType