Twitter as Combinatorics

a.jpg Ever since I discovered Twitter, I’ve been amazed at @ev, @biz, and @jack’s idea of simplicity and usefulness.  Lately, @windley (Phil Windley’s article), @monkchips, and JP have approached Twitter from a more theoretical perspective.  This article is my contribution to that healthy conversation (this blog post will be followed by a short tweet, of course).

~~~~~~~~~~~~~~

James Governor attempts to define a pattern found in Twitter and other social media and calls it “Asymmetric Follow”.  He defines Asymmetric Follow as the following:

Asymmetric Follow is a core pattern for Web 2.0, in which a social network user can have many people following them without a need for reciprocity.  Asymmetric Follow is unlike email for example, which tends to be within small groups, with all users knowing each other (newsletters are a clear exception here). If you see a social network where someone has 5000 followers and only follows 150 back – that’s Asymmetric Follow.

So, if I were to explain James’ definition to a teenager 1 , I might say something like:

“Asymmetric Follow is when the popular kid in school is admired by a bunch of less popular kids and when the popular kid speaks, everyone usually listens and when the less-popular kids speak, the popular kid can choose to listen or respond or do nothing”.

If I’ve been charitable in my understanding and summary of James’ definition of Asymmetric Follow, then his explanation and definition makes sense to me.  From my experience as a former High School student of the less-popular type, that’s how life was.

I have to note, however, that James’ definition confounds the behavioral economics definition of Information Asymmetry — but he means something different.  The historical definition has more to do with the direction of communication and the “stickiness” of that information and how that “stickiness” can impact decisions and rational choice.  James Governor doesn’t address the definition in behavioral economics but does use a similar term.

Twitter as Combinatorics

But, let’s quantify what James is talking about.  In fact, when we do, we’ll find out that it’s actually basic combinatorics.

Suppose there are persons A and B, who follow each other.  In this scenario there are 2 communication links (AB, BA).  Add person C who follows and is followed-by persons A and B, now we have 6 communication links, (ABC, ACB, BCA, BAC, CAB, CBA).  So, inductively, as inter-followership 2 permutation grows, the raw combinatorial communication link counts grows quadratically, not linearly.

To demonstrate this, we use basic statistics of the form n-choose-r, where !, such as n!, is equivalent to n factorial, to arrive at the formula for how many pairs or permutations we can choose from n items:

a.jpg

For the number of pairs, we can reduce the above formula to the following:

b.jpg

Visually, as inter-followership grows, the communication links grows non-linearly, but quadratically (n! grows exponentially) — in either case, the function is clearly not linear:

twitter-as-combinatorics.jpg

Mutually Exclusive, Comprehensively Exhaustive (MECE)

JP runs a really fun experiment that validates his hypothesis that tweets in the universe of Twitter are comprehensively exhaustive 3.  What his experiment does not show is the exclusiveness of the tweets — that is, their uniqueness from each other.  On its face, this is not a big deal, but in scientific inquiry, being able to compartmentalize objects in unique buckets is helpful.

One reason it is difficult to classify tweets as mutually exclusive in content is because there are Replies and Retweets.  There is probably an innovative way to find the unique and mutually exclusive clusters in the corpus that is Twitter — that would be fun work for a computational linguist.

For this post, this is not a big deal, but I just make this point for clarity — really great experiment, JP.

A Conclusion

Ummm, I don’t really have a conclusion or a point, except for that I think Twitter is pretty amazing and that Twitter can and should encourage computer scientists, computational linguists, behavioral economists, combinatorial mathematicians, set theory geekzoids, game theory freakonomica, cultural anthropologists, and others to participate in and learn from this massively human experiment.

I really like Twitter — oh, by the way — retweet this post…

  1. my personal criteria for an atomic and pragmatic definition of a concept is if it can be explained to normally-functioning-and-average human that is 15 human years or younger
  2. I make a distinction between inter-followership and intra-followership where the former is a set where each member follows each other and the latter is a set where the followership is disjointed.  However, for the purposes of Twitter, inter-followership and intra-followership doesn’t matter so much since a follower has the same rank as the non-follower to the one being followed — their voices are not weighted differently
  3. This is my term that I use to explain his point, but he does not use the terms Mutually Exclusive or Comprehensively Exhaustive in his writings.

Short URL: http://bit.ly/tPY3d

Share This Post:



  • Digg
  • Facebook
  • del.icio.us
  • Suggest to Techmeme via Twitter
  • StumbleUpon
  • LinkedIn
  • email

2-pizza teams (10)
3 C's (3)
5S (38)
A3 Report (9)
adoption (7)
agile/software (59)
ajax (4)
amazon (53)
apple (3)
apple iphone (7)
axiom (3)
Aza Raskin (9)
backcountry.com (2)
berlin (1)
bill gates (1)
bill marriott (1)
blog tag (1)
book reviews (4)
bullwhip effect (5)
business (394)
business plans (3)
busm361 (13)
BzzAgent (12)
call center and queueing (11)
car buying (2)
Carbonite (1)
change management (5)
chicago (1)
click fraud (1)
click-to-ship (21)
clocky (2)
colin powell (2)
community (2)
company interviews (18)
company interviews (6)
complexity (32)
costs (8)
culture (7)
customer experience (10)
customer obsession (52)
customer recovery function (1)
customer segmentation (8)
customer service (17)
design thinking (14)
digg (4)
drum-buffer-rope (38)
dublin (1)
dynamic systems (24)
eBay (6)
economics (3)
efficiency (4)
ethnography (29)
family (18)
featuritis (15)
flexibility (1)
forecasting (2)
four performance dimensions (2)
Fun With The 2×2 Matrix (1)
game theory (7)
Gemba (67)
genchi genbutsu (68)
general (135)
germany (1)
google (15)
heijunka (65)
holidays (1)
hoshin kanri (1)
how to be a human (1)
IDEO (2)
image uploading (1)
iphone (5)
ishikawa (69)
IT at Toyota (67)
just-in-time (4)
kaizen (4)
kanban (46)
law of instinct (1)
Leadership (43)
lean (165)
Lean Consumption Maps (98)
learning curve (1)
licketyship (1)
mark cuban (1)
martin luther king (1)
mary poppendieck (1)
metrics (73)
microsoft (6)
milton friedman (1)
moving average (1)
muda (68)
nba fines (1)
net promoter score (nps) (1)
obeya (39)
Off-Topic (1)
onstar (1)
operations (108)
pageviews (3)
pareto principle (39)
patent (1)
peanut butter manifesto (2)
philosophy (3)
Poka-Yoke (6)
poppendieck (3)
powerpoint sucks (2)
private equity (4)
process measures (6)
product development (20)
productivity (4)
quality (41)
quasimodal design (1)
queueing theory (41)
Raffle (1)
rational choice (2)
regression analysis (18)
respect for people (6)
root cause analysis (60)
sarah+palin (2)
seth godin (1)
simplicity principle (10)
six sigma (128)
snowboarding (2)
social media (3)
spam (1)
statistical process control (46)
strategy (46)
suburban (1)
supply chain (24)
takt time (8)
teaching (2)
team size (9)
technology (104)
the beer distribution game (1)
the profit tree (7)
The Visual Factory (11)
theory of constraints (41)
time (2)
timeline (3)
tony+hsieh (11)
toyota (75)
travel (1)
trump bankruptcy (1)
turnaround (5)
twitter (8)
uspto (1)
utah deal flow (2)
variation (69)
venture capital (1)
Visual Management (11)
waste (59)
website traffic (2)
Wing Chun (2)
wisdom of crowds (1)
wisdom teeth (1)
word-of-mouth marketing (18)
yahoo (2)
zappos.com (12)
zero defects (3)

WP Cumulus Flash tag cloud by Roy Tanck and Luke Morton requires Flash Player 9 or better.


If you enjoyed this post, please consider to leave a comment or subscribe to the feed and get future articles delivered to your feed reader.

Comments

RT: twitter with a scientific, mathematical twist: http://tinyurl.com/9hu8vy by @shmula Nice, geeky read.

RT @scottjedwards RT: twitter with a scientific, mathematical twist: http://tinyurl.com/9hu8vy by @shmula Nice, geeky read.

Your equations seem solid enough. What may be interesting is to figure out the ranking of someone’s ‘assymetric following’, which could be an indicator of their popularity.
1. If we assumed that the baseline model would be that everyone in a group connects to everyone else in that group, then we would have the n(n-1)/2 model. Let’s call this number B (for Baseline).
2. However, if someone has a huge following, then their number of connections would deviate from that baseline. As you state: [n r] [n!/r!(n-r)!]
3. Because r would be lower, r!(n-r)! would be smaller; the overall equation would be higher number. Let’s call this number C (for Cool factor).
4. The difference between B and C would be the degree to which someone has an assymetric following. It may be a way to rank their cult-like personality.

Mathematic implications of being followed versus following on Twitter. http://is.gd/dZLG

Great Henri Poincaré’s beard, you have made my day! I see you have more like this; amazing. Looking forward to plunging into the archives.

@superfactory @leanblog @lssacademy twitter does have some serious geekiness behind it – even combinatorics: http://is.gd/dZLG

Leave a comment

(required)

(required)


Additional comments powered by BackType