Metric Mail uses data provided by Google Analytics. We wanted to estimate the share of websites that use it. There are some studies about the market share of analytics solutions, but they are using rather small samples.
Alexa provides a list of the Top 1 million domains…
We used their data as a starting point for our research. There may be issues about the reliablilty and accuracy of their data, but it is still the best source that is easily available.
Next we imported the data to Google App Engine’s data store…. importing a million rows takes a long time ;) Around two days. We needed to retrieve each site and then look for patterns in Google Analytics data. There is
- the identifier UA-12345-123 and
For more info on the Mapper API check out theirproject site
and the fantastic article onNick Johnson’s blog
Here are the results (8+ days of processing):
- Number of websites successfully checked: 883194 (responded within 10 seconds)
- Number of Google Analytics profiles: 441207
- Market share in the top one million: 49.95%
Have you ever tried dying to a pleasure voluntarily, not forcibly? Ordinarily when you die you don’t want to; death comes and takes you away; it is not a voluntary act. But have you ever tried dying voluntarily, easily, felt that sense of the abandonment of pleasure? Obviously not! Life is living, abundance, fullness, abandonment, not a sense of the ‘I’ having significance… If you experiment with dying to little things… you will see that your mind is capable of dying to many things, dying to all memories. Machines are taking over the functions of memory- but the human mind is something more…. But it cannot be that something else if it does not die to everything it knows…
- J. Krishnamurti, The Book of Life
I do not want to go gently into that good night! I want to fight the dying of the light!
That was my initial reaction. Then I re-read the passage. A mind burdened with memories is pinned to the past, cannot make room emotionally or cognitively for the present and future. Yes, some of the past, of memory, can and must be relinquished, die without agonizing over it. Far easier to do so with events that are mundane, not too personally significant. Yet some of the good memories as well as the bad need to die, fall off the stack of memory.
I interpreted the passage differently on my first reading. I thought it was metaphor, polite euphemism.
Book of Life: Have you ever tried dying to a pleasure voluntarily, not forcibly?
Me: Yes! Not EVERYONE watches Wired Pussy on the internet and does that forced cum thing!” Not day in and day out.
B of L: Ordinarily when you die you don’t want to
Me: Ordinarily when I climax, I DO want to.
B of L: But have you ever tried dying voluntarily, easily, felt that sense of the abandonment of pleasure?
Me: Yes! I HAVE tried to climax voluntarily, easily. Usually, I do! But it is the very opposite of “the abandonment of pleasure.” The pleasure is opulent and thick.
B of L: Life is living, abundance and fullness.
Now I am as confused again as when I started writing this.
IBM distinguished engineer Jeff Jonas kicked off GigaOm’s Big Data conference with Scary Data: Cellphones are generating 600 billion geolocation records a day….“The data is being de-identified, but they know where you spend your time and who you spend it with.”
Jonas highlighted what data privacy advocates have been ringing alarm bells about for years: We’re tracking ourselves, in ways that would terrify us if a government tried it. Anyone who owns a smartphone carries in their pocket a tracking device that knows — and broadcasts — where you are. And we don’t really know who is getting hold of those records.
One of his clients is experimenting with using geolocation logs to track how often and for how long people visit various retail outlets. If store traffic has declined in recent months, they can detect that pattern — before the retailer reports its quarterly earnings.
True anomymization is hard to pull off with large data sets. It took academic statistical researchers two weeks to ID individual Netflix subscribers in a supposedly anomymized set of 100 million movie reviews. See slideshow deck available here: “Re-identification is somewhat trivial.”
Geolocation logs can show where you spend your time, and who you spend it with. Jonas quipped: “I can give you a list of the 10 friends you’re around the most, and if you don’t recognize one of the names, they’re following you.”
Jonas says … it is playing a role in his project code-named G2 …