Technical people complain about tech reporters. They gloss over details. They ask dumb questions. They don’t understand why we are the most important people they’ve ever talked to.
Some of those criticisms are baseless. Technical reporters have a really hard job. They write about complicated subjects for a really broad audience, so they have to try to both get it right and make it interesting (or at least comprehensible). The sad fact is that most tech journalists fail on both accounts. This happens in part because tech journalists just don’t have the serious technology background that they probably need to write accurately and coherently about the things they write about. And it also happens because in an effort to be more comprensible and relevant, many journalists gloss over or flat out change details that matter.
A funny, and frustrating, example of this occurred recently when Jason Perlow tried to explain why Limelight Networks was better than Akamai at serving Olympics content for NBC. Perlow stacks generalization upon inaccuracy and ends up mistating the facts and miseducating readers. All of this can be clearly, and simply detangled.
Perlow is trying to explain why NBC chose Limelight Networks (Autonomous System Number 22822) over Akamai Technologies (ASN 12222 and 20940 among others) to serve streaming content of the Olympics. Perlow writes:
“Akamai uses a centralized data hosting infrastructure with big Internet pipes that mirrors content that is hosted on a customer’s own servers. Usually with the aid of a special caching appliance installed at the customer’s ISP or edge network, the request to download that content is re-directed to Akamai’s own servers and fat Internet pipes. When you download big ISO CD and DVD images from MSDN, its going right to Akamai’s data centers over the public Internet. As fat as Akamai’s pipes are, I’ve seen MSDN’s downloads slow to a crawl during peak download periods, such as the days following Windows XP SP3 and Windows Vista SP1’s release. So like my colleagues here at ZDNet, I was expecting the worst.”
So far, Perlow is not even wrong, a phrase coined by Wolfgang Pauli, but which I heard first from Thomas Pogge, a stellar and compelling philosopher who tried valiantly to teach me some Kant long, long ago. That is to say, Perlow isn’t making enough sense yet to be clearly refutable, since it’s unclear what he could possibly mean. The gulf between the truth and what he writes is so large that the brain of any reasonbly technical person struggles to reinterpret everything he writes in some way that might be closer to reality.
Akamai most certainly does not have a “centralized data hosting infrastructure”. In fact, Akamai has one of the mostly widely distributed hosting infrastructures (probably the most widely distributed hosting infrastructure) on the planet. Akamai grew up during the go-go days of the late 1990s when space and power were cheap and bandwidth was expensive. Akamai pitched Internet Service Providers (IPSs) on a value proposition of allowing Akamai to put servers in your space and in exchange serving cached/distributed content to your customers (thereby saving bandwidth).
When I was Chief Technology Officer at Oso Grande Technologies, we ultimately cut a deal with Akamai to host servers with us because we thought it would save us bandwidth. It did. Hundreds of other ISPs made the same decision for the same reason. According to Akamai, “Akamai has the most pervasive platform for content delivery and application acceleration – 34000 servers in 70 countries within nearly 950 networks.” That’s not centralized, hoss.
Limelight, on the other hand, tends to pay for colocation and put their servers in a relatively smaller number of locations (although still a massive number of servers). This difference in architecture is probably for at least two reasons. The first is due to when they scaled out. While Akamai was building out a Content Distribution/Delivery Network (CDN) in the late 1990s and early 2000s, Limelight was still just a Phoenix-based ISP who mostly just resold Global Crossing bandwidth. Limelight didn’t find their true identity as a CDN until much later. So they were faced with a very different economic and industry landscape. By the time Limelight was building out latency was lower overall, speeds were higher, bandwidth was cheaper and power more expensive.
The second reason is that Limelight specializes in streaming large files: videos and downloads. These files are much less sensitive to latency than the small picture and html files that Akamai was originally architected for and may also benefit from centralization of storage (to improve cache-locality). That last is a slightly subtle point and if it isn’t obvious to you, ignore it. Doesn’t matter. Anyway…
Perlow goes on to write:
Where Limelight differs from Akamai and why the Internet didn’t “melt” is quite simple — they are completely “off the cloud”. In other words, unlike Akamai and similar content caching providers, their system isn’t deployed over the public Internet.
There are many differences between Limelight and Akamai. Some of those differences are even relevant to performance and scale. Not only is that difference not related to performance or scale, it isn’t even completely true. Let’s review what he might be talking about.
CDNs replicate content “closer” to users and serve that content up quickly—more quickly than the originator could have served it to that user. CDNs serve that content on the Internet, of course, so obviously any CDN (including Limelight) is “deployed over the public Internet”. It would be idiocy to suggest otherwise.
What Perlow might mean is that Limelight replicates to each cluster over their backbone whereas Akamai replicates over the Internet. That much is true. He got something almost right! Akamai does not have a backbone and rather they rely on their colocation and ISP partners (and transit-providers’) bandwidth to replicate content to their server clusters. Who cares? In order for that to matter you would have to believe in a world where the first copy of the source content to the server cluster was routinely failing. That doesn’t make any sense.
The source content is copied out to many, many server cluster who then serve that content hundreds of thousands of times to customers. Those server clusters are heavy on the outbound-bandwidth and very, very light on inbound bandwidth, since the bits going out go to customers and the bits coming in go to the servers. Since data circuits on the Internet (except for at your house) are symmetrical, there is always a surfeit of inbound bandwidth available. Whether you copy the data across a backbone or across the Internet, it’s going to get there. This difference between the two companies has nothing to do with whether end-user performance is slow or fast.
In the end, we’re left to wonder: does Perlow understand anything about Internet routing and CDNs at all, or did he just confuse users by glossing over details that he actually does understand? At this point, we don’t know…
[Disclaimer: I know and have long relationships with people from both Akamai and Limelight in my day job at Renesys. They're both good companies doing interesting work. I don't have a particular opinion over which CDN is better since I use neither. But both deserve more accurate coverage than this.]
5 responses so far ↓
1 Burgher Jon // Aug 20, 2008 at 7:31
Interesting, Perlow is a colleague of mine. I know little about CDNs, but it was odd to see a picture of Justin on the web.
Apparently he’s a better consultant (where I am a colleague of his) then he is ZDnet CDN reviewer.
2 todd // Aug 20, 2008 at 8:43
I hope I wasn’t too harsh on Perlow. He just happened to write something really, really confused on a subject I happen to know a fair bit about. Jon may have hit on another factor making the technical press inaccurate and confused: the need to offer broad coverage. It’s possible that Perlow is, as Jon says he is, very talented and clueful in some areas of technology. It seems likely that those areas do not include large-scale network engineering or content delivery. But when that’s the news and there’s a column due, maybe he just gets stuck writing about it.
I used to write reviews for ZD Net back when they had a separate Linux coverage area. Mostly I dealt with server clustering and load balancing product reviews. It was fun work, but I have to admit that I knew a lot more about those products after writing the reviews (and playing with a bunch of different solutions) than I did before.
3 Jason Perlow // Aug 20, 2008 at 10:33
While I agree with your notion that technical reporters are frequently under deadline and miss important details, I have to contest your assertions that I do not understand the CDN industry or the implementation of large enterprise networks.
For what its worth, Limelight fiercely contests Akamai’s response to me, which I published in full on my blog.
http://blogs.zdnet.com/perlow/?p=9224
While it is true that Akamai is a “Pull” infrastructure using distributed edge appliances, it is certainly not possible for them to cache huge amounts of videos or very large files at the ISP level, which is why a solution like Limelight was preferred for the Olympics video streaming. Only the most frequently requested files are cached on the appliances — so in effect, Akamai does sometimes have to go back to their centralized infrastructure in order to replicate very large files and serve them out. For an architecture such as MSDN, where you are talking about ISOs that everyone is trying to download at once, it makes sense to have distributed appliances doing content pulls. For the Olympics video streaming where you are talking about massive amounts of data that would have to be hosted on an edge server it just wouldn’t work. However, its obvious that even Microsoft is seeing the advantages of Limelight over Akamai because they have switched to them for Patch Tuesdays. It would be great if MSDN utilized them as well, because what they have implemented using Akamai is not working during volume high-demand situations.
4 Content Delivery Network Blog | CDN Evangelist » Blog Archive » Technical Accuracy: Why the Tech Press Just Can’t Get It (even close to) Right // Aug 20, 2008 at 11:29
[...] http://rustvalley.com/2008/08/19/technical-accuracy-why-the-tech-press-just-cant-get-it-even-close-t... This entry was posted on Wednesday, August 20th, 2008 at 10:29 am and is filed under CDN. You can [...]
5 todd // Aug 20, 2008 at 11:30
Thanks for clarifying, Jason. And thanks for pointing out the Akamai response and the Limelight meta-response. It’s a shame that neither is particularly relevant to the critique that I raised. Both responses are myopic and neither clarifies the central critique you raised: is it the case that Akamai’s architecture prevents it from being able to reliably serve large files?
Let’s go back and review the claims. First you argue that Akamai is “centralized”. Then you say that it’s the very fact that it’s distributed that is the problem. Which is exactly what Limelight is saying. Specifically, Limelight and Akamai are both acknowledging that Akamai has a much more distributed architecture. Let’s dig into that.
I referred to this exact point in my posting: Akamai, by virtue of its incredibly distributed architecture has potential problems with cache locality for large files. That is to say, since there is a limited amount of storage at each local node, it may be somewhat less likely that content that a user wants is already cached at the most local node (assuming there is some variant of a LRU (least recently used) caching algorithm in effect).
But now you’re making my point. It is Limelight, not Akamai, that is more centralized in a smaller number of locations. This isn’t a slight against either company, just a difference. Moreover, Akamai has been busily adding higher-storage-capacity more “centralized” nodes for years in order to deal with big storage customers. Like iTunes (all the songs of which are still served by Akamai, as far as I know). My understanding is that Akamai solves this problem by having some requests punted to regionally distributed (but not as far out to the edge) nodes that have larger storage capacity. This overlay starts to look suspiciously like Limelight’s architecture, at some point.
Jason implies, again, that Limelight somehow scales better than Akamai due to architectural reasons. I’ve never seen this claim before and I believe it to be patently untrue. Akamai’s customers are extremely demanding (as are Limelight’s) and it’s fairly ridiculous to assume that Akamai has some kind of endemic performance problem with large files.
It is true that Limelight specializes in hosting streaming video (growing up from their original pornography customers into mainstream business) and that their network architecture is optimized to host regionally centralized large files. They do a great job with that.
At some higher level of detail, this seems like a marketing dust-up that Jason got caught in the middle of. It sounds like Limelight is putting out PR saying “we got the olympics because Akamai just couldn’t handle it”. And it seems like some people in the press bought that. Too bad, cause it’s simply not true. Either company could easily handle the Olympics in terms of network and server scale. There are lots of other CDNs that could not, but these are not they.
The decision by NBC was almost certainly not made on the basis of ability to meet the requests. It was made on a combination of peformance, features and ultimately cost. Limelight is known for being quite a bit cheaper than Akamai and my guess is that played a big role in the decision. This is likewise true for Microsoft’s decision to use Limelight for some downloads.
Leave a Comment