The failure of “Web 2.0″
I think this post by Greg Linden (pointing to two other posts, one by Jason Calcanis and the other by Xeni Jardin) highlights the fundamental problem with “Web 2.0″, which is that any system that gains...
View ArticleRead replication with MySQL
I have been following the thread about the death of read replication over on the Planet MySQL weblog with interest. In with this issue the notion of caching is thrown in to illustrate that it can be...
View ArticleRead replication with MySQL – part deux
Following up on my last post on read replication with MySQL, I read this post by Greg Linden on the subject of caching which mirrors my thinking on the matter (except that his is better written): My...
View ArticleMySQL engines, MyISAM vs. Innodb
I think Narayan Newton does a very good job of summarizing the pros and cons of MyISAM and Innodb in this post “MySQL engines, MyISAM vs. Innodb”. I have seen a lot written about this before but I...
View ArticleCrawling is indeed harder than it looks
Greg Linden (a must-read blog because he picks up new publications very quickly) has a good post aggregating a number of papers from WWW 2008 on crawling and why crawling is hard. I wrote the version...
View ArticleScaling MySQL at Facebook
By way of Greg Linden, some interesting notes and figures from various high traffic web sites on scaling MySQL. As Greg points out, Facebook’s strategy is to partition the data and spread it across a...
View ArticleCaching and system optimization
Greg Linden mentions an interesting paper out of Yahoo Research and presented at SIGIR 2008 “ResIn: A Combination of Results Caching and Index Pruning for High-performance Web Search Engines”. The...
View ArticleMySQL 5.1.29 Released
MySQL 5.1.29 was just released, the final release candidate on the way to general availability. I have been running 5.1.28 for a while with no issues. The main change I see here (for me at least) is...
View ArticleMySQL 5.1 Goes GA, Finally…
I was happy to see that MySQL 5.1 went GA on November 27th, it has taken a very long time to get there. I use it for my current project, in fact I selected it over 5.0 in the expectation that it would...
View ArticleDetecting Spam just from HTTP headers
By way of Geeking with Greg, a paper on detecting spam just from HTTP headers, “Predicting Web Spam with HTTP Session Information” (PDF). This is very interesting and I will be taking a close look at...
View Article
More Pages to Explore .....