the median of a trillion numbers

the question

the base algorithm

distributing

generating test data

ruby implementation

brutally short introduction to erlang

single process erlang implementation

multiple process erlang implementation

performance comparisons

running on amazon ec2

conclusion

 

code available at github.com/matpalm/median

other projects

nov 2008
me on twitter
me on google+