At OnLamp, Top Ten Data Crunching Tips and Tricks, a cursory overview of some data crunching and munging techniques (maybe not the top ten, though). Comments posted there by readers are awfully cranky, but it’s still a good read.
Every day, all over the world, programmers have to recycle legacy data, translate from one vendor’s proprietary format into another’s, check configuration files, and yank data out of web server logs. This kind of programming is usually called data crunching, and while it’s not glamorous, knowing how to do it with the least amount of effort can make the difference between meeting a deadline and making another pot of coffee. These ten tips will take the headache out of crunching data.