Some clarifications. 1. Java and Python are good languages for data mining, due ...

rjdagost · on Jan 5, 2013

Excel is the most common statistical analysis package on the planet, bar-none. When you need to deliver code to people who are numerate but don't program then Excel is the best thing to use. Sometimes you just need to give someone (a manager) a tool where they can test out a bunch of different scenarios but you don't have time to make a polished application. Even for "big data" applications sometimes seeing the numbers in front of you in cells is quite useful.

elchief · on Jan 5, 2013

Yes, it is popular, but that doesn't make it good for statistical analysis. It also sucks compared to JMP.

On the accuracy of statistical procedures in Microsoft Excel 2007

http://or.nps.edu/faculty/PaulSanchez/oa4333/handouts/Excel/...

berkeleyjess · on Jan 5, 2013

Thanks for the corrections. I have changed the post to reflect the Hadoop/Hive/MapReduce mistake. I am still new at this tech thing and obviously don't know all the terminology yet.

elchief · on Jan 5, 2013

No worries, thanks for the post!

jwilliams · on Jan 5, 2013

1. I wouldn't discount Perl. Depending on the industry it can be quite prevalent & it's very common as glue in others.

elchief · on Jan 5, 2013

which Perl data mining libraries do you use/recommend?

jwilliams · on Jan 6, 2013

The only one I've ever touched is BioPerl. It's well regarded in the Bioinformatics sphere.