BigData White Papers
I don't know about you, but I always like to read the white papers that originate OpenSource projects (when available of course :) ).   I have been working with BigData quite a lot lately and this area is mostly dominated by Apache OpenSource projects.    So, naturally (given the nerd that I am) I tried to investigate their history. I created a list of articles and companies that originated most BigData Apache projects.   Here it is! Hope you guys find it interesting too. :)      Apache Hadoop      Based on: Google MapReduce and GFS    Papers:   https://static.googleusercontent.com/media/research.google.com/en//archive/mapreduce-osdi04.pdf   https://static.googleusercontent.com/media/research.google.com/en//archive/gfs-sosp2003.pdf        Apache Spark       Created by: University of California, Berkeley    Papers:      http://people.csail.mit.edu/matei/papers/2012/nsdi_spark.pdf   http://people.csail.mit.edu/matei/papers/2010/hotcloud_spark.pdf   http://peo...
 
