w3bookmarks.com - bookmarks of web development and design tutorials
Data scientists working with R and Python, as well as anybody looking for interesting, new-ish, high-performance programming languages should look into the...
Data Science for Business – by Foster Provost and Tom Fawcett
In a nutshell: If you are looking for a simple (but not
simplistic) introduction to nearly...
In my last post I discussed the count-min sketch data structure that can be used to process data streams using sub-linear space. In this post I will...
Some writings worth reading:
“Spam names” http://andrewgelman.com/… via @cynorrhodon“Degrees of Value: Making College Pay Off”...
The index I created for the exercise is just a text file, sorted by
the indexed key. When doing a search by a human, that makes it very easy
to work with....
 



When I pulled the over 5,000 datasets from 22 federal agencies after the implementation of OMB Memorandum M-13-13 Open Data...
The interesting thing about this problem is that I was very careful
in how I phrased things. I said what I wanted to happen, but didn’t
specify what...
Apache Spark is generating quite some buzz right now. Databricks, the company founded to support Spark raised $14M from Andreessen Horowitz, Cloudera has...
What is Pligg?
Pligg is an open source content management system that lets you easily create your own user-powered website.