The CRM114 discriminator

“… Now, in order to prevent the enemy from issuing fake or confusing orders, the CRM114 Discriminator is designed not to receive at all… That is, not unless the message is preceded by the proper three­letter code group.“

–George C. Scott, playing the role of General Buck Turgidson, in Stanley Kubrick’s Dr. Strangelove

One of my favorite stories about Dr. Strangelove is that apparently, Kubrick asked George C. Scott to do several takes of each scene, and specifically in one of takes, to be completely over the top, totally outrageous. Kubrick almost exclusively used the over the top takes in the final edit of the film.

What that story has to do with the automated machine learning library CRM114, other than the obvious link to Dr. Strangelove I don’t know, however, CRM114 is fricken amazing. In the words of the its creator Bill Yerazunis CRM114 is like “grep was bitten by some radio active spider”.

Essentially, if you have some stream of documents, data, bits or whatever and you’d like to automatically classify it into categories, CRM114 is your new bicycle. CRM114 learns, and it learns fast, how to accurately categorized your stream of data, with accuracies of up to 99.9% (with a little bit of love). It’s a swiss army knife of machine learning.

