Dec 3 2012

pbdR Updates - Distributed lm.fit() and More

wrathematics
Over the weekend, we updated all of the pbdR packages currently available on the CRAN.  The updates include tons of internal housecleaning as well as many new features. Notably, pbdBASE_0.1-1 and pbdDMAT_0.1-1 were released, which contain lm.fit() methods.  This function in particular has been available at my github for over a month, but didn't make its way to the CRAN until recently because of Read more >>

Oct 16 2012

R at 12,000 Cores

wrathematics
I am very happy to introduce a new set of packages that has just hit the CRAN. We are calling it the Programming with Big Data in R Project, or pbdR for short (or as I like to jokingly refer to it, 'pretty bad for dyslexics'). You can find out more about the pbdR project at http://r-pbd.org/ The packages are a natural programming framework that are, from the user's point of view, a very simple Read more >>

Aug 14 2012

Some Quirks of the R Language

wrathematics
R is my favorite programming language.  It's just so useful for getting work done.  Sometimes people will complain that R is a difficult language.  To me, this begs the questions:  difficult for what?  And for whom?  I personally think R is just about the easiest thing in the world for prototyping.  Meaning if you want to quickly crank out some result, R is king.  Now when you get into optimization, Read more >>

Jun 11 2012

Autoplot: Graphical Methods with ggplot2

wrathematics
Background As of ggplot2 0.9.0 released in March 2012, there is a new generic function autoplot.  This uses R's S3 methods (which is essentially oop for babies) to let you have some simple overloading of functions.  I'm not going to get deep into oop, because honestly we don't need to. The idea is very simple.  If I say "I'm sending a letter to my sister", you wouldn't be all, "Oh, which letter?  Read more >>

May 17 2012

Visualizing the CRAN: Graphing Package Dependencies

wrathematics
I had been meaning to start toying with the igraph package for a while. So a few weeks ago (lay off, I'm busy), I decided to grab a bunch of CRAN data about package dependencies. The easiest way that I could think to get this information was to just grab the html files for all the package descriptions and chop through them. Quick note before I forget: I'm not looking at any base packages. Only Read more >>

Apr 23 2012

Project Euler...in LaTeX?

wrathematics
I've been joking for a while now that I was going to start solving project euler problems in LaTeX.  Then today I finally did one.  So let's talk about solving Project Euler problem number 1 (the easy one) using only LaTeX. The problem asks you to sum up all the positive integers below 1000 which are divisible by 3 or 5 (or both).  Doing this in R is easy.  You could efficiently do which Read more >>

Apr 12 2012

Statistical Software Popularity on Google Scholar

wrathematics
Background (probably boring) Several months ago, my boss and I were discussing how he got the data for his software popularity article; the rest of the background discussion pertains to those plots, so I would recommend going over to take a look before continuing on (or just skip to the next section if you're impatient).  Specifically, we were talking about his figures 7 and 11.  Basically he was Read more >>

Mar 15 2012

A No BS Guide to the Basics of Parallelization in R

wrathematics
What is parallelization?Parallelization is using multiple processing cores to, hopefully, make your programs run faster than serial code, which is the use of just one processing core. Parallel code is not always faster than its serial counterpart (but if you're doing it right and you're careful about what you parallelize, it will be --- remember, that's your goal here).  Don't rush to parallelize Read more >>