Subtitles section Play video
As I have an engineering background,
I started programming with C, then I went to Matlab,
and eventually to Python.
I usually use Matlab, C, and C++,
but for data science I use Python.
- R and Python.
- R.
- I'm an R evangelist.
I was at the useR Conference last year
and I think it's got one of the best communities,
I'm also very fond of SQL and I think people don't spend
enough time appreciating it in its various incarnations.
- I primarily work with R and Stata.
I do not work a lot with big data so for the kind
of data sets I have, there are a few million observations,
even the hundreds of millions of observations
then I can work with with the existing Stata and R and SPSS,
I don't have a problem with it,
but as I said, if I were to work with large data sets,
I would use different tools.
My preferred tools are the three; R, Stata, and SPSS.
I also work with spacial data a lot
so these are data sets which have a geographical
component to it, so imagine 40 million Californians
and 40 million people, some of them in California,
some of them in the neighboring states,
and what if I know the exact home address
of each and everyone of them and where they work.
And that would be an amazing GIS,
spacial geographic information systems database.
So I work with those as well and my tool that I use
is called Maptitude and MapInfo,
these are the two I use the most.