Friday, August 16, 2019

Python vs R Debate #analytics

One of the modules I teach is called "Programming for Big Data". It is normal when creating module descriptors not to be specific on what technologies will be used. So the module descriptor for this module does not specify what language will be used - it just states in general that "...programming languages such as R, Python, Java, etc" will be used. Last year I took over this module and decided to create a brand new set of course resources (I always do this when taking over a module, I just cannot use other lecturer's notes). As R is the preferred language in most other modules on our data analytics courses, and I was far more familiar with it - I decided to switch from Python, which had been used previously, to R.

Last evening at an Information Session for incoming students I was asked about this again, and would the students be learning Python. The answer is "No", and that we are continuing with R (students can choose in their Project module to use any language they wish). Today I decided to take a quick look at the 20th annual KDnuggets Software Poll (which had over 1,800 participants - so good sample size), and Python stands out the leading language. In 2017 Python and R were neck-and-neck (59% and 57% respectively), but this has changed in 2019 (66% for Python, and 47% for R). I am very glad to see that other technologies that we use in the Data Analytics programmes (RapidMiner, Excel/SQL/Tableau) feature strongly in the poll.

Image source: KDnuggets.
In this coming academic year, our Higher Diploma in Data Analytics (which I teach on), is due for its five yearly programmatic review. This means that the academic year  2019/2020 will be the last for the "old" version of our programme, and from September 2020 we will be starting with a "new" version. One of the biggest decisions we will have to make is whether we continue with R, or switch to Python as the main programming language. I am already favouring such a switch mainly for two reasons: first, the survey results below show a dramatic change, and secondly - Python is regarded as an easier language to learn than R. This is an important consideration in a one year conversion course like a Higher Diploma where a huge number of students will not have done programming before. 

Much debate ahead!

No comments:

Post a Comment