Create isolated Jupyter ipython kernels with pyenv and virtualenv

Everyone loves isolation. Makes our life easier and our systems much more robust. Isolating Jupyter notebooks makes no exception. Maybe you want to try some cutting edge scientific library, or more simply your latest project dependencies are not compatible with your current system setup. Whatever is your situation, follow me in this simple tutorial on how to […]

Cross Validation done wrong

Cross validation is an essential tool in statistical learning to estimate the accuracy of your algorithm. Despite its great power it also exposes some fundamental risk when done wrong which may terribly bias your accuracy estimate. In this blog post I'll demonstrate - using the Python scikit-learn framework - how to avoid the biggest and […]

Data manipulation primitives in R and Python

Both R and Python are incredibly good tools to manipulate your data and their integration is becoming increasingly important. The latest tool for data manipulation in R is Dplyr whilst Python relies on Pandas. In this blog post I'll show you the fundamental primitives to manipulate your dataframes using both libraries highlighting their major advantages […]