Some of our blog post contain affiliate links.

Recent Posts



Wine Quality Prediction

Problem Here I’m looking at samples of Vinho Verde red wines from Portugal. The idea is to work out which of the variables is most important in determining the quality of the red wine on a scale from 0-10, this is the output variable. The input variables based are: Input variables (based on physicochemical tests): 1 - fixed acidity 2 - volatile acidity 3 - citric acid 4 - residual sugar 5 - chlorides 6 - free sulfur dioxide 7 - total sulfur dioxide 8 - density 9 - pH 10 - sulphates 11 - alcohol Applications There are a few wineries, gin factories and rum factories around where I live. They would employ lab tech

Become A Senior Data Scientist

How can you go from data scientist to senior data scientist? You need to work on these skills: communication & presentation, mentor othe

Build Yourself a Digital Bank

There’s plenty of lazy money to be made in banking, either retail banking or commercial banking. There, I said it! Here I step you through

Machine Learning A-Z™: Hands-On Python & R In Data Science Part 2

Hi Data Friends! So, I was able to finish the second half of the course, missed a week when my folks visited and we went on a holiday. I was at 51% before, but powered through the rest on the weekend. What I covered Part 4 Clustering K-Means Clustering Hierarchical Clustering Part 5 Association Rule Learning Apriori Eclat Part 6 Reinforcement Learning Upper Confidence Bound (UCB) Thompson Sampling Part 7 Natural Language Processing Part 8 Deep Learning Artificial Neural Networks Convolutional Neural Networks Part 9 Dimensionality Reduction Principal Component Analysis Linear Discriminant Analysis Kernel PCA Part 10 Model Selection and Boosting XGBoost What did I get out of it So, as a data s

Is it a Startup or Small Business?

People seem to be really confused about the difference between a 'startup' and a 'small business' to the demise of everyone involved. There are so many examples of people claiming to be a founder of a startup, but they are actually running a small business. It isn't their fault, they just don't understand what the difference between a startup and a small business is and being a 'founder' of a 'startup' is ultra chic these days. A few years ago I took a wonderful course called “Startup Engineering” on Coursera. The course no longer exists, but here are some notes and the video lectures:

Skill Up for Data Science

Learning how to learn Learning how to learn is a real skill. Two courses I found to be very helpful with everything from planning, deliberate practice, dealing with procrastination and so much more: Learning How to Learn: Powerful mental tools to help you master tough subjects Mindshift: Break Through Obstacles to Learning and Discover Your Hidden Potential Google is waaaay smart If you have a problem, or you can’t work out how to do something just Google it. Seriously, you won’t lose marks. More than likely some other poor traveller has been down the same path and arrived at the same issue. Leverage the collective power of the crowd to solve your problems! Basic skills all data scientists n

Visual Studio for Data Science

Should you use VS for Data Science? Actually, no – you shouldn’t use Visual Studio if you are doing exploratory work, but if you are shipping data products into a production system then yes you really should! If you are doing exploratory work or investigative work there are better IDEs out there for you to use. R Studio or Spyder will do nicely. If you would like to document your code, graphs and text in one place for reproducibility, team learning etc, well use Jupter labs or Jupyter notebooks or R Markdown. What are the benefits of Visual Studio for Data Science? Visual Studio is an IDE for the developer, data scientists are not developers, but in some roles the line between dev and data s