How did I become a data scientist?

New to data science?

Data Science

I get asked quite often on my YouTube channel (Data Professor) the following questions about how to break into data science:

  • How to become a Data Scientist?
  • What is the roadmap to being a Data Scientist?
  • What courses should I take to learn Data Science?

So I thought that it would probably be a great idea to write an article about it. And so, here it is. It should be noted that the 10 things that I wish I knew about learning data science is based on my personal journey as a self-taught data scientist. …

Study Tips

One of the biggest challenges facing aspiring data scientists is consistency. If you are able to acquire incremental knowledge and skills in data science on a consistent basis, you’ll be amazed at how much knowledge you’ll accrue in a year or two.

If you want to be successful in your learning journey but are doing the same thing that you’ve always been doing then how can success find you. To get a different result, you need to take a different approach.

In this article, we will be exploring the 7 tips that you can use right now to effectively study…

Python for Data Science

So you’re embarking on your journey into data science and everyone recommends that you start with learning how to code. You decided on Python and are now paralyzed by the large piles of learning resources that are at your disposal. Perhaps you are overwhelmed and owing to analysis paralysis, you are procrastinating your first steps in learning how to code in Python.

In this article, I’ll be your guide and take you on a journey of exploring the essential bare minimal knowledge that you need in order to master Python for getting started in data science. I will assume that…

Learning Resources

Are you preparing for a data interview whether as a data scientist, data analyst or data engineer? YouTube is a great starting point as there are tons of free educational contents that can help you in your data journey. With great abundance also comes a great burden of choosing which channels from amongst the hundreds and thousands out there. I’ve explored the entire YouTube space and curated a list of top YouTube channels that I think are great starting points that you can look to in preparing for your career as a data professional.

In spite of the fact that…

The advents of big data and computing advancements have led to the emergence of Data Science as an all-encompassing field that can help extract knowledge and value from data. In recent years, the popularity has witnessed exponential growth and hype contributing to a large market need for data professionals. In light of this, there is a large void as to the best curriculum for training the new generation of data scientists. Data science is such a unique field in that it is interdisciplinary and may take on a different meaning depending on the beholder.

In 2017, the Association for Computing…

Step-by-Step Tutorial

A picture is worth a thousand words and so does the insights provided by graphs and plots. Data visualization is such an important part of any data science project as it allows effective data storytelling in the form of graphs and plots. Even static plots can convey important information and provide immense value, imagine what an animated plot can do to highlight particular aspects of a plot.

Hans Rosling’s animated plot of the Gapminder data (for which he is the founder of) at his TED talks has captivated us all as it brings data to life.

In this article, you…

Automated machine learning (AutoML) helps to lower the barrier to entry for machine learning model building by streamlining the process thereby allowing non-technical users to harness the power of machine learning. On the other hand, the availability of AutoML also helps to free up the time of data scientists (that they would have otherwise spent doing redundant and repetitive pre-processing tasks or model building tasks) by allowing them to explore other areas of the data analytics pipeline.

In a nutshell, users can supply an input dataset to the AutoML system that it uses for model building (feature transformation, feature selection…

A common theme that revolves around those we know and our network is commonly tied with who we would become.

“Show Me Your Friends, and I’ll Tell You Who You Are.” — The Wah Wah Collective

“Show Me Your Friends and I’ll Show You Your Future.” — Chaplain Ronnie Melancon

“You’re the average of the five people you spend most of your time with.” — Jim Rohn

Not only does this apply to friends in your real life but also to those who you follow on social media. In fact, influencers on social media are individuals who have established a…

Often times you’re using default parameters for building machine learning models. In just a few blocks of code you can search for the best hyperparameters for your machine learning models. Why? Because the optimal set of hyperparameters can go a long way to significantly boost the performance of your models.

In this article, you will learn how to perform hyperparameter tuning of the random forest model in Python using the scikit-learn library.

Note: This article was inspired by a YouTube video I made some time ago (Hyperparameter Tuning of Machine Learning Model in Python).

1. Hyperparameters

In applied machine learning, tuning the…

