DevOps for Data Science – Automated Testing

I have a series of posts on DevOps for Data Science where I am covering a set of concepts for a DevOps “Maturity Model” – a list of things you can do, in order, that will set you on the path for implementing DevOps in Data Science. In this article, I'll cover the next maturity … Continue reading DevOps for Data Science – Automated Testing

Sponsored Post Learn from the experts: Create a successful blog with our brand new courseThe WordPress.com Blog

Are you new to blogging, and do you want step-by-step guidance on how to publish and grow your blog? Learn more about our new Blogging for Beginners course and get 50% off through December 10th.

DevOps for Data Science – Continuous Integration

In the previous post in this series on DevOps for Data Science, I covered the first the concept in a DevOps “Maturity Model” – a list of things you can do, in order, that will set you on the path for implementing DevOps in Data Science. The first thing you can do in your projects … Continue reading DevOps for Data Science – Continuous Integration

DevOps for Data Science – Infrastructure as Code

In the previous post in this series on DevOps for Data Science, I explained that it’s often difficult to try and implement all of the DevOps practices and tools at one time. I introduced the concept of a “Maturity Model” – a list of things you can do, in order, that will set you on … Continue reading DevOps for Data Science – Infrastructure as Code

Ethics and the Importance of Being an Information Skeptic

Whenever I teach or present a session on Artificial Intelligence, I start with Ethics. We've created a site where you can quickly walk through a few of the major principles we follow at Microsoft for AI here: http://aka.ms/ai-ethics. I walk through these principles before I show how to design a Machine Learning solution, and then … Continue reading Ethics and the Importance of Being an Information Skeptic

The Keys to Effective Data Science Projects – Part 10: Project Close-Out with the TDSP

Data Science projects have a lot in common with other IT projects in general, and with Business Intelligence in particular. There are differences, however, and I’ve covered those for you here in this series on The Keys to Effective Data Science Projects. One of those areas where general projects and Data Science projects are similar … Continue reading The Keys to Effective Data Science Projects – Part 10: Project Close-Out with the TDSP

The Keys to Effective Data Science Projects – Part 9: Testing and Validation

We’re continuing our discussion of the series of the Keys to Effective Data Science Projects,  this time focusing on Testing and Validating the Model. We're in the general phase in the Team Data Science Process called "Customer Acceptance". "Testing" in the general sense is the same in Data Science projects and any other typical software project - … Continue reading The Keys to Effective Data Science Projects – Part 9: Testing and Validation

The Keys to Effective Data Science Projects – Part 8: Operationalize

We’re in part eight on our journey through the series of the Keys to Effective Data Science Projects -"Operationalization" - a term only a marketer could love. It really just means "people using your solution". And it's this part of the process that is quite possibly the most complicated, and usually the one done with the … Continue reading The Keys to Effective Data Science Projects – Part 8: Operationalize

The Keys to Effective Data Science Projects – Part 7: Create and Train the Model

We’re in part seven on our series of the Keys to Effective Data Science Projects.  This is the section that most people think of when they think of "Data Science". It's where we take the question, the source data which has been turned into the proper Features (and potentially Labels), and select an algorithm or two … Continue reading The Keys to Effective Data Science Projects – Part 7: Create and Train the Model

The Keys to Effective Data Science Projects – Part 6: Feature Selection

We're in part six on our series of the Keys to Effective Data Science Projects. I won't cover basic Feature Engineering in this article - it's a huge topic and central to working in Machine Learning areas. I do recommend you check out as many articles as you can find on the subject, and once … Continue reading The Keys to Effective Data Science Projects – Part 6: Feature Selection

The Keys to Effective Data Science Projects – Part 5: Update the Data

In this series on the “Keys to Effective Data Science Projects”, we've seen a process we can use, we've determined what we want to know, and we've ingested the data. In the last step we explored the data, and in a different way than we might be used to when working with in a database … Continue reading The Keys to Effective Data Science Projects – Part 5: Update the Data