Reducing (although sadly not eliminating) bias in sample gathering

(Complete Table of Contents here: http://aka.ms/backyarddatascience) What, Why, How To obtain the data for the analysis a Data Scientist needs to work with, there are two options: you can get all the data (called a population or "X") or a subset of the data (called a sample, or "x"). Most of the time the information … Continue reading Reducing (although sadly not eliminating) bias in sample gathering

Advertisements

The Data Scientist’s Computer

(Complete Table of Contents here: http://aka.ms/backyarddatascience) Journal Everyone uses a computer for lots of things, from e-mail to chat, from gaming to office work. And yet, there are some specific needs a Data Scientist has for their primary system. While I don’t recommend a specific brand or model (these things change too quickly to make … Continue reading The Data Scientist’s Computer

Microsoft R

(Complete Table of Contents here: http://aka.ms/backyarddatascience) What, Why, How One of the most distinctive features of Data Science, as opposed to working with databases, Business Intelligence or other data professions, is its heavy use of statistical methods. At the first appearance of computing science, programs and algorithms were created to deal with the large amounts … Continue reading Microsoft R