- Descriptive - describe a set of data
- Descriptions cannot be generalized without adding statistical modelling
- Exploratory - find relationships you didnt know about
- Exploratory analysis should not be alone used for generalizing/predicting
- Correlation doesnt imply causation
- Inferential - use a relatively small sample of data to say something about the bigger population
- Inference is the common goal of statistical analysis
- Inference involves estimating both the quantity we care about and the certainty of that estimate
- Inference depends heavily on both the population and the sampling scheme
- Predictive - To use the data in some objects to predict the data in other objects
- If X predicts Y then it doesnt mean that X causes Y
- More and more data works well with reasonable models
- Causal - To find what happens to one variable when you change another variable
- Usually randomized variables are used for causation
- There are approaches to infering causation in non-randomized studies, but they are complicated and sensitive to assumptions
- Causal relationships are identified as average effects, but may not apply to every individual
- Causal models are usually the gold standard for data analysis
- Mechanistic - physics
This blog is about my learnings in big data, product management and digital advertising.
Tuesday, August 5, 2014
Types of Questions asked to Data Scientists
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment