Everybody wants to jump on the data science wagon and use all this data they have been collecting for years. Few know what data science entails, and what data you need in order to solve their issue. This is a talk for everyone that wants to add a data science team to their company, is looking to cash in on the whole machine learning deal, or has had to experience the challenges of not getting the data they need.
- Most often you will not get the data you need or want
- Most IRL data is bad. You have to learn to live with that.
- What the business wants is different from what you will do.
- Data is proxy for something real. There are many proxies for the same thing