#109 Joshua Sheppard, Data Science is Hard

Summary
Joshua Sheppard of Infinite Campus tells my about their data science and machine learning projects and how you can start your own.

Details
Who he is, what he does. What is data science, is a data scientist a role or a team, what skills are needed. Data vs big data. When does SQL + math become science, how to get started, Python, R and other languages; trying to follow software engineering principles when doing data science, testing, source control, etc. Azure and AWS machine learning, getting your data in to the cloud. Moving to production, scaling. Josh's data and insights into the school districts in Kentucky. Applying insights to other locations. Home baking your data science project vs leveraging the cloud platforms, it's all about access to data. Future of the field.

Links
Joshua's homepage
Joshua's Twitter

Download mp3 of podcast

#32 Eliot Knudsen, Tamr and a Brave New World of Data

Summary
Eliot Knudsen, field engineer at Tamr talks to me about their machine learning tool and a new way of examining data.

Details
Who he is and what he does; what is Tamr; working with data sources, the traditional way, the Tamr way, machine learning combined with human guidance;data quality and foreign languages; Thompson Reuters example, curating data, increasing speed; deploying Tamr; how Tamr works, db, java, web client; competitors; future work

Download mp3 of podcast