• Skip to main content
  • Skip to footer

Starbridge Partners

Data Science Recruiters

  • About Us
  • Open Data Science Roles
  • Data Science Report!
  • Testimonials
  • Contact Us

Edu-Video |”Building a Production Machine Learning Infrastructure” – Josh Wills /Dir. Data Scientist

Go Back

February 9, 2017 By Ted OBrien

josh wills bannerThis talk was given at Midwest.io 

VIDEO BELOW

Josh Wills, Director of Data Science at Cloudera has a gift for making fairly complicated technology explanations very digestible to the novice and intermediary techie. What I most love about this video is how Josh explains -very clearly – the issue of translating analytics Machine Learning on a large set of data records (see: individuals) and making it work in a production environment on one individual (think eCommerce).  It’s going from a SQL/R/SAS type of environment (pure analysis) to a Java, Scala, C++ programming environment (actual site) and how to deal with that effectively.

“The Data Science Team at Cloudera has a simple mission: build an analytics infrastructure so awesome that it makes Google’s Ads Quality Team seethe with jealousy. To that end, I’ll give an overview of Cloudera’s current data science tools, including Oryx and Spark for building and serving machine learning models, Gertrude for multivariate testing, and Impala for ludicrously high-performance SQL queries against HDFS.” – Josh

About the Speaker

Josh Wills is Cloudera’s Senior Director of Data Science, working with customers and engineers to develop Hadoop-based solutions across a wide-range of industries. He is the founder and VP of the Apache Crunch project for creating optimized MapReduce pipelines in Java and lead developer of Cloudera ML, a set of open-source libraries and command-line tools for building machine learning models on Hadoop. Prior to joining Cloudera, Josh worked at Google, where he worked on the ad auction system and then led the development of the analytics infrastructure used in Google+.

Share this:

  • Click to share on Twitter (Opens in new window)
  • Click to share on Facebook (Opens in new window)
  • More
  • Click to share on LinkedIn (Opens in new window)
  • Click to share on Reddit (Opens in new window)

Filed Under: Analytics, Data Science, eCommerce, Edu-Videos, Events & Meetups, How To..., Machine learning, What Now? How to get started Tagged With: Cloudera, How To, josh wills, Machine Learning

Go Back

Footer

Starbridge Partners LLC

Starbridge Partners is a specialty executive search firm focused entirely on data science recruiting  for our clients.

office: (646) 535-9533

Subscribe For Free

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Important Links

  • Contact Us
  • About Us
  • View Job Listings

Copyright © 2021 Starbridge Partners

Small Business Web Design - JSMT Media