Last weekend I took the Lynda course “Understanding Data Science ” with Doug Rose. The class is designed for those that do not intend to be Data Scientists but want to become familiar with the process and terminology. The class has a running time of 1 hour and 16 minutes and covers includes the following […]
VIDEOs From the DataEngConf NYC in November. “Demystifying Deep Learning with Visualizations” (44 min) AND “Python Data Wrangling: Preparing for the Future” (37 min).
(Reposted due to popular demand) Another great video from Josh Wills. Josh is Sr. Director of Data Science at Cloudera and has a gift for making fairly complicated technology explanations very digestible to the novice and intermediary techie.
What I most love about this video is how Josh explains -very clearly – the issue of translating analytics Machine Learning on a large set of data records (many individuals) and making it work well in a “real life” production environment on a single individual (think eCommerce). Watch Video
“Edward”: A library for probabilistic modeling, inference, and criticism Abstract https://www.meetup.com/NYC-Machine-Learning/events/236943279/ http://www.zentation.com/viewer2/webcast/NAPNgdDUBF/Dustin-Tran—Spotify-Talk-(2017-01-19) Probabilistic modeling is a powerful approach for analyzing empirical information. In this talk, I will provide an overview of Edward, a software library for probabilistic modeling. Formally, Edward is a probabilistic programming system built on computational graphs,supporting compositions of both models and inference […]
The University of Illinois’ Coordinated Science Laboratory had it student run conference last month. This video is Dr. Andy Feng – VP Architecture at Yahoo! He leads the architecture and design of big data and machine learning initiatives. “In this talk, we illustrate Yahoo use cases and datasets, and explain the evolution of big-data technology stack.” Watch Video
“This list of 500+ was started in 2012, updated in 2014 and also very recently according to the author. It was compiled by 101.datascience.community, and broken down by degree (master / bachelor / certificate / doctorate) and location (online / on-site.)” – Source Data Science Central Read More
In this episode of the O’Reilly Data Show, O’Reilly’s online managing editor Jenn Webb speaks with Natalino Busa on the topic of predictive analytics, the challenges of feature engineering, and a new class of techniques that is enabling features to emerge from patterns within the data.
They also discuss the relationship between predictive techniques and high-quality microservices, and how machine learning is being used to improve financial services. Listen to Podcast
In this Video by Ryan Compton, Head of Data Science at Clarifai, talks about using convolutional neural networks to deal with the problem of nudity detection. Watch Video
In this video, Tyler Akida presents a whirlwind tour of the evolution of massive-scale data processing at Google, from the original MapReduce paradigm to the high-level pipelines of Flume to the streaming approach of MillWheel to the portable, unified streaming/batch model of Google Cloud Dataflow and Apache Beam (incubating).
Tyler also highlights similarities and differences with related open source systems such as Flink, Spark, Storm, and Gearpump, calling out ways in which they’re converging on and diverging from the Beam model and what that means when running Beam pipelines on their respective runners. Watch Video
At the Machine Learning Meetup in NYC, Dan Melamed gave a machine learning talk titled: “How To Learn From What Your Users Might Not See”. This talk will focus on contextual bandits and their applications.
In this tutorial, Dan will show how to learn from such data in a principled, efficient, and unbiased manner. The techniques that he will describe were largely responsible for a click-thru rate gain of over 25% on MSN.com. Watch Video