From PyData Amsterdam, April 2017. Giovanni Lanzani gives a talk on the Data Science Process and where things can go wrong.
Kaggle is a community of almost 450K data scientists who have built nearly 2 million machine learning models to participate in its competitions. Data scientists come to Kaggle to learn, collaborate and develop the state of the art in machine learning. This talk will cover some of the lessons on winning techniques we have learned from the Kaggle community. Watch Video
Apache Spark’s popularity as part of big data analytics solutions is exploding. Spark is an open-source data analytics cluster computing framework originally developed in the AMPLab at UC Berkeley. Spark fits into the Hadoop open-source community, building on top of the Hadoop Distributed File System (HDFS). However, Spark promises performance up to 100 times faster than Hadoop MapReduce for certain applications…and that’s why you should care!
Spark’s in-memory cluster computing is very well suited to machine learning algorithms. These Videos will give you a nice introduction to Spark, how it’s being used in business and why you should care…Watch Spark Videos…
Read about 10 Big Data Case Studies | by NATHAN GOLIA, CHRIS MCMAHON
These 10 insurance companies developed cross-enterprise big data strategies, hired the right data scientists and staff members, and delivered impressive results. READ MORE
What’s Hot in Data Science? Well, there has been a lot of talk recently about DEEP-LEARNING, a subset of Machine Learning, which allows machines to classify what they perceive.
Adam Gibson (Data Scientist and Co-Founder, Blix.io) presents his open-source, distributed deep-learning framework, Deeplearning4j. He demos sentiment analysis and facial recognition tools. If you are using or learning Machine Learning then you should watch this video. Watch Video
Here are 2 researched articles by Randy Bartlett on the value of the Chief Analytics Officer (CAO) role.
The first discusses the benefits of a CAO: “3 Good Reasons You Need a CAO”. He then invested about 50 hours interviewing people for a unique case study illustrating what happens when a corporation loses its CAO: “The Dissolution of an Analytics Team” Here they are…Read More
Learn how Pivotal’s Data Science Team has developed several methods to analyze traffic information from real-time data sources. Using a variety of methods on a massively parallel analytics database system, the team will also demonstrate a traffic disruption model that can predict the duration of recent incidents – learning the disruption patterns of a major city. Watch Webinar Replay
Insurer Bankers Financial wins a CIO 100 award for a new system that helps sales agents generate more accurate quotes that customers are more likely to accept. Read More
Discover the breakthrough tool your company can use to make winning decisions
This forward-thinking book addresses the emergence of predictive business analytics, how it can help redefine the way your organization operates, and many of the misconceptions that impede the adoption of this new management capability. Filled with case examples. Read More
The editors at SearchDataManagement.com have reviewed their most popular Hadoop-related stories of 2013, and taken together, they form a narrative of Hadoop in 2013. The content followed the path of Hadoop and related software tools, such as HBase, as they gained footholds in the enterprise.
Read about the Top 5 Enterprise Hadoop Stories here by Jack Vaughan Read More