Here is Martin Odersky, the creator of Scala – with his Big Data Scala By The Bay keynote. Discussing “How Spark is a logical extension of Scala. Read More / Watch Video
“Spark Summit Europe 2015” wrapped up yesterday in Amsterdam. Here’s a couple of the top videos from the last few days. Juliet Houghland, a Data Scientist from Cloudera talks about the client-side need/demand for PySpark, and Aaron Davidson from Databricks talks about some more recent problems he sees emerging. Watch Videos
You can always tell when you are listening to someone who knows what they are talking about. In this Edu-video, Charles Martin, an excellent Chief Scientist with over 15 years of ground breaking data science experience with top firms, talks to a class at Cal Berkeley’s Haas Business School. Given the audience, it’s less techie and very practical in nature. Enjoy! Read More
Happy Monday to All. Here are a set of compelling articles, videos, webinars on Data Science & Big Data Analytics. Also see the Top 5 jobs in Data Science that we are working vigorously to fill at our clients. Read More
Here are few sets of interesting articles, videos, and our top jobs in Data Science & Analytics that stood out this week. Read More
So what is HBase? Yes, it’s the Bigtable-like structured storage for Hadoop HDFS, but how exactly does it work? What is the architecture? When is a good time to use it and when is not? This post will help inform those questions. More…
This guide by Robert Schneider, who also wrote Hadoop for Dummies, created this guide give you everything you need to know about choosing the right Hadoop distribution For Production Read More
MIT’s Adjunct Professor Michael Stonebraker, discusses the future of Hadoop. Hadoop is technically the open source version of MapReduce, created by Google/Yahoo.
Google has long since dropped MapReduce in favor of better solutions, so will the market follow suit?Read More
EARLY RELEASE Available Now via O’Reilly books. Print Copy not out until next Spring.
Get expert guidance on architecting end-to-end data management solutions with Apache Hadoop. While many sources explain how to use various components in the Hadoop ecosystem, this practical book takes you through architectural considerations necessary to tie those components together into a complete tailored application, based on your particular use case. Read More
Watch this pre-recorded webinar to learn what Machine Learning is, why you should use machine learning algorithms, what the common challenges of machine learning are, and how Cloudera’s enterprise data hub supports machine learning. More…