Videos tagged with Hadoop
Summary Adrian Cole discusses his jclouds project, which is an open source library that helps Java developers get started in the cloud and reuse their Java development skills. Cole also talks about some of the challenges of creating a cloud agnostic library, such as the use of different hypervisors and that various cloud implementations are written in different languages, such as VB, Python, Ru...
Research and Education in the Clouds: Experience at the Univ
Hadoop, the open-source implementation of MapReduce, provides unprecedented opportunities for both research and education. On the research side, academics are now able to explore problems at scale with modest resource investments, either with inexpensive commodity clusters or through utility computing services. On the education side, Hadoop provides a nice vehicle to teach students about large-...
Cloudera Hadoop Training #10: Pig Tutorial Screencast
This tutorial is a screen cast that will show you exactly how to work with Pig. We'll walk through examples discussed in the Pig Introduction.
Cloudera Hadoop Training #9: Introduction to Pig
Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. This lecture includes a work-along exercise.
Cloudera Hadoop Training #8: Eclipse for Hadoop Development
A screencast illustrating how to configure Eclipse for Hadoop Development.
Cloudera Hadoop Training #7: Hive Tutorial Screencast
This tutorial is a screen cast that will show you exactly how to work with Hive. We'll walk through importing data, creating tables, and making queries.
Cloudera Hadoop Training #6: Introduction to Hive
Hive is a powerful data warehousing application built on top of Hadoop which allows you to use SQL to access your data. This lecture will give an overview of Hive and the query language. This lecture includes a work-along exercise. The next video is a screencast of a user performing this exercise.
Cloudera Hadoop Training #5: MapReduce Algorithms
After we've introduced you to the tools, it's time to learn how to use them efficiently. Algorithms designed for running on MapReduce look a little different than those you've written before. We'll introduce you to some widely-used algorithms, common idioms to use when designing your own, and techniques for implementing these in Java MapReduce and scripting languages via HadoopStreaming....
Cloudera Hadoop Training #4: Programming with Hadoop
Learn how to get started writing programs against Hadoop's API.
Cloudera Hadoop Training #3: Hadoop Ecosystem Tour
An introduction to other projects surrounding Hadoop, which complete the greater ecosystem of available large-data processing tools.