Videos tagged with MapReduce


Velocity 09: Richard Crowley, "Building OpenDNS Stats"

Velocity 09: Richard Crowley, "Building OpenDNS Stats"

Posted in Conferences, Databases, Development, Networking, MySQL, PHP, C and C++, DNS

The old OpenDNS Stats system was built when we were doing 1 billion queries a day and had far outlived its usefulness. Playing hot potato with load on overworked servers all struggling to keep up gets old after a while, doesn’t it? This gave me the opportunity to start from a blank slate and build the system we need to serve us at 8 billion queries a day and scale to 16 or 24 billion. We ...

Tags: Networking, DNS, Conferences, PHP, Databases, MySQL, Scalability, MapReduce, C and C++, Performance, Development, ...



Research and Education in the Clouds: Experience at the Univ

Research and Education in the Clouds: Experience at the Univ

Posted in Conferences, Companies, Development, Techtalks, Google

Hadoop, the open-source implementation of MapReduce, provides unprecedented opportunities for both research and education. On the research side, academics are now able to explore problems at scale with modest resource investments, either with inexpensive commodity clusters or through utility computing services. On the education side, Hadoop provides a nice vehicle to teach students about large-...

Tags: Techtalks, Google, Conferences, Scalability, MapReduce, Cloud Computing, Google Tech Talks, Hadoop, Development, Companies


Application Informed Tuning of Virtualized Environments

Application Informed Tuning of Virtualized Environments

Posted in Conferences, Technologies, Companies, Techtalks, Google

Virtualization is currently being used in cloud computing environments and traditional IT environments to improve the flexibility and manageability of the computing infrastructure, and to enable the sharing of computing resources. This means that applications (such as database systems) are increasingly being run on virtual machines and using virtualized storage. The performance of an applicatio...

Tags: Techtalks, Google, Conferences, Technologies, MapReduce, Google Tech Talks, Virtualization, Companies, virtualized, environments, storage, ...







Cloudera Hadoop Training #6: Introduction to Hive

Cloudera Hadoop Training #6: Introduction to Hive

Posted in Development, Broadcasting, Screencasts

  Hive is a powerful data warehousing application built on top of Hadoop which allows you to use SQL to access your data. This lecture will give an overview of Hive and the query language. This lecture includes a work-along exercise. The next video is a screencast of a user performing this exercise.  

Tags: Scalability, apache, SQL, MapReduce, Hadoop, training, Cloudera, Bigdata, Hdfs, Hive, Cloudera Hadoop Training, ...


Cloudera Hadoop Training #5: MapReduce Algorithms

Cloudera Hadoop Training #5: MapReduce Algorithms

Posted in Development, Broadcasting, Screencasts

  After we've introduced you to the tools, it's time to learn how to use them efficiently. Algorithms designed for running on MapReduce look a little different than those you've written before. We'll introduce you to some widely-used algorithms, common idioms to use when designing your own, and techniques for implementing these in Java MapReduce and scripting languages via HadoopStreaming....

Tags: Scalability, apache, MapReduce, Hadoop, training, Cloudera, Bigdata, Hdfs, Algorithms, Cloudera Hadoop Training