Hadoop is an Open Source Apache project that provides a framework for running applications on large clusters of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion, making it easy to write applications that process hundreds of terabytes of data efficiently. Hadoop provides both a distributed file system, similar to GFS, and Map/Reduce for managing distributed applications. O'Malley will cover the motivation and approach for Hadoop, an overview of the components and architecture, and an overview of how Hadoop is used by Yahoo! on 17,000 computers to store and process petabytes of data.
Owen O'Malley is a Software Architect in Yahoo's Grid Computing team and is the chair of the Program Management Committee for Apache Hadoop. He has been a Hadoop committer since March of 2006 and more than 200 of his patches have been committed to Hadoop. Before working on Hadoop, he worked on Yahoo Search's Webmap that builds and analyzes the graph of the World Wide Web. Prior to Yahoo, he worked at NASA Ames Research Center on software model checking and at Sun working on a distributed version control system. He received his PhD in Software Engineering from University of California, Irvine.
Home page: http://people.apache.org/~omalley
email: [email protected]
Free for eBig Members, Guests $10.
Official Website: http://www.ebig.org
Added by FullCalendar on September 26, 2008