Marco will be presenting an overview of the Apache Hadoop open source
project. Hadoop is platform software aimed at solving Big Data problems. It
is two distributed systems that cooperatively work together using the same
set of computers: a distributed file system (HDFS) and a Map-Reduce
framework. We'll go over the strengths and weaknesses of the design choices
made by the development team, go through a brief demo of Hadoop in action,
and discuss how to implement Hadoop grids within your own enterprise. If
time permits, we'll take a sneak peek at two of the new open source projects
to which the Yahoo! Grid Computing team is contributing effort.
Marco is the manager of Yahoo! Grid Services Operations. He has over a dozen
years of experience managing Unix clusters at Walmart.com, Inktomi, GNN (a
subsidiary of AOL) and UC Berkeley. His responsibilities have ranged from
installations supporting over 1 billion transactions per month to systems of
over 1,000 nodes. Marco holds a Bachelor's Degree from the University of
California at Berkeley in Cognitive Science.
One way to put it is that there are many user groups, but we are the sysadmins group. BayLISA includes system and network administrators across a range of skill levels. BayLISA meets to discuss topics of interest to administrators and managers. The meetings are free and open to the public.
BayLISA grew out of an after-hours discussion among attendees of the USENIX LISA IV conference. The idea was to provide a forum for Sysadmin professionals in the San Francisco Bay area to get together and exchange ideas, hear speakers address topics of interest and most importantly, socialize.
Official Website: http://www.baylisa.org