This is the slide deck for the talk I gave at SouthWest Big Data (UK) and Hadoop Summit EMEA 2013.
Video is at: http://www.youtube.com/watch?v=Hw-V7-T3GmE
Goals: - Fix the performance - Make the system operationally sound
Goals: - Corporate decision to switch to Linux - Start prep for security
we use cobbler to control our kickstart installs. key features: * template engine * snippet system * RPM repo sync * both command line and programmable APIs * and, most importantly, great support for a “netboot always” environment. This means that we always have our hosts boot from the network and, if that fails, local disk. We generally always re-install the machine after a disk failure so that we can start it from a clean slate, cleaning any excess cruft and restoring any host specific parts like Kerberos keytabs. What may be surprising is that our kickstart environment serves primarily to do three things: * partition disks * get enough of the OS installed to troubleshoot a broken kickstart * bootstrap our configuration management tool
Born out of the HPC community in 2004 Python BSD License Love the community Works with everything, not just the Hadoop ecosystem Services based methodology with conflict resolution Awesome reporting engine
Goals: - Deploy secure Hadoop - Reduce user friction
A talk in and of itself Highlights: - another cultural shift - finding many bugs in what was considered stable code - forking the kerberos web filter due to poor code quality