Wednesday, August 22, 2012
04:15 PM - 04:45 PM
|Level: ||Technical - Introductory|
The need to process terabytes of data daily becomes a common need these days.
Unless you have implemented architectures that use NoSQL databases and frameworks that support data-intensive distributed applications, then myriads of options are probably a slight enigma.
This workshop in the form of case study will prepare you for success when you need to make a NoSQL column based database like Cassandra work together with data-intensive distributed framework like Hadoop.
• The problem: need to collect, process and store 5Tb of data daily
• The solution: A combination of Cassandra, Hadoop and some MySQL
• Best practices for modeling your data in Cassandra
• Best practices for scaling Hadoop Clusters
• Lessons learned about Cassandra’s strong and weak areas
• Lessons learned from implementing Map/Reduce
• Anti-Patterns (things to avoid)
Renat Khasanshyn is CEO of Altoros Systems and Venture Partner at Runa Capital. Renat is a subject matter expert on open source database infrastructure software. Most recently Renat scaled an idea called "Altoros" into a global software services organization with 200+ engineers in Belarus, US, UK, Denmark, and Norway. In the past, Mr. Khasanshyn was selected as a finalist for the 2007 Emerging Executive of the Year award by the Massachusetts Technology Leadership Council and won the 2007 IBM Business Mashup Challenge. In 2006, Mr. Khasanshyn founded Apatar, the world's first on-demand, open source data mashup software company. Prior to founding Altoros in 2001, Renat was VP of Engineering for Tampa-based insurance company PriMed, Inc. Renat has a passion for emerging technologies and. Mr. Khasanshyn is a founder of Silicon Valley NewSQL User Group, and co-founder of Belarusian Java User Group, and studied Engineering at the Belarusian National Technical University.