A survey of the landscape shows the types of tools remain the same, but interfaces continue to improve. [A version of this post appears on the O’Reilly Radar.] As data projects become complex and as data teams grow in size, individuals and organizations need tools to efficiently manage data projects. A while back, I wroteContinue reading “More tools for managing and reproducing complex data projects”
Monthly Archives: April 2015
The Spark Spot at Spark Summit East 2015 – NYC
I had a series of conversations with a few lead developers and users of Apache Spark, at the recent Spark Summit NYC conference. You can view them on Youtube or using the player below:
Coming full circle with Bigtable and HBase
The O’Reilly Data Show Podcast: Michael Stack on HBase past, present, and future. [A version of this post appears on the O’Reilly Radar.] Subscribe to the O’Reilly Data Show to explore the opportunities and techniques driving big data and data science. At least once a year, I sit down with Michael Stack, engineer at Cloudera,Continue reading “Coming full circle with Bigtable and HBase”
Spark Summit East panel
Here’s a video of our well-received panel from a few weeks ago, featuring Abhishek Mehta , Tresata – Founder & CEO George Mathew , Alteryx – President & COO Patrick Wendell , Databricks – Co-founder Martin Van Ryswyk , DataStax – VP of Engineering
Building big data systems in academia and industry
[A version of this post appears on the O’Reilly Radar blog.] The O’Reilly Data Show Podcast: Mikio Braun on stream processing, academic research, and training. Mikio Braun is a machine learning researcher who also enjoys software engineering. We first met when he co-founded a real-time analytics company called streamdrill. Since then, I’ve always had greatContinue reading “Building big data systems in academia and industry”
A real-time processing revival
[A version of this post appears on the O’Reilly Radar blog.] Things are moving fast in the stream processing world. There’s renewed interest in stream processing and analytics. I write this based on some data points (attendance in webcasts and conference sessions; a recent meetup), and many conversations with technologists, startup founders, and investors. Certainly,Continue reading “A real-time processing revival”