I hosted a series of webcasts on data science and data engineering for O’Reilly:
- Alexander Ulanov, Distributed Deep Learning on Spark (2016-06-15)
- Evan Sparks, KeystoneML: Optimized large-scale machine-learning pipelines on Apache Spark (2016-05-17)
- Evan Chan and Helena Edelson, Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spark Streaming (2016-03-16)
- Natalino Busa, How to build an anomaly detection engine with Spark, Akka and Cassandra (2015-12-09)
- Haoyuan Li and Shaoshan Liu, Introduction to Tachyon and a deep dive into Baidu’s production use case (2015-09-14)
- Josh Rosen, Deep Dive into Project Tungsten: Bringing Apache Spark Closer to Bare Metal (2015-09-03)
- Patrick Wendell, Apache Spark 1.4 (2015-07-08)
- Olivier Grisel and Andreas Mueller, News from Scikit-Learn 0.16 and Soon-To-Be Gems for the Next Release (2015-04-02)
- Kay Ousterhout, Making Sense of Spark Performance (2015-04-01)
- Patrick Wendell, Spark 1.3 and Spark’s New Dataframe API (2015-03-25)
- Adam Marcus, Crowdsourcing at GoDaddy: How I Learned to Stop Worrying and Love the Crowd (2015-01-22)
- Patrick Wendell, Apache Spark 1.2 and Beyond! (2015-01-13)
- Andreas Antonopoulos, Bitcoin and the Future of Money (2014-12-17)
- Kieren James-Lubin, The Future of Bitcoin: A Data-Driven Perspective (2014-12-03)
- Sameer Farooqui, Spark + Cassandra: Technical Integration Details (2014-11-12)
- Mark Harwood, Revealing the Uncommonly Common with Elasticsearch (2014-10-30)
- Patrick Wendell, Apache Spark 1.1 and Beyond! (2014-10-02)
- Chuck Yarbrough, Building a Data Refinery (2014-09-23); this webcast is sponsored by Pentaho
- Lukas Biewald, Real-world Active Learning (2014-08-21)
- Olivier Grisel, What’s New in Scikit-learn 0.15 and What’s Cooking in the Development Branch? (2014-08-13)
- Alex Bordei, Getting the Most Out of Your NoSQL DB: Best Practices for Optimizing Infrastructure Performance and Budget (2014-08-07)
- Pete Warden, How to Get Started with Deep Learning in Computer Vision (2014-07-24)
- Jodok Batlogg, Super Simple Real Time Big Data Backend: Crate Data (2014-07-08)
- Alice Zheng, Scalable Data Science on a Laptop (2014-06-24)
- Mikio Braun, Data Analysis on Streams (2014-06-12)
- Jay Kreps, I ♥ Logs: Apache Kafka and Real-time Data Integration (2014-05-21)
- Michael Armbrust, Performing Advanced Analytics on Relational Data with Spark SQL (2014-04-29)