Speak at Ray Summit 2025

One of my favorite conferences takes place November 3-5 in San Francisco! This year’s conference spotlights the critical layers of AI development: open source infrastructure, multimodal data, post-training optimization, and scalable ML platforms, highlighted by a full track dedicated to vLLM. This is the definitive gathering for the community of builders, operators, and innovators shapingContinue reading “Speak at Ray Summit 2025”

Notes from the 2020 Spark+AI Summit

I recently live-tweeted some of the keynotes from the 2020 #SparkAISummit and I collected the series of Twitter Threads in this short post: Matei Zaharia on Spark 3.0 Ali Ghodsi on Lakehouses Reynold Xin on Delta Engine and Photon Clemens Mewald on the Data Science Workspace Matei Zaharia on MLflow Kim Hazelwood on deep learningContinue reading “Notes from the 2020 Spark+AI Summit”

Time-turner: Strata San Jose 2017, day 2

There are so many good talks happening at the same time that it’s impossible to not miss out on good sessions. But imagine I had a time-turner necklace and could actually “attend” 3 (maybe 5) sessions happening at the same time. Taking into account my current personal interests and tastes, here’s how my day wouldContinue reading “Time-turner: Strata San Jose 2017, day 2”

Time-turner: Strata San Jose 2017, day 1

There are so many good talks happening at the same time that it’s impossible to not miss out on good sessions. But imagine I had a time-turner necklace and could actually “attend” 3 (maybe 5) sessions happening at the same time. Taking into account my current personal interests and tastes, here’s how my day wouldContinue reading “Time-turner: Strata San Jose 2017, day 1”

Building the next-generation big data analytics stack

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Michael Franklin on the lasting legacy of AMPLab. Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data, data science, and AI. Find us on Stitcher, TuneIn, iTunes, SoundCloud, RSS. In this episode IContinue reading “Building the next-generation big data analytics stack”

Strata NYC 2016 is next week

1/ We have great keynotes next week at #StrataHadoop NYC🗽 Here are a few to watch for: — Ben Lorica (@bigdata) September 21, 2016 2/ .@Pagankennedy author of Inventology ⇢ The art and science of serendipity #StrataHadoop NYC🗽 https://t.co/fJMitfeFk7 — Ben Lorica (@bigdata) September 21, 2016 3/ Jill Lepore of @Harvard_History & @NewYorker ⇢ TheContinue reading “Strata NYC 2016 is next week”

Hardcore Data Science, London 2016

The first edition of Hardcore Data Science in Europe featured 12 outstanding sessions. My co-organizer, Angie Ma of ASI deserves much of the credit for recruiting many of the speakers. We had talks on machine learning techniques (deep / transfer / reinforcement / ensemble / semisupervised) applied to a variety of data sets (images, text,Continue reading “Hardcore Data Science, London 2016”

Time-turner: Strata San Jose 2016, day 2

There are so many good talks happening at the same time that it’s impossible to not miss out on good sessions. But imagine I had a time-turner necklace and could actually “attend” 3 (maybe 5) sessions happening at the same time. Taking into account my current personal interests and tastes, here’s how my day wouldContinue reading “Time-turner: Strata San Jose 2016, day 2”