The quest for high-quality data

Machine learning solutions for data integration, cleaning, and data generation are beginning to emerge. By Ihab Ilyas and Ben Lorica. “AI starts with ‘good’ data” is a statement that receives wide agreement from data scientists, analysts, and business owners. There has been a significant increase in our ability to build complex AI models for predictions,Continue reading “The quest for high-quality data”

Applications of data science and machine learning in financial services

The O’Reilly Data Show Podcast: Jike Chong on the many exciting opportunities for data professionals in the U.S. and China. In this episode of the Data Show, I spoke with Jike Chong, chief data scientist at Acorns, a startup focused on building tools for micro-investing. Chong has extensive experience using analytics and machine learning inContinue reading “Applications of data science and machine learning in financial services”

Becoming a machine learning company means investing in foundational technologies

Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms. In this post, I share slides and notes from a keynote I gave at the Strata Data Conference in London earlier this year. I will highlight the results of a recent survey on machineContinue reading “Becoming a machine learning company means investing in foundational technologies”

How AI and machine learning are improving customer experience

From data quality to personalization, to customer acquisition and retention, and beyond, AI and ML will shape the customer experience of the future. By Ben Lorica and Mike Loukides. What can artificial intelligence (AI) and machine learning (ML) do to improve customer experience? AI and ML already have been intimately involved in online shopping since, well,Continue reading “How AI and machine learning are improving customer experience”

Real-time entity resolution made accessible

The O’Reilly Data Show Podcast: Jeff Jonas on the evolution of entity resolution technologies. In this episode of the Data Show, I spoke with Jeff Jonas, CEO, founder and chief scientist of Senzing, a startup focused on making real-time entity resolution technologies broadly accessible. He was previously a fellow and chief scientist of context computingContinue reading “Real-time entity resolution made accessible”

What data scientists and data engineers can do with current generation serverless technologies

The O’Reilly Data Show Podcast: Avner Braverman on what’s missing from serverless today and what users should expect in the near future. In this episode of the Data Show, I spoke with Avner Braverman, co-founder and CEO of Binaris, a startup that aims to bring serverless to web-scale and enterprise applications. This conversation took placeContinue reading “What data scientists and data engineers can do with current generation serverless technologies”

Specialized tools for machine learning development and model governance are becoming essential

Why companies are turning to specialized machine learning tools like MLflow. By Ben Lorica and Mike Loukides. A few years ago, we started publishing articles (see “Related resources” at the end of this post) on the challenges facing data teams as they start taking on more machine learning (ML) projects. Along the way, we described aContinue reading “Specialized tools for machine learning development and model governance are becoming essential”

It’s time for data scientists to collaborate with researchers in other disciplines

The O’Reilly Data Show Podcast: Forough Poursabzi Sangdeh on the interdisciplinary nature of interpretable and interactive machine learning. In this episode of the Data Show, I spoke with Forough Poursabzi-Sangdeh, a postdoctoral researcher at Microsoft Research New York City. Poursabzi works in the interdisciplinary area of interpretable and interactive machine learning. As models and algorithmsContinue reading “It’s time for data scientists to collaborate with researchers in other disciplines”

Algorithms are shaping our lives – here’s how we wrest back control

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Kartik Hosanagar on the growing power and sophistication of algorithms. In this episode of the Data Show, I spoke with Kartik Hosanagar, professor of technology and digital business, and professor of marketing at The Wharton School of the University ofContinue reading “Algorithms are shaping our lives – here’s how we wrest back control”

You created a machine learning application. Now make sure it’s secure.

[A version of this post appears on the O’Reilly Radar.] The software industry has demonstrated, all too clearly, what happens when you don’t pay attention to security. By Ben Lorica and Mike Loukides. In a recent post, we described what it would take to build a sustainable machine learning practice. By “sustainable,” we mean projectsContinue reading “You created a machine learning application. Now make sure it’s secure.”