Data Science Archives - Page 4 of 22

How social science research can inform the design of AI systems

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Jacob Ward on the interplay between psychology, decision-making, and AI systems. In this episode of the Data Show, I spoke with Jacob Ward, a Berggruen Fellow at Stanford University. Ward has an extensive background in journalism, mainly covering topics in science andContinue reading “How social science research can inform the design of AI systems”

Why it’s hard to design fair machine learning models

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Sharad Goel and Sam Corbett-Davies on the limitations of popular mathematical formalizations of fairness. In this episode of the Data Show, I spoke with Sharad Goel, assistant professor at Stanford, and his student Sam Corbett-Davies. They recently wrote a surveyContinue reading “Why it’s hard to design fair machine learning models”

Building accessible tools for large-scale computation and machine learning

[A version of this post appears on the O’Reilly Radar.] In this episode of the Data Show, I spoke with Eric Jonas, a postdoc in the new Berkeley Center for Computational Imaging. Jonas is also affiliated with UC Berkeley’s RISE Lab. It was at a RISE Lab event that he first announced Pywren, a frameworkContinue reading “Building accessible tools for large-scale computation and machine learning”

How privacy-preserving techniques can lead to more robust machine learning models

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Chang Liu on operations research, and the interplay between differential privacy and machine learning. In this episode of the Data Show, I spoke with Chang Liu, applied research scientist at Georgian Partners. In a previous post, I highlighted early toolsContinue reading “How privacy-preserving techniques can lead to more robust machine learning models”

Data collection and data markets in the age of privacy and machine learning

While models and algorithms garner most of the media coverage, this is a great time to be thinking about building tools focused on data. In this post I share slides and notes from a keynote I gave at the Strata Data Conference in London at the end of May. My goal was to remind the dataContinue reading “Data collection and data markets in the age of privacy and machine learning”

Data regulations and privacy discussions are still in the early stages

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Aurélie Pols on GDPR, ethics, and ePrivacy. In this episode of the Data Show, I spoke with Aurélie Pols of Mind Your Privacy, one of my go-to resources when it comes to data privacy and data ethics. This interview tookContinue reading “Data regulations and privacy discussions are still in the early stages”

Managing risk in machine learning models

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Andrew Burt and Steven Touw on how companies can manage models they cannot fully explain. In this episode of the Data Show, I spoke with Andrew Burt, chief privacy officer at Immuta, and Steven Touw, co-founder and CTO of Immuta.Continue reading “Managing risk in machine learning models”

The real value of data requires a holistic view of the end-to-end data pipeline

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Ashok Srivastava on the emergence of machine learning and AI for enterprise applications. In this episode of the Data Show, I spoke with Ashok Srivastava, senior vice president and chief data officer at Intuit. He has a strong science andContinue reading “The real value of data requires a holistic view of the end-to-end data pipeline”

The evolution of data science, data engineering, and AI

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: A special episode to mark the 100th episode. This episode of the Data Showmarks our 100th episode. This podcast stemmed out of video interviews conducted at O’Reilly’s 2014 Foo Camp. We had a collection of friends who were key membersContinue reading “The evolution of data science, data engineering, and AI”

Companies in China are moving quickly to embrace AI technologies

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Jason Dai on the first year of BigDL and AI in China. In this episode of the Data Show, I spoke with Jason Dai, CTO of Big Data Technologies at Intel, and one of my co-chairs for the AI ConferenceContinue reading “Companies in China are moving quickly to embrace AI technologies”