One Simple Chart: where do consumers prefer AI data be processed

With machine learning and AI being embedded in a growing number of products and systems, privacy and security become central for users and companies. Every company now has a Privacy Policy to comply with regulations like GDPR and CCPA. And in the not-so-distant future, companies will have teams focused on managing risks stemming from dataContinue reading “One Simple Chart: where do consumers prefer AI data be processed”

Machine learning on encrypted data

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Alon Kaufman on the interplay between machine learning, encryption, and security. In this episode of the Data Show, I spoke with Alon Kaufman, CEO and co-founder of Duality Technologies, a startup building tools that will allow companies to apply analyticsContinue reading “Machine learning on encrypted data”

How machine learning can be used to write more secure computer programs

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Fabian Yamaguchi on the potential of using large-scale analytics on graph representations of code. In this episode of the Data Show, I spoke with Fabian Yamaguchi, chief scientist at ShiftLeft. His 2015 Ph.D. dissertation sketched out how the combination ofContinue reading “How machine learning can be used to write more secure computer programs”

Building machine learning solutions that can withstand adversarial attacks

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Parvez Ahammad on minimal supervision, and the importance of explainability, interpretability, and security. Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data, data science, and AI. Find us on Stitcher, TuneIn, iTunes, SoundCloud,Continue reading “Building machine learning solutions that can withstand adversarial attacks”

Using Apache Spark to predict attack vectors among billions of users and trillions of events

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show podcast: Fang Yu on data science in security, unsupervised learning, and Apache Spark. Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science: Stitcher, TuneIn, iTunes, SoundCloud, RSS. In this episode ofContinue reading “Using Apache Spark to predict attack vectors among billions of users and trillions of events”

Analytic engines that factor in security labels

[A version of this post appears on the O’Reilly Strata blog.] Originated by the NSA, Apache Accumulo is a BigTable inspired data store known for being highly scalable and for its interesting security model. Federal agencies and Defense contractors have deployed Accumulo on clusters of a thousand or more servers. It also uses “cell-level” securityContinue reading “Analytic engines that factor in security labels”