text mining Archives - Gradient Flow

Building a natural language processing library for Apache Spark

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: David Talby on a new NLP library for Spark, and why model development starts after a model gets deployed to production. When I first discovered and started using Apache Spark, a majority of the use cases I used it forContinue reading “Building a natural language processing library for Apache Spark”

Language understanding remains one of AI’s grand challenges

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: David Ferrucci on the evolution of AI systems for language understanding. Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data, data science, and AI. Find us on Stitcher, TuneIn, iTunes, SoundCloud, RSS. InContinue reading “Language understanding remains one of AI’s grand challenges”

From search to distributed computing to large-scale information extraction

Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data and data science. February 2016 marks the 10th anniversary of Hadoop — at a point in time when many IT organizations actively use Hadoop, and/or one of the open source, big data projects that originated after, and in someContinue reading “From search to distributed computing to large-scale information extraction”

Topic Models: Past, Present, Future

[A version of this post appears on the O’Reilly Radar blog.] The O’Reilly Data Show Podcast: David Blei, co-creator of one of the most popular tools in text mining and machine learning. I don’t remember when I first came across topic models, but I do remember being an early proponent of them in industry. IContinue reading “Topic Models: Past, Present, Future”