Site icon Gradient Flow

Metadata services can lead to performance and organizational improvements

[A version of this post appears on the O’Reilly Radar.]

The O’Reilly Data Show podcast: Joe Hellerstein on data wrangling, distributed systems, and metadata services.

In this episode of the O’Reilly Data Show, I spoke with one of the most popular speakers at Strata+Hadoop World:  Joe Hellerstein, Professor of Computer Science at UC Berkeley and co-founder/CSO of Trifacta. We talked about his past and current academic research (which spans HCI, databases, and systems), data wrangling, large-scale distributed systems, and his recent work on metadata services.

Data wrangling and preparation

Coordination and consistency in distributed systems

Vendor-neutral metadata services

As Hellerstein discussed, the use cases for metadata are varied and evolving. Some key uses of metadata and metadata stores include interpreting data, tracking data usage by multiple users, and surfacing patterns and associations among many data sets.

Use cases for metadata. Source: Joe Hellerstein, used with permission.

Joe Hellerstein will speak about metadata services and data wrangling at Strata + Hadoop World in San Jose this March.

Related resources:

Exit mobile version