Keep up to date with every new upload!

Join free & follow O'Reilly Design Podcast - O'Re
Share
  • 3 years ago
Resolving transactional access and analytic performance trade-offs

Resolving transactional access and analytic performance trade-offs

The O’Reilly Data Show podcast: Todd Lipcon on hybrid and specialized tools in distributed systems.In recent months, I've been hearing about hybrid systems designed to handle different data management needs. At Strata + Hadoop World NYC last week, Cloudera's Todd Lipcon unveiled an open source storage layer — Kudu — that's good at both table scans (analytics) and random access (updates and inserts). During the latest episode of the O'Reilly Data Show Podcast, I sat down with Lipcon to discuss his new project a few weeks before it was released.HDFS and Hbase

[Hadoop is] more like a file store. It allows you to upload files onto an arbitrarily sized cluster with 20-plus petabytes, in single clusters. The thing is, you can upload the files but you can't edit them in place. To make any change, you have to basically put in a new file. What HBase does in distinction is that it has more of a tabular data model, where you can update and insert individual row-by- row data, and then randomly a

Comments