hadoop – Page 2 – Lester Martin (l11n)

hive acid transactions with partitions (a behind the scenes perspective)

let’s take a deeper look at what happens under the hood of hive on these “acid” activities such as insert, update and delete — including look at the actual directories and orc files created

viewing the content of ORC files (using the Java ORC tool jar)

a quick tutorial about finding and using the orc java tool jar for peering into the contents of the otherwise non humanly readable orc file format

topology supervision features of streaming frameworks (or lack thereof)

a smackdown of sort pitting kafka streams, spark streaming, and storm against each other — not for the features they give developers, but for the features they offer the operations side of the devops formula

presenting at hadoop summit (archiving evolving databases in hive)

overview of, and links to related artifacts for, my presentation at hadoop summit about strategies to handle changing data in hive’s immutable architecture

small files and hadoop’s hdfs (bonus: an inode formula)

a walk-thru of the infamous small files problem for hadoop coupled with a unique problem with inodes usage for mass quantities of extremely small files on hdfs

how do i load a fixed-width formatted file into hive? (with a little help from pig)

presents a couple of options for converting a fixed-width formatted file a a delimited one to prepare it to be exposed as a hive table