a quick tutorial about finding and using the orc java tool jar for peering into the contents of the otherwise non humanly readable orc file format
Tag Archives: hdp
topology supervision features of streaming frameworks (or lack thereof)
a smackdown of sort pitting kafka streams, spark streaming, and storm against each other — not for the features they give developers, but for the features they offer the operations side of the devops formula
how do i load a fixed-width formatted file into hive? (with a little help from pig)
presents a couple of options for converting a fixed-width formatted file a a delimited one to prepare it to be exposed as a hive table