it seems that folks who haven’t used hive in production are always quick to say that hive doesn’t have classic crud operations, much less the merge statement, and that simply isn’t true – this post shows you that you can create a hive acid table and mutate its contents with trino
Tag Archives: hortonworks
wrapping up my 8 year hortonworks – cloudera adventure (best job ever)
what an amazing eight years at hortonworks/cloudera — the technology, the focus, the use cases, the domains, the FUN and most importantly, the PEOPLE, made this the best job of my entire career and make it super hard to say goodbye to this role
viewing the content of ORC files (using the Java ORC tool jar)
a quick tutorial about finding and using the orc java tool jar for peering into the contents of the otherwise non humanly readable orc file format
topology supervision features of streaming frameworks (or lack thereof)
a smackdown of sort pitting kafka streams, spark streaming, and storm against each other — not for the features they give developers, but for the features they offer the operations side of the devops formula
how do i load a fixed-width formatted file into hive? (with a little help from pig)
presents a couple of options for converting a fixed-width formatted file a a delimited one to prepare it to be exposed as a hive table
visiting the computer history museum (yes, i’m a geek)
pictures and observations from my visit to the computer history museum in palo alto, ca