a walk-thru of the infamous small files problem for hadoop coupled with a unique problem with inodes usage for mass quantities of extremely small files on hdfs
Tag Archives: hdfs
how do i load a fixed-width formatted file into hive? (with a little help from pig)
presents a couple of options for converting a fixed-width formatted file a a delimited one to prepare it to be exposed as a hive table