i’ve noticed the is_current_ancestor column of the apache iceberg $history metadata table for a while now – it wasn’t until I got a direct question about it that i realized it was time to find out for sure
Tag Archives: trino
dbt cloud & starburst galaxy workshop (beta testers welcome)
interested in building a data pipeline with dbt cloud and starburst galaxy? if so, then this post presents recorded videos of 7 lab exercises plus the lab guide itself so you work through them on your own & at your pace
z-order (visualized)
when asked to compare sort-by with z-order for data lake tables i realized i finally needed to have a better understanding of what z-order is all about and my goal with this blog post is to present a simplified visualization of what’s going on and how it can help
ibis & trino (dataframe api part deux)
this is a port of the dataframe api code from my original pystarburst posting – this time i implemented the same scenarios with ibis, the portable python dataframe library, and had a blast doing it
viewing astronauts thru windows (more pystarburst examples)
i’ve got a fever and the only prescription is more pystarburst examples — this third installment is all about window functions via the dataframe api and like before, I present sql first for comparison
pystarburst analytics examples (querying aviation data part deux)
i had so much fun publishing my first pystarburst post and running it in starburst galaxy that i wanted to share some more examples – i ported my aviation dataset analytical queries to python and the dataframe api
hive acid transactions work on trino (can even update a partitioned column)
it seems that folks who haven’t used hive in production are always quick to say that hive doesn’t have classic crud operations, much less the merge statement, and that simply isn’t true – this post shows you that you can create a hive acid table and mutate its contents with trino
configuring the cache service (starburst enterprise)
showcasing a video walk-through of configuring and validating the caching service for starburst enterprise which enables table scan redirection, materialized views, and data products
pystarburst (the dataframe api)
the dataframe api is finally available for trino and starburst galaxy thanks to the pystarburst libraries — take a peek at some example usages in this quick validation run
building a sql-based data pipeline with trino & starburst (5 slick videos)
a collection of videos presented as an overview of how you could build a sql-based data transformation pipeline utilizing trino/starburst and automating it with dbt