Ensure your BigQuery tables have reliable data with Dataform · Reliable, accurate data is the foundation of data-driven decision-making. Poor data...
Building data pipelines in Dataform · Dataform is a tool that creates data pipelines using SQL. If you’re familiar with Dbt, Dataform is probably best...
Sometimes it so happens that we create or are using a table with data that is non-partitioned but we need to convert this into a partitioned table. A...
Cloud Run helps you deploy containerized workloads at scale. Using it as a backend for web server use cases is quite well known, but it also can be...
Concise list of ways to process nested data · This post attempts to document a small part of a Data Engineer's workflow along with some techniques that...
Demystifying how Parquet works internally · After a lot of theory it's finally to talk about the code. Since there is a lot going on in the codebase,...