Cloud Run helps you deploy containerized workloads at scale. Using it as a backend for web server use cases is quite well known, but it also can be...
Concise list of ways to process nested data · This post attempts to document a small part of a Data Engineer's workflow along with some techniques that...
Demystifying how Parquet works internally · After a lot of theory it's finally to talk about the code. Since there is a lot going on in the codebase,...
Demystifying how Parquet works internally · Previously, we talked about how to parquet writes data. In this article, we will talk about how parquet reads...
Using S3 and Athena is great for data storage and retrieval using queries. But when I first started using it, one common problem that came up fairly...
Demystifying how Parquet works internally · In this article we will get into how data is written as parquet format. To do that, one of the first thing we...