Mohammad AftabA Comprehensive Guide to File Formats in Data Engineering:Understanding the Pros and Cons of using CSV, JSON, Parquet, Avro, and ORC file format in Data Engineering.Dec 15, 20231Dec 15, 20231
Neeraj KushwahaDeep Dive into Apache Parquet: Efficient Data Storage for AnalyticsIn today’s digital age, the amount of data being generated is growing at an unprecedented rate. This explosion of data has given rise to…Aug 27, 20231Aug 27, 20231
Ankush SinghComparing Data Storage: Parquet vs. ArrowData storage formats have significant implications on how quickly and efficiently we can extract and process data. In today’s blog, we’re…Jun 11, 20233Jun 11, 20233
ahLoading Parquet in PostgreSQL via DuckDB: Testing queries and exploring the CoreIn the realm of data management and analytics, PostgreSQL has long been a popular choice as an open-source relational database management…Nov 5, 20233Nov 5, 20233
InTDS ArchivebyAlon AgmonBoost Your Cloud Data Applications with DuckDB and Iceberg APIUse Iceberg API with DuckDB to optimize analytics queries on massive Iceberg tables in your cloud storageDec 23, 20223Dec 23, 20223
Dipankar MazumdarBuilding a Streamlit app on a Lakehouse using Apache Iceberg & DuckDBDeveloping full-stack data applications can be a massive overhead for data scientists or analysts who would need to spend a considerable…Apr 25, 20232Apr 25, 20232
InBetter ProgrammingbyMarin AglićLearning Apache Iceberg — Storing the Catalog to PostgresAn interesting experiment to see how Apache Iceberg organised the table data and metadata under the hoodMay 17, 20231May 17, 20231