#apache-iceberg

[ follow ]
Business intelligence
fromInfoWorld
18 hours ago

Why observability needs Apache Iceberg

Iceberg enables storing logs, metrics, and traces in the same lakehouse as business data, allowing SQL-based telemetry exploration without costly data transfers.
fromTechzine Global
1 week ago

Fabric gets real-time data mirroring from Oracle and BigQuery

Fabric was launched in 2023 as a unified cloud platform for data and analytics. Later that same year, mirroring was added, a feature that allows data from existing warehouses and databases to be added and managed without complex ETL processes or self-built data pipelines. With the latest update, organizations can replicate a snapshot of Oracle and BigQuery databases to OneLake, the lakehouse system within Fabric, where the copies remain synchronized with the source databases in near real time.
Data science
fromInfoQ
2 months ago

Building Reproducible ML Systems with Apache Iceberg and SparkSQL: Open Source Foundations

Time travel in Apache Iceberg allows users to precisely identify which data snapshot produced exceptional results, eliminating the need to sift through production logs.
Data science
fromDevOps.com
2 months ago

StarTree Bridges the Lakehouse Gap: Serving Apache Iceberg Data Directly to Applications - DevOps.com

This introduces latency, complexity and what we call 'bloat,' explains Chad Meley, SVP of Marketing at StarTree. We're collapsing that serving and query layer into one piece of the puzzle, significantly reducing the bloat and simplifying that architecture.
Data science
E-Commerce
fromInfoQ
2 months ago

Amazon S3 Adds Sort and Z-Order Compaction to Improve Apache Iceberg Query Performance

Amazon S3 now supports sort and z-order compaction for Apache Iceberg tables to improve query performance and reduce costs.
Business intelligence
fromTheregister
5 months ago

Delta Lake and Iceberg communities collide - in a good way

Databricks collaborates on the Iceberg table format, enhancing interoperability with its own Delta Lake format, aiming to optimize analytics performance.
[ Load more ]