[Wranglers] Apache Iceberg and Parquet now support GEO data types
Ali, Saqib
docbook.xml at gmail.com
Wed Jun 4 22:13:27 UTC 2025
A brief intro to the new Geometry and Geography (GEO) data types support in
Iceberg:
"Similarly, geospatial data — information associated with locations on the
Earth’s surface like roads, parks, or city boundaries — is also hard to
work with and query efficiently. Without geospatial types, customers had to
use binary columns to store geodata locations. However, this representation
did not support geographic searching, since binary columns cannot be
filtered to find objects within a given area. Iceberg v3 solves this
problem by introducing new geometry and geography data types. Geometry
types are for planar spatial data, whereas geography types are for global
data accounting for the curvature of the earth. With these types, customers
easily find data using bounding boxes that represent geographic regions and
efficiently locate geospatial objects."
source:
https://www.databricks.com/blog/apache-icebergtm-v3-moving-ecosystem-towards-unification
On Sun, Feb 16, 2025 at 8:34 PM Ali, Saqib <docbook.xml at gmail.com> wrote:
> Both Iceberg and Parquet now support geometry and geography (GEO) data
> types:
>
> https://github.com/apache/iceberg/blob/8839c9bf1f1d8c9b718f9766302ff8a2018e515f/format/spec.md?plain=1#L51
>
> https://github.com/apache/parquet-format/blob/94b9d631aef332c78b8f1482fb032743a9c3c407/Geospatial.md?plain=1#L27
>
> This paves the way to make geospatial data a first-class citizen in
> analytics.
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://analyticsengineering.net/pipermail/wranglers/attachments/20250604/7cabce15/attachment.htm>
More information about the Wranglers
mailing list