Community Extensions#
Want to build your own extension?
Start with the Extensions overview, the Python UDF docs, or the Native Extension Authoring Guide.
Community extensions are reusable packages that add functionality to Daft outside of the core repository. They may be pure-Python UDF-based extensions, native ABI extensions, or higher-level domain libraries that combine Daft expressions, UDFs, file types, model calls, and distributed execution patterns.
Projects listed here are maintained independently of Daft's release cadence. Each project installs and versions independently of Daft.
| Name | Kind | Repository | Description |
|---|---|---|---|
| daft-h3 | Native ABI | gweaverbiodev/daft-h3 | Native H3 geospatial indexing functions. |
| daft-lance | Python UDF-based | daft-engine/daft-lance | Lance-specific distributed operations for compaction, scalar indexing, column merging, and REST catalog operations. |
| daft-html | Native ABI | daft-engine/daft-html | Native HTML document processing functions exposed as Daft expressions. |
| daft-geo | Native ABI / Datatypes | daft-engine/daft-geo | Geospatial prototype showing native functions and extension-backed datatypes. |
To propose a new extension for this list, open a PR against this page.
daft-h3#
daft-h3 adds H3 geospatial indexing functions such as latitude/longitude to cell conversion, cell traversal, grid distance, parent resolution, and string conversion.
1 | |
1 2 3 4 5 6 7 8 9 10 11 12 | |
See the daft-h3 README for the full list of functions and behavior details.
daft-lance#
daft-lance extends Daft with Lance-specific distributed maintenance and data management operations. It is a Python UDF-based extension: internally, it uses Daft's Python UDF and class-UDF APIs to distribute Lance tasks across Daft queries, while users interact with simple Python functions.
1 | |
1 2 3 4 5 6 7 8 9 | |
See the daft-lance README for additional operations such as column merging and REST catalog writes.
daft-html#
daft-html adds native HTML processing functions such as html_to_text, html_extract_links, html_extract_tables, and CSS-selector extraction.
1 | |
1 2 3 4 5 6 7 8 9 10 | |
See the daft-html README for the full list of document and CSS-selector functions.
daft-geo#
daft-geo is a geospatial prototype showing native Daft functions and extension-backed datatypes. It defines Point2D and Point3D as DataType.extension(...) values and provides point construction, accessors, Euclidean distance, and haversine distance.
1 | |
Note
daft-geo is not yet published to PyPI. The command above installs the latest commit from the repository's default branch, so behavior may change between installs.
1 2 3 4 5 6 7 8 9 10 | |
See the daft-geo README for current capabilities and caveats.