terraflow.export¶

Re-indexes pipeline output to H3 hexagonal cells for interop with DeckGL, Kepler.gl, and h3pandas. The h3-py dependency is optional — install with pip install terraflow-agro[h3].

Public surface¶

to_h3(features, resolution=8) — convert a features DataFrame to an H3-indexed structure.
run_export(config, ...) — orchestrator used by terraflow export --format h3.

API Reference¶

`export` ¶

H3-indexed export adapter for TerraFlow pipeline output.

`run_export(config_path, resolution_override=None, format='h3')` ¶

Run H3 export on an existing pipeline run.

Parameters:

Name	Type	Description	Default
`config_path`	`Path \| str`	Path to YAML config file (must have an `export:` section).	required
`resolution_override`	`int \| None`	If provided, overrides `export.h3_resolution` from config for the H3 conversion and output filename. Does not affect the run directory (which is determined by the on-disk config fingerprint).	`None`
`format`	`str`	Export format. Currently only `"h3"` is supported.	`'h3'`

Returns:

Type	Description
`Path`	Path to the written H3 parquet artifact.

Raises:

Type	Description
`ValueError`	If format is unsupported, config has no `export:` section, or resolution is outside 0-15.
`FileNotFoundError`	If no pipeline run directory containing `features.parquet` is found.

Source code in terraflow/export.py

def run_export(
    config_path: Path | str,
    resolution_override: int | None = None,
    format: str = "h3",
) -> Path:
    """Run H3 export on an existing pipeline run.

    Parameters
    ----------
    config_path : Path | str
        Path to YAML config file (must have an ``export:`` section).
    resolution_override : int | None
        If provided, overrides ``export.h3_resolution`` from config for the
        H3 conversion and output filename. Does not affect the run directory
        (which is determined by the on-disk config fingerprint).
    format : str
        Export format. Currently only ``"h3"`` is supported.

    Returns
    -------
    Path
        Path to the written H3 parquet artifact.

    Raises
    ------
    ValueError
        If format is unsupported, config has no ``export:`` section, or
        resolution is outside 0-15.
    FileNotFoundError
        If no pipeline run directory containing ``features.parquet`` is found.
    """
    if format != "h3":
        raise ValueError(
            f"Unsupported export format: '{format}'. Supported formats: h3"
        )

    data = load_config_dict(config_path)
    cfg = build_config(data)

    if cfg.export is None:
        raise ValueError(
            "Config file has no 'export:' section. "
            "Add an export: block with h3_resolution (0-15). "
            "See TerraFlow documentation for details."
        )

    # Effective resolution: CLI override > config value (per D-01, D-12)
    effective_resolution = (
        resolution_override
        if resolution_override is not None
        else cfg.export.h3_resolution
    )

    if not (0 <= effective_resolution <= 15):
        raise ValueError(f"H3 resolution must be 0-15, got {effective_resolution}")

    # Run dir is determined by on-disk config fingerprint (per D-07)
    run_dir = resolve_run_dir(config_path)
    features_path = run_dir / "features.parquet"
    if not features_path.exists():
        raise FileNotFoundError(
            f"No pipeline run found at {run_dir}. "
            "Run `terraflow run -c config.yml` before exporting."
        )

    logger.info(f"Running H3 export (resolution={effective_resolution}) on {run_dir}")

    df = pd.read_parquet(features_path)
    h3_df = to_h3(df, resolution=effective_resolution)

    output_path = run_dir / f"h3_resolution_{effective_resolution}.parquet"
    _atomic_write_parquet(
        output_path,
        h3_df.reset_index(),
        {"export_format": "h3", "h3_resolution": str(effective_resolution)},
    )

    logger.info(f"H3 export written to {output_path}")
    return output_path

`to_h3(features, resolution=8)` ¶

Convert features DataFrame to H3-indexed DataFrame.