I/O#

Writing Data#

Schema.write_parquet(df, /, file, **kwargs)

Write a typed data frame with this schema to a parquet file.

Schema.sink_parquet(lf, /, file, **kwargs)

Stream a typed lazy frame with this schema to a parquet file.

Schema.write_delta(df, /, target, **kwargs)

Write a typed data frame with this schema to a Delta Lake table.

Reading Data#

Schema.read_parquet(source, *[, validation])

Read a parquet file into a typed data frame with this schema.

Schema.scan_parquet(source, *[, validation])

Lazily read a parquet file into a typed data frame with this schema.

Schema.read_delta(source, *[, validation])

Read a Delta Lake table into a typed data frame with this schema.

Schema.scan_delta(source, *[, validation])

Lazily read a Delta Lake table into a typed data frame with this schema.

Schema Serialization#

Schema.serialize()

Serialize this schema to a JSON string.

deserialize_schema(data[, strict])

Deserialize a schema from a JSON string.

read_parquet_metadata_schema(source)

Read a dataframely schema from the metadata of a parquet file.