Skip to content

Commit d2be387

Browse files
committed
Docs on raster write: samples, parquet
Signed-off-by: Jason T. Brown <[email protected]>
1 parent ea5a93a commit d2be387

File tree

3 files changed

+132
-10
lines changed

3 files changed

+132
-10
lines changed

pyrasterframes/src/main/python/docs/raster-io.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,19 +2,18 @@
22

33
The standard mechanism by which any data is brought in and out of a Spark Dataframe is the [Spark SQL DataSource][DS]. RasterFrames provides specialized DataSources for geospatial raster data and maintains compatibility with existing general purpose DataSources, such as Parquet.
44

5-
Three types of DataSources will be introduced:
6-
75
* @ref:[Catalog Readers](raster-catalogs.md)
86
- `aws-pds-l8-catalog`: built-in catalog over [Landsat on AWS][Landsat]
97
- `aws-pds-modis-catalog`: built-in catalog over [MODIS on AWS][MODIS]
108
- `geotrellis-catalog`: for enumerating [GeoTrellis layers][GTLayer]
119
* @ref:[Raster Readers](raster-read.md)
12-
- `raster`: the standard reader for most raster data
10+
- `raster`: the standard reader for most raster data, including single raster files or catalogs
1311
- `geotiff`: a simplified reader for reading a single GeoTIFF file
1412
- `geotrellis`: for reading a [GeoTrellis layer][GTLayer])
1513
* @ref:[Raster Writers](raster-write.md)
16-
- `geotrellis`: for creating a [GeoTrellis layer][GTLayer]
14+
- @ref:[Tile](raster-write.md#tile-samples) and @ref:[DataFrame](raster-write.md#dataframe-samples) samples
1715
- `geotiff`: beta writer to GeoTiff file
16+
- `geotrellis`: creating a [GeoTrellis layer][GTLayer]
1817
- [`parquet`][Parquet]: general purpose writer
1918

2019
There is also support for @ref:[vector data](vector-data.md) for masking and data labeling.

pyrasterframes/src/main/python/docs/raster-read.pymd

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,13 +12,15 @@ RasterFrames registers a DataSource named `raster` that enables reading of GeoTI
1212

1313
## Single Raster
1414

15-
The simplest form is reading a single raster from a single URI:
15+
The simplest form is reading a single raster from a single URI.
1616

1717
```python read_one_uri
1818
rf = spark.read.raster('https://s22s-test-geotiffs.s3.amazonaws.com/luray_snp/B02.tif')
1919
rf.printSchema()
2020
```
2121

22+
The file at the address above is a valid [Cloud Optimized GeoTIFF (COG)](https://www.cogeo.org/), which RasterFrames fully supports. RasterFrames will take advantage of the optimizations in the COG format to enable more efficient reading compared to vanilla GeoTIFFs.
23+
2224
Let's unpack the `proj_raster` column and look at the contents in more detail. It contains a [_CRS_][CRS], a spatial _extent_ measured in that CRS, and a two-dimensional array of numeric values called a _tile_.
2325

2426
```python unpack_schema

0 commit comments

Comments
 (0)