Spatial analysis

Spatial analysis is the engine that derives new data from existing data: spatial operations (buffer / intersect / proximity / etc.), aggregate statistics, and AI-driven defect detection, change detection, and natural-language querying. Every analysis output is itself a feature layer, so results are visible on the map immediately and are valid input to the next operation. Everything is chainable.

Core data model

geoprocessing_jobs:

Column	Purpose
`operation`	BUFFER / INTERSECT / UNION / DIFFERENCE / CLIP / PROXIMITY / SPATIAL_JOIN / HEATMAP / ISOCHRONE / STATS.
`input_layer_ids[]`	Array of input feature classes.
`parameters JSONB`	Operation-specific (buffer distance, proximity radius, heatmap bandwidth, isochrone time window, …).
`output_feature_class_id`	The new layer produced.
`status`	QUEUED / RUNNING / COMPLETED / FAILED.
`progress`	0–100.

ai_jobs (in development):

Column	Purpose
Job kind	Semantic segmentation / defect detection / change detection / vegetation encroachment / mesh quality / scan-to-BIM / IoT anomaly / predictive maintenance / simulation surrogate / report narrative / NL querying / data valuation.
`model_version`, `confidence_threshold`, `status`	Reproducibility, tuning, lifecycle.

analysis_results (in development): VOLUME / CHANGE_DETECTION / DEFECT_SCAN with JSONB output.

Async operations run via Kafka on a dedicated geoprocessing worker. Buffer, intersect, union, difference, clip, proximity (nearest N features), spatial join (count-in-polygon, sum-in-polygon), heatmap density generation, attribute statistics within a region, isochrone (drive / walk-time catchment, via pgRouting). All results produced as new feature layers, chainable analysis.

Why it matters

Spatial analysis is Stratumly's wedge against traditional desktop GIS: not the whole of geoprocessing, but enough of the 15–20% of operations operators run weekly (buffer, intersect, proximity, spatial join) to stop them opening a separate desktop tool at all. Public sharing is the contract-renewal hook: when an operator shares a flood-risk map with their council partner, the next budget cycle is yours.

Sample workflow:

The asset team is preparing a regulator submission. They open the bridges layer, filter to "concrete, built pre-1960, not inspected in 18 months": 42 bridges. They run a spatial analysis: buffer each by 500 m, intersect with a flood-risk layer, heat-map the result. A new flood_risk_bridges layer is produced. For the three highest-risk bridges, they dispatch drone surveys, let AI flag spalling and deformation. One of the three has a twin, so they run a structural-load simulation and attach the output to the regulator PDF. Three days of work collapsed to an afternoon.

Daily users

GIS analyst / Asset manager: runs weekly spatial analyses against asset registers.
Field engineer / Inspector: sees AI-flagged defects in the survey viewer, accepts or rejects them.
Structural / mechanical engineer: runs simulations on twins.

Geoprocessing (shipped)

Operation	Output
Buffer	Buffer features by distance (m / ft) → new POLYGON layer.
Intersect	Features from A overlapping with B → new layer.
Union	Merge overlapping features → new layer.
Difference	A minus B → new layer.
Clip	Clip features to a bounding geometry → new clipped layer.
Proximity (Nearest N)	Nearest N features from B to each feature in A, with distance → new layer.
Spatial join	Count-in-polygon / sum-in-polygon → new polygon layer with counts/sums as properties.
Heatmap	Density layer (POLYGON or RASTER).
Isochrone	Drive / walk-time catchment via pgRouting → POLYGON.
Attribute stats	Sum / mean / min / max / count over a region → aggregate value.

All outputs are new feature layers, chainable; no dead-end exports.
Async via Kafka: operations queued as geoprocessing.jobs.queued events.
A Python geoprocessing-worker consumes the queue, runs PostGIS operations, and writes results as new feature classes.
WebSocket /ws/geoprocessing/{jobId} streams live progress (0–100, log messages).