Hot Spot Analysis

Identify statistically significant clusters of high and low values using Getis-Ord Gi* statistics.

Overview

Hot spot analysis uses the Getis-Ord Gi* statistic to identify statistically significant spatial clusters. Features are classified as:

99% Hot Spot: Very high values, 99% confidence
95% Hot Spot: High values, 95% confidence
90% Hot Spot: High values, 90% confidence
Not Significant: No significant clustering
90% Cold Spot: Low values, 90% confidence
95% Cold Spot: Low values, 95% confidence
99% Cold Spot: Very low values, 99% confidence

Inputs

Dataset: Point or polygon dataset
Value Field: Numeric field to analyze
Neighbor Type: Distance-based or K-nearest neighbors
Distance (if distance-based): Maximum neighbor distance
K Neighbors (if KNN): Number of nearest neighbors

Outputs

New dataset containing:

Original geometry
Gi Z-Score*: Standardized z-score
P-Value: Statistical significance
Hot Spot Class: Categorized class
Original attributes

Algorithm

Calculate spatial weights matrix based on neighbor configuration
Compute Getis-Ord Gi* statistic for each feature
Calculate z-scores and p-values
Categorize into hot spot classes
Store results in output dataset

Example

{
  "dataset_id": 123,
  "value_field": "population",
  "neighbor_type": "distance",
  "distance": 1000
}

Background Jobs

This analysis runs as a background job. See Hot Spot Analysis Worker for details.

Use Cases

Crime analysis
Disease clustering
Economic activity patterns
Environmental monitoring
Social phenomena analysis

Notes

Requires numeric field with sufficient variation
Distance should be appropriate for data scale
KNN method is generally faster for large datasets
Results depend on neighbor configuration