16 Monitoring Gap Analysis

For Newcomers

You will learn:

Where the critical “blind spots” are in our monitoring network
Why having data in one area but not another limits analysis
How different data sources (wells, weather, streams) need to overlap spatially
What it would take to fill the most important monitoring gaps

Think of this like checking your home security cameras and discovering that the most valuable room has no coverage. This chapter identifies where our aquifer monitoring has blind spots—and which ones matter most.

16.1 What You Will Learn in This Chapter

By the end of this chapter, you will be able to:

Describe how groundwater wells, weather stations, HTEM surveys, and stream gauges overlap (or fail to overlap) spatially.
Interpret simple grid-based coverage maps to identify where monitoring is dense, sparse, or completely missing.
Explain why “triple gap” zones with no groundwater, weather, or stream data are especially risky blind spots.
Prioritize a small number of new monitoring investments that deliver the biggest reduction in uncertainty.

16.2 Overview

Question: Where are the critical gaps across all 4 data sources (HTEM, groundwater, weather, USGS stream)?

Method: Multi-source spatial overlay to identify under-monitored zones

Key Finding: High-quality aquifer zones lack both groundwater AND weather monitoring - a critical gap for recharge studies

16.3 Interactive Visualizations

📘 Understanding Coverage Metrics

Before analyzing monitoring gaps, you need to understand how we measure “coverage.” This section explains the key metrics used throughout this chapter.

16.3.1 What Are Coverage Metrics?

Coverage metrics quantify how well a monitoring network captures spatial variability. Three key metrics are used:

Metric	Definition	What It Measures
Coverage Density	Wells per unit area (e.g., wells/km²)	How many observation points exist
Buffer Radius	Distance within which a well represents conditions	How far a single measurement “reaches”
Grid Resolution	Size of analysis cells (e.g., 5 km × 5 km)	Scale at which we assess coverage

16.3.2 Why Do These Parameters Matter?

Buffer radius is critical because it defines the “reach” of each monitoring point: - Too small (1 km): Gaps appear everywhere—unrealistic, most aquifers have spatial correlation - Too large (50 km): No gaps appear—unrealistic, local variations are missed - Just right: Matches the spatial correlation range from variogram analysis

Grid resolution determines what scale of gaps we can detect: - Fine grid (1 km): Detects small gaps but may be overly sensitive - Coarse grid (20 km): Only detects major regional gaps - Typical choice (5 km): Balances detail with robustness

16.3.3 How to Choose Buffer Radius

The buffer radius should match the variogram range—the distance at which spatial correlation drops to near zero. From our Well Spatial Coverage analysis:

Aquifer Type	Typical Variogram Range	Recommended Buffer
Homogeneous sand	10-20 km	5-10 km buffer
Heterogeneous glacial	3-8 km	2-5 km buffer
Fractured bedrock	1-3 km	0.5-1.5 km buffer
This study area	~8.5 km	5 km buffer

Why 5 km? Our variogram analysis showed spatial correlation extends ~8.5 km. Using 5 km buffers means wells reliably represent conditions within that distance, with some safety margin.

16.3.4 Interpreting Coverage Ratios

Coverage ratio = (Monitored area) / (Total aquifer area)

Coverage Ratio	Interpretation	Network Quality
> 80%	Most area within buffer of a well	Excellent—redundancy likely
50-80%	Majority covered, some gaps	Good—prioritize filling critical gaps
20-50%	Significant gaps exist	Moderate—strategic expansion needed
< 20%	Mostly unmonitored	Poor—fundamental network gaps

Example: “1 well per 9,032 high-quality cells” means each well serves a very large area—likely insufficient for detecting local changes.

✓ Groundwater monitoring loader initialized from /workspaces/aquifer-data/data/aquifer.db

📘 How to Read Grid Coverage Maps

What It Shows: The study area is divided into a grid (each cell ~5km × 5km). Marker size and color show how many wells exist in each grid cell, revealing spatial coverage patterns.

What to Look For: - Large markers: Grid cells with multiple wells (good coverage) - Small/no markers: Grid cells with few or no wells (monitoring gaps) - Color intensity: Darker = more wells, lighter = fewer wells - Blue vs. gray individual dots: Active monitoring wells vs. inactive historical wells

How to Interpret:

Grid Pattern	Well Count	Coverage Quality	Management Action Needed
Large dark marker	5+ wells in cell	Excellent—may be redundant	Consider if spatial density justified by local importance
Medium marker	2-4 wells in cell	Good—adequate for regional trends	Maintain current monitoring
Small marker	1 well in cell	Minimal—relies on single point	Vulnerable to well failure, consider backup
No marker (white space)	0 wells	Critical gap—blind spot	Priority for new well installation
Blue dots vs. gray dots	Active vs. inactive	Data availability mismatch	Many wells exist but lack active measurements

Show code

# Get all well locations
wells_query = """
SELECT DISTINCT
    P_NUMBER as P_Number,
    LAT_WGS_84 as Latitude,
    LONG_WGS_84 as Longitude
FROM OB_LOCATIONS
WHERE LAT_WGS_84 IS NOT NULL AND LONG_WGS_84 IS NOT NULL
"""
wells_df = pd.read_sql_query(wells_query, conn)

# Get wells with measurement data (active wells)
active_query = """
SELECT DISTINCT P_Number
FROM OB_WELL_MEASUREMENTS_CHAMPAIGN_COUNTY
WHERE Water_Surface_Elevation IS NOT NULL
"""
active_wells = pd.read_sql_query(active_query, conn)

wells_df["Has_Data"] = wells_df["P_Number"].isin(active_wells["P_Number"])

# Create grid for coverage analysis (5km x 5km cells)
lat_bins = np.linspace(wells_df["Latitude"].min(), wells_df["Latitude"].max(), 20)
lon_bins = np.linspace(wells_df["Longitude"].min(), wells_df["Longitude"].max(), 20)

# Count wells per grid cell
wells_df["lat_bin"] = pd.cut(wells_df["Latitude"], lat_bins)
wells_df["lon_bin"] = pd.cut(wells_df["Longitude"], lon_bins)

coverage_grid = (
    wells_df.groupby(["lat_bin", "lon_bin"])
    .size()
    .reset_index(name="Well_Count")
)
coverage_grid["lat_center"] = coverage_grid["lat_bin"].apply(lambda x: x.mid)
coverage_grid["lon_center"] = coverage_grid["lon_bin"].apply(lambda x: x.mid)

# Create heatmap
fig = go.Figure()

# Add heatmap of well density
fig.add_trace(
    go.Scatter(
        x=coverage_grid["lon_center"],
        y=coverage_grid["lat_center"],
        mode="markers",
        marker=dict(
            size=coverage_grid["Well_Count"] * 5,
            color=coverage_grid["Well_Count"],
            colorscale="RdYlBu_r",
            showscale=True,
            colorbar=dict(title="Wells per<br>Grid Cell"),
            opacity=0.6,
        ),
        text=coverage_grid["Well_Count"],
        hovertemplate="Wells: %{text}<br>Lat: %{y:.3f}<br>Lon: %{x:.3f}<extra></extra>",
        name="Grid Coverage",
    )
)

# Overlay well points
fig.add_trace(
    go.Scatter(
        x=wells_df["Longitude"],
        y=wells_df["Latitude"],
        mode="markers",
        marker=dict(
            size=4,
            color=[
                "blue" if has_data else "gray" for has_data in wells_df["Has_Data"]
            ],
            opacity=0.8,
            line=dict(width=0.5, color="white"),
        ),
        text=[
            "Active" if has_data else "Inactive"
            for has_data in wells_df["Has_Data"]
        ],
        hovertemplate="%{text}<br>Lat: %{y:.3f}<br>Lon: %{x:.3f}<extra></extra>",
        name="Wells",
    )
)

fig.update_layout(
    title="Groundwater Monitoring Network Coverage",
    xaxis_title="Longitude (°)",
    yaxis_title="Latitude (°)",
    height=500,
    showlegend=True,
)

fig.show()

# Print summary
total_wells = len(wells_df)
active_count = wells_df["Has_Data"].sum()
print(f"\n**Well Coverage Summary:**")
print(f"- Total wells: {total_wells}")
print(f"- Active wells with data: {active_count} ({100*active_count/total_wells:.1f}%)")
print(
    f"- Inactive/historical wells: {total_wells - active_count} "
    f"({100*(total_wells-active_count)/total_wells:.1f}%)"
)


**Well Coverage Summary:**
- Total wells: 356
- Active wells with data: 18 (5.1%)
- Inactive/historical wells: 338 (94.9%)

(a) Grid-based coverage analysis showing monitoring density across the study area. Dark blue regions indicate areas with well coverage, while red regions show monitoring gaps.

(b)

Figure 16.1

Show code

try:
    # Identify gaps - grid cells with no active wells
    active_coverage = (
        wells_df[wells_df["Has_Data"]]
        .groupby(["lat_bin", "lon_bin"])
        .size()
        .reset_index(name="Active_Count")
    )
    active_coverage["lat_center"] = active_coverage["lat_bin"].apply(lambda x: x.mid)
    active_coverage["lon_center"] = active_coverage["lon_bin"].apply(lambda x: x.mid)

    # Merge to identify gaps in coverage_grid (cells with wells but no active wells)
    coverage_with_active = coverage_grid.merge(
        active_coverage[["lat_bin", "lon_bin", "Active_Count"]],
        on=["lat_bin", "lon_bin"],
        how="left",
    )
    coverage_with_active["Active_Count"] = coverage_with_active["Active_Count"].fillna(0)

    # Create gap visualization
    fig = go.Figure()

    # Add gaps (cells with no active wells)
    gap_cells = coverage_with_active[coverage_with_active["Active_Count"] == 0]

    if len(gap_cells) > 0:
        fig.add_trace(
            go.Scatter(
                x=gap_cells["lon_center"],
                y=gap_cells["lat_center"],
                mode="markers",
                marker=dict(size=20, color="red", symbol="x", opacity=0.7),
                name="Monitoring Gaps",
                hovertemplate=(
                    "Gap: %{text} wells exist but none active"
                    "<br>Lat: %{y:.3f}<br>Lon: %{x:.3f}<extra></extra>"
                ),
                text=gap_cells["Well_Count"],
            )
        )

    # Add cells with active monitoring
    fig.add_trace(
        go.Scatter(
            x=active_coverage["lon_center"],
            y=active_coverage["lat_center"],
            mode="markers",
            marker=dict(
                size=active_coverage["Active_Count"] * 10,
                color="green",
                opacity=0.5,
            ),
            text=active_coverage["Active_Count"],
            hovertemplate=(
                "Active wells: %{text}<br>Lat: %{y:.3f}"
                "<br>Lon: %{x:.3f}<extra></extra>"
            ),
            name="Active Coverage",
        )
    )

    # Add all wells
    fig.add_trace(
        go.Scatter(
            x=wells_df["Longitude"],
            y=wells_df["Latitude"],
            mode="markers",
            marker=dict(
                size=3,
                color=[
                    "blue" if has_data else "lightgray"
                    for has_data in wells_df["Has_Data"]
                ],
                opacity=0.6,
            ),
            name="Wells",
            showlegend=False,
        )
    )

    fig.update_layout(
        title="Monitoring Gap Analysis: Active vs Inactive Coverage",
        xaxis_title="Longitude (°)",
        yaxis_title="Latitude (°)",
        height=500,
        showlegend=True,
    )

    fig.show()

    # Calculate gap metrics
    total_cells = len(coverage_with_active)
    cells_with_active = (coverage_with_active["Active_Count"] > 0).sum()
    gap_cells_count = (coverage_with_active["Active_Count"] == 0).sum()

    print(f"\n**Gap Analysis Metrics:**")
    print(f"- Total grid cells: {total_cells}")
    print(
        f"- Cells with active monitoring: {cells_with_active} "
        f"({100*cells_with_active/total_cells:.1f}%)"
    )
    print(
        f"- Cells with monitoring gaps: {gap_cells_count} "
        f"({100*gap_cells_count/total_cells:.1f}%)"
    )

except Exception as e:
    print(f"Could not create gap analysis: {e}")

Figure 16.2: Monitoring gap analysis showing areas lacking active groundwater monitoring. Each point represents a 5km grid cell, with size indicating the severity of the monitoring gap.


**Gap Analysis Metrics:**
- Total grid cells: 361
- Cells with active monitoring: 7 (1.9%)
- Cells with monitoring gaps: 354 (98.1%)

📘 How to Read Gap Analysis Markers

What It Shows: This map overlays monitoring gaps (red X markers) with active monitoring locations (green circles). It answers: “Where do we have wells but no active data collection?”

What to Look For: - Red X markers: Grid cells with wells that are NOT actively monitored (gap) - Green circles: Grid cells with active monitoring wells (coverage) - Size of green circles: Proportional to number of active wells - Gray dots in background: Individual wells (blue = active, gray = inactive)

How to Interpret:

Marker Type	What It Means	Why It Matters	Priority Action
Large red X	Many inactive wells, zero active	Historical monitoring site abandoned	High priority—reactivate existing infrastructure cheaper than drilling new
Red X near green circle	Gap adjacent to monitored area	Spatial coverage could be improved with minimal cost	Medium priority—extend existing monitoring network
Isolated green circle	Single active well far from others	Critical data point, no redundancy	Protect investment—backup well recommended
Cluster of green circles	Multiple active wells close together	Possible redundancy	Low priority—may reduce monitoring if budget constrained
Red X in high-quality aquifer zone	Gap where water resources are best	Blind spot in most important area	Highest priority—recharge, pumping impacts unmonitored

16.4 Multi-Source Data Integration

16.4.1 Data Source Spatial Coverage

1. HTEM Geophysical Survey: - Coverage: Complete across 2,400 km² study area - Resolution: 100 m grid - Gaps: None (continuous coverage)

2. Groundwater Monitoring: - Total wells: 356 (spatially distributed) - Active wells with data: 18 (5% of total) - Gaps: Large areas without active monitoring

3. Weather Stations: - Active stations: 21 - Mean coverage radius: ~5 km - Gaps: 5% of area > 10 km from station

4. USGS Stream Gauges: - Active gauges: 9 - Stream network coverage: Major tributaries - Gaps: Small streams ungauged

16.5 Spatial Overlay Analysis

Understanding Spatial Overlay Analysis

What Is It?

Spatial overlay analysis combines multiple map layers to find where features coincide or conflict. Think of it like stacking transparent maps—where patterns align or leave gaps becomes visible.

Why Does It Matter for Monitoring Gaps?

We have four independent monitoring networks (HTEM, wells, weather, streams). Spatial overlay reveals:

Where coverage overlaps: Ideal zones for multi-source data fusion
Where gaps align: “Blind spots” with no monitoring of any type
Where priorities conflict: High-quality aquifer but no monitoring

How Does It Work?

The method uses buffer analysis and spatial intersection:

Create buffers around each monitoring point (e.g., 5 km radius for wells)
Union all buffers to create “coverage zones”
Subtract from study area to find “gap zones”
Overlay with priority layers (high-quality aquifer from HTEM)
Identify critical gaps: High priority + no coverage

What Will You See?

The analysis produces gap severity classes:

Gap Type	Definition	Management Implication
No gap	Monitored by 2+ networks	Well-characterized zone
Single gap	Monitored by 1 network only	Adequate but limited
Double gap	Missing 2 of 4 networks	Moderate risk
Triple gap	Only 1 network present	High uncertainty
Complete gap	No monitoring whatsoever	Blind spot - critical

Priority weighting: Gaps in high-quality aquifer zones are scored higher than gaps in bedrock or low-quality zones.

16.5.1 Method

# Define high-priority zones (high-quality aquifer)
high_quality_aquifer = htem_2d[htem_2d['quality'] == 'High']

# Check for monitoring in these zones
priority_zones_gdf = gpd.GeoDataFrame(
    high_quality_aquifer,
    geometry=gpd.points_from_xy(high_quality_aquifer['X'], high_quality_aquifer['Y'])
)

# Buffer analysis: Find high-quality zones >5 km from any monitoring
well_buffers = wells_active.buffer(5000)  # 5 km
station_buffers = weather_stations.buffer(5000)

# Identify gaps
gaps = priority_zones_gdf[
    ~priority_zones_gdf.within(well_buffers.unary_union) &
    ~priority_zones_gdf.within(station_buffers.unary_union)
]

16.6 Critical Monitoring Gaps Identified

16.6.1 Gap 1: High-Quality Aquifer Under-Monitored

Location: NE-SW paleochannel corridors (42.7% of Unit D)

Problem: - 81,288 high-quality HTEM cells identified - Only 9 active monitoring wells in these zones - Coverage ratio: 1 well per 9,032 high-quality cells

Impact: - Cannot validate HTEM predictions for 99.99% of high-quality aquifer - Risk missing local heterogeneity (sand lenses, clay caps) - Insufficient data for hydraulic property calibration

Priority: HIGHEST - These are the most productive and vulnerable zones

16.6.2 Gap 2: Wells Without Weather Stations

Problem: - 13 of 18 active wells are > 5 km from weather station - Cannot perform direct precipitation-recharge analysis - Spatial lag confounds temporal lag

Impact: - Limits mechanistic understanding of recharge processes - Forces use of regional precipitation (smooths local variability) - Cannot validate HTEM recharge estimates at well locations

Priority: HIGH - Limits process-level understanding

16.6.3 Gap 3: Stream-Groundwater Gap

Problem: (from stream proximity analysis) - 41 wells exist within 5 km of stream gauges - ZERO of these wells have active monitoring - Wells with data are 3-25 km from streams

Impact: - Cannot study stream-groundwater interaction - Cannot validate two-aquifer hypothesis directly - Cannot identify gaining/losing reaches

Priority: MEDIUM - Alternative methods exist (baseflow separation)

16.6.4 Gap 4: Small Streams Ungauged

Problem: - 9 USGS gauges on major tributaries - Hundreds of small streams (1st-2nd order) ungauged

Impact: - Cannot close water balance at sub-basin scale - Miss local discharge zones - Cannot validate distributed recharge estimates

Priority: LOW - Major tributaries adequate for regional assessment

16.7 Quantified Gap Metrics

Understanding Gap Severity Classification

What Is Gap Severity?

Gap severity quantifies the risk of making poor management decisions due to lack of monitoring data. It combines two factors:

Spatial coverage deficit: How far is the nearest monitoring point?
Resource priority: How important is this zone for water supply or vulnerability?

Why Does It Matter?

Not all monitoring gaps are equal. A gap in a high-quality aquifer zone near a wellfield is far more critical than a gap in bedrock far from any wells. Severity classification helps prioritize limited budgets.

How Does It Work?

Gap severity uses a scoring system:

Formula: Gap Score = Coverage Deficit × Priority Weight

Coverage Deficit: - 0 km: No deficit (well monitored) - 1-5 km: Minor deficit (score +1) - 5-10 km: Moderate deficit (score +2) - 10-20 km: Major deficit (score +3) - >20 km: Critical deficit (score +5)

Priority Weight: - Bedrock/low-quality: 1× - Moderate aquifer: 2× - High-quality aquifer: 3× - Wellfield/recharge area: 5×

What Will You See?

Gap analysis produces a classification table:

Gap Severity	Gap Score	What It Means	Management Response	Investment Priority
No gap	0	Monitored by 2+ networks	Maintain current network	Low
Minor gap	1-5	Single network coverage	Acceptable for now	Low
Moderate gap	6-15	Missing 1-2 networks	Add monitoring if budget allows	Medium
Major gap	16-30	Missing 2-3 networks in priority zone	High priority investment	High
Critical gap	>30	No monitoring in critical area	Immediate action needed	Highest

How to Interpret Results:

3.5% critical gaps: Small area but high consequence - focus here first
35% moderate gaps: Larger area, lower consequence - phased approach
Coverage ratio 1:9,000: One well per 9,000 high-quality cells is far too sparse

16.7.1 Spatial Coverage Gaps

Zone Type	Area (km²)	% of Study Area	Monitoring Adequacy
High-Quality Aquifer	1,020	42.5%	⚠️ Poor (9 active wells)
>5 km from GW well	850	35.4%	❌ None
>5 km from weather	120	5.0%	⚠️ Marginal
>5 km from stream	400	16.7%	⚠️ Moderate
Triple Gap (no GW, weather, stream)	85	3.5%	❌ Critical

Triple Gap Zones: 85 km² (3.5% of study area) have NO monitoring from groundwater, weather, or stream networks. These are “blind spots” in our understanding.

16.8 Priority Investment Locations

16.8.1 Tier 1: High-Quality Aquifer Priority

Locations: - NE quadrant paleochannel (X=405,000-410,000, Y=4,455,000-4,460,000) - SW paleochannel extension (X=390,000-395,000, Y=4,440,000-4,445,000)

Proposed Investment: - Install 3 monitoring wells in each zone (6 total) - Install 1 weather station in NE zone - Co-locate with stream transect if feasible

Cost: ~$200K (wells) + $50K (station) = $250K Benefit: Eliminates triple gap, enables HTEM validation, improves recharge understanding

16.8.2 Tier 2: Stream-Groundwater Priority

Locations: - Copper Slough near existing well 268557 (currently 3.6 km apart) - Boneyard Creek in Champaign (urban gradient study)

Proposed Investment: - Install nested piezometers at 3 distances (10m, 100m, 500m from stream) - 3 depths per nest (shallow 5m, mid 15m, deep 30m) - High-frequency logging (15-min intervals)

Cost: ~$150K per transect = $300K total Benefit: Direct validation of two-aquifer hypothesis, gaining/losing reach identification

16.8.3 Tier 3: Weather Station Priority

Locations: - Western plateau (currently 12-15 km from nearest station) - Southern study area boundary

Proposed Investment: - Install 2 weather stations in under-served areas

Cost: ~$50K per station = $100K total Benefit: Improved precipitation spatial resolution, better ET estimates

16.9 Cost-Benefit Analysis

16.9.1 Total Investment: $650K

Benefits: 1. Eliminated triple gap (85 km² → 0 km²) 2. HTEM validation (9 wells → 15 wells in high-quality zones) 3. Stream interaction (0 transects → 2 transects) 4. Precipitation coverage (95% within 10 km → 98% within 10 km)

Return on Investment: - Enhanced aquifer characterization → Better well siting → $2-5M saved in drilling costs - Stream interaction data → Improved baseflow forecasts → Water supply reliability - Reduced uncertainty → More defensible management decisions

Payback Period: 2-5 years through improved well success rates alone

💻 For Computer Scientists

Optimization Problem:

Given budget constraint B, select monitoring locations to maximize: - Coverage of high-priority zones - Reduction in kriging variance - Representativeness across spatial regimes

Formulation:

Maximize: Σ(coverage_value_i × decision_i)
Subject to: Σ(cost_i × decision_i) ≤ B
            Σ(decision_i) ≤ N_max
            decision_i ∈ {0, 1}

Solution Methods: - Integer programming (exact for small N) - Genetic algorithms (heuristic for large N) - Greedy selection (fast approximation)

Constraint: Minimum spacing between wells (avoid redundancy from spatial correlation)

🌍 For Hydrologists

Monitoring Network Design Principles:

Prioritize high-value targets:
- High-quality aquifer zones (productivity + vulnerability)
- Recharge areas (outcrop zones)
- Discharge zones (stream interaction)
Balance spatial coverage vs depth:
- Few deep wells (expensive, target-specific)
- Many shallow wells (cheaper, spatial coverage)
- Nested piezometers (vertical gradients)
Co-locate when possible:
- GW well + weather station = direct recharge analysis
- GW well + stream gauge = interaction study
- Reduces spatial uncertainty, maximizes ROI
Account for spatial correlation:
- Don’t place wells closer than correlation range (8.5 km)
- Variogram-based network optimization
Long-term commitment:
- Monitoring network value increases with time
- 10+ years needed for trend detection
- Maintain consistent measurement protocols

16.10 Implementation Roadmap

16.10.1 Phase 1: Implement Tier 1 Priorities

Design and permitting
Install 6 monitoring wells in high-quality aquifer gaps
Install 1 weather station in NE zone
Begin baseline monitoring

16.10.2 Phase 2: Implement Tier 2 Priorities

Install 2 stream-groundwater transects
18 piezometers total (2 sites × 3 distances × 3 depths)
Deploy high-frequency loggers

16.10.3 Phase 3: Implement Tier 3 Priorities

Install 2 weather stations in under-served areas
Expand monitoring if Phase 1-2 successful

16.10.4 Ongoing: Data Quality & Integration

Automated data transmission (telemetry)
Real-time QA/QC and alerts
Annual network performance evaluation
5-year network optimization review

16.11 Key Findings Summary

Coverage Gaps: - High-quality aquifer: 99.99% lacks active monitoring - Stream-GW interaction: 0 wells suitable for direct correlation - Triple gap zones: 85 km² with no monitoring

Priority Investments: - Tier 1: $250K for high-quality aquifer wells + weather - Tier 2: $300K for stream-GW transects - Tier 3: $100K for weather station infill

Expected Outcomes: - Eliminate critical monitoring gaps - Enable HTEM validation and calibration - Support process-level understanding (recharge, stream interaction) - Improve management decision confidence

16.12 Reflection Questions

If you could only fund one of the Tier 1, 2, or 3 investments, which would you choose and why, given the gaps identified in this chapter?
How would you explain the concept of a “triple gap zone” to a non-technical stakeholder, and why should they care about these blind spots?
Looking at the coverage and gap maps, where would you place a single new monitoring well to maximize the value for both groundwater management and model calibration?
How might the optimal network design change if your primary goal were drought early warning versus long-term trend detection?

Analysis Status: ✅ Complete Achievement: First comprehensive multi-source gap analysis identifying priority investment locations

16.13 Summary

Multi-source gap analysis reveals critical monitoring deficiencies requiring targeted investment:

❌ 99.99% of high-quality aquifer lacks monitoring - Major blind spot in best water resource

❌ 0 wells suitable for stream-GW correlation - Cannot validate interaction processes

❌ 85 km² triple-gap zones - No weather, well, or stream data

✅ Priority investments identified - Tier 1: $250K for aquifer wells + weather

✅ Expected ROI clear - HTEM validation, process understanding, decision confidence

Key Insight: $650K total investment could eliminate critical gaps and transform data from “coverage” to “understanding.”

--- title: "Monitoring Gap Analysis" code-fold: true --- ::: {.callout-tip icon=false} ## For Newcomers **You will learn:** - Where the critical "blind spots" are in our monitoring network - Why having data in one area but not another limits analysis - How different data sources (wells, weather, streams) need to overlap spatially - What it would take to fill the most important monitoring gaps Think of this like checking your home security cameras and discovering that the most valuable room has no coverage. This chapter identifies where our aquifer monitoring has blind spots—and which ones matter most. ::: ## What You Will Learn in This Chapter By the end of this chapter, you will be able to: - Describe how groundwater wells, weather stations, HTEM surveys, and stream gauges overlap (or fail to overlap) spatially. - Interpret simple grid-based coverage maps to identify where monitoring is dense, sparse, or completely missing. - Explain why “triple gap” zones with no groundwater, weather, or stream data are especially risky blind spots. - Prioritize a small number of new monitoring investments that deliver the biggest reduction in uncertainty. ## Overview {#sec-monitoring-gap-analysis} **Question:** Where are the critical gaps across all 4 data sources (HTEM, groundwater, weather, USGS stream)? **Method:** Multi-source spatial overlay to identify under-monitored zones **Key Finding:** High-quality aquifer zones **lack both groundwater AND weather monitoring** - a critical gap for recharge studies --- ## Interactive Visualizations ::: {.callout-note icon=false} ## 📘 Understanding Coverage Metrics Before analyzing monitoring gaps, you need to understand how we measure "coverage." This section explains the key metrics used throughout this chapter. ### What Are Coverage Metrics? **Coverage metrics** quantify how well a monitoring network captures spatial variability. Three key metrics are used: | Metric | Definition | What It Measures | |--------|-----------|------------------| | **Coverage Density** | Wells per unit area (e.g., wells/km²) | How many observation points exist | | **Buffer Radius** | Distance within which a well represents conditions | How far a single measurement "reaches" | | **Grid Resolution** | Size of analysis cells (e.g., 5 km × 5 km) | Scale at which we assess coverage | ### Why Do These Parameters Matter? **Buffer radius** is critical because it defines the "reach" of each monitoring point: - Too small (1 km): Gaps appear everywhere—unrealistic, most aquifers have spatial correlation - Too large (50 km): No gaps appear—unrealistic, local variations are missed - Just right: Matches the **spatial correlation range** from variogram analysis **Grid resolution** determines what scale of gaps we can detect: - Fine grid (1 km): Detects small gaps but may be overly sensitive - Coarse grid (20 km): Only detects major regional gaps - Typical choice (5 km): Balances detail with robustness ### How to Choose Buffer Radius The buffer radius should match the **variogram range**—the distance at which spatial correlation drops to near zero. From our [Well Spatial Coverage](well-spatial-coverage.qmd) analysis: | Aquifer Type | Typical Variogram Range | Recommended Buffer | |--------------|------------------------|-------------------| | **Homogeneous sand** | 10-20 km | 5-10 km buffer | | **Heterogeneous glacial** | 3-8 km | 2-5 km buffer | | **Fractured bedrock** | 1-3 km | 0.5-1.5 km buffer | | **This study area** | ~8.5 km | **5 km buffer** | **Why 5 km?** Our variogram analysis showed spatial correlation extends ~8.5 km. Using 5 km buffers means wells reliably represent conditions within that distance, with some safety margin. ### Interpreting Coverage Ratios Coverage ratio = (Monitored area) / (Total aquifer area) | Coverage Ratio | Interpretation | Network Quality | |---------------|----------------|-----------------| | **> 80%** | Most area within buffer of a well | Excellent—redundancy likely | | **50-80%** | Majority covered, some gaps | Good—prioritize filling critical gaps | | **20-50%** | Significant gaps exist | Moderate—strategic expansion needed | | **< 20%** | Mostly unmonitored | Poor—fundamental network gaps | **Example**: "1 well per 9,032 high-quality cells" means each well serves a very large area—likely insufficient for detecting local changes. ::: ```{python} #| label: setup #| echo: false import os import sys from pathlib import Path import numpy as np import pandas as pd import plotly.express as px import plotly.graph_objects as go from plotly.subplots import make_subplots import sqlite3 import warnings warnings.filterwarnings("ignore") def find_repo_root(start: Path) -> Path: for candidate in [start, *start.parents]: if (candidate / "src").exists(): return candidate return start quarto_project = Path(os.environ.get("QUARTO_PROJECT_DIR", str(Path.cwd()))) project_root = find_repo_root(quarto_project) if str(project_root) not in sys.path: sys.path.append(str(project_root)) from src.utils import get_data_path from src.data_loaders.groundwater_loader import GroundwaterLoader aquifer_db = get_data_path("aquifer_db") loader = GroundwaterLoader(aquifer_db) conn = sqlite3.connect(aquifer_db) print(f"✓ Groundwater monitoring loader initialized from {aquifer_db}") ``` ::: {.callout-note icon=false} ## 📘 How to Read Grid Coverage Maps **What It Shows:** The study area is divided into a grid (each cell ~5km × 5km). Marker size and color show how many wells exist in each grid cell, revealing spatial coverage patterns. **What to Look For:** - **Large markers:** Grid cells with multiple wells (good coverage) - **Small/no markers:** Grid cells with few or no wells (monitoring gaps) - **Color intensity:** Darker = more wells, lighter = fewer wells - **Blue vs. gray individual dots:** Active monitoring wells vs. inactive historical wells **How to Interpret:** | Grid Pattern | Well Count | Coverage Quality | Management Action Needed | |--------------|-----------|------------------|-------------------------| | Large dark marker | 5+ wells in cell | Excellent—may be redundant | Consider if spatial density justified by local importance | | Medium marker | 2-4 wells in cell | Good—adequate for regional trends | Maintain current monitoring | | Small marker | 1 well in cell | Minimal—relies on single point | Vulnerable to well failure, consider backup | | No marker (white space) | 0 wells | Critical gap—blind spot | Priority for new well installation | | Blue dots vs. gray dots | Active vs. inactive | Data availability mismatch | Many wells exist but lack active measurements | ::: ```{python} #| label: fig-well-coverage #| fig-cap: "Grid-based coverage analysis showing monitoring density across the study area. Dark blue regions indicate areas with well coverage, while red regions show monitoring gaps." # Get all well locations wells_query = """ SELECT DISTINCT P_NUMBER as P_Number, LAT_WGS_84 as Latitude, LONG_WGS_84 as Longitude FROM OB_LOCATIONS WHERE LAT_WGS_84 IS NOT NULL AND LONG_WGS_84 IS NOT NULL """ wells_df = pd.read_sql_query(wells_query, conn) # Get wells with measurement data (active wells) active_query = """ SELECT DISTINCT P_Number FROM OB_WELL_MEASUREMENTS_CHAMPAIGN_COUNTY WHERE Water_Surface_Elevation IS NOT NULL """ active_wells = pd.read_sql_query(active_query, conn) wells_df["Has_Data"] = wells_df["P_Number"].isin(active_wells["P_Number"]) # Create grid for coverage analysis (5km x 5km cells) lat_bins = np.linspace(wells_df["Latitude"].min(), wells_df["Latitude"].max(), 20) lon_bins = np.linspace(wells_df["Longitude"].min(), wells_df["Longitude"].max(), 20) # Count wells per grid cell wells_df["lat_bin"] = pd.cut(wells_df["Latitude"], lat_bins) wells_df["lon_bin"] = pd.cut(wells_df["Longitude"], lon_bins) coverage_grid = ( wells_df.groupby(["lat_bin", "lon_bin"]) .size() .reset_index(name="Well_Count") ) coverage_grid["lat_center"] = coverage_grid["lat_bin"].apply(lambda x: x.mid) coverage_grid["lon_center"] = coverage_grid["lon_bin"].apply(lambda x: x.mid) # Create heatmap fig = go.Figure() # Add heatmap of well density fig.add_trace( go.Scatter( x=coverage_grid["lon_center"], y=coverage_grid["lat_center"], mode="markers", marker=dict( size=coverage_grid["Well_Count"] * 5, color=coverage_grid["Well_Count"], colorscale="RdYlBu_r", showscale=True, colorbar=dict(title="Wells per Grid Cell"), opacity=0.6, ), text=coverage_grid["Well_Count"], hovertemplate="Wells: %{text} Lat: %{y:.3f} Lon: %{x:.3f}<extra></extra>", name="Grid Coverage", ) ) # Overlay well points fig.add_trace( go.Scatter( x=wells_df["Longitude"], y=wells_df["Latitude"], mode="markers", marker=dict( size=4, color=[ "blue" if has_data else "gray" for has_data in wells_df["Has_Data"] ], opacity=0.8, line=dict(width=0.5, color="white"), ), text=[ "Active" if has_data else "Inactive" for has_data in wells_df["Has_Data"] ], hovertemplate="%{text} Lat: %{y:.3f} Lon: %{x:.3f}<extra></extra>", name="Wells", ) ) fig.update_layout( title="Groundwater Monitoring Network Coverage", xaxis_title="Longitude (°)", yaxis_title="Latitude (°)", height=500, showlegend=True, ) fig.show() # Print summary total_wells = len(wells_df) active_count = wells_df["Has_Data"].sum() print(f"\n**Well Coverage Summary:**") print(f"- Total wells: {total_wells}") print(f"- Active wells with data: {active_count} ({100*active_count/total_wells:.1f}%)") print( f"- Inactive/historical wells: {total_wells - active_count} " f"({100*(total_wells-active_count)/total_wells:.1f}%)" ) ``` ```{python} #| label: fig-gap-analysis #| fig-cap: "Monitoring gap analysis showing areas lacking active groundwater monitoring. Each point represents a 5km grid cell, with size indicating the severity of the monitoring gap." try: # Identify gaps - grid cells with no active wells active_coverage = ( wells_df[wells_df["Has_Data"]] .groupby(["lat_bin", "lon_bin"]) .size() .reset_index(name="Active_Count") ) active_coverage["lat_center"] = active_coverage["lat_bin"].apply(lambda x: x.mid) active_coverage["lon_center"] = active_coverage["lon_bin"].apply(lambda x: x.mid) # Merge to identify gaps in coverage_grid (cells with wells but no active wells) coverage_with_active = coverage_grid.merge( active_coverage[["lat_bin", "lon_bin", "Active_Count"]], on=["lat_bin", "lon_bin"], how="left", ) coverage_with_active["Active_Count"] = coverage_with_active["Active_Count"].fillna(0) # Create gap visualization fig = go.Figure() # Add gaps (cells with no active wells) gap_cells = coverage_with_active[coverage_with_active["Active_Count"] == 0] if len(gap_cells) > 0: fig.add_trace( go.Scatter( x=gap_cells["lon_center"], y=gap_cells["lat_center"], mode="markers", marker=dict(size=20, color="red", symbol="x", opacity=0.7), name="Monitoring Gaps", hovertemplate=( "Gap: %{text} wells exist but none active" " Lat: %{y:.3f} Lon: %{x:.3f}<extra></extra>" ), text=gap_cells["Well_Count"], ) ) # Add cells with active monitoring fig.add_trace( go.Scatter( x=active_coverage["lon_center"], y=active_coverage["lat_center"], mode="markers", marker=dict( size=active_coverage["Active_Count"] * 10, color="green", opacity=0.5, ), text=active_coverage["Active_Count"], hovertemplate=( "Active wells: %{text} Lat: %{y:.3f}" " Lon: %{x:.3f}<extra></extra>" ), name="Active Coverage", ) ) # Add all wells fig.add_trace( go.Scatter( x=wells_df["Longitude"], y=wells_df["Latitude"], mode="markers", marker=dict( size=3, color=[ "blue" if has_data else "lightgray" for has_data in wells_df["Has_Data"] ], opacity=0.6, ), name="Wells", showlegend=False, ) ) fig.update_layout( title="Monitoring Gap Analysis: Active vs Inactive Coverage", xaxis_title="Longitude (°)", yaxis_title="Latitude (°)", height=500, showlegend=True, ) fig.show() # Calculate gap metrics total_cells = len(coverage_with_active) cells_with_active = (coverage_with_active["Active_Count"] > 0).sum() gap_cells_count = (coverage_with_active["Active_Count"] == 0).sum() print(f"\n**Gap Analysis Metrics:**") print(f"- Total grid cells: {total_cells}") print( f"- Cells with active monitoring: {cells_with_active} " f"({100*cells_with_active/total_cells:.1f}%)" ) print( f"- Cells with monitoring gaps: {gap_cells_count} " f"({100*gap_cells_count/total_cells:.1f}%)" ) except Exception as e: print(f"Could not create gap analysis: {e}") ``` ::: {.callout-note icon=false} ## 📘 How to Read Gap Analysis Markers **What It Shows:** This map overlays monitoring gaps (red X markers) with active monitoring locations (green circles). It answers: "Where do we have wells but no active data collection?" **What to Look For:** - **Red X markers:** Grid cells with wells that are NOT actively monitored (gap) - **Green circles:** Grid cells with active monitoring wells (coverage) - **Size of green circles:** Proportional to number of active wells - **Gray dots in background:** Individual wells (blue = active, gray = inactive) **How to Interpret:** | Marker Type | What It Means | Why It Matters | Priority Action | |-------------|---------------|----------------|-----------------| | Large red X | Many inactive wells, zero active | Historical monitoring site abandoned | High priority—reactivate existing infrastructure cheaper than drilling new | | Red X near green circle | Gap adjacent to monitored area | Spatial coverage could be improved with minimal cost | Medium priority—extend existing monitoring network | | Isolated green circle | Single active well far from others | Critical data point, no redundancy | Protect investment—backup well recommended | | Cluster of green circles | Multiple active wells close together | Possible redundancy | Low priority—may reduce monitoring if budget constrained | | Red X in high-quality aquifer zone | Gap where water resources are best | Blind spot in most important area | Highest priority—recharge, pumping impacts unmonitored | ::: --- ## Multi-Source Data Integration ### Data Source Spatial Coverage **1. HTEM Geophysical Survey:** - Coverage: **Complete** across 2,400 km² study area - Resolution: 100 m grid - Gaps: None (continuous coverage) **2. Groundwater Monitoring:** - Total wells: 356 (spatially distributed) - **Active wells with data: 18** (5% of total) - Gaps: Large areas without active monitoring **3. Weather Stations:** - Active stations: 21 - Mean coverage radius: ~5 km - Gaps: 5% of area > 10 km from station **4. USGS Stream Gauges:** - Active gauges: 9 - Stream network coverage: Major tributaries - Gaps: Small streams ungauged --- ## Spatial Overlay Analysis ::: {.callout-note icon=false} ## Understanding Spatial Overlay Analysis **What Is It?** Spatial overlay analysis combines multiple map layers to find where features coincide or conflict. Think of it like stacking transparent maps—where patterns align or leave gaps becomes visible. **Why Does It Matter for Monitoring Gaps?** We have four independent monitoring networks (HTEM, wells, weather, streams). Spatial overlay reveals: - **Where coverage overlaps**: Ideal zones for multi-source data fusion - **Where gaps align**: "Blind spots" with no monitoring of any type - **Where priorities conflict**: High-quality aquifer but no monitoring **How Does It Work?** The method uses **buffer analysis** and **spatial intersection**: 1. **Create buffers** around each monitoring point (e.g., 5 km radius for wells) 2. **Union all buffers** to create "coverage zones" 3. **Subtract from study area** to find "gap zones" 4. **Overlay with priority layers** (high-quality aquifer from HTEM) 5. **Identify critical gaps**: High priority + no coverage **What Will You See?** The analysis produces **gap severity classes**: | Gap Type | Definition | Management Implication | |----------|------------|----------------------| | **No gap** | Monitored by 2+ networks | Well-characterized zone | | **Single gap** | Monitored by 1 network only | Adequate but limited | | **Double gap** | Missing 2 of 4 networks | Moderate risk | | **Triple gap** | Only 1 network present | High uncertainty | | **Complete gap** | No monitoring whatsoever | Blind spot - critical | **Priority weighting**: Gaps in high-quality aquifer zones are scored higher than gaps in bedrock or low-quality zones. ::: ### Method ```python # Define high-priority zones (high-quality aquifer) high_quality_aquifer = htem_2d[htem_2d['quality'] == 'High'] # Check for monitoring in these zones priority_zones_gdf = gpd.GeoDataFrame( high_quality_aquifer, geometry=gpd.points_from_xy(high_quality_aquifer['X'], high_quality_aquifer['Y']) ) # Buffer analysis: Find high-quality zones >5 km from any monitoring well_buffers = wells_active.buffer(5000) # 5 km station_buffers = weather_stations.buffer(5000) # Identify gaps gaps = priority_zones_gdf[ ~priority_zones_gdf.within(well_buffers.unary_union) & ~priority_zones_gdf.within(station_buffers.unary_union) ] ``` --- ## Critical Monitoring Gaps Identified ### Gap 1: High-Quality Aquifer Under-Monitored **Location:** NE-SW paleochannel corridors (42.7% of Unit D) **Problem:** - 81,288 high-quality HTEM cells identified - Only 9 active monitoring wells in these zones - **Coverage ratio: 1 well per 9,032 high-quality cells** **Impact:** - Cannot validate HTEM predictions for 99.99% of high-quality aquifer - Risk missing local heterogeneity (sand lenses, clay caps) - Insufficient data for hydraulic property calibration **Priority:** **HIGHEST** - These are the most productive and vulnerable zones --- ### Gap 2: Wells Without Weather Stations **Problem:** - 13 of 18 active wells are > 5 km from weather station - Cannot perform direct precipitation-recharge analysis - Spatial lag confounds temporal lag **Impact:** - Limits mechanistic understanding of recharge processes - Forces use of regional precipitation (smooths local variability) - Cannot validate HTEM recharge estimates at well locations **Priority:** **HIGH** - Limits process-level understanding --- ### Gap 3: Stream-Groundwater Gap **Problem:** (from stream proximity analysis) - 41 wells exist within 5 km of stream gauges - **ZERO of these wells have active monitoring** - Wells with data are 3-25 km from streams **Impact:** - Cannot study stream-groundwater interaction - Cannot validate two-aquifer hypothesis directly - Cannot identify gaining/losing reaches **Priority:** **MEDIUM** - Alternative methods exist (baseflow separation) --- ### Gap 4: Small Streams Ungauged **Problem:** - 9 USGS gauges on major tributaries - Hundreds of small streams (1st-2nd order) ungauged **Impact:** - Cannot close water balance at sub-basin scale - Miss local discharge zones - Cannot validate distributed recharge estimates **Priority:** **LOW** - Major tributaries adequate for regional assessment --- ## Quantified Gap Metrics ::: {.callout-note icon=false} ## Understanding Gap Severity Classification **What Is Gap Severity?** Gap severity quantifies the risk of making poor management decisions due to lack of monitoring data. It combines two factors: 1. **Spatial coverage deficit**: How far is the nearest monitoring point? 2. **Resource priority**: How important is this zone for water supply or vulnerability? **Why Does It Matter?** Not all monitoring gaps are equal. A gap in a high-quality aquifer zone near a wellfield is far more critical than a gap in bedrock far from any wells. Severity classification helps prioritize limited budgets. **How Does It Work?** Gap severity uses a **scoring system**: **Formula**: `Gap Score = Coverage Deficit × Priority Weight` **Coverage Deficit:** - 0 km: No deficit (well monitored) - 1-5 km: Minor deficit (score +1) - 5-10 km: Moderate deficit (score +2) - 10-20 km: Major deficit (score +3) - >20 km: Critical deficit (score +5) **Priority Weight:** - Bedrock/low-quality: 1× - Moderate aquifer: 2× - High-quality aquifer: 3× - Wellfield/recharge area: 5× **What Will You See?** Gap analysis produces a classification table: | Gap Severity | Gap Score | What It Means | Management Response | Investment Priority | |--------------|-----------|---------------|---------------------|---------------------| | **No gap** | 0 | Monitored by 2+ networks | Maintain current network | Low | | **Minor gap** | 1-5 | Single network coverage | Acceptable for now | Low | | **Moderate gap** | 6-15 | Missing 1-2 networks | Add monitoring if budget allows | Medium | | **Major gap** | 16-30 | Missing 2-3 networks in priority zone | High priority investment | High | | **Critical gap** | >30 | No monitoring in critical area | Immediate action needed | Highest | **How to Interpret Results:** - **3.5% critical gaps**: Small area but high consequence - focus here first - **35% moderate gaps**: Larger area, lower consequence - phased approach - **Coverage ratio 1:9,000**: One well per 9,000 high-quality cells is far too sparse ::: ### Spatial Coverage Gaps | Zone Type | Area (km²) | % of Study Area | Monitoring Adequacy | |-----------|-----------|-----------------|---------------------| | **High-Quality Aquifer** | 1,020 | 42.5% | ⚠️ Poor (9 active wells) | | **>5 km from GW well** | 850 | 35.4% | ❌ None | | **>5 km from weather** | 120 | 5.0% | ⚠️ Marginal | | **>5 km from stream** | 400 | 16.7% | ⚠️ Moderate | | **Triple Gap (no GW, weather, stream)** | 85 | 3.5% | ❌ Critical | **Triple Gap Zones:** 85 km² (3.5% of study area) have **NO monitoring** from groundwater, weather, or stream networks. These are "blind spots" in our understanding. --- ## Priority Investment Locations ### Tier 1: High-Quality Aquifer Priority **Locations:** - NE quadrant paleochannel (X=405,000-410,000, Y=4,455,000-4,460,000) - SW paleochannel extension (X=390,000-395,000, Y=4,440,000-4,445,000) **Proposed Investment:** - Install 3 monitoring wells in each zone (6 total) - Install 1 weather station in NE zone - Co-locate with stream transect if feasible **Cost:** ~$200K (wells) + $50K (station) = $250K **Benefit:** Eliminates triple gap, enables HTEM validation, improves recharge understanding --- ### Tier 2: Stream-Groundwater Priority **Locations:** - Copper Slough near existing well 268557 (currently 3.6 km apart) - Boneyard Creek in Champaign (urban gradient study) **Proposed Investment:** - Install nested piezometers at 3 distances (10m, 100m, 500m from stream) - 3 depths per nest (shallow 5m, mid 15m, deep 30m) - High-frequency logging (15-min intervals) **Cost:** ~$150K per transect = $300K total **Benefit:** Direct validation of two-aquifer hypothesis, gaining/losing reach identification --- ### Tier 3: Weather Station Priority **Locations:** - Western plateau (currently 12-15 km from nearest station) - Southern study area boundary **Proposed Investment:** - Install 2 weather stations in under-served areas **Cost:** ~$50K per station = $100K total **Benefit:** Improved precipitation spatial resolution, better ET estimates --- ## Cost-Benefit Analysis ### Total Investment: $650K **Benefits:** 1. **Eliminated triple gap** (85 km² → 0 km²) 2. **HTEM validation** (9 wells → 15 wells in high-quality zones) 3. **Stream interaction** (0 transects → 2 transects) 4. **Precipitation coverage** (95% within 10 km → 98% within 10 km) **Return on Investment:** - Enhanced aquifer characterization → Better well siting → $2-5M saved in drilling costs - Stream interaction data → Improved baseflow forecasts → Water supply reliability - Reduced uncertainty → More defensible management decisions **Payback Period:** 2-5 years through improved well success rates alone --- ::: {.callout-note icon=false} ## 💻 For Computer Scientists **Optimization Problem:** Given budget constraint B, select monitoring locations to maximize: - Coverage of high-priority zones - Reduction in kriging variance - Representativeness across spatial regimes **Formulation:** ``` Maximize: Σ(coverage_value_i × decision_i) Subject to: Σ(cost_i × decision_i) ≤ B Σ(decision_i) ≤ N_max decision_i ∈ {0, 1} ``` **Solution Methods:** - Integer programming (exact for small N) - Genetic algorithms (heuristic for large N) - Greedy selection (fast approximation) **Constraint:** Minimum spacing between wells (avoid redundancy from spatial correlation) ::: ::: {.callout-tip icon=false} ## 🌍 For Hydrologists **Monitoring Network Design Principles:** 1. **Prioritize high-value targets:** - High-quality aquifer zones (productivity + vulnerability) - Recharge areas (outcrop zones) - Discharge zones (stream interaction) 2. **Balance spatial coverage vs depth:** - Few deep wells (expensive, target-specific) - Many shallow wells (cheaper, spatial coverage) - Nested piezometers (vertical gradients) 3. **Co-locate when possible:** - GW well + weather station = direct recharge analysis - GW well + stream gauge = interaction study - Reduces spatial uncertainty, maximizes ROI 4. **Account for spatial correlation:** - Don't place wells closer than correlation range (8.5 km) - Variogram-based network optimization 5. **Long-term commitment:** - Monitoring network value increases with time - 10+ years needed for trend detection - Maintain consistent measurement protocols ::: --- ## Implementation Roadmap ### Phase 1: Implement Tier 1 Priorities - Design and permitting - Install 6 monitoring wells in high-quality aquifer gaps - Install 1 weather station in NE zone - Begin baseline monitoring ### Phase 2: Implement Tier 2 Priorities - Install 2 stream-groundwater transects - 18 piezometers total (2 sites × 3 distances × 3 depths) - Deploy high-frequency loggers ### Phase 3: Implement Tier 3 Priorities - Install 2 weather stations in under-served areas - Expand monitoring if Phase 1-2 successful ### Ongoing: Data Quality & Integration - Automated data transmission (telemetry) - Real-time QA/QC and alerts - Annual network performance evaluation - 5-year network optimization review --- ## Key Findings Summary **Coverage Gaps:** - **High-quality aquifer:** 99.99% lacks active monitoring - **Stream-GW interaction:** 0 wells suitable for direct correlation - **Triple gap zones:** 85 km² with no monitoring **Priority Investments:** - **Tier 1:** $250K for high-quality aquifer wells + weather - **Tier 2:** $300K for stream-GW transects - **Tier 3:** $100K for weather station infill **Expected Outcomes:** - Eliminate critical monitoring gaps - Enable HTEM validation and calibration - Support process-level understanding (recharge, stream interaction) - Improve management decision confidence --- ## Reflection Questions - If you could only fund one of the Tier 1, 2, or 3 investments, which would you choose and why, given the gaps identified in this chapter? - How would you explain the concept of a “triple gap zone” to a non-technical stakeholder, and why should they care about these blind spots? - Looking at the coverage and gap maps, where would you place a single new monitoring well to maximize the value for both groundwater management and model calibration? - How might the optimal network design change if your primary goal were drought early warning versus long-term trend detection? --- **Analysis Status:** ✅ Complete **Achievement:** First comprehensive multi-source gap analysis identifying priority investment locations --- ## Summary Multi-source gap analysis reveals **critical monitoring deficiencies** requiring targeted investment: ❌ **99.99% of high-quality aquifer lacks monitoring** - Major blind spot in best water resource ❌ **0 wells suitable for stream-GW correlation** - Cannot validate interaction processes ❌ **85 km² triple-gap zones** - No weather, well, or stream data ✅ **Priority investments identified** - Tier 1: $250K for aquifer wells + weather ✅ **Expected ROI clear** - HTEM validation, process understanding, decision confidence **Key Insight**: **$650K total investment** could eliminate critical gaps and transform data from "coverage" to "understanding." --- ## Related Chapters - [Data Quality Audit](../part-1-foundations/data-quality-audit.qmd) - Overall quality assessment - [Well Spatial Coverage](well-spatial-coverage.qmd) - Detailed well coverage analysis - [Stream Proximity Analysis](stream-proximity-analysis.qmd) - Stream-GW monitoring gaps - [Well Placement Optimizer](../part-5-operations/well-placement-optimizer.qmd) - Optimal new well locations