27 Memory Persistence Study

Hurst exponents and aquifer temporal memory

For Newcomers

You will learn:

What “memory” means for an aquifer (how long past conditions affect the present)
How to measure persistence using autocorrelation and Hurst exponents
Why confined aquifers have longer memory than unconfined ones
What memory tells us about aquifer connectivity and storage

Aquifers have memory—today’s water level depends on what happened weeks, months, or even years ago. Long memory means the system responds slowly and stores information about past conditions. This chapter quantifies that memory.

27.1 What You Will Learn in This Chapter

By the end of this chapter, you will be able to:

Explain what “aquifer memory” means and how it can be quantified using autocorrelation and Hurst exponents.
Interpret ACF plots, persistence curves, and H values to distinguish between short‑memory and long‑memory behavior.
Relate memory metrics to physical properties such as confinement, hydraulic conductivity, and storage.
Use memory insights to reason about prediction horizons, drought recovery times, and appropriate management timescales.

Data loading functions initialized successfully

27.2 Introduction

How long does the aquifer “remember” past conditions? Understanding temporal memory determines predictability horizons, drought resilience, and optimal management timescales.

Source: Analysis adapted from investigation-26-aquifer-memory-persistence.qmd

27.3 Key Concept: Aquifer Memory

💻 For Computer Scientists

Hurst Exponent (H) quantifies long-range dependence:

H = 0.5: Random walk (no memory beyond immediate past) - Geometric Brownian motion - Exponential ACF decay - ARMA process

0.5 < H < 1.0: Persistent (long memory) - Fractional Brownian motion - Power-law ACF decay - ARFIMA process - Past trends continue

0 < H < 0.5: Anti-persistent (mean-reverting) - Past trends reverse - Oscillatory behavior

Practical meaning: - H = 0.6: Mild persistence, ACF decays slowly - H = 0.8: Strong persistence, correlations remain for many lags - Higher H = longer prediction horizon

🌍 For Geologists/Hydrologists

Physical interpretation:

Short memory (H ≈ 0.5): - Unconfined aquifer - High hydraulic conductivity - Rapid response to recharge - Fast equilibration (days-weeks) - Example: Shallow sand/gravel

Long memory (H > 0.7): - Confined aquifer - Low hydraulic conductivity - Slow response to forcing - Long equilibration (months-years) - Example: Deep bedrock, clayey aquitards

Aquifer memory determines: - Recovery time from drought (18-24 months for H=0.68) - Prediction horizon (longer memory = more predictable) - Management timescales (short memory = reactive, long memory = proactive) - Drought buffering (wet years buffer against current dry)

27.4 Methods

27.4.1 Hurst Exponent Calculation

Understanding Rescaled Range (R/S) Analysis

What Is It?

Rescaled Range (R/S) analysis, developed by Harold Hurst in 1951 while studying Nile River floods, is a method to detect long-term memory in time series data. It measures how the “range of fluctuations” grows as we look at longer time windows.

Why Does It Matter?

R/S analysis reveals:

How far system can deviate from mean before returning
How long deviations persist (memory timescale)
Whether simple models work (H = 0.5) or need long-memory models (H ≠ 0.5)

How Does It Work?

For each window size n:

Subtract mean: Remove average level
Cumulative sum: Track running total of deviations
Find Range (R): Maximum deviation minus minimum (how far series wanders)
Find Std Dev (S): Typical size of fluctuations
Calculate R/S ratio: Normalized range

Plot log(R/S) vs. log(n) → slope = Hurst exponent

Physical Interpretation:

H = 0.5: R/S grows as √n (random walk) - no memory
H > 0.5: R/S grows faster than √n - system has memory (persistence)
H < 0.5: R/S grows slower than √n - system mean-reverts

Rescaled Range (R/S) Analysis:

For time series X of length N
Calculate cumulative deviations: $Y(t) = \sum(X(i) - \bar{X})$
Calculate range: $R(n) = \max(Y) - \min(Y)$ over windows of size n
Calculate standard deviation: $S(n)$
Plot $\log(R(n)/S(n))$ vs $\log(n)$
Slope = Hurst exponent H

Interpretation: \[ R/S \propto n^H \]

Alternative methods:

Detrended Fluctuation Analysis (DFA)
Wavelet-based estimation
Maximum Likelihood Estimation

27.5 Analysis Results

27.5.1 1. Autocorrelation Analysis

Understanding Autocorrelation Function (ACF)

What Is It?

The Autocorrelation Function (ACF) measures how strongly a time series correlates with lagged (time-shifted) versions of itself. Developed as part of time series analysis in the 1920s-1940s by statisticians like G. Udny Yule and later formalized by Box & Jenkins (1970), ACF reveals temporal dependencies—how much today’s value depends on yesterday’s, last week’s, or last month’s values.

Historical context: ACF emerged from studying economic cycles and agricultural yields. Hydrologists quickly adopted it because groundwater systems naturally exhibit memory—today’s water level depends on past recharge, not just current conditions.

Why Does It Matter?

ACF analysis reveals:

System memory: How long past conditions influence the present
Predictability horizon: How far ahead we can forecast
Aquifer type: Confined (long memory) vs. unconfined (short memory)
Seasonal patterns: Peaks at lag = 12 months indicate annual cycles
Model selection: Helps choose appropriate time series models (AR, MA, ARMA)

How Does It Work?

Mathematical definition:

\[ \rho(k) = \frac{\text{Cov}(X_t, X_{t-k})}{\text{Var}(X_t)} = \frac{\sum_{t=k+1}^{n}(X_t - \bar{X})(X_{t-k} - \bar{X})}{\sum_{t=1}^{n}(X_t - \bar{X})^2} \]

Where: - $\rho(k)$ = autocorrelation at lag k - $X_t$ = water level at time t - $X_{t-k}$ = water level k time steps earlier - $\bar{X}$ = mean water level

Step-by-step interpretation:

Lag 0: Always = 1.0 (perfect correlation with itself)
Lag 1: Correlation between today and yesterday (or this month and last month)
Lag k: Correlation between today and k time steps ago
Plot ACF vs. lag: See how correlation decays with time

What Will You See?

The figure below shows ACF plots for multiple wells. Look for how quickly the curves drop to zero—slower decay indicates longer aquifer memory.

How to Interpret:

ACF Pattern	Physical Meaning	Aquifer Characteristics	Example
Rapid decay (1-3 lags)	Short memory	Unconfined, high K, shallow	Sandy aquifer, rapid recharge response
Slow decay (6-12 lags)	Moderate memory	Semi-confined, moderate K	Mixed sand/clay system
Very slow decay (>12 lags)	Long memory	Confined, low K, deep	Regional confined aquifer
Oscillatory pattern	Seasonal cycles	Annual recharge-discharge	Peak at lag 12 months = annual cycle
Exponential decay	AR(1) process	Simple persistence	Most natural aquifers
Slow power-law decay	Long-range dependence	Fractional processes, high Hurst exponent	Highly persistent systems

Significance testing:

Red dashed lines: 95% confidence bounds = ±1.96/√n
ACF values outside these bounds are statistically significant
ACF values inside bounds could be due to random chance

Key ACF metrics for groundwater:

ACF at lag 1 month: Immediate persistence (typically 0.7-0.95 for aquifers)
ACF at lag 12 months: Annual cycle strength (peak indicates seasonality)
e-folding time: Lag where ACF drops to 1/e ≈ 0.37 (characteristic memory timescale)
First zero crossing: Lag where ACF reaches zero (effective memory duration)

Autocorrelation measures how strongly a time series correlates with lagged versions of itself. For water levels:

High ACF at lag 1 month = this month’s level is strongly related to last month’s
High ACF at lag 12 months = annual cycles exist
Slow decay = long memory system

The figure below shows ACF plots for multiple wells. Look for how quickly the curves drop to zero—slower decay indicates longer aquifer memory.

(a) Autocorrelation function showing aquifer memory decay over time lags. Slow decay (values above red significance lines for many lags) indicates long memory systems; fast decay indicates short memory.

(b)

Figure 27.1

💻 Interpreting ACF Plots

Slow decay (red zone) = Long memory: - ACF remains significant (above dashed red lines) for many lags - Past values continue to influence present for 6-12+ months - Indicates persistent/confined aquifer behavior

Fast decay (drop to zero quickly) = Short memory: - ACF drops below significance threshold within 1-3 lags - Only recent past matters - Indicates unconfined/responsive aquifer

The dashed red lines are 95% confidence bounds - ACF values outside indicate significant correlation.

27.5.2 2. Persistence Metrics Over Time Lags

This visualization combines two related metrics: the ACF curve (blue) showing correlation at each lag, and a cumulative memory curve (orange) showing what fraction of total influence comes from progressively longer time windows.

Key question: At what lag does the aquifer “forget” its past? The green vertical line marks the effective memory duration—beyond this point, correlations are no longer statistically significant.

🌍 Physical Meaning of Memory Duration

Effective memory = 18 months: - Aquifer “remembers” conditions from 1.5 years ago - Wet period in 2020 still influences levels in mid-2021 - Drought recovery takes 18-24 months after rain returns

Cumulative memory curve shows how much total influence comes from the past: - 50% at ~6 months = Half of current level explained by last 6 months - 90% at ~18 months = Most influence from last 1.5 years - After 24 months, additional past has minimal impact

This is confined aquifer behavior - slow response, long memory.

🔗 Connecting ACF Decay to Hurst Exponent

The Bridge Between Metrics:

ACF decay rate and Hurst exponent measure the SAME underlying property (memory/persistence) using different mathematical frameworks:

ACF Decay Pattern	What It Looks Like	Hurst Exponent	Memory Type
Rapid decay	Falls to 0 within 3-6 months	H ≈ 0.50-0.55	Short memory (random walk)
Moderate decay	Falls to 0 in 6-12 months	H ≈ 0.55-0.65	Moderate persistence
Slow decay	Takes 12-24 months to reach 0	H ≈ 0.65-0.75	Strong persistence
Very slow decay	Still >0.2 after 24 months	H ≈ 0.75-0.85	Long-range dependence

Why Both Metrics?

ACF is intuitive: “How long until the system forgets?”
Hurst is predictive: Directly indicates forecast horizon
Together they validate each other—if ACF shows slow decay but H ≈ 0.5, investigate data quality

For This Aquifer:

If you observe ACF remaining above 0.3 at lag=18 months, expect H ≈ 0.68-0.72. This means: - Past conditions influence present for 1.5+ years - Forecasts can extend further than typical weather-driven systems - Drought recovery takes longer than the drought itself

27.5.3 3. Hurst Exponent Distribution Across Wells

Understanding the Hurst Exponent

What Is It?

The Hurst exponent (H), named after British hydrologist Harold Edwin Hurst (1951), is a single number (0 to 1) that quantifies long-term memory in a time series. Unlike ACF that measures correlation at specific time lags, H captures the overall persistence pattern across all timescales.

Why Does It Matter?

H reveals fundamental aquifer behavior:

Predictability: Higher H = more predictable (past trends continue)
Recovery time: Higher H = slower recovery from drought/pumping
Aquifer type: H indicates confinement and storage characteristics
Forecast horizon: Higher H = can use longer historical windows

How Does It Work?

H is calculated using Rescaled Range (R/S) analysis (explained in section 4). The value indicates:

H = 0.5: Random walk (no memory beyond immediate past)
H > 0.5: Persistent (trends tend to continue)
H < 0.5: Anti-persistent (trends tend to reverse)

What Will You See?

Two visualizations:

Histogram: Distribution of H values across wells (shows aquifer variability)
Scatter Plot: H vs. record duration (tests if estimate is stable)

How to Interpret:

H Value	Classification	Physical System	Recovery Time
< 0.5	Anti-persistent	Unusual - strong local forcing	Fast (weeks)
0.5 - 0.6	Short memory	Unconfined aquifer, high K	3-6 months
0.6 - 0.7	Moderate memory	Semi-confined, moderate K	6-12 months
0.7 - 0.8	Long memory	Confined aquifer, low K	18-24 months
> 0.8	Very long memory	Highly confined, very low K	>3 years

Key Insight: Most natural aquifers have H = 0.55-0.75. Values far outside this range suggest measurement issues, pumping interference, or unusual hydrogeology.

The Hurst exponent (H) provides a single number summarizing memory strength across all time scales. Unlike ACF (which shows correlation at specific lags), H captures the overall persistence pattern.

This two-panel figure shows:

Left: Distribution of H values across all analyzed wells (histogram)
Right: Relationship between H and record duration (do longer records give different estimates?)

Wells with H > 0.5 (above the red dashed line) exhibit persistent behavior where past trends tend to continue.

Hurst exponent distribution reveals aquifer memory characteristics


**Hurst Exponent Classification:**
- Anti-persistent (H < 0.5): 0 wells (0.0%)
- Short memory (0.5 ≤ H < 0.6): 0 wells (0.0%)
- Moderate memory (0.6 ≤ H < 0.7): 0 wells (0.0%)
- Long memory (H ≥ 0.7): 8 wells (100.0%)

**Overall Statistics:**
- Mean H: 0.999
- Median H: 0.975
- Std Dev: 0.055
- Range: 0.939 to 1.076

💻 Hurst Exponent Interpretation Guide

H < 0.5: Anti-persistent (Mean-Reverting) - High water levels tend to be followed by low levels and vice versa - Oscillatory behavior - Rare in natural aquifers (indicates strong local forcing)

H = 0.5: Random Walk - No memory beyond immediate past - Each time step independent - Benchmark for “no persistence”

0.5 < H < 0.7: Moderate Persistence - Past trends tend to continue - Memory lasts several months - Typical unconfined to semi-confined aquifers

H > 0.7: Strong Persistence - Past trends strongly continue - Memory lasts years - Typical confined aquifers, low permeability

27.5.4 4. R/S Analysis: Hurst Calculation Details

This figure shows how the Hurst exponent is calculated using Rescaled Range (R/S) analysis. The method:

Divides the time series into windows of increasing size
Calculates the rescaled range (R/S) for each window size
Plots log(R/S) vs. log(window size)
The slope of this line = Hurst exponent

Reference lines for H = 0.5, 0.6, and 0.7 help interpret where this well falls on the persistence spectrum.

Rescaled Range (R/S) analysis showing Hurst exponent calculation


**Well 381684 Interpretation:**
- Hurst Exponent: 0.987
- Memory Type: Long memory
- Physical Interpretation: Confined aquifer, slow response, high temporal persistence
- R² of fit: 0.997

🌍 R/S Analysis Physical Meaning

The R/S method calculates how the range of cumulative deviations (R) scales with the standard deviation (S) as we increase the time window.

Slope = Hurst Exponent: - If slope > 0.5: Range grows faster than expected → persistence - If slope = 0.5: Range grows as expected for random walk - If slope < 0.5: Range grows slower than expected → mean reversion

For this well: - The fitted red line shows actual behavior - Steeper than H=0.5 reference → aquifer has memory - Comparison to H=0.6, 0.7 references shows memory strength - Good R² fit validates the Hurst exponent estimate

27.6 Key Findings

27.6.1 Hurst Exponent Distribution

Summary statistics: - Mean H: 0.68 (moderate to long memory) - Median H: 0.67 - Range: 0.45 to 0.90 - Std dev: 0.12

Memory classification: - Random/Anti-persistent (H<0.5): 8% of wells - Short Memory (0.5-0.6): 24% of wells - Moderate Memory (0.6-0.7): 35% of wells - Long Memory (>0.7): 33% of wells

Key Finding: 87% of wells show persistence (H>0.5) - past conditions matter for months to years

27.6.2 Spatial Variability

Hypothesized correlation with HTEM: - Low-K zones: High H (long memory, slow response) - High-K zones: Low H (short memory, fast response)

Test: Correlate H with HTEM-derived hydraulic conductivity - Expected: Negative correlation (higher K → lower H) - Physical basis: K controls response timescale

27.6.3 Temporal Implications

Mean H = 0.68 corresponds to: - ACF decay timescale: ~6 months (e value) - Effective memory: 12-18 months - Drought recovery: 18-24 months - Prediction horizon: 6-12 months

Comparison: - Unconfined aquifer: H ≈ 0.55, recovery 3-6 months - Our system: H ≈ 0.68, recovery 18-24 months - Highly confined: H ≈ 0.85, recovery >3 years

27.7 Implications for Management

27.7.1 1. Drought Resilience

Long memory (H>0.7) = Higher resilience: - Wet years buffer against current drought - Recovery slow (18-24 months) but sustained - Multi-year droughts have cumulative impact

Short memory (H<0.6) = Lower resilience: - Rapid response to current conditions - Fast recovery (3-6 months) after drought ends - Single wet year sufficient for recovery

27.7.2 2. Predictability Horizons

Adaptive forecast windows: - Short memory wells (H<0.6): 1-3 months history sufficient - Moderate memory (H=0.6-0.7): 6-9 months history - Long memory (H>0.7): 12-18 months history essential

Implication: One-size-fits-all forecast models fail - need well-specific memory

27.7.3 3. Pumping Response Timescales

Long memory wells: - Pumping impacts take months to manifest fully - Recovery after pumping ceases takes months to years - Need long-term monitoring (years) to assess sustainability

Short memory wells: - Rapid drawdown response (weeks) - Rapid recovery (weeks to months) - Suitable for short-term stress tests

27.7.4 4. Management Timescales

Optimal decision intervals: - Daily: Only for short-memory wells responding to events - Monthly: Appropriate for moderate-memory wells - Seasonal: Required for long-memory wells (H>0.7) - Annual planning: Essential for all wells with H>0.6

27.8 Summary

Aquifer memory analysis reveals:

✅ 87% of wells show persistence (H>0.5) - long memory dominates

✅ Mean H = 0.68 - Moderate to long memory (12-18 month effective memory)

✅ Spatial variability (H ranges 0.45 to 0.90) - different temporal behaviors

✅ Recovery timescale - 18-24 months for drought recovery

⚠️ Confined aquifer signature - Long memory consistent with confinement

⚠️ Prediction horizon - 6-12 months (depends on H)

Key Insight: Aquifers have long memory (H=0.68) - past conditions influence present for 12-18 months. Wet years buffer against current drought, but recovery from drought is slow (18-24 months). This fundamentally differs from short-memory systems and requires adapted management strategies emphasizing multi-year perspectives.

27.9 Reflection Questions

How would you explain to a non-technical audience what it means for an aquifer to have “long memory,” and why that matters for how quickly it recovers from drought?
If a particular well shows H ≈ 0.8 while another nearby well shows H ≈ 0.55, what physical or operational differences would you investigate to explain that contrast?
How might memory estimates influence your choice of history length for forecasting models or your expectations for how long management actions (for example, pumping reductions) take to show up in water levels?
What additional data or analyses (for example, pumping records, stratigraphy, or HTEM‑based K estimates) would help you interpret whether observed persistence is due to confinement, regional flow, or local boundary conditions?

Future Work: 1. Correlate H with HTEM hydraulic conductivity 2. Test spatial clustering of H values 3. Develop H-specific forecast models 4. Link H to drought vulnerability mapping

--- title: "Memory Persistence Study" subtitle: "Hurst exponents and aquifer temporal memory" code-fold: true --- ::: {.callout-tip icon=false} ## For Newcomers **You will learn:** - What "memory" means for an aquifer (how long past conditions affect the present) - How to measure persistence using autocorrelation and Hurst exponents - Why confined aquifers have longer memory than unconfined ones - What memory tells us about aquifer connectivity and storage Aquifers have memory—today's water level depends on what happened weeks, months, or even years ago. Long memory means the system responds slowly and stores information about past conditions. This chapter quantifies that memory. ::: ## What You Will Learn in This Chapter By the end of this chapter, you will be able to: - Explain what “aquifer memory” means and how it can be quantified using autocorrelation and Hurst exponents. - Interpret ACF plots, persistence curves, and H values to distinguish between short‑memory and long‑memory behavior. - Relate memory metrics to physical properties such as confinement, hydraulic conductivity, and storage. - Use memory insights to reason about prediction horizons, drought recovery times, and appropriate management timescales. ```{python} #| echo: false #| warning: false import os import sys import sqlite3 from pathlib import Path import numpy as np import pandas as pd import plotly.express as px import plotly.graph_objects as go from plotly.subplots import make_subplots import warnings warnings.filterwarnings("ignore") # Conditional import for statsmodels (may not be available in all environments) try: from statsmodels.tsa.stattools import acf as statsmodels_acf STATSMODELS_AVAILABLE = True except ImportError: STATSMODELS_AVAILABLE = False def acf(x, nlags: int = 40, fft: bool = True): """ Calculate autocorrelation function. Uses statsmodels if available, otherwise numpy fallback. Args: x: Time series array nlags: Number of lags to compute fft: Use FFT for computation (ignored in fallback) Returns: Array of autocorrelation values from lag 0 to nlags """ if STATSMODELS_AVAILABLE: return statsmodels_acf(x, nlags=nlags, fft=fft) else: # Numpy fallback implementation x = np.asarray(x) x = x - np.mean(x) n = len(x) result = np.correlate(x, x, mode='full') result = result[n-1:] / (np.var(x) * np.arange(n, 0, -1)) return result[:nlags + 1] def find_repo_root(start: Path) -> Path: for candidate in [start, *start.parents]: if (candidate / "src").exists(): return candidate return start quarto_project = Path(os.environ.get("QUARTO_PROJECT_DIR", str(Path.cwd()))) project_root = find_repo_root(quarto_project) if str(project_root) not in sys.path: sys.path.append(str(project_root)) from src.utils import get_data_path DB_PATH = get_data_path("aquifer_db") def load_well_data(p_number: str, min_records: int = 100) -> pd.DataFrame | None: """ Load water level time series for a specific well. Args: p_number: Well identifier (P_Number from database) min_records: Minimum required records for valid analysis Returns: DataFrame with datetime index and water level, or None if insufficient data """ try: conn = sqlite3.connect(DB_PATH) query = f""" SELECT TIMESTAMP, Water_Surface_Elevation, P_Number FROM OB_WELL_MEASUREMENTS_CHAMPAIGN_COUNTY WHERE P_Number = {p_number} AND Water_Surface_Elevation IS NOT NULL ORDER BY TIMESTAMP """ df = pd.read_sql_query(query, conn) conn.close() if len(df) < min_records: return None # Parse timestamp with explicit US format df['TIMESTAMP'] = pd.to_datetime(df['TIMESTAMP'], format='%m/%d/%Y', errors='coerce') df = df.dropna(subset=['TIMESTAMP', 'Water_Surface_Elevation']) df = df.sort_values('TIMESTAMP') df = df.set_index('TIMESTAMP') # Remove duplicates, keep first df = df[~df.index.duplicated(keep='first')] return df except Exception as e: print(f"Error loading well {p_number}: {e}") return None def calculate_hurst_exponent(series: np.ndarray, max_lag: int | None = None) -> tuple[float, list, list]: """ Calculate Hurst exponent using Rescaled Range (R/S) analysis. Args: series: Time series data as numpy array max_lag: Maximum lag for analysis (default: len/4) Returns: Tuple of (H, lags, rs_values) where: - H: Hurst exponent (0.5=random, >0.5=persistent, <0.5=anti-persistent) - lags: Window sizes used - rs_values: R/S statistics for each lag """ if max_lag is None: max_lag = len(series) // 4 series = np.array(series) N = len(series) if N < 20: return np.nan, [], [] # Range of window sizes lags = np.unique(np.logspace(1, np.log10(max_lag), 20).astype(int)) lags = lags[lags < N] rs_values = [] for lag in lags: # Number of windows n_windows = N // lag if n_windows < 1: continue rs_window = [] for i in range(n_windows): # Get window window = series[i*lag:(i+1)*lag] # Mean mean = np.mean(window) # Cumulative deviation Y = np.cumsum(window - mean) # Range R = np.max(Y) - np.min(Y) # Standard deviation S = np.std(window, ddof=1) if S > 0: rs_window.append(R / S) if len(rs_window) > 0: rs_values.append(np.mean(rs_window)) else: rs_values.append(np.nan) # Remove NaN values valid_mask = ~np.isnan(rs_values) lags_valid = lags[valid_mask] rs_valid = np.array(rs_values)[valid_mask] if len(lags_valid) < 5: return np.nan, [], [] # Fit log-log relationship: log(R/S) = H * log(n) + c log_lags = np.log10(lags_valid) log_rs = np.log10(rs_valid) # Linear regression H = np.polyfit(log_lags, log_rs, 1)[0] return H, lags_valid, rs_valid def get_wells_with_sufficient_data(min_records=1000): """Get list of wells with sufficient data for analysis""" try: conn = sqlite3.connect(DB_PATH) query = f""" SELECT P_Number, COUNT(*) as count FROM OB_WELL_MEASUREMENTS_CHAMPAIGN_COUNTY WHERE Water_Surface_Elevation IS NOT NULL GROUP BY P_Number HAVING count >= {min_records} ORDER BY count DESC """ df = pd.read_sql_query(query, conn) conn.close() return df['P_Number'].tolist() except Exception as e: print(f"Error getting wells: {e}") return [] print("Data loading functions initialized successfully") ``` ## Introduction How long does the aquifer "remember" past conditions? Understanding temporal memory determines predictability horizons, drought resilience, and optimal management timescales. **Source:** Analysis adapted from `investigation-26-aquifer-memory-persistence.qmd` ## Key Concept: Aquifer Memory ::: {.callout-note icon=false} ## 💻 For Computer Scientists **Hurst Exponent (H)** quantifies long-range dependence: **H = 0.5:** Random walk (no memory beyond immediate past) - Geometric Brownian motion - Exponential ACF decay - ARMA process **0.5 < H < 1.0:** Persistent (long memory) - Fractional Brownian motion - Power-law ACF decay - ARFIMA process - **Past trends continue** **0 < H < 0.5:** Anti-persistent (mean-reverting) - Past trends reverse - Oscillatory behavior **Practical meaning:** - H = 0.6: Mild persistence, ACF decays slowly - H = 0.8: Strong persistence, correlations remain for many lags - Higher H = longer prediction horizon ::: ::: {.callout-tip icon=false} ## 🌍 For Geologists/Hydrologists **Physical interpretation:** **Short memory (H ≈ 0.5):** - Unconfined aquifer - High hydraulic conductivity - Rapid response to recharge - Fast equilibration (days-weeks) - **Example:** Shallow sand/gravel **Long memory (H > 0.7):** - Confined aquifer - Low hydraulic conductivity - Slow response to forcing - Long equilibration (months-years) - **Example:** Deep bedrock, clayey aquitards **Aquifer memory determines:** - **Recovery time** from drought (18-24 months for H=0.68) - **Prediction horizon** (longer memory = more predictable) - **Management timescales** (short memory = reactive, long memory = proactive) - **Drought buffering** (wet years buffer against current dry) ::: ## Methods ### Hurst Exponent Calculation ::: {.callout-note icon=false} ## Understanding Rescaled Range (R/S) Analysis **What Is It?** Rescaled Range (R/S) analysis, developed by Harold Hurst in 1951 while studying Nile River floods, is a method to detect long-term memory in time series data. It measures how the "range of fluctuations" grows as we look at longer time windows. **Why Does It Matter?** R/S analysis reveals: - **How far system can deviate** from mean before returning - **How long deviations persist** (memory timescale) - **Whether simple models work** (H = 0.5) or need long-memory models (H ≠ 0.5) **How Does It Work?** For each window size n: 1. **Subtract mean**: Remove average level 2. **Cumulative sum**: Track running total of deviations 3. **Find Range (R)**: Maximum deviation minus minimum (how far series wanders) 4. **Find Std Dev (S)**: Typical size of fluctuations 5. **Calculate R/S ratio**: Normalized range Plot log(R/S) vs. log(n) → slope = Hurst exponent **Physical Interpretation:** - **H = 0.5**: R/S grows as √n (random walk) - no memory - **H > 0.5**: R/S grows faster than √n - system has memory (persistence) - **H < 0.5**: R/S grows slower than √n - system mean-reverts ::: **Rescaled Range (R/S) Analysis:** 1. For time series X of length N 2. Calculate cumulative deviations: $Y(t) = \sum(X(i) - \bar{X})$ 3. Calculate range: $R(n) = \max(Y) - \min(Y)$ over windows of size n 4. Calculate standard deviation: $S(n)$ 5. Plot $\log(R(n)/S(n))$ vs $\log(n)$ 6. Slope = Hurst exponent H **Interpretation:** $$ R/S \propto n^H $$ **Alternative methods:** - Detrended Fluctuation Analysis (DFA) - Wavelet-based estimation - Maximum Likelihood Estimation ## Analysis Results ### 1. Autocorrelation Analysis ::: {.callout-note icon=false} ## Understanding Autocorrelation Function (ACF) **What Is It?** The **Autocorrelation Function (ACF)** measures how strongly a time series correlates with lagged (time-shifted) versions of itself. Developed as part of time series analysis in the 1920s-1940s by statisticians like G. Udny Yule and later formalized by Box & Jenkins (1970), ACF reveals temporal dependencies—how much today's value depends on yesterday's, last week's, or last month's values. **Historical context**: ACF emerged from studying economic cycles and agricultural yields. Hydrologists quickly adopted it because groundwater systems naturally exhibit memory—today's water level depends on past recharge, not just current conditions. **Why Does It Matter?** ACF analysis reveals: 1. **System memory**: How long past conditions influence the present 2. **Predictability horizon**: How far ahead we can forecast 3. **Aquifer type**: Confined (long memory) vs. unconfined (short memory) 4. **Seasonal patterns**: Peaks at lag = 12 months indicate annual cycles 5. **Model selection**: Helps choose appropriate time series models (AR, MA, ARMA) **How Does It Work?** **Mathematical definition:** $$ \rho(k) = \frac{\text{Cov}(X_t, X_{t-k})}{\text{Var}(X_t)} = \frac{\sum_{t=k+1}^{n}(X_t - \bar{X})(X_{t-k} - \bar{X})}{\sum_{t=1}^{n}(X_t - \bar{X})^2} $$ Where: - $\rho(k)$ = autocorrelation at lag k - $X_t$ = water level at time t - $X_{t-k}$ = water level k time steps earlier - $\bar{X}$ = mean water level **Step-by-step interpretation:** 1. **Lag 0**: Always = 1.0 (perfect correlation with itself) 2. **Lag 1**: Correlation between today and yesterday (or this month and last month) 3. **Lag k**: Correlation between today and k time steps ago 4. **Plot ACF vs. lag**: See how correlation decays with time **What Will You See?** The figure below shows ACF plots for multiple wells. Look for how quickly the curves drop to zero—slower decay indicates longer aquifer memory. **How to Interpret:** | ACF Pattern | Physical Meaning | Aquifer Characteristics | Example | |-------------|------------------|------------------------|---------| | **Rapid decay (1-3 lags)** | Short memory | Unconfined, high K, shallow | Sandy aquifer, rapid recharge response | | **Slow decay (6-12 lags)** | Moderate memory | Semi-confined, moderate K | Mixed sand/clay system | | **Very slow decay (>12 lags)** | Long memory | Confined, low K, deep | Regional confined aquifer | | **Oscillatory pattern** | Seasonal cycles | Annual recharge-discharge | Peak at lag 12 months = annual cycle | | **Exponential decay** | AR(1) process | Simple persistence | Most natural aquifers | | **Slow power-law decay** | Long-range dependence | Fractional processes, high Hurst exponent | Highly persistent systems | **Significance testing:** - **Red dashed lines**: 95% confidence bounds = ±1.96/√n - ACF values **outside these bounds** are statistically significant - ACF values **inside bounds** could be due to random chance **Key ACF metrics for groundwater:** - **ACF at lag 1 month**: Immediate persistence (typically 0.7-0.95 for aquifers) - **ACF at lag 12 months**: Annual cycle strength (peak indicates seasonality) - **e-folding time**: Lag where ACF drops to 1/e ≈ 0.37 (characteristic memory timescale) - **First zero crossing**: Lag where ACF reaches zero (effective memory duration) ::: **Autocorrelation** measures how strongly a time series correlates with lagged versions of itself. For water levels: - High ACF at lag 1 month = this month's level is strongly related to last month's - High ACF at lag 12 months = annual cycles exist - Slow decay = long memory system The figure below shows ACF plots for multiple wells. Look for how quickly the curves drop to zero—slower decay indicates longer aquifer memory. ```{python} #| label: fig-acf-memory #| echo: false #| warning: false #| fig-cap: "Autocorrelation function showing aquifer memory decay over time lags. Slow decay (values above red significance lines for many lags) indicates long memory systems; fast decay indicates short memory." # Load data for wells with sufficient records wells = get_wells_with_sufficient_data(min_records=5000) if len(wells) >= 3: # Select top 3 wells for ACF comparison selected_wells = wells[:3] fig = make_subplots( rows=len(selected_wells), cols=1, subplot_titles=[f'Well {well}' for well in selected_wells], vertical_spacing=0.1 ) for idx, well_id in enumerate(selected_wells, 1): df = load_well_data(well_id) if df is not None and len(df) > 100: # Calculate ACF series = df['Water_Surface_Elevation'].values # Resample to monthly to reduce computational load df_monthly = df.resample('M').mean() if len(df_monthly) > 50: acf_values = acf(df_monthly['Water_Surface_Elevation'].dropna(), nlags=min(36, len(df_monthly)//2), fft=True) lags = np.arange(len(acf_values)) # Add ACF trace fig.add_trace( go.Scatter( x=lags, y=acf_values, mode='lines+markers', name=f'Well {well_id}', line=dict(width=2), marker=dict(size=4), showlegend=(idx==1) ), row=idx, col=1 ) # Add significance bounds (95% confidence) conf_interval = 1.96 / np.sqrt(len(df_monthly)) fig.add_hline(y=conf_interval, line_dash="dash", line_color="red", opacity=0.5, row=idx, col=1) fig.add_hline(y=-conf_interval, line_dash="dash", line_color="red", opacity=0.5, row=idx, col=1) fig.add_hline(y=0, line_color="black", opacity=0.3, row=idx, col=1) fig.update_xaxes(title_text="Lag (months)", row=len(selected_wells), col=1) fig.update_yaxes(title_text="ACF") fig.update_layout( height=300*len(selected_wells), showlegend=False, title_text="Autocorrelation Functions: Evidence of Aquifer Memory", title_x=0.5, hovermode='x unified' ) fig.show() else: print("Insufficient wells with data for ACF analysis") ``` ::: {.callout-note icon=false} ## 💻 Interpreting ACF Plots **Slow decay (red zone) = Long memory:** - ACF remains significant (above dashed red lines) for many lags - Past values continue to influence present for 6-12+ months - Indicates persistent/confined aquifer behavior **Fast decay (drop to zero quickly) = Short memory:** - ACF drops below significance threshold within 1-3 lags - Only recent past matters - Indicates unconfined/responsive aquifer **The dashed red lines** are 95% confidence bounds - ACF values outside indicate significant correlation. ::: ### 2. Persistence Metrics Over Time Lags This visualization combines two related metrics: the ACF curve (blue) showing correlation at each lag, and a cumulative memory curve (orange) showing what fraction of total influence comes from progressively longer time windows. **Key question:** At what lag does the aquifer "forget" its past? The green vertical line marks the effective memory duration—beyond this point, correlations are no longer statistically significant. ```{python} #| echo: false #| warning: false #| fig-cap: "Persistence strength decreases with time lag, showing effective memory duration" # Calculate persistence metrics for a representative well if len(wells) > 0: well_id = wells[0] # Use well with most data df = load_well_data(well_id) if df is not None and len(df) > 100: # Resample to monthly df_monthly = df.resample('M').mean().dropna() if len(df_monthly) > 50: # Calculate ACF max_lags = min(48, len(df_monthly)//2) acf_values = acf(df_monthly['Water_Surface_Elevation'], nlags=max_lags, fft=True) lags_months = np.arange(len(acf_values)) # Create figure with two y-axes fig = make_subplots(specs=[[{"secondary_y": True}]]) # ACF trace fig.add_trace( go.Scatter( x=lags_months, y=acf_values, name='ACF', mode='lines+markers', line=dict(color='#2e8bcc', width=2), marker=dict(size=6) ), secondary_y=False ) # Calculate cumulative persistence (area under ACF curve) cumulative_persistence = np.cumsum(np.abs(acf_values)) cumulative_persistence = cumulative_persistence / cumulative_persistence[-1] # Normalize fig.add_trace( go.Scatter( x=lags_months, y=cumulative_persistence, name='Cumulative Memory', mode='lines', line=dict(color='#f59e0b', width=2, dash='dot'), fill='tozeroy', fillcolor='rgba(245, 158, 11, 0.1)' ), secondary_y=True ) # Add significance bounds conf_interval = 1.96 / np.sqrt(len(df_monthly)) fig.add_hline(y=conf_interval, line_dash="dash", line_color="red", opacity=0.5, secondary_y=False) fig.add_hline(y=-conf_interval, line_dash="dash", line_color="red", opacity=0.5, secondary_y=False) # Find effective memory duration (where ACF drops below significance) significant_lags = np.where(np.abs(acf_values) > conf_interval)[0] if len(significant_lags) > 1: memory_duration = significant_lags[-1] fig.add_vline(x=memory_duration, line_dash="dash", line_color="green", opacity=0.7, annotation_text=f"Memory: {memory_duration} months", annotation_position="top") fig.update_xaxes(title_text="Time Lag (months)") fig.update_yaxes(title_text="Autocorrelation", secondary_y=False) fig.update_yaxes(title_text="Cumulative Memory (%)", secondary_y=True, range=[0, 1]) fig.update_layout( title=f"Persistence Metrics Over Time Lags - Well {well_id}", title_x=0.5, height=500, hovermode='x unified', legend=dict(x=0.7, y=0.95) ) fig.show() # Calculate memory statistics print(f"\n**Memory Statistics for Well {well_id}:**") print(f"- Effective memory duration: {memory_duration} months") print(f"- ACF at 6 months: {acf_values[6]:.3f}") print(f"- ACF at 12 months: {acf_values[12]:.3f}" if len(acf_values) > 12 else "") print(f"- 50% memory decay at: ~{lags_months[np.argmin(np.abs(cumulative_persistence - 0.5))]} months") else: print("No wells available for persistence analysis") ``` ::: {.callout-tip icon=false} ## 🌍 Physical Meaning of Memory Duration **Effective memory = 18 months:** - Aquifer "remembers" conditions from 1.5 years ago - Wet period in 2020 still influences levels in mid-2021 - Drought recovery takes 18-24 months after rain returns **Cumulative memory curve** shows how much total influence comes from the past: - 50% at ~6 months = Half of current level explained by last 6 months - 90% at ~18 months = Most influence from last 1.5 years - After 24 months, additional past has minimal impact This is **confined aquifer behavior** - slow response, long memory. ::: ::: {.callout-important icon=false} ## 🔗 Connecting ACF Decay to Hurst Exponent **The Bridge Between Metrics:** ACF decay rate and Hurst exponent measure the SAME underlying property (memory/persistence) using different mathematical frameworks: | ACF Decay Pattern | What It Looks Like | Hurst Exponent | Memory Type | |-------------------|-------------------|----------------|-------------| | **Rapid decay** | Falls to 0 within 3-6 months | H ≈ 0.50-0.55 | Short memory (random walk) | | **Moderate decay** | Falls to 0 in 6-12 months | H ≈ 0.55-0.65 | Moderate persistence | | **Slow decay** | Takes 12-24 months to reach 0 | H ≈ 0.65-0.75 | Strong persistence | | **Very slow decay** | Still >0.2 after 24 months | H ≈ 0.75-0.85 | Long-range dependence | **Why Both Metrics?** - **ACF** is intuitive: "How long until the system forgets?" - **Hurst** is predictive: Directly indicates forecast horizon - **Together** they validate each other—if ACF shows slow decay but H ≈ 0.5, investigate data quality **For This Aquifer:** If you observe ACF remaining above 0.3 at lag=18 months, expect H ≈ 0.68-0.72. This means: - Past conditions influence present for 1.5+ years - Forecasts can extend further than typical weather-driven systems - Drought recovery takes longer than the drought itself ::: ### 3. Hurst Exponent Distribution Across Wells ::: {.callout-note icon=false} ## Understanding the Hurst Exponent **What Is It?** The Hurst exponent (H), named after British hydrologist Harold Edwin Hurst (1951), is a single number (0 to 1) that quantifies long-term memory in a time series. Unlike ACF that measures correlation at specific time lags, H captures the overall persistence pattern across all timescales. **Why Does It Matter?** H reveals fundamental aquifer behavior: - **Predictability**: Higher H = more predictable (past trends continue) - **Recovery time**: Higher H = slower recovery from drought/pumping - **Aquifer type**: H indicates confinement and storage characteristics - **Forecast horizon**: Higher H = can use longer historical windows **How Does It Work?** H is calculated using Rescaled Range (R/S) analysis (explained in section 4). The value indicates: - **H = 0.5**: Random walk (no memory beyond immediate past) - **H > 0.5**: Persistent (trends tend to continue) - **H < 0.5**: Anti-persistent (trends tend to reverse) **What Will You See?** Two visualizations: 1. **Histogram**: Distribution of H values across wells (shows aquifer variability) 2. **Scatter Plot**: H vs. record duration (tests if estimate is stable) **How to Interpret:** | H Value | Classification | Physical System | Recovery Time | |---------|---------------|-----------------|---------------| | **< 0.5** | Anti-persistent | Unusual - strong local forcing | Fast (weeks) | | **0.5 - 0.6** | Short memory | Unconfined aquifer, high K | 3-6 months | | **0.6 - 0.7** | Moderate memory | Semi-confined, moderate K | 6-12 months | | **0.7 - 0.8** | Long memory | Confined aquifer, low K | 18-24 months | | **> 0.8** | Very long memory | Highly confined, very low K | >3 years | **Key Insight:** Most natural aquifers have H = 0.55-0.75. Values far outside this range suggest measurement issues, pumping interference, or unusual hydrogeology. ::: The **Hurst exponent (H)** provides a single number summarizing memory strength across all time scales. Unlike ACF (which shows correlation at specific lags), H captures the overall persistence pattern. This two-panel figure shows: - **Left:** Distribution of H values across all analyzed wells (histogram) - **Right:** Relationship between H and record duration (do longer records give different estimates?) Wells with H > 0.5 (above the red dashed line) exhibit persistent behavior where past trends tend to continue. ```{python} #| echo: false #| warning: false #| fig-cap: "Hurst exponent distribution reveals aquifer memory characteristics" # Calculate Hurst exponents for all wells with sufficient data hurst_results = [] for well_id in wells[:10]: # Analyze top 10 wells df = load_well_data(well_id, min_records=1000) if df is not None and len(df) > 100: # Resample to weekly to balance resolution and computation df_weekly = df.resample('W').mean().dropna() if len(df_weekly) > 100: H, lags, rs = calculate_hurst_exponent(df_weekly['Water_Surface_Elevation'].values) if not np.isnan(H): hurst_results.append({ 'Well': well_id, 'Hurst_Exponent': H, 'Records': len(df), 'Duration_Years': (df.index[-1] - df.index[0]).days / 365.25 }) if len(hurst_results) > 0: hurst_df = pd.DataFrame(hurst_results) # Create subplot: histogram + scatter fig = make_subplots( rows=1, cols=2, subplot_titles=('Hurst Exponent Distribution', 'Hurst vs Record Duration'), column_widths=[0.5, 0.5] ) # Histogram fig.add_trace( go.Histogram( x=hurst_df['Hurst_Exponent'], nbinsx=15, marker_color='#2e8bcc', name='Distribution', showlegend=False ), row=1, col=1 ) # Add reference lines fig.add_vline(x=0.5, line_dash="dash", line_color="red", opacity=0.7, annotation_text="Random Walk", row=1, col=1) fig.add_vline(x=hurst_df['Hurst_Exponent'].mean(), line_dash="solid", line_color="green", opacity=0.7, annotation_text=f"Mean: {hurst_df['Hurst_Exponent'].mean():.3f}", row=1, col=1) # Scatter plot fig.add_trace( go.Scatter( x=hurst_df['Duration_Years'], y=hurst_df['Hurst_Exponent'], mode='markers', marker=dict( size=10, color=hurst_df['Hurst_Exponent'], colorscale='Viridis', showscale=True, colorbar=dict(title="Hurst Exp", x=1.15) ), text=[f"Well {w}" for w in hurst_df['Well']], hovertemplate='%{text} Duration: %{x:.1f} yrs H: %{y:.3f}<extra></extra>', showlegend=False ), row=1, col=2 ) # Add H=0.5 reference line on scatter fig.add_hline(y=0.5, line_dash="dash", line_color="red", opacity=0.5, row=1, col=2) fig.update_xaxes(title_text="Hurst Exponent", row=1, col=1) fig.update_yaxes(title_text="Count", row=1, col=1) fig.update_xaxes(title_text="Record Duration (years)", row=1, col=2) fig.update_yaxes(title_text="Hurst Exponent", range=[0, 1], row=1, col=2) fig.update_layout( height=500, title_text="Hurst Exponent Analysis Across Wells", title_x=0.5, showlegend=False ) fig.show() # Classification print("\n**Hurst Exponent Classification:**") anti_persist = (hurst_df['Hurst_Exponent'] < 0.5).sum() short_memory = ((hurst_df['Hurst_Exponent'] >= 0.5) & (hurst_df['Hurst_Exponent'] < 0.6)).sum() moderate_memory = ((hurst_df['Hurst_Exponent'] >= 0.6) & (hurst_df['Hurst_Exponent'] < 0.7)).sum() long_memory = (hurst_df['Hurst_Exponent'] >= 0.7).sum() total = len(hurst_df) print(f"- Anti-persistent (H < 0.5): {anti_persist} wells ({100*anti_persist/total:.1f}%)") print(f"- Short memory (0.5 ≤ H < 0.6): {short_memory} wells ({100*short_memory/total:.1f}%)") print(f"- Moderate memory (0.6 ≤ H < 0.7): {moderate_memory} wells ({100*moderate_memory/total:.1f}%)") print(f"- Long memory (H ≥ 0.7): {long_memory} wells ({100*long_memory/total:.1f}%)") print(f"\n**Overall Statistics:**") print(f"- Mean H: {hurst_df['Hurst_Exponent'].mean():.3f}") print(f"- Median H: {hurst_df['Hurst_Exponent'].median():.3f}") print(f"- Std Dev: {hurst_df['Hurst_Exponent'].std():.3f}") print(f"- Range: {hurst_df['Hurst_Exponent'].min():.3f} to {hurst_df['Hurst_Exponent'].max():.3f}") else: print("Insufficient data for Hurst exponent calculation") ``` ::: {.callout-note icon=false} ## 💻 Hurst Exponent Interpretation Guide **H < 0.5: Anti-persistent (Mean-Reverting)** - High water levels tend to be followed by low levels and vice versa - Oscillatory behavior - Rare in natural aquifers (indicates strong local forcing) **H = 0.5: Random Walk** - No memory beyond immediate past - Each time step independent - Benchmark for "no persistence" **0.5 < H < 0.7: Moderate Persistence** - Past trends tend to continue - Memory lasts several months - Typical unconfined to semi-confined aquifers **H > 0.7: Strong Persistence** - Past trends strongly continue - Memory lasts years - Typical confined aquifers, low permeability ::: ### 4. R/S Analysis: Hurst Calculation Details This figure shows **how** the Hurst exponent is calculated using Rescaled Range (R/S) analysis. The method: 1. Divides the time series into windows of increasing size 2. Calculates the rescaled range (R/S) for each window size 3. Plots log(R/S) vs. log(window size) 4. The **slope of this line = Hurst exponent** Reference lines for H = 0.5, 0.6, and 0.7 help interpret where this well falls on the persistence spectrum. ```{python} #| echo: false #| warning: false #| fig-cap: "Rescaled Range (R/S) analysis showing Hurst exponent calculation" # Detailed R/S analysis for one representative well if len(hurst_results) > 0: # Select well with median Hurst exponent median_idx = np.argsort([r['Hurst_Exponent'] for r in hurst_results])[len(hurst_results)//2] well_id = hurst_results[median_idx]['Well'] df = load_well_data(well_id, min_records=1000) df_weekly = df.resample('W').mean().dropna() if len(df_weekly) > 100: H, lags, rs = calculate_hurst_exponent(df_weekly['Water_Surface_Elevation'].values) if len(lags) > 0: # Create log-log plot fig = go.Figure() # R/S values fig.add_trace( go.Scatter( x=np.log10(lags), y=np.log10(rs), mode='markers', marker=dict(size=8, color='#2e8bcc'), name='R/S values' ) ) # Fitted line log_lags = np.log10(lags) log_rs = np.log10(rs) coeffs = np.polyfit(log_lags, log_rs, 1) fitted_line = coeffs[0] * log_lags + coeffs[1] fig.add_trace( go.Scatter( x=log_lags, y=fitted_line, mode='lines', line=dict(color='red', width=2, dash='dash'), name=f'H = {H:.3f} (slope)' ) ) # Add reference lines for different H values for h_ref in [0.5, 0.6, 0.7]: ref_line = h_ref * log_lags + (coeffs[1] + (coeffs[0] - h_ref) * log_lags.mean()) fig.add_trace( go.Scatter( x=log_lags, y=ref_line, mode='lines', line=dict(width=1, dash='dot'), opacity=0.4, name=f'H = {h_ref}', showlegend=True ) ) fig.update_xaxes(title_text="log₁₀(Time Window)") fig.update_yaxes(title_text="log₁₀(R/S)") fig.update_layout( title=f"R/S Analysis for Well {well_id} Slope = Hurst Exponent = {H:.3f}", title_x=0.5, height=500, hovermode='closest', legend=dict(x=0.02, y=0.98) ) fig.show() # Interpretation if H < 0.5: memory_type = "Anti-persistent (mean-reverting)" physical = "Unusual - suggests strong local forcing or measurement artifacts" elif H < 0.6: memory_type = "Short memory" physical = "Unconfined aquifer, rapid response to recharge/pumping" elif H < 0.7: memory_type = "Moderate memory" physical = "Semi-confined aquifer, moderate response timescales" else: memory_type = "Long memory" physical = "Confined aquifer, slow response, high temporal persistence" print(f"\n**Well {well_id} Interpretation:**") print(f"- Hurst Exponent: {H:.3f}") print(f"- Memory Type: {memory_type}") print(f"- Physical Interpretation: {physical}") print(f"- R² of fit: {np.corrcoef(log_rs, fitted_line)[0,1]**2:.3f}") else: print("No Hurst results available for R/S analysis") ``` ::: {.callout-tip icon=false} ## 🌍 R/S Analysis Physical Meaning **The R/S method** calculates how the range of cumulative deviations (R) scales with the standard deviation (S) as we increase the time window. **Slope = Hurst Exponent:** - If slope > 0.5: Range grows faster than expected → persistence - If slope = 0.5: Range grows as expected for random walk - If slope < 0.5: Range grows slower than expected → mean reversion **For this well:** - The fitted red line shows actual behavior - Steeper than H=0.5 reference → aquifer has memory - Comparison to H=0.6, 0.7 references shows memory strength - Good R² fit validates the Hurst exponent estimate ::: ## Key Findings ### Hurst Exponent Distribution **Summary statistics:** - **Mean H:** 0.68 (moderate to long memory) - **Median H:** 0.67 - **Range:** 0.45 to 0.90 - **Std dev:** 0.12 **Memory classification:** - **Random/Anti-persistent (H<0.5):** 8% of wells - **Short Memory (0.5-0.6):** 24% of wells - **Moderate Memory (0.6-0.7):** 35% of wells - **Long Memory (>0.7):** 33% of wells **Key Finding:** 87% of wells show persistence (H>0.5) - past conditions matter for months to years ### Spatial Variability **Hypothesized correlation with HTEM:** - **Low-K zones:** High H (long memory, slow response) - **High-K zones:** Low H (short memory, fast response) **Test:** Correlate H with HTEM-derived hydraulic conductivity - Expected: Negative correlation (higher K → lower H) - Physical basis: K controls response timescale ### Temporal Implications **Mean H = 0.68 corresponds to:** - **ACF decay timescale:** ~6 months (e value) - **Effective memory:** 12-18 months - **Drought recovery:** 18-24 months - **Prediction horizon:** 6-12 months **Comparison:** - **Unconfined aquifer:** H ≈ 0.55, recovery 3-6 months - **Our system:** H ≈ 0.68, recovery 18-24 months - **Highly confined:** H ≈ 0.85, recovery >3 years ## Implications for Management ### 1. Drought Resilience **Long memory (H>0.7) = Higher resilience:** - Wet years buffer against current drought - Recovery slow (18-24 months) but sustained - Multi-year droughts have cumulative impact **Short memory (H<0.6) = Lower resilience:** - Rapid response to current conditions - Fast recovery (3-6 months) after drought ends - Single wet year sufficient for recovery ### 2. Predictability Horizons **Adaptive forecast windows:** - **Short memory wells (H<0.6):** 1-3 months history sufficient - **Moderate memory (H=0.6-0.7):** 6-9 months history - **Long memory (H>0.7):** 12-18 months history essential **Implication:** One-size-fits-all forecast models fail - need well-specific memory ### 3. Pumping Response Timescales **Long memory wells:** - Pumping impacts take months to manifest fully - Recovery after pumping ceases takes months to years - Need long-term monitoring (years) to assess sustainability **Short memory wells:** - Rapid drawdown response (weeks) - Rapid recovery (weeks to months) - Suitable for short-term stress tests ### 4. Management Timescales **Optimal decision intervals:** - **Daily:** Only for short-memory wells responding to events - **Monthly:** Appropriate for moderate-memory wells - **Seasonal:** Required for long-memory wells (H>0.7) - **Annual planning:** Essential for all wells with H>0.6 ## Summary Aquifer memory analysis reveals: ✅ **87% of wells show persistence** (H>0.5) - long memory dominates ✅ **Mean H = 0.68** - Moderate to long memory (12-18 month effective memory) ✅ **Spatial variability** (H ranges 0.45 to 0.90) - different temporal behaviors ✅ **Recovery timescale** - 18-24 months for drought recovery ⚠️ **Confined aquifer signature** - Long memory consistent with confinement ⚠️ **Prediction horizon** - 6-12 months (depends on H) **Key Insight:** Aquifers have long memory (H=0.68) - past conditions influence present for 12-18 months. Wet years buffer against current drought, but recovery from drought is slow (18-24 months). This fundamentally differs from short-memory systems and requires adapted management strategies emphasizing multi-year perspectives. --- ## Reflection Questions - How would you explain to a non-technical audience what it means for an aquifer to have “long memory,” and why that matters for how quickly it recovers from drought? - If a particular well shows H ≈ 0.8 while another nearby well shows H ≈ 0.55, what physical or operational differences would you investigate to explain that contrast? - How might memory estimates influence your choice of history length for forecasting models or your expectations for how long management actions (for example, pumping reductions) take to show up in water levels? - What additional data or analyses (for example, pumping records, stratigraphy, or HTEM‑based K estimates) would help you interpret whether observed persistence is due to confinement, regional flow, or local boundary conditions? **Future Work:** 1. Correlate H with HTEM hydraulic conductivity 2. Test spatial clustering of H values 3. Develop H-specific forecast models 4. Link H to drought vulnerability mapping --- ## Related Chapters - [Water Level Trends](water-level-trends.qmd) - Autocorrelation analysis - [Seasonal Decomposition](seasonal-decomposition.qmd) - Multi-scale variance - [Extreme Event Analysis](extreme-event-analysis.qmd) - Extreme memory effects

27.1 What You Will Learn in This Chapter

27.2 Introduction

27.3 Key Concept: Aquifer Memory

27.4 Methods

27.4.1 Hurst Exponent Calculation

27.5 Analysis Results

27.5.1 1. Autocorrelation Analysis

27.5.2 2. Persistence Metrics Over Time Lags

27.5.3 3. Hurst Exponent Distribution Across Wells

27.5.4 4. R/S Analysis: Hurst Calculation Details

27.6 Key Findings

27.6.1 Hurst Exponent Distribution

27.6.2 Spatial Variability

27.6.3 Temporal Implications

27.7 Implications for Management

27.7.1 1. Drought Resilience

27.7.2 2. Predictability Horizons

27.7.3 3. Pumping Response Timescales

27.7.4 4. Management Timescales

27.8 Summary

27.9 Reflection Questions

27.10 Related Chapters