This playbook is designed for everyone—from curious beginners to experienced professionals:
Complete newcomers with no water science or coding background
Students at any level exploring environmental data for the first time
Data practitioners comfortable with code but new to groundwater
Water professionals wanting to apply modern data analysis methods
Researchers conducting in-depth studies in environmental data science
This book does not require any programming knowledge. All code is provided and explained, but you can skip every code block and still fully understand the concepts through text, figures, and key takeaways. Think of the code as a recipe you can return to later if you want to try it yourself.
TipChoosing Your Pathway
Quick assessment: Answer these questions to find your best starting point:
Do you have groundwater/hydrogeology background? (Yes → Pathway 3; No → Continue)
Are you comfortable with Python and ML? (Yes → Pathway 2; No → Continue)
Do you need only high-level findings? (Yes → Pathway 1; No → Continue)
Are you doing research/thesis work? (Yes → Pathway 4; No → Pathway 0)
2.2 Learning Objectives
By the end of this chapter, you will be able to:
Select a learning pathway that matches your background, goals, and time commitment.
Map major chapters to your chosen pathway (understanding dependencies and recommended sequence).
Estimate time requirements for different engagement levels (skim vs. full implementation).
Identify support resources for terminology, data questions, and technical issues.
Estimated Time: 15-20 minutes (selecting pathway); varies by pathway chosen
Each pathway is presented both as a visual diagram and as a structured table/list to support different learning preferences and accessibility needs.
TipHow to Choose a Pathway
If you have no water background and may not run code, start with Pathway 0.
If you are comfortable with Python / ML but new to groundwater, follow Pathway 2 (Data Scientist).
If you are a groundwater or geophysics expert, use Pathway 3 (Hydrogeologist Track).
If you want only the big picture for decisions and ROI, use Pathway 1 (Quick).
If you plan to do research or a thesis based on this work, choose Pathway 4 (Full Deep).
2.3 Pathway 0: No-Water, No-Code On-Ramp
Target Audience: Curious readers, general science students, non-technical stakeholders
Prerequisites: None required
Time Commitment: 6-8 hours total (reading only)
Skill Level: Beginner (no coding or hydrology background needed)
Goal: Understand the big picture of the aquifer and what modern data analysis can reveal.
index.qmd – Read the Welcome, How to Use This Playbook, and Story of the Aquifer sections.
The flowchart and table above show a minimal, non-technical path through the book that still covers data foundations, fusion concepts, key findings, and economic value.
2.5 Pathway 2: Data Scientist Track
Target Audience: ML practitioners, data engineers, computational scientists, students interested in data science
Prerequisites:
Python programming (intermediate level)
Basic statistics and ML concepts
Familiarity with pandas, scikit-learn, or similar libraries
Time Commitment: 20-25 hours total (reading + running code)
Skill Level: Intermediate to Advanced (coding required)
Goal: Master environmental data engineering, time series ML, and multi-source fusion techniques.
Show code
flowchart TD A[Data Quality Audit] --> B[Well Network Analysis] B --> C[Temporal Fusion Engine] C --> D[Causal Discovery] D --> E[Material Classification ML] E --> F[Water Level Forecasting] F --> G[Explainable AI]
flowchart TD
A[Data Quality Audit] --> B[Well Network Analysis]
B --> C[Temporal Fusion Engine]
C --> D[Causal Discovery]
D --> E[Material Classification ML]
E --> F[Water Level Forecasting]
F --> G[Explainable AI]
The diagram and table together highlight how a data scientist can move from data quality checks through temporal fusion, causal discovery, supervised learning, forecasting, and interpretability.
2.6 Pathway 3: Hydrogeologist Track
Target Audience: Groundwater scientists, geophysicists, environmental engineers, students interested in earth sciences
Prerequisites:
Understanding of hydrogeology fundamentals (aquifers, recharge, flow systems)
Basic familiarity with geophysical methods (helpful but not required)
Python basics (helpful for code examples)
Time Commitment: 18-22 hours total (reading + selective code exploration)
Skill Level: Intermediate (domain expertise in water resources)
Goal: Apply quantitative data fusion methods to hydrogeological problems and decision-making.
Show code
flowchart TD A[HTEM Survey] --> B[3D Model] B --> C[Aquifer Material Map] C --> D[Water Balance] D --> E[Recharge Estimation] E --> F[Stream-Aquifer Exchange] F --> G[Well Placement]
flowchart TD
A[HTEM Survey] --> B[3D Model]
B --> C[Aquifer Material Map]
C --> D[Water Balance]
D --> E[Recharge Estimation]
E --> F[Stream-Aquifer Exchange]
F --> G[Well Placement]
This pathway is designed so that hydrogeologists first build a clear mental model of the subsurface structure, then connect that model to water balance, recharge, and stream-aquifer interactions before moving into well siting decisions.
Pathway 5 (Specific Topics) inherits prerequisites from the individual chapters you choose; check the “For Newcomers” or introduction sections of those chapters for details.
2.10 Time Estimates
Activity
Time
Read one chapter
20-40 min
Run code examples
10-20 min
Explore interactive plots
5-10 min
Modify and experiment
30-60 min
These time ranges are approximate and assume you skim figures at a comfortable pace and optionally run code in an environment that is already set up (see the main index.qmd chapter for environment and data configuration details).
---title: "Guided Learning Pathways"subtitle: "Choose Your Journey Through the Playbook"---## OverviewThis playbook is designed for **everyone**—from curious beginners to experienced professionals:- **Complete newcomers** with no water science or coding background- **Students** at any level exploring environmental data for the first time- **Data practitioners** comfortable with code but new to groundwater- **Water professionals** wanting to apply modern data analysis methods- **Researchers** conducting in-depth studies in environmental data scienceThis book **does not require any programming knowledge**. All code is provided and explained, but you can **skip every code block** and still fully understand the concepts through text, figures, and key takeaways. Think of the code as a recipe you can return to later if you want to try it yourself.::: {.callout-tip}## Choosing Your Pathway**Quick assessment:** Answer these questions to find your best starting point:1. Do you have groundwater/hydrogeology background? (Yes → Pathway 3; No → Continue)2. Are you comfortable with Python and ML? (Yes → Pathway 2; No → Continue)3. Do you need only high-level findings? (Yes → Pathway 1; No → Continue)4. Are you doing research/thesis work? (Yes → Pathway 4; No → Pathway 0):::## Learning ObjectivesBy the end of this chapter, you will be able to:1. **Select** a learning pathway that matches your background, goals, and time commitment.2. **Map** major chapters to your chosen pathway (understanding dependencies and recommended sequence).3. **Estimate** time requirements for different engagement levels (skim vs. full implementation).4. **Identify** support resources for terminology, data questions, and technical issues.**Estimated Time:** 15-20 minutes (selecting pathway); varies by pathway chosenEach pathway is presented both as a visual diagram and as a structured table/list to support different learning preferences and accessibility needs.::: {.callout-tip}## How to Choose a Pathway- If you have **no water background** and may not run code, start with **Pathway 0**.- If you are comfortable with Python / ML but new to groundwater, follow **Pathway 2 (Data Scientist)**.- If you are a groundwater or geophysics expert, use **Pathway 3 (Hydrogeologist Track)**.- If you want only the big picture for decisions and ROI, use **Pathway 1 (Quick)**.- If you plan to do research or a thesis based on this work, choose **Pathway 4 (Full Deep)**.:::## Pathway 0: No-Water, No-Code On-Ramp**Target Audience:** Curious readers, general science students, non-technical stakeholders**Prerequisites:** None required**Time Commitment:** 6-8 hours total (reading only)**Skill Level:** Beginner (no coding or hydrology background needed)**Goal:** Understand the big picture of the aquifer and what modern data analysis can reveal.1. `index.qmd` – Read the **Welcome**, **How to Use This Playbook**, and **Story of the Aquifer** sections.2. **Part 1 Foundations** - [Data Foundations Overview](parts/part-1-foundations/overview.qmd) - [HTEM Survey](parts/part-1-foundations/htem-survey-overview.qmd) - [Subsurface 3D Model](parts/part-1-foundations/subsurface-3d-model.qmd) - [Well Network](parts/part-1-foundations/well-network-analysis.qmd)3. **One spatial chapter** - [Aquifer Material Map](parts/part-2-spatial/aquifer-material-map.qmd) *or* - [Aquifer Vulnerability Map](parts/part-2-spatial/aquifer-vulnerability-map.qmd)4. **One temporal chapter** - [Water Level Trends](parts/part-3-temporal/water-level-trends.qmd)5. **One fusion and one operations chapter** - [Water Balance Closure](parts/part-4-fusion/water-balance-closure.qmd) - [Well Placement Optimizer](parts/part-5-operations/well-placement-optimizer.qmd)6. Use reference chapters as needed: - [Terminology Translation](parts/assets/terminology-translation.qmd) - [FAQ](parts/assets/faq.qmd)Reading tips:- When you see code blocks, you can **skim them** and focus on the narrative, figures, and “Key Takeaways” sections.- Use the **“For Newcomers”** callouts at the top of chapters to see what you will learn and which prerequisite chapters to read first.## Pathway 1: Quick Overview**Target Audience:** Executives, managers, decision-makers, busy professionals**Prerequisites:** None required**Time Commitment:** 2-3 hours total**Skill Level:** Beginner (high-level concepts only)**Goal:** Understand key findings, ROI, and decision implications without technical details.```{mermaid}flowchart LR A[Index] --> B[Part 1 Overview] B --> C[Part 4 Overview] C --> D[Synthesis Narrative] D --> E[Value of Information]```| Step | Chapter | Time | Key Takeaway ||------|---------|------|--------------|| 1 |[Welcome](index.qmd)| 10 min | What this project does || 2 |[Data Foundations](parts/part-1-foundations/overview.qmd)| 20 min | What data we have || 3 |[Data Fusion](parts/part-4-fusion/overview.qmd)| 20 min | Why fusion matters || 4 |[Synthesis](parts/part-5-operations/synthesis-narrative.qmd)| 40 min | Key findings || 5 |[ROI Analysis](parts/part-4-fusion/value-of-information.qmd)| 30 min | Economic value |The flowchart and table above show a **minimal, non-technical path** through the book that still covers data foundations, fusion concepts, key findings, and economic value.## Pathway 2: Data Scientist Track**Target Audience:** ML practitioners, data engineers, computational scientists, students interested in data science**Prerequisites:**- Python programming (intermediate level)- Basic statistics and ML concepts- Familiarity with pandas, scikit-learn, or similar libraries**Time Commitment:** 20-25 hours total (reading + running code)**Skill Level:** Intermediate to Advanced (coding required)**Goal:** Master environmental data engineering, time series ML, and multi-source fusion techniques.```{mermaid}flowchart TD A[Data Quality Audit] --> B[Well Network Analysis] B --> C[Temporal Fusion Engine] C --> D[Causal Discovery] D --> E[Material Classification ML] E --> F[Water Level Forecasting] F --> G[Explainable AI]```| Step | Chapter | Focus ||------|---------|-------|| 1 |[Data Quality](parts/part-1-foundations/data-quality-audit.qmd)| Data validation patterns || 2 |[Well Network](parts/part-1-foundations/well-network-analysis.qmd)| Time series data || 3 |[Temporal Fusion](parts/part-4-fusion/temporal-fusion-engine.qmd)| Multi-source ML || 4 |[Causal Discovery](parts/part-4-fusion/causal-discovery-network.qmd)| Granger causality || 5 |[ML Classification](parts/part-5-operations/material-classification-ml.qmd)| Supervised learning || 6 |[Forecasting](parts/part-5-operations/water-level-forecasting.qmd)| LSTM deep learning || 7 |[XAI](parts/part-5-operations/explainable-ai-insights.qmd)| SHAP interpretation |**Key Skills Developed:**- Handling messy environmental data- Time series with autocorrelation- Spatial cross-validation- Multi-source feature engineeringThe diagram and table together highlight how a data scientist can move from data quality checks through temporal fusion, causal discovery, supervised learning, forecasting, and interpretability.## Pathway 3: Hydrogeologist Track**Target Audience:** Groundwater scientists, geophysicists, environmental engineers, students interested in earth sciences**Prerequisites:**- Understanding of hydrogeology fundamentals (aquifers, recharge, flow systems)- Basic familiarity with geophysical methods (helpful but not required)- Python basics (helpful for code examples)**Time Commitment:** 18-22 hours total (reading + selective code exploration)**Skill Level:** Intermediate (domain expertise in water resources)**Goal:** Apply quantitative data fusion methods to hydrogeological problems and decision-making.```{mermaid}flowchart TD A[HTEM Survey] --> B[3D Model] B --> C[Aquifer Material Map] C --> D[Water Balance] D --> E[Recharge Estimation] E --> F[Stream-Aquifer Exchange] F --> G[Well Placement]```| Step | Chapter | Focus ||------|---------|-------|| 1 |[HTEM Survey](parts/part-1-foundations/htem-survey-overview.qmd)| Geophysical methods || 2 |[3D Model](parts/part-1-foundations/subsurface-3d-model.qmd)| Subsurface structure || 3 |[Material Map](parts/part-2-spatial/aquifer-material-map.qmd)| Aquifer delineation || 4 |[Water Balance](parts/part-4-fusion/water-balance-closure.qmd)| Water budget closure (P - ET - Q = dS) || 5 |[Recharge](parts/part-4-fusion/recharge-rate-estimation.qmd)| Recharge estimation || 6 |[SW-GW](parts/part-4-fusion/stream-aquifer-exchange.qmd)| Stream–aquifer (surface water–groundwater) interaction || 7 |[Well Siting](parts/part-5-operations/well-placement-optimizer.qmd)| Optimal locations |**Key Skills Developed:**- HTEM data interpretation- Aquifer characterization- Water balance closure- Hydrogeological decision makingThis pathway is designed so that hydrogeologists first build a clear mental model of the subsurface structure, then connect that model to water balance, recharge, and stream-aquifer interactions before moving into well siting decisions.## Pathway 4: Full Deep Dive**Target Audience:** Researchers, advanced students, anyone planning in-depth studies or publications**Prerequisites:**- Strong Python programming skills- Statistics and ML knowledge (regression, time series, causal inference)- Hydrogeology or environmental science background (or commitment to learn)- Research methodology experience**Time Commitment:** 30-40 hours total (complete reading + all code execution + experimentation)**Skill Level:** Advanced (graduate research level)**Goal:** Master all techniques, reproduce all analyses, and extend methods to new research questions.**Complete Sequential Reading:**1. **Part 1: Data Foundations** (4-5 hrs): Understand each data source in depth2. **Part 2: Spatial Patterns** (4-5 hrs): Spatial analysis methods and geostatistics3. **Part 3: Temporal Dynamics** (6-7 hrs): Time series analysis techniques4. **Part 4: Data Fusion Insights** (10-12 hrs): Multi-source fusion approaches5. **Part 5: Predictive Operations** (6-8 hrs): ML applications and decision systems6. **Reference Library** (2-3 hrs): Terminology, data dictionary, and FAQ**Research Opportunities:**- Extend methods to new study areas or aquifer systems- Integrate additional data sources (soil moisture, isotopes, geochemistry)- Develop novel ML models or fusion algorithms- Investigate specific hydrogeological questions in depth- Publish case studies or methodological advances## Pathway 5: Specific TopicsFor readers interested in specific analytical techniques or research questions, you can follow these focused mini-pathways:### Water Balance Focus**Time:** 6-8 hours | **Level:** Intermediate1. [Weather Data](parts/part-1-foundations/weather-station-data.qmd) (1.5 hrs)2. [Stream Gauges](parts/part-1-foundations/stream-gauge-network.qmd) (1.5 hrs)3. [Well Network](parts/part-1-foundations/well-network-analysis.qmd) (2 hrs)4. [Water Balance](parts/part-4-fusion/water-balance-closure.qmd) (2 hrs)### Aquifer Characterization Focus**Time:** 6-7 hours | **Level:** Intermediate1. [HTEM Survey](parts/part-1-foundations/htem-survey-overview.qmd) (2 hrs)2. [3D Model](parts/part-1-foundations/subsurface-3d-model.qmd) (1.5 hrs)3. [Material Map](parts/part-2-spatial/aquifer-material-map.qmd) (1.5 hrs)4. [Resistivity](parts/part-2-spatial/resistivity-distribution.qmd) (1.5 hrs)### Time Series Analysis Focus**Time:** 8-10 hours | **Level:** Intermediate to Advanced1. [Water Level Trends](parts/part-3-temporal/water-level-trends.qmd) (2 hrs)2. [Seasonal Decomposition](parts/part-3-temporal/seasonal-decomposition.qmd) (2 hrs)3. [Wavelet Analysis](parts/part-3-temporal/wavelet-spectral-analysis.qmd) (2.5 hrs)4. [Memory Persistence](parts/part-3-temporal/memory-persistence-study.qmd) (2.5 hrs)### Machine Learning Focus**Time:** 10-12 hours | **Level:** Advanced1. [Material Classification](parts/part-5-operations/material-classification-ml.qmd) (3 hrs)2. [Forecasting](parts/part-5-operations/water-level-forecasting.qmd) (3 hrs)3. [Anomaly Detection](parts/part-5-operations/anomaly-early-warning.qmd) (2.5 hrs)4. [Explainable AI](parts/part-5-operations/explainable-ai-insights.qmd) (2.5 hrs)## Prerequisites by Pathway| Pathway | Python | Statistics | Hydrogeology | ML ||---------|--------|------------|--------------|-----|| Pathway 0 – No-Water, No-Code | Not required | Not required | Not required | Not required || Pathway 1 – Quick | Not required | Not required | Not required | Not required || Pathway 2 – Data Scientist | Required | Basic | Helpful | Required || Pathway 3 – Hydrogeologist | Basic | Basic | Required | Helpful || Pathway 4 – Full Deep Dive | Required | Required | Required | Required |Pathway 5 (Specific Topics) inherits prerequisites from the individual chapters you choose; check the “For Newcomers” or introduction sections of those chapters for details.## Time Estimates| Activity | Time ||----------|------|| Read one chapter | 20-40 min || Run code examples | 10-20 min || Explore interactive plots | 5-10 min || Modify and experiment | 30-60 min |These time ranges are approximate and assume you skim figures at a comfortable pace and optionally run code in an environment that is already set up (see the main `index.qmd` chapter for environment and data configuration details).## Getting Help- **Technical Issues**: [GitHub Issues](https://github.com/ngcharithperera/aquifer-data/issues)- **Terminology**: [Translation Guide](parts/assets/terminology-translation.qmd)- **Data Questions**: [Data Dictionary](parts/assets/data-dictionary.qmd)- **Common Problems**: [FAQ](parts/assets/faq.qmd)## Pathway Comparison at a Glance| Feature | Pathway 0 | Pathway 1 | Pathway 2 | Pathway 3 | Pathway 4 ||---------|-----------|-----------|-----------|-----------|-----------|| **Audience** | Newcomers | Executives | Data Scientists | Hydrogeologists | Researchers || **Time** | 6-8 hrs | 2-3 hrs | 20-25 hrs | 18-22 hrs | 30-40 hrs || **Coding** | None | None | Required | Optional | Required || **Hydro Knowledge** | None | None | Helpful | Required | Required || **ML Skills** | None | None | Required | Helpful | Required || **Chapters** | 12 selected | 5 core | 7 focused | 7 focused | All 54 || **Outcomes** | Understanding | Key findings | ML mastery | Applied methods | Research ready |## Study Tips for Success**For All Pathways:**1. **Start with prerequisites**: Check the [Terminology Translation](parts/assets/terminology-translation.qmd) chapter when you encounter unfamiliar terms.2. **Use the FAQ**: Consult the [FAQ](parts/assets/faq.qmd) for common technical and conceptual questions.3. **Take breaks**: Environmental data science is complex—take time to absorb concepts between chapters.4. **Experiment**: If running code, try modifying parameters to see how results change.**For Coding Pathways (2, 4):**- Set up your environment first (see [index.qmd](index.qmd) for installation instructions)- Test data loading before diving into analysis chapters- Use the [Data Dictionary](parts/assets/data-dictionary.qmd) to understand data structures- Start with smaller data samples when experimenting**For Non-Coding Pathways (0, 1):**- Focus on figures, key takeaways, and narrative explanations- Code blocks can be safely skipped—insights are in the text- Interactive figures are accessible via browser—no installation needed- Use reflection questions at chapter ends to check understanding## Customizing Your PathwayFeel free to **mix and match** chapters from different pathways based on your interests:- **Add depth**: If following Pathway 1, add specific chapters from Parts 2-4 that interest you- **Skip sections**: If following Pathway 4, skip chapters outside your research focus- **Branch out**: Start with your primary pathway, then explore related topics from Pathway 5**Example Custom Path for Research Projects:**1. Start with Pathway 0 foundations (if new to hydrology)2. Move to Pathway 2 or 3 based on your background3. Deep dive into Pathway 5 chapters relevant to your project4. Reference Pathway 4 research opportunities for future work## Reflection Questions- Which pathway best matches your current background and time constraints?- What are your specific learning goals? (Understanding, application, research, decision-making?)- Do you need to review prerequisites before starting? (Python, statistics, hydrogeology?)- Which chapters outside your main pathway might complement your learning?- After reading this page, do you feel clear about where to start?