Goal: Select
| 1. Surrogate Model — Predicts values + uncertainty; must be differentiable | ![]() |
| 2. Acquisition Function — Scores candidates; guides placement | ![]() |
Pick
locations uniformly at random from candidates
In plain English:
"Close your eyes and point at the map
Greedily pick the location with highest uncertainty
In plain English:
"Where are we most uncertain? Put a sensor there."
Repeat
Pros: Fast
Cons: Doesn't maximize information gain with minimum sensors
Mutual Information: How much does knowing
reduce uncertainty about ?
Reading this: "Given existing sensors
For Gaussian outputs (GP/Neural Process):
Problem: Exact MI requires searching
Since exact MI is intractable, use greedy approximation: select one sensor at a time
Algorithm: For each of
Complexity:
| Evaluations | ||
|---|---|---|
| 1,000 | 50 | 50,000 |
| 20,000 | 100 | 2,000,000 |
For India (
): computationally infeasible!
Greedy MI gives best quality but doesn't scale
MaxVar scales but gives poor quality
Can we get both?
Scalable sensor placement via continuous optimization
Publication (AAAI 2026 — AI for Social Impact)
Scalable Air-Quality Sensor Placement via Gradient-Based Mutual Information Maximization
Zeel B Patel, Vinayak Rana, Nipun Batra
Key insight: Don't search discrete candidates — optimize coordinates directly!
| Greedy MI | GD-MI (Ours) | |
|---|---|---|
| Search space | Discrete ( |
Continuous (coordinates) |
| Optimization | Enumerate all | Gradient descent |
| Complexity | ||
| Scalability | ✓ Any |

Greedy MI: Search grid

GD-MI: Follow gradient
What we optimize:
Objective: Minimize average variance over target region
where
Algorithm: Initialize → Forward (get
Key: Model is frozen — only the
coordinates are optimized
GD-MI needs a model that provides:
| Requirement | Why? |
|---|---|
| Predictions | To impute values at proposed locations |
| Calibrated uncertainty | To compute information gain |
| Differentiable | To backpropagate through |
| Fast inference | To iterate quickly |
TNP-D (Transformer Neural Process) satisfies all!
| Model | RMSE ↓ | NLL ↓ |
|---|---|---|
| Gaussian Process | 5.16 | -0.19 |
| Convolutional GNP | 5.31 | -0.30 |
| Transformer NP | 4.90 | -0.44 |
Does GD-MI actually work?
Madhya Pradesh — a central Indian state (~size of Germany,
=308 candidates)
Results (k=9 sensors):
| Method | RMSE ↓ |
|---|---|
| Random | 7.2 |
| MaxVar | 6.3 |
| GD-MI | 5.8 |
| Greedy MI | 5.6 |
GD-MI within 0.2 RMSE of Greedy MI (gold standard)
Greedy MI infeasible (
) — GD-MI shines here
Key findings:
MaxVar (blue dots)
GD-MI (red dots)
MI objective guides placement for maximum coverage

Optimizes in continuous space
→ cannot enforce real-world deployment constraints
May select invalid / impractical locations
(e.g., inaccessible terrain)
Assumes uniform importance
→ ignores population / exposure differences
Non-convex optimization
→ solutions depend on initialization
Differentiable discrete selection under regional constraints
Publication (Under Review)
Large-Scale Air-Quality Sensor Placement via Joint Optimization under Regional Constraints
Vinayak Rana, Neerja Kasture, Anura Mantri, Nipun Batra

Sensors are allocated per state
Constraint:
exactly
GD-MI cannot enforce: "exactly
sensors in state "
Each sensor must choose one location from its state's candidate pool
Represent this as a probability distribution over candidates
argmax picks the highest probability location
But argmax is not differentiable — gradients cannot flow through it
Don't worry about the complexity — let's go term by term
— learnable logits, one row per sensor, one column per candidate location
— regional mask: sets out-of-state entries to → probability = 0
— Gumbel noise: constant in backward pass — this is the reparameterisation trick
— temperature: high → explore broadly, low → commit to one location

(a) Objective + temperature
(b) Location update count
No post-hoc projection needed — the relaxation self-stabilizes into a valid discrete solution
Does GSM actually work?

| Method | RMSE ↓ |
|---|---|
| Random | 6.50 |
| MaxVar | 6.41 |
| GSM (Ours) | 6.12 |
| Greedy MI | 5.74 |
GSM closes most of the gap to Greedy MI — at far lower cost

No state budget constraints — pure joint optimization
| Strategy | All-India RMSE ↓ |
|---|---|
| Existing network | 9.25 ± 0.00 |
| Random | 7.69 ± 0.15 |
| Greedy MaxVar | 7.46 ± 0.00 |
| GSM Indep (ours) | 7.42 ± 0.09 |
| GSM Joint (ours) | 7.27 ± 0.03 |
21.4% reduction in national prediction error over existing network
13/32 states where GSM Joint achieves lowest RMSE
Joint > Indep — cross-border gradients capture transboundary pollution structure

Uses a continuous relaxation
→ introduces approximation error
Sensitive to temperature (τ) scheduling
→ affects convergence stability
Non-convex optimization over logits
→ sensitive to initialization; multiple local optima
Equity-aware placement
→ weight by population, pollution, vulnerability
Dynamic sensing
→ mobile sensors, adaptive placement
Multi-pollutant optimization
→ PM₂.₅, NO₂, O₃ jointly
Improved discrete relaxations
→ Top-K, optimal transport (Sinkhorn)
• Scalable Air-Quality Sensor Placement via Gradient-Based Mutual Information Maximization (AAAI 2026 — AI for Social Impact)
Zeel B Patel, Vinayak Rana, Nipun Batra
• Large-Scale Air-Quality Sensor Placement via Joint Optimization under Regional Constraints (Under Review)
Vinayak Rana, Neerja Kasture, Anura Mantri, Nipun Batra
# The Coverage Gap <div class="cols"> <div class="col"> <img src="assets/images/india_urban_only_coverage.png" style="max-height:340px;" /> </div> <div class="col"> **Urban-only coverage** - Sensors in Delhi, Mumbai, Chennai, Bangalore, Kolkata - Rural India remains **invisible** **Hundreds of millions** unmonitored No data → No policy → No protection </div> </div> ---
# Global Comparison: Room for Growth | Country | Stations ↑ | People/Station ↓ | Stations/1000 km² ↑ | |---------|----------:|--------------:|-----------------:| | USA | 4,800 | 69K | 0.49 | | China | 5,000 | 280K | 0.52 | | Germany | 500 | 166K | 1.40 | | UK | 300 | 223K | 1.23 | | **India** | **611** | **2,290K** | **0.19** | India has **33× more people per station** than USA — significant opportunity for expansion ---
# From Prediction to Placement > Not all locations are equally informative <br> - Some regions are **well-understood** - Some regions are **uncertain** <br> > Place sensors where they reduce uncertainty the most ---
| | MaxVar | MI | |:--|:------:|:--:| | Optimizes for | Single point | <span class="blue">Entire target region</span> | | Variance | **at candidate** | **over target region** |
---

---
# Main Takeaways 1. **GD-MI** = First gradient-based MI maximization for sensor placement 2. **Scalability breakthrough:** $O(I)$ vs $O(n \cdot k)$ - Enables **continental-scale** optimization 3. **Quality preserved:** Matches Greedy MI where tractable, **4% better** than MaxVar at scale 4. **Real-world ready:** Deployed framework for India air quality monitoring
---
# Thank You! <div style="display:flex; justify-content:center; gap:3rem; margin:1.5rem 0;"> <div class="qr" style="text-align:center;"> <img src="assets/images/qr_paper.png" /> <div style="font-size:0.7em; margin-top:0.3rem;">Paper</div> </div> <div class="qr" style="text-align:center;"> <img src="assets/images/qr_lab.png" /> <div style="font-size:0.7em; margin-top:0.3rem;">Lab</div> </div> </div> **Sustainability Lab @ IIT Gandhinagar** Positions: PhD · Postdoc · RA · Intern {patel_zeel, vinayak.rana, nipun.batra}@iitgn.ac.in
---
_paginate: false # Backup Slides --- # Impact: Same Budget, Better Outcomes **The cascade effect:** Better placement (GD-MI) ↓ Better predictions (lower RMSE) ↓ Better pollution maps (policy-ready) ↓ Better health outcomes (targeted interventions) > **4% RMSE improvement** at national scale = **millions** of people better served --- # Dataset: WUSTL PM₂.₅ - **Source:** Washington University in St. Louis - **Resolution:** 0.1° × 0.1° (~11 km) - **Period:** 1998-2018 (21 years monthly) - **Split:** Train 98-08 | Val 09-10 | Test 11-18 --- # Full Model Benchmark | Model | NLL ↓ | RMSE ↓ | |-------|:-----:|:------:| | CNP | 0.48 | 11.46 | | Random Forest | -0.11 | 6.55 | | GP | -0.19 | 5.16 | | ConvCNP | -0.27 | 5.28 | | ConvGNP | -0.30 | 5.31 | | TabPFN | -0.37 | 5.09 | | **TNP-D** | **-0.44** | **4.90** | --- # Constraint: Keep Sensors on Land $$\mathcal{L}_{\text{OOR}} = \sum_i \exp\Big(\text{dist}(\color{#00a651}{x_i}, \text{land}) - \delta\Big) - 1$$ Soft penalty grows exponentially as <span class="green">sensors</span> drift toward ocean --- # Scalability Analysis  - GD-MI runtime **independent of pool size** - At 20K candidates: GD-MI is **100× faster** than Greedy MI - Enables continental-scale optimization --- # More Results: k = 50  --- # More Results: k = 100 
> Same Madhya Pradesh setup ($n=308$, $k=8$)