PINA/tutorials/tutorial22/tutorial.py at master · mathLab/PINA

History

409 lines (343 loc) · 16.4 KB

Raw

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

288

289

290

291

292

293

294

295

296

297

298

299

300

301

302

303

304

305

306

307

308

309

310

311

312

313

314

315

316

317

318

319

320

321

322

323

324

325

326

327

328

329

330

331

332

333

334

335

336

337

338

339

340

341

342

343

344

345

346

347

348

349

350

351

352

353

354

355

356

357

358

359

360

361

362

363

364

365

366

367

368

369

370

371

372

373

374

375

376

377

378

379

380

381

382

383

384

385

386

387

388

389

390

391

392

393

394

395

396

397

398

399

400

401

402

403

404

405

406

407

408

409

#!/usr/bin/env python

# coding: utf-8

# # Tutorial: Reduced Order Model with Graph Neural Networks

# [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/mathLab/PINA/blob/master/tutorials/tutorial22/tutorial.ipynb)

# > ##### ⚠️ ***Before starting:***

# > We assume you are already familiar with the concepts covered in the [Data Structure for SciML](https://mathlab.github.io/PINA/tutorial19/tutorial.html) tutorial. If not, we strongly recommend reviewing them before exploring this advanced topic.

# In this tutorial, we will demonstrate a typical use case of **PINA** for Reduced Order Modelling using Graph Convolutional Neural Network. The tutorial is largely inspired by the paper [A graph convolutional autoencoder approach to model order reduction for parametrized PDEs](https://www.sciencedirect.com/science/article/pii/S0021999124000111).

# Let's start by importing the useful modules:

# In[1]:

## routine needed to run the notebook on Google Colab

try:

import google.colab

IN_COLAB = True

except:

IN_COLAB = False

if IN_COLAB:

get_ipython().system('pip install "pina-mathlab[tutorial]"')

get_ipython().system('wget "https://github.com/mathLab/PINA/raw/refs/heads/master/tutorials/tutorial22/holed_poisson.pt" -O "holed_poisson.pt"')

import torch

from torch import nn

from torch_geometric.nn import GMMConv

from torch_geometric.data import (

Data,

Batch,

) # alternatively, from pina.graph import Graph, LabelBatch

from torch_geometric.utils import to_dense_batch

import matplotlib.pyplot as plt

import warnings

warnings.filterwarnings("ignore")

from pina import Trainer

from pina.model import FeedForward

from pina.optim import TorchOptimizer

from pina.solver import ReducedOrderModelSolver

from pina.problem.zoo import SupervisedProblem

# ## Data Generation

# In this tutorial, we will focus on solving the parametric **Poisson** equation, a linear PDE. The equation is given by:

# $$

# \begin{cases}

# -\frac{1}{10}\Delta u = 1, &\Omega(\boldsymbol{\mu}),\\

# u = 0, &\partial \Omega(\boldsymbol{\mu}).

# \end{cases}

# $$

# In this equation, $\Omega(\boldsymbol{\mu}) = [0, 1]\times[0,1] \setminus [\mu_1, \mu_2]\times[\mu_1+0.3, \mu_2+0.3]$ represents the spatial domain characterized by a parametrized hole defined via $\boldsymbol{\mu} = (\mu_1, \mu_2) \in \mathbb{P} = [0.1, 0.6]\times[0.1, 0.6]$. Thus, the geometrical parameters define the left bottom corner of a square obstacle of dimension $0.3$. The problem is coupled with homogenous Dirichlet conditions on both internal and external boundaries. In this setting, $u(\mathbf{x}, \boldsymbol{\mu})\in \mathbb{R}$ is the value of the function $u$ at each point in space for a specific parameter $\boldsymbol{\mu}$.

# We have already generated data for different parameters. The dataset is obtained via $\mathbb{P}^1$ FE method, and an equispaced sampling with 11 points in each direction of the parametric space.

# The goal is to build a Reduced Order Model that given a new parameter $\boldsymbol{\mu}^*$, is able to get the solution $u$ *for any discretization* $\mathbf{x}$. To this end, we will train a Graph Convolutional Autoencoder Reduced Order Model (GCA-ROM), as presented in [A graph convolutional autoencoder approach to model order reduction for parametrized PDEs](https://www.sciencedirect.com/science/article/pii/S0021999124000111). We will cover the architecture details later, but for now, let’s start by importing the data.

# **Note:**

# The numerical integration is obtained using a finite element method with the [RBniCS library](https://www.rbnicsproject.org/).

# In[2]:

# === load the data ===

# x, y -> spatial discretization

# edge_index, triang -> connectivity matrix, triangulation

# u, params -> solution field, parameters

data = torch.load("holed_poisson.pt")

x = data["x"]

y = data["y"]

edge_index = data["edge_index"]

u = data["u"]

triang = data["triang"]

params = data["mu"]

# simple plot

plt.figure(figsize=(4, 4))

plt.tricontourf(x[:, 10], y[:, 10], triang, u[:, 10], 100, cmap="jet")

plt.scatter(params[10, 0], params[10, 1], c="r", marker="x", s=100)

plt.tight_layout()

plt.show()

# ## Graph-Based Reduced Order Modeling

# In this problem, the geometry of the spatial domain is **unstructured**, meaning that classical grid-based methods (e.g., CNNs) are not well suited. Instead, we represent the mesh as a **graph**, where nodes correspond to spatial degrees of freedom and edges represent connectivity. This makes **Graph Neural Networks (GNNs)**, and in particular **Graph Convolutional Networks (GCNs)**, a natural choice to process the data.

# <p align="center">

# <img src="http://raw.githubusercontent.com/mathLab/PINA/master/tutorials/static/gca_off_on_3_pina.png" alt="GCA-ROM" width="800"/>

# </p>

# To reduce computational complexity while preserving accuracy, we employ a **Reduced Order Modeling (ROM)** strategy (see picture above). The idea is to map high-dimensional simulation data $u(\mathbf{x}, \boldsymbol{\mu})$ to a compact **latent space** using a **graph convolutional encoder**, and then reconstruct it back via a **decoder** (offline phase). The latent representation captures the essential features of the solution manifold. Moreover, we can learn a **parametric map** $\mathcal{M}$ from the parameter space $\boldsymbol{\mu}$ directly into the latent space, enabling predictions for new unseen parameters.

# Formally, the autoencoder consists of an **encoder** $\mathcal{E}$, a **decoder** $\mathcal{D}$, and a **parametric mapping** $\mathcal{M}$:

# $$

# z = \mathcal{E}(u(\mathbf{x}, \boldsymbol{\mu})),

# \quad

# \hat{u}(\mathbf{x}, \boldsymbol{\mu}) = \mathcal{D}(z),

# \quad

# \hat{z} = \mathcal{M}(\boldsymbol{\mu}),

# $$

# where $z \in \mathbb{R}^r$ is the latent representation with $r \ll N$ (the number of degrees of freedom) and the **hat notation** ($\hat{u}, \hat{z}$) indicates *learned or approximated quantities*.

# The training objective balances two terms:

# 1. **Reconstruction loss**: ensuring the autoencoder can faithfully reconstruct $u$ from $z$.

# 2. **Latent consistency loss**: enforcing that the parametric map $\mathcal{M}(\boldsymbol{\mu})$ approximates the encoder’s latent space.

# The combined loss function is:

# $$

# \mathcal{L}(\theta) = \frac{1}{N} \sum_{i=1}^N

# \big\| u(\mathbf{x}, \boldsymbol{\mu}_i) -

# \mathcal{D}\!\big(\mathcal{E}(u(\mathbf{x}, \boldsymbol{\mu}_i))\big)

# \big\|_2^2

# \;+\; \frac{1}{N} \sum_{i=1}^N

# \big\| \mathcal{E}(u(\mathbf{x}, \boldsymbol{\mu}_i)) - \mathcal{M}(\boldsymbol{\mu}_i) \big\|_2^2.

# $$

# This framework leverages the expressive power of GNNs for unstructured geometries and the efficiency of ROMs for handling parametric PDEs.

# We will now build the autoencoder network, which is a `nn.Module` with two methods: `encode` and `decode`.

# In[3]:

class GraphConvolutionalAutoencoder(nn.Module):

def __init__(

self, hidden_channels, bottleneck, input_size, ffn, act=nn.ELU

super().__init__()

self.hidden_channels, self.input_size = hidden_channels, input_size

self.act = act()

self.current_graph = None

# Encoder GMM layers

self.fc_enc1 = nn.Linear(input_size * hidden_channels[-1], ffn)

self.fc_enc2 = nn.Linear(ffn, bottleneck)

self.encoder_convs = nn.ModuleList(

[

GMMConv(

hidden_channels[i],

hidden_channels[i + 1],

dim=1,

kernel_size=5,

)

for i in range(len(hidden_channels) - 1)

]

)

# Decoder GMM layers

self.fc_dec1 = nn.Linear(bottleneck, ffn)

self.fc_dec2 = nn.Linear(ffn, input_size * hidden_channels[-1])

self.decoder_convs = nn.ModuleList(

[

GMMConv(

hidden_channels[-i - 1],

hidden_channels[-i - 2],

dim=1,

kernel_size=5,

)

for i in range(len(hidden_channels) - 1)

]

)

def encode(self, data):

self.current_graph = data

x = data.x

h = x

for conv in self.encoder_convs:

x = self.act(conv(x, data.edge_index, data.edge_weight) + h)

x = x.reshape(

data.num_graphs, self.input_size * self.hidden_channels[-1]

)

return self.fc_enc2(self.act(self.fc_enc1(x)))

def decode(self, z, decoding_graph=None):

data = decoding_graph or self.current_graph

x = self.act(self.fc_dec2(self.act(self.fc_dec1(z)))).reshape(

data.num_graphs * self.input_size, self.hidden_channels[-1]

)

h = x

for i, conv in enumerate(self.decoder_convs):

x = conv(x, data.edge_index, data.edge_weight) + h

if i != len(self.decoder_convs) - 1:

x = self.act(x)

return x

# Great! We now need to build the graph structure (a PyTorch Geometric `Data` object) from the numerical solver outputs.

# The solver provides the solution values $u(\mathbf{x}, \boldsymbol{\mu})$ for each parameter instance $\boldsymbol{\mu}$, along with the node coordinates $(x, y)$ of the unstructured mesh. Because the geometry is not defined on a regular grid, we naturally represent the mesh as a graph:

# - **Nodes** correspond to spatial points in the mesh. Each node stores the **solution value** $u$ at that point as a feature.

# - **Edges** represent mesh connectivity. For each edge, we compute:

# - **Edge attributes**: the relative displacement vector between the two nodes.

# - **Edge weights**: the Euclidean distance between the connected nodes.

# - **Positions** store the physical $(x, y)$ coordinates of the nodes.

# For each parameter realization $\boldsymbol{\mu}_i$, we therefore construct a PyTorch Geometric `Data` object:

# In[4]:

# number of nodes and number of graphs (parameter realizations)

num_nodes, num_graphs = u.shape

graphs = []

for g in range(num_graphs):

# node positions

pos = torch.stack([x[:, g], y[:, g]], dim=1) # shape [num_nodes, 2]

# edge attributes and weights

ei, ej = pos[edge_index[0]], pos[edge_index[1]] # [num_edges, 2]

edge_attr = torch.abs(ej - ei) # relative offsets

edge_weight = edge_attr.norm(p=2, dim=1, keepdim=True) # Euclidean distance

# node features (solution values)

node_features = u[:, g].unsqueeze(-1) # [num_nodes, 1]

# build PyG graph

graphs.append(

Data(

x=node_features,

edge_index=edge_index,

edge_weight=edge_weight,

edge_attr=edge_attr,

pos=pos,

)

# ## Training with PINA

# Everything is now ready! We can use **PINA** to train the model, following the workflow from previous tutorials. First, we need to define the problem. In this case, we will use the [`SupervisedProblem`](https://mathlab.github.io/PINA/_rst/problem/zoo/supervised_problem.html#module-pina.problem.zoo.supervised_problem), which expects:

# - **Input**: the parameter tensor $\boldsymbol{\mu}$ describing each scenario.

# - **Output**: the corresponding graph structure (PyTorch Geometric `Data` object) that we aim to reconstruct.

# In[5]:

problem = SupervisedProblem(params, graphs)

# Next, we build the **autoencoder network** and the **interpolation network**.

# - The **Graph Convolutional Autoencoder (GCA)** encodes the high-dimensional graph data into a compact latent space and reconstructs the graphs from this latent representation.

# - The **interpolation network** (or parametric map) learns to map a new parameter $\boldsymbol{\mu}^*$ directly into the latent space, enabling the model to predict solutions for unseen parameter instances without running the full encoder.

# In[6]:

reduction_network = GraphConvolutionalAutoencoder(

hidden_channels=[1, 1], bottleneck=8, input_size=1352, ffn=200, act=nn.ELU

)

interpolation_network = FeedForward(

input_dimensions=2,

output_dimensions=8,

n_layers=2,

inner_size=200,

func=nn.Tanh,

)

# Finally, we will use the [`ReducedOrderModelSolver`](https://mathlab.github.io/PINA/_rst/solver/supervised_solver/reduced_order_model.html#pina.solver.supervised_solver.reduced_order_model.ReducedOrderModelSolver) to perform the training, as discussed earlier.

# This solver requires two components:

# - an **interpolation network**, which maps parameters $\boldsymbol{\mu}$ to the latent space, and

# - a **reduction network**, which in our case is the **autoencoder** that compresses and reconstructs the graph data.

# In[7]:

# This loss handles both Data and Torch.Tensors

class CustomMSELoss(nn.MSELoss):

def forward(self, output, target):

if isinstance(output, Data):

output = output.x

if isinstance(target, Data):

target = target.x

return torch.nn.functional.mse_loss(

output, target, reduction=self.reduction

)

# Define the solver

solver = ReducedOrderModelSolver(

problem=problem,

reduction_network=reduction_network,

interpolation_network=interpolation_network,

use_lt=False,

loss=CustomMSELoss(),

optimizer=TorchOptimizer(torch.optim.Adam, lr=0.001, weight_decay=1e-05),

)

# Training is performed as usual using the **`Trainer`** API. In this tutorial, we will use only **30% of the data** for training, and only $300$ epochs of training to illustrate the workflow.

# In[ ]:

trainer = Trainer(

solver=solver,

accelerator="cpu",

max_epochs=300,

train_size=0.3,

val_size=0.7,

test_size=0.0,

shuffle=True,

)

trainer.train()

# Once the model is trained, we can test the reconstruction by following two steps:

# 1. **Interpolate**: Use the `interpolation_network` to map a new parameter $\boldsymbol{\mu}^*$ to the latent space.

# 2. **Decode**: Pass the interpolated latent vector through the autoencoder (`reduction_network`) to reconstruct the corresponding graph data.

# In[9]:

# interpolate

z = interpolation_network(params)

# decode

batch = Batch.from_data_list(graphs)

out = reduction_network.decode(z, decoding_graph=batch)

out, _ = to_dense_batch(out, batch.batch)

out = out.squeeze(-1).T.detach()

# Let's compute the total error, and plot a sample solution:

# In[10]:

# compute error

l2_error = (torch.norm(out - u, dim=0) / torch.norm(u, dim=0)).mean()

print(f"L2 relative error {l2_error:.2%}")

# plot solution

idx_to_plot = 42

# Determine min and max values for color scaling

vmin = min(out[:, idx_to_plot].min(), u[:, idx_to_plot].min())

vmax = max(out[:, idx_to_plot].max(), u[:, idx_to_plot].max())

plt.figure(figsize=(16, 4))

plt.subplot(1, 3, 1)

plt.tricontourf(

x[:, idx_to_plot],

y[:, idx_to_plot],

triang,

out[:, idx_to_plot],

100,

cmap="jet",

vmin=vmin,

vmax=vmax,

)

plt.title("GCA-ROM")

plt.colorbar()

plt.subplot(1, 3, 2)

plt.title("True")

plt.tricontourf(

x[:, idx_to_plot],

y[:, idx_to_plot],

triang,

u[:, idx_to_plot],

100,

cmap="jet",

vmin=vmin,

vmax=vmax,

)

plt.colorbar()

plt.subplot(1, 3, 3)

plt.title("Square Error")

plt.tricontourf(

x[:, idx_to_plot],

y[:, idx_to_plot],

triang,

(u - out).pow(2)[:, idx_to_plot],

100,

cmap="jet",

)

plt.colorbar()

plt.ticklabel_format()

plt.show()

# Nice! We can see that the network is correctly learning the solution operator, and the workflow was very straightforward.

# You may notice that the network outputs are not as smooth as the actual solution. Don’t worry — training for longer (e.g., ~5000 epochs) will produce a smoother, more accurate reconstruction.

# ## What's Next?

# Congratulations on completing the introductory tutorial on **Graph Convolutional Reduced Order Modeling**! Now that you have a solid foundation, here are a few directions to explore:

# 1. **Experiment with Training Duration** — Try different training durations and adjust the network architecture to optimize performance. Explore different integral kernels and observe how the results vary.

# 2. **Explore Physical Constraints** — Incorporate physics-informed terms or constraints during training to improve model generalization and ensure physically consistent predictions.

# 3. **...and many more!** — The possibilities are vast! Continue experimenting with advanced configurations, solvers, and features in PINA.

# For more resources and tutorials, check out the [PINA Documentation](https://mathlab.github.io/PINA/).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FilesExpand file tree

tutorial.py

Latest commit

History

tutorial.py

File metadata and controls