Temporal Graph Neural Networks for Failure and Cause Prediction in Transmission Networks

Overview

Developed temporal Graph Neural Network (GNN) models for large-scale transmission infrastructure in collaboration with Pacific Gas and Electric Company (PG&E).

The project focuses on transforming raw asset inventories, span connectivity data, and historical outage records into graph-structured datasets to enable structure-level failure localization and cause prediction under highly imbalanced, real-world operational conditions.

The framework supports proactive reliability planning by shifting from reactive failure analysis to predictive, topology-aware risk assessment.

Technical Contributions

1. Scalable Graph Construction Pipeline

Designed and implemented a utility-scale graph-building engine that converts raw utility records into structured power-grid graphs:

Nodes: transmission structures (poles/towers)
Edges: physical spans connecting structures
Node features: design attributes, age, environmental exposure layers, insulator aggregates
Edge features: conductor properties, span geometry

Constructed event-level graph instances with shared topology and event-specific failure labels, enabling temporal train/validation/test splits aligned with operational timelines.

2. Multi-Task Temporal GNN Modeling

Developed multi-task learning architectures to jointly perform:

Node-level failure localization (identify failed structure within large graph)
Graph-level failure cause classification (≈10–12 operational cause categories)

Core modeling components:

Graph Attention Network (GATv2) backbone
Temporal data splits based on outage timestamps
Pointer-style localization objective to handle extreme class imbalance
Candidate-set sampling using k-hop neighborhoods
Class-balanced and focal-style losses for rare-event learning

Designed topology-aware evaluation metrics including Recall@k and graph-distance-to-true-failure to reflect operational usefulness rather than naive accuracy.

3. Coarse-Graph and Feature Selection Experiments

Implemented supernode-based coarse graph representations to:

Study information bottlenecks under aggregation
Perform distance-aware soft labeling
Evaluate top-k performance across reduced graph resolutions

This experimentation quantified trade-offs between computational efficiency and spatial localization precision.

4. Asset Lifecycle and Insulator Analytics

Developed analytical modules to examine:

Insulator replacement vs addition detection
Age distribution and material evolution trends
Data currency validation across decades of records

Generated lifecycle summaries and structured reports to inform graph feature engineering and reliability insights.

5. Connectivity and GIS-Based Network Analytics

Conducted large-scale graph diagnostics:

Connected component analysis
Degree distributions and structural bottleneck identification
Temporal failure trend analysis
OpenStreetMap-based interactive GIS visualization of structures and failure clusters

Enabled domain experts to interpret learned patterns in spatial and network context.

Impact

Converted heterogeneous utility asset records into scalable graph datasets with ~10⁵ nodes per network instance
Enabled structure-level predictive failure localization under extreme class imbalance
Integrated topology-aware metrics aligned with utility operational decision-making
Provided interpretable spatial analytics linking network structure to failure patterns
Supported transition toward predictive reliability planning and early-warning systems

Relevance

Demonstrates deployment of advanced graph machine learning in real-world utility operations.

Bridges:

Infrastructure asset analytics
Graph deep learning
Temporal failure modeling
GIS-based network interpretation
Reliability engineering for large-scale power systems

Establishes a reusable blueprint for graph-based predictive maintenance in critical infrastructure networks.

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)

Kamiar Khayambashi