Temporal Graph Neural Networks for Failure and Cause Prediction in Transmission Networks
Overview
Developed temporal Graph Neural Network (GNN) models for large-scale transmission infrastructure in collaboration with Pacific Gas and Electric Company (PG&E).
The project focuses on transforming raw asset inventories, span connectivity data, and historical outage records into graph-structured datasets to enable structure-level failure localization and cause prediction under highly imbalanced, real-world operational conditions.
The framework supports proactive reliability planning by shifting from reactive failure analysis to predictive, topology-aware risk assessment.
Technical Contributions
1. Scalable Graph Construction Pipeline
Designed and implemented a utility-scale graph-building engine that converts raw utility records into structured power-grid graphs:
- Nodes: transmission structures (poles/towers)
- Edges: physical spans connecting structures
- Node features: design attributes, age, environmental exposure layers, insulator aggregates
- Edge features: conductor properties, span geometry
Constructed event-level graph instances with shared topology and event-specific failure labels, enabling temporal train/validation/test splits aligned with operational timelines.
2. Multi-Task Temporal GNN Modeling
Developed multi-task learning architectures to jointly perform:
- Node-level failure localization (identify failed structure within large graph)
- Graph-level failure cause classification (≈10–12 operational cause categories)
Core modeling components:
- Graph Attention Network (GATv2) backbone
- Temporal data splits based on outage timestamps
- Pointer-style localization objective to handle extreme class imbalance
- Candidate-set sampling using k-hop neighborhoods
- Class-balanced and focal-style losses for rare-event learning
Designed topology-aware evaluation metrics including Recall@k and graph-distance-to-true-failure to reflect operational usefulness rather than naive accuracy.
3. Coarse-Graph and Feature Selection Experiments
Implemented supernode-based coarse graph representations to:
- Study information bottlenecks under aggregation
- Perform distance-aware soft labeling
- Evaluate top-k performance across reduced graph resolutions
This experimentation quantified trade-offs between computational efficiency and spatial localization precision.
4. Asset Lifecycle and Insulator Analytics
Developed analytical modules to examine:
- Insulator replacement vs addition detection
- Age distribution and material evolution trends
- Data currency validation across decades of records
Generated lifecycle summaries and structured reports to inform graph feature engineering and reliability insights.
5. Connectivity and GIS-Based Network Analytics
Conducted large-scale graph diagnostics:
- Connected component analysis
- Degree distributions and structural bottleneck identification
- Temporal failure trend analysis
- OpenStreetMap-based interactive GIS visualization of structures and failure clusters
Enabled domain experts to interpret learned patterns in spatial and network context.
Impact
- Converted heterogeneous utility asset records into scalable graph datasets with ~10⁵ nodes per network instance
- Enabled structure-level predictive failure localization under extreme class imbalance
- Integrated topology-aware metrics aligned with utility operational decision-making
- Provided interpretable spatial analytics linking network structure to failure patterns
- Supported transition toward predictive reliability planning and early-warning systems
Relevance
Demonstrates deployment of advanced graph machine learning in real-world utility operations.
Bridges:
- Infrastructure asset analytics
- Graph deep learning
- Temporal failure modeling
- GIS-based network interpretation
- Reliability engineering for large-scale power systems
Establishes a reusable blueprint for graph-based predictive maintenance in critical infrastructure networks.
