Detecting Unsoundness in Neural Network Verifiers via Concrete–Abstract Consistency (AIware 2026 - Main Track)

Who

Kaijie Liu, Yulei Sui

Track

AIware 2026 Main Track

This program is tentative and subject to change.

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 6 Jul 2026 14:30 - 14:35 at MB 1.210 - Trustworthy Code Generation, Reliability, and Engineering of AIware Systems

Abstract

Neural network (NN) verifiers are increasingly used to certify safety properties such as robustness (i.e., small allowed perturbations to an input should not alter a model’s decision). Since verifiers aim to prove the absence of violations by considering all possible specified behaviors, the soundness of their implementations is therefore critical to guaranteeing correctness. Detecting unsoundness is particularly important and challenging, because a verifier typically spans multiple components, including specifications, neural networks, operator semantics, and constraint solving, where subtle implementation bugs can silently lead to \emph{false} certified results.

We present an approach for neural network robustness verifiers that detects and localizes soundness-relevant faults via two types of concrete–abstract consistency checks: (1) \textit{Counterexample-Based Refutation} (CBR), where a certification is supposed to be refuted if a concrete counterexample is found at runtime; and (2) \emph{Bounds-Based Localization} (BBL), which audits per-neuron containment (concrete activations must lie within abstract bounds as an invariant) to pinpoint incorrect implementations at particular NN layers or operators. To reduce representation drift, we use specification-embedded models that wrap the core NN with input and output specifications as two additional layers. We further develop an operator-aware NN generator that can produce diverse NN models spanning a wide range of layer types, parameters, and architectures, enabling systematic exposure and exercise of different operator behaviors.

We evaluate neural network verifiers on three abstract domains using six mutation operators. Across 450 soundness-violating instances, our framework detects 73% of injected soundness violations. CBR mainly exposes input-output-level soundness failures when a concrete counterexample is found during input sampling, while BBL catches internal bound-containment violations and localizes them to specific layers/operators, even when CBR becomes ineffective in high-dimensional inputs. These results indicate that combining coarse refutation (CBR) with fine-grained invariant checking (BBL) provides practical assurance for robustness verifiers, and operator-aware generation further boosts both coverage and discovery of unsoundness issues.

Link to Preprint

https://openreview.net/forum?id=6vjnMGdx5i

Kaijie Liu

University of New South Wales, Sydney

Yulei Sui

University of New South Wales

Australia