AIware 2026
Mon 6 - Tue 7 July 2026 Montreal, Canada
co-located with FSE 2026

This program is tentative and subject to change.

Neural network (NN) verifiers are increasingly used to certify safety properties such as robustness (i.e., small allowed perturbations to an input should not alter a model’s decision). Since verifiers aim to prove the absence of violations by considering all possible specified behaviors, the soundness of their implementations is therefore critical to guaranteeing correctness. Detecting unsoundness is particularly important and challenging, because a verifier typically spans multiple components, including specifications, neural networks, operator semantics, and constraint solving, where subtle implementation bugs can silently lead to \emph{false} certified results.

We present an approach for neural network robustness verifiers that detects and localizes soundness-relevant faults via two types of concrete–abstract consistency checks: (1) \textit{Counterexample-Based Refutation} (CBR), where a certification is supposed to be refuted if a concrete counterexample is found at runtime; and (2) \emph{Bounds-Based Localization} (BBL), which audits per-neuron containment (concrete activations must lie within abstract bounds as an invariant) to pinpoint incorrect implementations at particular NN layers or operators. To reduce representation drift, we use specification-embedded models that wrap the core NN with input and output specifications as two additional layers. We further develop an operator-aware NN generator that can produce diverse NN models spanning a wide range of layer types, parameters, and architectures, enabling systematic exposure and exercise of different operator behaviors.

We evaluate neural network verifiers on three abstract domains using six mutation operators. Across 450 soundness-violating instances, our framework detects 73% of injected soundness violations. CBR mainly exposes input-output-level soundness failures when a concrete counterexample is found during input sampling, while BBL catches internal bound-containment violations and localizes them to specific layers/operators, even when CBR becomes ineffective in high-dimensional inputs. These results indicate that combining coarse refutation (CBR) with fine-grained invariant checking (BBL) provides practical assurance for robustness verifiers, and operator-aware generation further boosts both coverage and discovery of unsoundness issues.

This program is tentative and subject to change.

Mon 6 Jul

Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:30
Trustworthy Code Generation, Reliability, and Engineering of AIware SystemsMain Track at MB 1.210
14:00
5m
Talk
VeriTrans: Fine-Tuned LLM-Assisted NL→PL Translation via a Deterministic Neuro-Symbolic Pipeline
Main Track
Xuan Liu , Dheeraj Kodakandla Pennsylvania State University, US, Kushagra Srivastva Pennsylvania State University, US, Mahfuza Farooque Pennsylvania State University, US
14:05
5m
Talk
Kubernetes Misconfigurations in the Wild: Taxonomy, Evolution, and Automated Repair with Large Language Models
Main Track
GHORAB Mostafa Anouar Université Laval, CA, Ahmad Abdellatif University of Calgary, Mohamed Aymen saied Laval University
14:10
5m
Talk
Quality and Security Signals in AI-Generated Python Refactoring Pull Requests
Main Track
Mohamed Almukhtar University of Michigan-Flint, Anwar Ghammam University of Michigan - Dearborn, Hua Ming
Pre-print
14:15
5m
Talk
From Assistance to Agency: Rethinking Autonomy and Control in CI/CD Pipelines
Main Track
Marcus Barnes University of Toronto, Taher A. Ghaleb Trent University, Safwat Hassan University of Toronto
Pre-print
14:20
5m
Talk
Beyond Translation Accuracy: Addressing False Failures in LLM-Based Code Translation
Main Track
Fazle Rabbi Concordia University, Soumit Kanti Saha Concordia University, CA, Jinqiu Yang Concordia University
14:25
5m
Talk
Executable but Unlearnable: Designing Code that Resists LLM-Based Learning
Main Track
Viraaji Mothukuri Kennesaw State University, Reza M. Parizi Kennesaw State University
14:30
5m
Talk
Detecting Unsoundness in Neural Network Verifiers via Concrete–Abstract Consistency
Main Track
Kaijie Liu University of New South Wales, Sydney, Yulei Sui University of New South Wales
Pre-print
14:35
5m
Talk
From Correctness to Consistency: Redefining Reliability for the Agentware Era
Main Track
Xue Qin Villanova University, Mauricio Gouvea Gruppi
14:40
5m
Talk
A Preliminary Study on Explaining Risk of Code Changes using LLM-based Prediction Models
Main Track
Yalin Liu Facebook, US, Kosay Jabre Meta Platforms, Inc., Rui Abreu Meta, Zachariah J Carmichael Facebook, US, Vijayaraghavan Murali Rice University, Akshay Patel Meta Platforms, Inc., Jun Ge Meta Platforms, Inc., Weiyan Sun Meta Platforms, Inc., Cong Zhang Southern Methodist University, Southern Methodist University, US, Audris Mockus The University of Tennessee, Knoxville / Vilnius University, David Khavari , Peter Rigby Concordia University; Meta, Nachiappan Nagappan Meta Platforms, Inc.
14:45
5m
Talk
When AI Coding Assistants Leak Training Data: Study Memorization in Code LLMs
Main Track
Xiaoyu Cheng , Kundi Yao Ontario Tech University, Pengyu Nie University of Waterloo, Weiyi Shang University of Waterloo
14:50
5m
Talk
Zombie Agents: Detecting Semantic Livelock in Long-Horizon Autonomous Software
Main Track
14:55
5m
Talk
Neural-Symbolic Multi-Objective Optimization for Performance-Aware ORM Database Design
Main Track
Sasan Azizian Bellevue University, Ayoub Hazrati The Vanguard Group, Artin Azizian McGill University, School of Computer Science, Elham Rastegari Creighton University, Hamid Bagheri University of Nebraska-Lincoln, Juan Cui University of Nebraska, Lincoln, US
15:00
5m
Talk
TriORM: Workload-Aware Neural--Symbolic Multi-Objective Optimization for ORM Mapping Design
Main Track
Sasan Azizian Bellevue University, Ayoub Hazrati The Vanguard Group, Artin Azizian McGill University, School of Computer Science, Elham Rastegari Creighton University
15:05
5m
Talk
Artifact Readiness Gates with Saturation Stop Rules and Host-Parity Admissibility for FM Release Evaluation
Main Track
Yanick Kanyiki InvarLock Inc., CA
15:10
5m
Talk
Towards Migrating Neural Network Implementations
Main Track
Nadia Daoudi Luxembourg Institute of Science and Technology, Iván Alfonso Luxembourg Institute of Science and Technology, Jordi Cabot Luxembourg Institute of Science and Technology
15:15
5m
Talk
From Code Review to Spec-Driven Contracts: A Vision for Auditable AIWare Systems
Main Track
Mohammad Hamdaqa Polytechnique Montreal, Moataz Chouchen Concordia University
15:20
10m
Live Q&A
Joint Q&A and Discussion
Main Track