Framework Verification Summary#

Overview#

MADDENING’s verification evidence is maintained through automated testing and registered verification benchmarks.

Test Suite#

  • Total tests: 500+ (and growing)

  • CI system: GitHub Actions

  • Test runner: pytest

  • Platforms tested:

    • OS: Ubuntu Linux

    • Python: 3.12

    • JAX: 0.4+ (pinned in CI)

    • Backend: CPU (GPU tests disabled due to MADD-ANO-001)

Test Organization#

Directory

Scope

tests/core/

Core framework: GraphManager, scheduling, coupling, adaptive, checkpoint, sweep

tests/nodes/

Physics node correctness: HeatNode, LBMPipeNode, RigidBody2DNode, SpringDamperNode

tests/surrogates/

Neural surrogate training, architectures, dataset generation

tests/api/

FastAPI server, WebSocket, binary encoding, server-side rendering

tests/viz/

Visualization backends, ZMQ transport, serialization

tests/compliance/

Compliance infrastructure: metadata, anomaly validator, stability decorator

tests/verification/

Registered verification benchmarks (analytical comparisons, convergence studies)

Registered Verification Benchmarks#

See maddening.compliance.get_benchmark_registry() for the machine-readable registry.

Benchmark ID

Node

Type

Acceptance Criteria

MADD-VER-001

HeatNode

Analytical

L2 error vs analytical solution < 1e-3

Registry grows as benchmarks are added in Phase 2+.