Aerospace & Space

ESA Anomaly Detection Benchmark — Mission 1 | Kotowski et al., 2024 Aerospace

9-Month Predictive Lead Time on Spacecraft Telemetry

The most demanding spacecraft telemetry dataset ever publicly released — 17.5 years of real satellite data produced by ESA, Airbus Defence and Space, and KP Labs. FSME Logic detected a four-sensor subsystem cascade 9 months before the first officially recorded anomaly. Every competing algorithm tested by ESA was classified as operationally insufficient.

9 mo

Predictive lead time

7.4min

8.4M rows processed

False positives

4/4

Channels flagged

FSME Logic audited four sensor channels from Group 13, Subsystem 6 — four physically linked sensors on the same spacecraft subsystem. The engine detected entropic deviation across all four channels within a narrow sequential window at the very start of the audit period, consistent with a physical failure propagating through connected hardware in topology order.

Three of four channels flagged within a 17-step detection window (steps 502–519). The fourth confirmed at step 754. This sequential propagation — detected independently across four separate channels with no cross-channel communication — maps the physical order a failure event would spread through a connected subsystem. A false-positive-generating algorithm flags channels in random, non-physical order. FSME mapped the cascade.

The first official ESA-annotated failure event was not recorded until October 2012. FSME detection steps correspond to approximately January 1–2, 2012. Lead time: 9 months.

During the full 6-month stable baseline window before the detection event, the engine produced zero false positive detections — correctly distinguishing commanded spacecraft events from genuine structural degradation. False positive rate was the ESA researchers' single highest operational priority, cited above detection rate and F-score in their published benchmark.

Algorithm	Training Time	Mission 1 Score	FSME Logic
Telemanom-ESA	13,115 sec (3.6 hrs)	F₀.₅ = 0.061 — Operationally Failed	9-month lead time confirmed
DC-VAE-ESA	13,466 sec (3.7 hrs)	F₀.₅ ≈ 0.008 — Operationally Failed
Windowed iForest	2,833 sec (0.8 hrs)	Concept drift failure
KNN	3,844 sec (1.1 hrs)	Out of memory — Failed

NASA Mars Science Laboratory — Curiosity Rover Actuator T-13 Aerospace

6.3-Hour Warning on NASA's Mars Rover

Detected mechanical actuator binding on the Curiosity Rover 382 data steps before NASA's official failure label — providing 6.3 hours of advance warning on a mission-critical planetary asset operating 225 million kilometres from the nearest repair crew.

6.3hr

Warning before failure

382

Steps ahead of NASA label

Training data used

The Curiosity Rover actuator failure is one of the most well-documented mechanical failure events in planetary exploration. FSME Logic detected the early stress accumulation pattern 382 steps before NASA's own failure annotation — representing 6.3 hours of advance warning on a system where no physical intervention is possible once failure occurs.

A 6.3-hour warning window on a planetary rover is the equivalent of detecting a fleet vehicle failure on Monday morning before the Tuesday breakdown — enough time to adjust mission parameters, redistribute workload, and execute controlled shutdown procedures rather than emergency response.

ESA OPS-SAT — Orbital CubeSat Sensor Degradation Aerospace

11.5-Minute Orbital Warning on Live Satellite Hardware

Flagged in-orbit CubeSat sensor degradation nearly 12 minutes before official ESA ground-truth failure timestamps — blind test, no prior access to the dataset, no training data.

11.5min

Before official timestamp

Blind

No prior dataset access

The OPS-SAT result was produced as a blind test — FSME Logic had no prior access to the dataset and no knowledge of when the official failure events were recorded. Detection results were compared against ESA ground-truth labels after the audit was complete. On in-orbit hardware operating in a high-radiation environment with no possibility of physical maintenance, an 11.5-minute early warning enables operational responses — safe-mode transitions, data preservation routines, ground station alerts — that would be impossible without advance notice.

Industrial & Fleet

NASA C-MAPSS — Commercial Modular Aero-Propulsion System Simulation Fleet

78% Detection Rate Across 509 Jet Engines

Successfully identified degradation across 509 commercial turbofan engines with an average 126-cycle warning advantage — enabling planned overhauls instead of emergency groundings across an entire operational fleet.

78%

Fleet detection rate

126

Avg cycle warning

509

Engines tested

The NASA C-MAPSS dataset is the industry benchmark for jet engine predictive maintenance — 509 turbofan engines run to failure under varying operating conditions and fault severities. FSME Logic detected degradation across 78% of the fleet with an average 126-cycle warning window.

Applied to a commercial trucking fleet: a 126-cycle equivalent warning on a vehicle running 5-day cycles represents a 630-day advance notice window — over 20 months of lead time to schedule maintenance, order components, and prevent unplanned downtime entirely.

CWRU Bearing Fault Benchmark — Case Western Reserve University Industrial

100% Fault Classification on Industrial Bearings

Successfully classified all 9 bearing fault types and 3 severity levels — on ARM edge hardware, fully offline, with no cloud infrastructure and no training phase on historical failure examples.

100%

Classification rate

Fault types classified

Severity levels

The CWRU bearing dataset is the most widely used benchmark in industrial rotating machinery fault detection. FSME Logic classified every fault type and severity level correctly — including inner race faults, outer race faults, and ball faults at 0.007, 0.014, and 0.021 inch defect diameters. This result was produced on ARM-based edge hardware equivalent to the Raspberry Pi devices used in FSME Logic's field deployment configuration.

Commercial Refrigeration — Condenser Unit Health Monitor Industrial

Defeating the Green Dashboard Fallacy

Detected a 54% internal stress deviation in refrigeration condenser hardware while the thermostat showed a completely normal reading — the clearest possible demonstration of why threshold monitoring misses real failures.

54%

Stress deviation detected

Normal

Thermostat reading

The refrigeration dataset demonstrates the core commercial value proposition directly. Standard threshold monitoring showed a completely normal operating temperature. FSME Logic detected a 54% deviation in the internal stress signature of the condenser unit — structural degradation that was completely invisible to the conventional sensor.

This is the Green Dashboard Fallacy in a real commercial system: the dashboard says green. The system is failing. FSME Logic reads the difference.

For fleet operators, food distribution companies, cold-chain logistics, and any operation relying on refrigeration equipment — the gap between "thermostat normal" and "condenser degrading" is exactly where unplanned downtime and product loss occur.

Methodology

How every result
was produced.

Every validation result follows the same principles — so you know exactly what the numbers mean.

Public Datasets Only

Every dataset used is publicly available and independently verifiable. NASA C-MAPSS, ESA-ADB, CWRU, and OPS-SAT are all accessible to any researcher who wants to reproduce these results.

Cold Deployment

No historical failure examples were provided before testing. The engine deployed cold on every dataset — the same way it deploys on a new client site with no prior failure history on record.

Ground Truth Comparison After the Fact

Detection results were produced first, then compared against official ground-truth labels. No parameter adjustment after seeing the results. No retrofitting to known outcomes.

Edge Hardware Throughout

All processing was performed on ARM-based edge hardware — the same class of device used in field deployments. No cloud servers, no GPU acceleration, no infrastructure unavailable to a standard audit engagement.

Forensic Reports

Full Forensic
Audit Reports.

Deep-dive validation reports from FSME Logic engagements. Each report documents the full detection timeline, channel-by-channel analysis, and forensic findings.

Note on sensitive reports: Reports containing proprietary methodology details or client-specific data are available under NDA only. Contact us directly to request access.

▸ NASA

SMAP Satellite Reaction Wheel Failure

NASA JPL Telemanom Benchmark Dataset

Lead Time84.7 hours (E-4), 68.6 hours (A-2)

DatasetSMAP orbital telemetry, channels E-4, A-2, D-12

False PositivesZero across D-12 healthy baseline

Full channel-by-channel detection timeline for NASA's SMAP satellite. Documents 3.5-day advance warning on orbital reaction wheel failure with zero false positives on verified stable channels.

▸ View Report ↓ Download

▸ NASA

Curiosity Rover Actuator Binding Detection

NASA MSL Curiosity Mission Dataset

Lead Time6.3 hours / 382 operational steps

AssetActuator T-13, Martian surface operation

EnvironmentExtreme temperature variance, terrain resistance

Detection of mechanical actuator binding on the Mars Curiosity Rover 382 operational steps before NASA's official failure label. Differentiates internal degradation from normal terrain load.

▸ View Report ↓ Download

▸ ESA

OPS-SAT CubeSat Blind Validation

ESA OPS-SAT 3U CubeSat Mission

Lead Time11.5 minutes orbital warning

DeploymentEdge-native, resource-constrained flight computer

Test TypeBlind — ground truth withheld until after detection

Blind validation on active ESA CubeSat mission data. Engine detected in-orbit sensor degradation 11.5 minutes before ESA's official monitoring systems flagged the event, with zero false positives.

▸ View Report ↓ Download

▸ IBM Quantum

IBM Processor Fleet Audit

IBM Boston, Athens, Melbourne, Santiago

Processors4 superconducting quantum processors

Volume66,747 circuit executions per processor

Key ResultTopological cascade mapping — exact failure propagation sequence

Four-processor quantum audit documenting hardware degradation cascades, environmental shock signatures, and sequential domino failure propagation across superconducting qubit systems.

▸ View Report ↓ Download

▸ Industrial

NASA C-MAPSS Fleet Detection

NASA Commercial Modular Aero-Propulsion System Simulation

Fleet Size509 commercial turbofan engines

Detection Rate78% fleet-wide

Avg Lead Time126 flight cycles advance warning

Fleet-scale detection across 509 turbofan engines. Documents 78% detection rate with 126 flight cycle average advance warning. Zero false positives on verified healthy baseline engines.

▸ View Report ↓ Download

▸ Multi-Domain

Full Capability Brief

All Validated Domains — NASA, ESA, IBM, Industrial

CoverageQuantum, aerospace, fleet, refrigeration

FormatExecutive summary + full case studies

AudienceCTOs, procurement, technical evaluators

Complete multi-domain validation brief covering all case studies, competitive landscape analysis, and commercial value proposition. The recommended starting point for technical evaluators.

▸ View Report ↓ Download

Real datasets.
Real hardware.
Real results.

9-Month Predictive Lead Time on Spacecraft Telemetry

6.3-Hour Warning on NASA's Mars Rover

11.5-Minute Orbital Warning on Live Satellite Hardware

78% Detection Rate Across 509 Jet Engines

100% Fault Classification on Industrial Bearings

Defeating the Green Dashboard Fallacy

The full picture
at a glance.

How every result
was produced.

Public Datasets Only

Cold Deployment

Ground Truth Comparison After the Fact

Edge Hardware Throughout

Full Forensic
Audit Reports.

SMAP Satellite Reaction Wheel Failure

Curiosity Rover Actuator Binding Detection

OPS-SAT CubeSat Blind Validation

IBM Processor Fleet Audit

NASA C-MAPSS Fleet Detection

Full Capability Brief

Want to see what FSME Logic
finds in your data?

Real datasets.Real hardware.Real results.

9-Month Predictive Lead Time on Spacecraft Telemetry

6.3-Hour Warning on NASA's Mars Rover

11.5-Minute Orbital Warning on Live Satellite Hardware

78% Detection Rate Across 509 Jet Engines

100% Fault Classification on Industrial Bearings

Defeating the Green Dashboard Fallacy

The full pictureat a glance.

How every resultwas produced.

Public Datasets Only

Cold Deployment

Ground Truth Comparison After the Fact

Edge Hardware Throughout

Full ForensicAudit Reports.

SMAP Satellite Reaction Wheel Failure

Curiosity Rover Actuator Binding Detection

OPS-SAT CubeSat Blind Validation

IBM Processor Fleet Audit

NASA C-MAPSS Fleet Detection

Full Capability Brief

Want to see what FSME Logicfinds in your data?

Real datasets.
Real hardware.
Real results.

The full picture
at a glance.

How every result
was produced.

Full Forensic
Audit Reports.

Want to see what FSME Logic
finds in your data?