Failure-Finding Intervals: How to Set Inspection Frequency for Protective Devices

Calendar
Duration:
9 min read
calendar today
Published on
June 11, 2026
Featured Image

A failure-finding interval (FFI) is the maximum allowable time between inspections of a protective device to ensure its hidden failure probability stays within an acceptable risk threshold. For protective devices like pressure relief valves, fire suppression systems, safety interlocks, and emergency shutdown valves, FFI is not optional — it is the calculation that keeps dormant failures from becoming catastrophic events. According to the U.S. Occupational Safety and Health Administration (OSHA), the majority of catastrophic process failures involve a safety system that had been in a failed state for an unknown period before the incident. Setting the right inspection frequency — and enforcing it — is the difference between a protective layer that works and one that only looks like it does on paper.

This guide covers the failure-finding interval formula, device-specific inspection frequencies, common calculation mistakes, and how a modern CMMS automates FFI scheduling to keep your protective layers functional and auditable.

What Is a Failure-Finding Interval?

What is a failure-finding interval — 4-card concept illustration showing hidden failure detection, scheduled proof tests, dormant risk window, and RCM strategy | Cryotos

A failure-finding interval is a scheduled inspection task designed specifically to detect whether a protective device has failed in a dormant (hidden) state. Unlike run-to-failure or time-based preventive maintenance tasks, an FFI task does not fix anything — it tests whether the device can still do its job when called upon. If the device fails the test, a corrective action is triggered. If it passes, the next FFI is scheduled for the calculated interval ahead.

The concept comes directly from reliability-centered maintenance (RCM), where failure modes are classified by whether they are evident or hidden to the operator under normal operating conditions. Protective devices — by design — only activate during an abnormal event. A pressure relief valve does not pop open during normal operation. A fire suppression system does not discharge on a normal working day. An emergency shutdown valve sits closed until a process upset demands it to open. This means a failure of any of these devices will remain completely invisible until the moment protection is actually needed.

Hidden and Evident Failures in Protective Devices

A failure is hidden when no operating crew member will notice it during normal operations. A pressure transmitter that feeds a high-pressure alarm can fail in a way that leaves the alarm permanently silenced — and no one knows until an overpressure event occurs with no warning. A fire damper that has seized open due to corrosion is invisible on the operations dashboard but catastrophically wrong when fire breaks out. These are precisely the failure modes that FFI tasks are designed to surface before they matter.

Why Protective Devices Need Dedicated Inspection Intervals

Most maintenance strategies focus on assets that fail in ways operators can detect — vibration rises, temperature increases, throughput drops. Protective devices are fundamentally different. They are designed to sit idle and activate only under the conditions they are protecting against. This idle-by-design nature means:

  • No degradation signal: A safety interlock that has failed open generates no alarm, no noise, and no observable change in process parameters. Without deliberate testing, it remains in a failed state indefinitely.
  • Compounding risk over time: The longer a hidden failure goes undetected, the higher the probability that a demand event (the process upset the device was meant to protect against) will occur while the device is unavailable. This is the core risk metric that failure-finding intervals are designed to control.
  • Regulatory requirement: Standards including IEC 61511 (Functional Safety of Safety Instrumented Systems) and NFPA 72 require documented proof-test intervals for protective devices. An undocumented or irregular FFI schedule is a compliance gap, not just an operational risk.
  • Independence from normal PM: Time-based lubrication and cleaning tasks do not verify function. A safety valve that has been cleaned and lubricated on schedule can still fail to lift at set pressure if the disc has corroded to the seat — and a standard PM task will not catch that.

The practical implication is that protective devices need their own maintenance strategy: one built on function testing at calculated intervals, with results tracked and correlated to failure rates over time. Preventive maintenance software that supports dynamic interval scheduling — driven by actual failure data — is the operational backbone of an effective FFI program.

How to Calculate Your Failure-Finding Interval (FFI Formula)

How to calculate failure-finding interval — 5-stage process flow from failure rate sourcing to interval refinement with field data | Cryotos

The standard RCM-derived formula for a failure-finding interval is based on balancing three variables: the unavailability target (the acceptable probability that the device is in a failed state at any given moment), the probability that a demand event will occur, and the mean time to restore or detect the failure. The simplified working formula used in most industrial FFI programs is:

FFI = 2 × (Target Unavailability ÷ Failure Rate)

Where:

  • Target Unavailability is the maximum acceptable fraction of time the protective device can be in a failed state. A common industry target for a Safety Integrity Level (SIL) 1 system is 0.01 to 0.1 (1%–10% unavailability).
  • Failure Rate is the probability that the device will fail in a dormant state per unit time, typically expressed as failures per hour (λ). This data comes from device manufacturer specifications, OREDA (Offshore Reliability Data) handbooks, or your own historical failure data.

For example, if a safety interlock has a dormant failure rate of 0.0001 failures per hour (one failure per ~10,000 hours) and your target unavailability is 5%, the calculation is: FFI = 2 × (0.05 ÷ 0.0001) = 1,000 hours, or approximately 42 days. You would inspect this device at least once every 42 days to maintain the 5% unavailability target.

Using the FFI Formula in Practice

For most maintenance teams, the exact dormant failure rate is not initially available from their own records. In this case, start with conservative published rates from reliability databases such as OREDA, then refine the interval based on actual proof-test results over time. If 20 consecutive proof tests show zero failures, your real failure rate is lower than assumed and you can justify extending the interval. If failures are found at a rate higher than assumed, shorten the interval immediately and investigate the cause with a root cause analysis. Use the failure rate calculator to help compute the right starting values for your specific devices.

Inspection Intervals by Protective Device Type

While the FFI formula gives you a site-specific calculated interval, the following reference table provides widely used starting-point inspection frequencies drawn from IEC 61511, API 689, NFPA standards, and industry practice. These are not substitutes for a site-specific FFI calculation — they are starting benchmarks to validate your calculations against.

Protective DeviceTypical FFI RangeKey Proof-Test ActionGoverning Standard
Pressure Relief Valve (PRV)12–24 monthsLift test at set pressure; inspect seat and discAPI 510 / API 576
Safety Instrumented System (SIS) Interlock3–12 months (SIL-dependent)Full functional test including trip initiationIEC 61511
Emergency Shutdown Valve (ESDV)3–6 monthsPartial stroke test; full stroke test at shutdownAPI 6D / IEC 61511
Fire Detection and Alarm System1–6 monthsDetector activation test; panel annunciation checkNFPA 72
Fire Suppression System (Sprinkler/Gaseous)6–12 monthsFlow test; control valve inspection; agent weight checkNFPA 25 / NFPA 2001
Gas Detection System1–3 monthsBump test with calibration gas; alarm setpoint verificationISA-TR84.00.04
Safety Relief Valve (Rupture Disk)24–36 months or process-dependentVisual inspection; pressure holding testASME Section VIII
Electrical Overcurrent Protection (Circuit Breaker)12–36 monthsTrip test at rated current; contact resistance checkNFPA 70B / IEEE C37.06

When your calculated FFI is significantly shorter than the typical range shown above, that is a signal to re-examine your failure rate assumption or evaluate whether the device specification is appropriate for the application. When your calculated FFI is significantly longer than the typical range, treat the typical range as an upper limit until you have accumulated enough proof-test history to justify the extension.

Common Mistakes That Undermine Failure-Finding Effectiveness

Even teams that calculate FFIs correctly often erode their protective value through flawed execution. These are the most common failure modes in failure-finding programs:

  • Partial testing instead of full proof tests: A visual inspection of a pressure relief valve is not the same as a lift test at set pressure. Partial tests give false confidence and do not verify that the device will perform under actual demand conditions. Every FFI task must verify the complete function the device is designed to perform.
  • No failure recording from proof tests: If proof test results are recorded as pass/fail with no detail, you cannot improve your FFI calculations over time. Record the actual measured response — lift pressure for a PRV, response time for an ESDV, gas concentration for a detector calibration — so failure rate trends can be tracked.
  • Using time-based intervals without recalculating: An FFI interval set five years ago based on a generic failure rate may be wrong today. Changes in process conditions, device age, environmental factors, or accumulated proof-test data all affect the optimal interval. Review FFI calculations at least every two to three years, or after any proof-test failure is found.
  • Treating the interval as the target rather than the limit: The FFI formula gives the maximum acceptable interval, not the ideal inspection frequency. For high-consequence protective devices in critical service, inspecting more frequently than calculated is acceptable and often prudent in the early stages of a program before your own failure rate data matures.
  • No corrective action loop for proof-test failures: When a proof test finds a failed device, the failure mode and corrective action must feed back into the FFI calculation. A device that fails at a higher rate than assumed needs a shorter interval — and that adjustment needs to happen before the next scheduled inspection, not at the next annual review.

These mistakes are preventable with the right process and the right tools. A workflow automation layer that ties proof-test results to automatic interval recalculation and corrective work order generation closes the loop that manual processes leave open.

How a CMMS Automates Failure-Finding Inspection Schedules

How CMMS automates failure-finding inspection schedules — 4 capabilities: dynamic PM scheduling, proof-test checklists, auto corrective work orders, audit-ready compliance history | Cryotos

Managing failure-finding intervals manually — through spreadsheets, paper logs, or calendar reminders — creates the same hidden failure problem you are trying to solve. A missed proof test due to a scheduling error is a period of uncontrolled unavailability. A result that never gets analyzed is a data point that never improves your FFI calculation. A corrective action that is not formally raised and tracked may or may not be completed before the next demand event.

A modern CMMS solves each of these gaps through four specific capabilities:

  • Dynamic PM scheduling with multiple trigger types: Cryotos supports time-based, meter-based, and condition-triggered PM schedules. For protective devices whose FFI is expressed in operating hours rather than calendar time (such as an emergency shutdown valve that should be tested every 2,000 hours of process operation), the CMMS generates the work order automatically when the meter reading reaches the threshold — without any manual tracking. The IoT meter reading integration feeds process counters directly into the scheduler.
  • Structured proof-test checklists with quantitative fields: Cryotos allows maintenance teams to build inspection checklists with numeric input fields for actual measured values — not just pass/fail checkboxes. A pressure relief valve proof test can require technicians to log the actual lift pressure in bar, the disc condition rating, and the seat leakage classification. These structured data points feed into failure rate tracking over time, supporting FFI recalculation with real field data rather than handbook assumptions. Use the asset inspection checklist templates as a starting framework for your protective device proof tests.
  • Automatic corrective work order generation on failures: When a technician marks a proof-test step as failed in the Cryotos mobile app, the system automatically generates a corrective work order with the failure details, links it to the same asset, and routes it to the appropriate team for resolution. The protective device is flagged as functionally unavailable in the asset record until the corrective work order is closed — giving operations and safety teams real-time visibility into whether their protective layer is active.
  • Audit-ready inspection history for compliance: Every proof test completed in Cryotos generates an immutable, timestamped record linked to the specific asset and the technician who performed the test. For IEC 61511 audits, OSHA PSM inspections, and insurance risk assessments, the complete proof-test history is retrievable in seconds from the BI dashboard or the asset maintenance history. There is no gap between what was scheduled and what was recorded — the CMMS maintains both in the same system.

Teams using Cryotos CMMS have reported a 30% reduction in unplanned downtime and 25% faster repair times after connecting their protective device inspection programs to structured PM workflows. For safety-critical assets where the failure mode is hidden, that reduction represents not just maintenance efficiency but real risk reduction across the operation. See how Cryotos maintenance management software supports protective device FFI programs from scheduling through compliance reporting.

Frequently Asked Questions

What is the difference between a failure-finding interval and a preventive maintenance interval?

A preventive maintenance interval targets failure modes that degrade over time and affect normal equipment operation — lubrication intervals, filter replacements, and calibration cycles. A failure-finding interval targets hidden failure modes in protective devices that only activate during abnormal events. The goal of a PM interval is to prevent failure. The goal of an FFI is to detect a failure that has already occurred but cannot be observed during normal operations.

How do I determine the dormant failure rate for a protective device if I have no historical data?

Start with published reliability databases. The OREDA (Offshore and Onshore Reliability Data) handbook is the most widely used source for process industry safety devices. The EXIDA database covers safety instrumented systems specifically. Use a conservative (higher) failure rate initially, which will produce a shorter inspection interval. As you accumulate proof-test results, recalculate using your actual observed failure rate. Within three to five years of data collection, your site-specific rate will be more reliable than any handbook estimate.

Can I use a single FFI for all protective devices of the same type?

Only if they operate in identical conditions and service environments. A pressure relief valve protecting a clean water system has a different dormant failure rate than an identical valve protecting a sour gas service with potential for seat corrosion and disc fouling. Device type is a starting point; service conditions, process fluid, environmental exposure, and maintenance history all affect the actual failure rate and therefore the correct FFI.

What happens if a proof test finds a failed protective device?

Three actions are required immediately: isolate or compensate for the loss of protection (put in place a compensating safety measure if the process must continue), raise a corrective work order to restore the device to functional status, and record the failure details for FFI recalculation. The time the device was in the failed state should be estimated and documented for your unavailability records. The FFI should be reviewed immediately after the corrective work order is closed — if the device failed before the scheduled FFI expired, the interval needs to be shortened.

How does IEC 61511 influence failure-finding intervals?

IEC 61511 (Functional Safety of Safety Instrumented Systems) requires that proof-test intervals for safety instrumented functions (SIFs) be calculated to maintain the required Safety Integrity Level (SIL). For a SIL 1 function, the overall probability of failure on demand (PFDavg) must be between 0.01 and 0.1. The proof-test interval is one of the primary variables in the PFDavg calculation. An FFI that is too long will push PFDavg above the SIL 1 limit, meaning the safety layer is no longer meeting its design intent — even if it passes every proof test it receives.

Setting the right failure-finding intervals for your protective devices requires calculation, field data, and a system that enforces the schedule without exception. If your current program relies on spreadsheets or manual calendars, you have a hidden availability risk in the maintenance process itself — separate from the device failure rates you are trying to manage. Cryotos CMMS gives maintenance and reliability teams the scheduling engine, structured proof-test checklists, automatic corrective action workflows, and compliance-ready audit trails to run a failure-finding program that actually maintains the protective value your safety layers were designed to deliver. Book a free demo today to see how Cryotos handles protective device FFI scheduling across your entire asset register.

Want to Try Cryotos CMMS Today?

Get Free Demo

Let AI Take Control of Your Maintenance

Cryotos AI predicts failures, automates work orders, and simplifies maintenance—before problems slow you down.

Try AI-Powered CMMS
🡢