<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:media="http://search.yahoo.com/mrss/" >

<channel>
	<title>SIL Safe</title>
	<atom:link href="https://silsafe.net/feed/" rel="self" type="application/rss+xml" />
	<link>https://silsafe.net</link>
	<description>Safer Communities, Resilient Operations.</description>
	<lastBuildDate>Sat, 06 Jun 2026 03:51:32 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://silsafe.net/wp-content/uploads/2025/07/cropped-SIL-Safe-Logo-master-white-favicon-square-150x150.png</url>
	<title>SIL Safe</title>
	<link>https://silsafe.net</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>The Longview Chemical Tank Implosion: What We Know So Far</title>
		<link>https://silsafe.net/longview-chemical-tank-implosion/</link>
					<comments>https://silsafe.net/longview-chemical-tank-implosion/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sat, 06 Jun 2026 03:51:31 +0000</pubDate>
				<category><![CDATA[Advanced]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6777</guid>

					<description><![CDATA[On May 26, 2026, a 900,000-gallon white liquor tank imploded at the Nippon Dynawave mill in Longview, Washington, killing eleven workers. This piece covers the facility, the chemical, and why the tank likely sat outside OSHA PSM and EPA RMP, plus what the Longview chemical tank implosion reveals about tank vacuum protection.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">On May 26, 2026, a tank holding roughly 900,000 gallons of white liquor imploded at the Nippon Dynawave Packaging mill in Longview, Washington, killing eleven workers. The Longview chemical tank implosion is now under investigation by the U.S. Chemical Safety Board (CSB). It also raises a practical question for anyone running process equipment: how do the major US process safety regulations apply to a tank like this, and what can be said while the investigation is still open? The CSB review has only just begun, and nothing here is a root cause analysis.</p>



<h2 class="wp-block-heading">What is the Nippon Dynawave facility?</h2>



<p class="wp-block-paragraph">The mill sits in Longview, Washington, in the Columbia River industrial corridor about 50 miles north of Portland, Oregon, sharing that stretch of riverfront with other timber, paper, and chemical operations. This is an industrial area, not a residential one, and no surrounding neighborhood was evacuated after the failure. Longview itself is a city of roughly 38,000.</p>



<p class="wp-block-paragraph">Nippon Dynawave runs a kraft pulp and paper mill that produces bleached paperboard, the stiff stock used in milk cartons, beverage containers, cups, plates, and food packaging, plus kraft pulp for tissue and printing paper. Around 1,000 people work in the facility, roughly 550 in the pulp and paper mill and 450 in the packaging plant. The site is a US subsidiary of Japan-based Nippon Paper, which bought it in 2016.</p>



<p class="wp-block-paragraph">The Longview site has made pulp since 1931, and this paperboard mill has run since 1953, originally built by Weyerhaeuser. The infrastructure is roughly seven decades old. The age of the specific tank that imploded is not publicly known.</p>



<h2 class="wp-block-heading">What is white liquor?</h2>



<p class="wp-block-paragraph">White liquor is the cooking chemical kraft mills pump into their digesters to break wood chips down into pulp. It dissolves lignin, the natural binder that holds wood fibers together, so the cellulose fibers can be freed and turned into paper. The name is standard industry shorthand, one of a set: white liquor is the fresh cooking chemical, black liquor is the spent liquor after cooking, and green liquor is the intermediate in the chemical recovery cycle.</p>



<p class="wp-block-paragraph">Chemically, it is a water-based solution of sodium hydroxide (NaOH), sodium sulfide (Na2S), and sodium carbonate (Na2CO3). The combination of high temperature and high alkalinity does the cooking work.</p>



<p class="wp-block-paragraph">White liquor is strongly caustic, meaning basic. It is not a solvent like paint thinner, and it is not flammable: no flash point, no flammable vapor. The danger it carries is corrosivity, severe caustic burns on contact and damage to eyes, skin, and lungs on exposure.</p>



<h2 class="wp-block-heading">What occurred on May 26, 2026?</h2>



<p class="wp-block-paragraph">Around 7:15 AM local time, the tank imploded, collapsing inward on itself under vacuum. It did not explode.</p>



<p class="wp-block-paragraph">That distinction matters, and the early public account struggled with it. In the first day, authorities and reporters reached for three different words: explosion, then implosion, then rupture. Implosion is the accurate one. The tank failed under underpressure, a vacuum pulling its walls inward, not under the outward force of an overpressure or a blast.</p>



<p class="wp-block-paragraph">The timing made it worse. A shift change about fifteen minutes earlier had put an unusually large number of workers in the immediate area, across operations, an administrative space, and a break room. It became the deadliest industrial disaster in Washington state since 1930.</p>



<p class="wp-block-paragraph">The cause of the vacuum has not been established publicly, and there is no way to know it with certainty until the investigation runs its course.</p>



<p class="wp-block-paragraph">Sadly, eleven workers were killed. Eight others were injured, seven workers and one responding firefighter, with chemical burns and inhalation injuries.</p>



<h2 class="wp-block-heading">What causes a tank to implode?</h2>



<p class="wp-block-paragraph">A white liquor storage tank is almost certainly an atmospheric tank. In kraft mills, white liquor is held at close to ambient pressure, in tanks built for the narrow band near atmospheric, not in pressure vessels and not in vacuum vessels. &#8220;Atmospheric&#8221; is a pressure rating, not a description of the lid. Plenty of atmospheric tanks are fully enclosed, with fixed roofs, vents, and conservation devices. The term describes how much pressure the shell can hold, which is very little.</p>



<p class="wp-block-paragraph">That last point is the whole story of an implosion. An atmospheric tank is far weaker against vacuum than against pressure. A few inches of water column of underpressure, a trivial amount, can buckle a fixed-roof tank inward. Modest internal pressure it shrugs off; modest vacuum it cannot.</p>



<p class="wp-block-paragraph">Vacuum builds in a storage tank through ordinary mechanisms: pumping liquid out faster than air can flow back in through the vent, the contents cooling and contracting, condensation after a steam-out, or a vent that is simply blocked or frozen shut.</p>



<p class="wp-block-paragraph">Tank vacuum protection, the vacuum-relief side of a tank&#8217;s venting, is what stands between routine operation and a collapse like this. The consensus standard for sizing it, alongside the pressure side, is API Standard 2000, &#8220;Venting Atmospheric and Low-Pressure Storage Tanks.&#8221; In practice, the vacuum side tends to get less attention than the overpressure side, both when the venting is first sized and in the maintenance that follows. The standard covers both cases. Field habits often do not.</p>



<p class="wp-block-paragraph">This is also where process hazard analysis (PHA) could fall short, a gap SIL Safe often sees in hazard study reviews. A hazard and operability study (HAZOP) or a layer of protection analysis (LOPA) could give overpressure scenarios full rigor and treat low pressure as less likely. Vacuum is exactly the low-frequency, high-consequence scenario those studies exist to catch.</p>



<h2 class="wp-block-heading">What regulations apply?</h2>



<p class="wp-block-paragraph">This is a US incident, so the analysis below is US-specific; readers elsewhere operate under their own frameworks. One thread runs through all of it: being regulated and being hazardous are not the same thing.</p>



<h3 class="wp-block-heading">Does OSHA PSM apply to the white liquor tank?</h3>



<p class="wp-block-paragraph">OSHA&#8217;s Process Safety Management (PSM) standard (29 CFR 1910.119) reaches a process through one of two doors: a chemical on its Appendix A list at or above a threshold quantity, or a flammable liquid or gas at or above 10,000 pounds.</p>



<p class="wp-block-paragraph">White liquor opens neither. Appendix A names 137 highly hazardous chemicals, and white liquor&#8217;s constituents, sodium hydroxide, sodium sulfide, and sodium carbonate, are not among them. OSHA has said so directly: its published interpretations confirm that sodium hydroxide is not an Appendix A chemical. The other two are not listed either. And because white liquor is aqueous and non-flammable, the flammable threshold never comes into play.</p>



<p class="wp-block-paragraph">OSHA PSM does not reach the white liquor tank.</p>



<h3 class="wp-block-heading">Does EPA RMP apply?</h3>



<p class="wp-block-paragraph">EPA&#8217;s Risk Management Program (RMP) rule (40 CFR 68.130) works off its own lists: regulated toxic substances and regulated flammable substances. White liquor&#8217;s components appear on neither. The toxic list is built around inhalation hazards, gases and volatile toxics such as chlorine (Cl2), ammonia (NH3), hydrogen sulfide (H2S), and methyl isocyanate. The flammable list covers flammable gases and volatile flammable liquids. A non-volatile, non-flammable aqueous caustic fits none of that.</p>



<p class="wp-block-paragraph">EPA RMP does not apply.</p>



<h3 class="wp-block-heading">Do paper mills fall under PSM and RMP?</h3>



<p class="wp-block-paragraph">Often, yes, but not because of white liquor. A kraft mill usually does have PSM-covered and RMP-covered processes, and they sit in the bleach plant and the chlorine dioxide generation area, not on the white liquor side. The chemicals that trigger coverage are chlorine dioxide (ClO2), chlorine, sulfur dioxide (SO2), and methanol. Chlorine dioxide is unstable and is typically generated on-site from sodium chlorate (NaClO3), methanol, and sulfuric acid (H2SO4); methanol is flammable. Those are what put a mill under federal coverage.</p>



<p class="wp-block-paragraph">The obvious follow-up: if part of the mill is covered, isn&#8217;t all of it? Under OSHA&#8217;s interpretations, an interconnected process is treated as a single process, and if any part of it holds a listed chemical above the threshold, the whole interconnected process is covered. But in a kraft mill, the white liquor system and the bleach plant are not interconnected in that sense. They are linked by pulp moving from the cook side to the bleach side, not by any PSM-listed chemical flowing between them. So even where the bleach plant is covered, the white liquor tank generally falls outside that covered process. That said, this is an interpretation of federal law, and a lawyer should weigh in on any specific case.</p>



<p class="wp-block-paragraph">The practical read: it is more likely than not that this part of the plant was not covered by PSM or RMP at all.</p>



<p class="wp-block-paragraph">Two federal duties still apply regardless. The OSHA General Duty Clause, Section 5(a)(1), requires employers to keep a workplace free of recognized hazards whether or not a chemical is listed. The EPA Clean Air Act General Duty Clause, Section 112(r)(1), requires facilities handling hazardous substances to design and maintain a safe operation, listed or not. Those are the backstop when a tank falls outside both PSM and RMP on a listing technicality.</p>



<h3 class="wp-block-heading">Any additional requirements in the State of Washington</h3>



<p class="wp-block-paragraph">Washington enforces workplace safety itself, as a state-plan state, through the Department of Labor and Industries (L&amp;I) and its Division of Occupational Safety and Health (DOSH), not federal OSHA. It has its own rule modeled on federal PSM, WAC 296-67, with the same listed-chemical structure and thresholds, but no separate state list of additional regulated chemicals the way California does with CalARP, its accidental-release program. So the state PSM rule pulls the white liquor tank in no further than the federal one does.</p>



<p class="wp-block-paragraph">Where Washington does reach this facility directly is through WAC 296-79, &#8220;Safety standards for pulp, paper, and paperboard mills,&#8221; an industry-specific standard that applies regardless of PSM coverage. The Department of Ecology also holds dangerous-waste and spill authority, and is already engaged in the environmental response.</p>



<h3 class="wp-block-heading">Does this trigger functional safety via IEC 61511?</h3>



<p class="wp-block-paragraph">Likely no, not through regulation. This tank sits outside both PSM and RMP, and its white liquor system most likely isn&#8217;t part of any covered process at the mill. Since PSM and RMP are the route by which US regulation pulls in IEC 61511, the recognized and generally accepted good engineering practice (RAGAGEP) for safety instrumented systems (SIS), a tank that neither rule reaches isn&#8217;t pulled into IEC 61511 by them either. That is a regulatory point, not an engineering one: if a facility chose to guard a tank like this with a safety instrumented function (SIF) rather than a mechanical relief device, IEC 61511 would still be the standard to design, verify, and proof test it to. It just isn&#8217;t compelled here.</p>



<h2 class="wp-block-heading">What we do not know at the time of writing (June 2026)</h2>



<p class="wp-block-paragraph">Until more comes out, most likely through the CSB investigation, the important questions stay open.</p>



<ul class="wp-block-list">

<li>What pulled the tank into vacuum. A blocked or plugged vent, pump-out outrunning the make-up air, thermal contraction, or condensation after a steam-out could each do it.</li>


<li>The condition of the tank: its corrosion history in caustic service, when it was last inspected, and the age of the specific tank that failed.</li>


<li>Whether the tank vacuum protection was present and working. Whether a pressure and vacuum relief device or vacuum breaker was installed, sized per API 2000, and maintained, and whether the vacuum scenario was ever identified in the facility&#8217;s PHA.</li>


<li>The longer-term environmental and health impact of the white liquor that escaped into the site&#8217;s storm-drain and dike system.</li>

</ul>



<h2 class="wp-block-heading">Frequently Asked Questions</h2>



<h3 class="wp-block-heading">Q1. I used to work at a paper mill and it definitely had a functional safety program. You&#8217;re telling me this tank may not have been covered? Why is that?</h3>



<p class="wp-block-paragraph">Both things can be true. Your mill almost certainly did have a functional safety program, and it was almost certainly built around the bleach plant and chlorine dioxide generation, where the listed chemicals live and where PSM requires it. The white liquor system is a separate process. It is tied to the rest of the mill by pulp, not by any PSM-listed chemical, so it usually sits outside the covered process even at a mill that takes PSM seriously everywhere it applies. The program you remember was real. It just was likely not pointed at this tank, because the rules that drive those programs were not pointed there either.</p>



<h3 class="wp-block-heading">Q2. Why do so many states have extra regulations on top of PSM and RMP? Is that just a US thing?</h3>



<p class="wp-block-paragraph">To a large degree, yes. The US splits authority between federal and state government, and occupational and process safety is one of the areas where states are allowed to run their own show. About half the states operate their own OSHA-approved safety programs instead of deferring to federal OSHA, and they can be stricter than the federal floor, never weaker. On top of that, a few states have built their own chemical-accident programs, California&#8217;s CalARP being the best known, that add requirements beyond federal RMP.</p>



<p class="wp-block-paragraph">Most other countries run process safety through a single national framework, so the patchwork across US states is, in fact, fairly distinctive.</p>



<h3 class="wp-block-heading">Q3. My plant has atmospheric tanks of non-flammable caustic. After reading this, what should I be checking on Monday morning?</h3>



<p class="wp-block-paragraph">Start with the vent path. For each tank, confirm there is a vacuum relief device, that it is sized for your worst-case outflow plus thermal effects per API 2000, and that it is actually maintained, not painted over, corroded shut, or screened off by a bird guard nobody has looked at in years. Then check pump-out rates against that vent capacity, because the fastest way to pull a vacuum is to draw liquid out faster than air can come back in. Last, pull the PHA and see whether vacuum or low pressure shows up as a deviation with a credited safeguard. If the study spent ten pages on overpressure and one line on vacuum, that is your gap. None of this needs the CSB report.</p>



<h3 class="wp-block-heading">Q4. I work with safety instrumented systems for a living. Could a SIF have prevented this?</h3>



<p class="wp-block-paragraph">Possibly, and it is the right question to ask, though not automatically the right answer. The first line of defense against tank vacuum is mechanical: a properly sized, properly maintained vacuum relief device. That is simpler and more reliable than instrumentation for the basic breathing case, and it is what API 2000 is built around. A safety instrumented function earns its place when the mechanical layer cannot cover the credible scenarios on its own, say a pump-out rate that can outrun any practical vent, where you might credit a vacuum or low-pressure instrument that trips the outflow before the tank is endangered.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<p class="wp-block-paragraph"><strong>Internal (silsafe.net):</strong></p>



<ul class="wp-block-list">

<li><a href="https://silsafe.net/hazard-and-risk-analysis-methods/">Hazard and Risk Analysis Methods: How HAZOP, What-If, LOPA, Risk Graph, FTA, ETA, and Bowtie Fit Together</a></li>


<li><a href="https://silsafe.net/layer-of-protection-analysis-lopa/">Layer of Protection Analysis (LOPA): The Engineer&#8217;s Guide to SIL Selection</a></li>


<li><a href="https://silsafe.net/hazard-and-risk-assessment-hra/">Hazard and Risk Assessment (H&amp;RA): The Foundation of Functional Safety</a></li>

</ul>



<p class="wp-block-paragraph"><strong>External:</strong></p>



<ul class="wp-block-list">

<li><a href="https://ecology.wa.gov/spills-cleanup/spills/spill-preparedness-response/responding-to-spill-incidents/spill-incidents/nippon-dynawave-industrial-incident" target="_blank" rel="noopener">Washington Department of Ecology, Nippon Dynawave Longview incident page</a></li>


<li><a href="https://www.csb.gov/us-chemical-safety-board-opens-investigation-into-fatal-chemical-tank-implosion-at-nippon-dynawave-paper-mill-in-washington/" target="_blank" rel="noopener">U.S. Chemical Safety Board, investigation announcement</a></li>


<li><a href="https://en.wikipedia.org/wiki/2026_Longview,_Washington_paper_mill_implosion" target="_blank" rel="noopener">Wikipedia, &#8220;2026 Longview, Washington paper mill implosion&#8221;</a></li>


<li><a href="https://www.pbs.org/newshour/nation/crews-recover-remains-of-6-of-9-workers-missing-after-chemical-tank-rupture-in-washington" target="_blank" rel="noopener">PBS NewsHour (Associated Press), recovery and casualty coverage</a></li>


<li><a href="https://www.osha.gov/laws-regs/oshact/section5-duties" target="_blank" rel="noopener">OSHA, OSH Act Section 5 (General Duty Clause)</a></li>


<li><a href="https://www.lni.wa.gov/safety-health/safety-rules/chapter-pdfs/WAC296-79.pdf" target="_blank" rel="noopener">Washington L&amp;I, WAC 296-79, &#8220;Safety standards for pulp, paper, and paperboard mills&#8221;</a></li>

</ul>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "I used to work at a paper mill and it definitely had a functional safety program. You're telling me this tank may not have been covered? Why is that?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Both things can be true. Your mill almost certainly did have a functional safety program, and it was almost certainly built around the bleach plant and chlorine dioxide generation, where the listed chemicals live and where PSM requires it. The white liquor system is a separate process. It is tied to the rest of the mill by pulp, not by any PSM-listed chemical, so it usually sits outside the covered process even at a mill that takes PSM seriously everywhere it applies. The program you remember was real. It just was likely not pointed at this tank, because the rules that drive those programs were not pointed there either."
      }
    },
    {
      "@type": "Question",
      "name": "Why do so many states have extra regulations on top of PSM and RMP? Is that just a US thing?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "To a large degree, yes. The US splits authority between federal and state government, and occupational and process safety is one of the areas where states are allowed to run their own show. About half the states operate their own OSHA-approved safety programs instead of deferring to federal OSHA, and they can be stricter than the federal floor, never weaker. On top of that, a few states have built their own chemical-accident programs, California's CalARP being the best known, that add requirements beyond federal RMP."
      }
    },
    {
      "@type": "Question",
      "name": "My plant has atmospheric tanks of non-flammable caustic. After reading this, what should I be checking on Monday morning?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Start with the vent path. For each tank, confirm there is a vacuum relief device, that it is sized for your worst-case outflow plus thermal effects per API 2000, and that it is actually maintained, not painted over, corroded shut, or screened off by a bird guard nobody has looked at in years. Then check pump-out rates against that vent capacity, because the fastest way to pull a vacuum is to draw liquid out faster than air can come back in. Last, pull the PHA and see whether vacuum or low pressure shows up as a deviation with a credited safeguard. If the study spent ten pages on overpressure and one line on vacuum, that is your gap. None of this needs the CSB report."
      }
    },
    {
      "@type": "Question",
      "name": "I work with safety instrumented systems for a living. Could a SIF have prevented this?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Possibly, and it is the right question to ask, though not automatically the right answer. The first line of defense against tank vacuum is mechanical: a properly sized, properly maintained vacuum relief device. That is simpler and more reliable than instrumentation for the basic breathing case, and it is what API 2000 is built around. A safety instrumented function earns its place when the mechanical layer cannot cover the credible scenarios on its own, say a pump-out rate that can outrun any practical vent, where you might credit a vacuum or low-pressure instrument that trips the outflow before the tank is endangered."
      }
    }
  ]
}
</script>

]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/longview-chemical-tank-implosion/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Functional Safety Is Not the Same as Occupational Safety</title>
		<link>https://silsafe.net/functional-safety-vs-occupational-safety/</link>
					<comments>https://silsafe.net/functional-safety-vs-occupational-safety/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sat, 30 May 2026 19:43:25 +0000</pubDate>
				<category><![CDATA[Beginner]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6690</guid>

					<description><![CDATA[Occupational safety, process safety, and functional safety are not the same discipline, and treating them as interchangeable is a common mistake at process facilities. Different training, different credentials, different daily work. Where the boundaries blur, the resulting errors are predictable and costly.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">Suppose a facility realizes functional safety applies to them. Maybe an auditor flagged it. Maybe a corporate directive came down. Maybe a near-miss made it impossible to ignore. The reaction is predictable: the Health, Safety, and Environment (HSE) team will handle it. It sounds reasonable on the face of it. Both have &#8220;safety&#8221; in the name. Both involve hazards. Both involve compliance. It is also wrong.</p>



<h2 class="wp-block-heading">The Disciplines Are Not the Same</h2>



<p class="wp-block-paragraph"><u>Occupational safety</u> is a discipline and a professional track. <u>Process safety</u> is an engineering discipline. <u>Functional safety</u> is an engineering discipline that follows a specific set of standards (IEC 61511 and IEC 61508) to implement risk reduction within process safety.</p>



<ul class="wp-block-list">
<li><strong>Occupational safety</strong> protects workers from workplace hazards: falls, struck-by, chemical exposure, ergonomics, noise, confined spaces, electrical contact.</li>



<li><strong>Process safety</strong> is the engineering discipline for preventing major accidents from the process itself, such as releases, fires, explosions, and runaway reactions.</li>



<li><strong>Functional safety</strong> sits inside process safety as the standards-based engineering discipline (IEC 61511 and IEC 61508) for reducing process risk through instrumented protective functions.</li>
</ul>



<p class="wp-block-paragraph">A Venn diagram makes the relationship clear. Occupational safety is a separate circle. Process safety is a larger circle. Functional safety is a circle inside process safety.</p>



<p class="wp-block-paragraph">Functional safety does not overlap with occupational safety as disciplines, though the two are adjacent and several site activities bring them together. They differ at the technical level, and skill sets do not transfer between them.</p>



<h2 class="wp-block-heading">The Professionals and Their Backgrounds</h2>



<p class="wp-block-paragraph">The discipline distinction shows up cleanly in who does the work, where they train, and what credentials or certifications they hold.</p>



<h3 class="wp-block-heading">Occupational safety professionals</h3>



<p class="wp-block-paragraph">The job type or discipline goes by several names depending on company and region: HSE (Health, Safety, and Environment), EHS (Environment, Health, and Safety), SHE (Safety, Health, and Environment), and HSSE (with Security added). The people doing the work carry an even longer list of job titles:</p>



<ul class="wp-block-list">
<li>Safety Manager</li>



<li>Safety Engineer</li>



<li>Safety Specialist</li>



<li>Safety Coordinator</li>



<li>HSE / EHS Manager</li>



<li>Health and Safety Officer</li>



<li>Industrial Hygienist</li>



<li>Occupational Health Specialist</li>



<li>Director of EHS / HSE</li>
</ul>



<p class="wp-block-paragraph">At SIL Safe, we refer to all of these roles collectively as occupational safety. Backgrounds run through occupational health and safety, industrial hygiene, environmental health, and sometimes engineering or kinesiology. Industrial hygiene is the field that figures out what workers are being exposed to on the job (chemicals, dust, noise, heat) and gets the exposure down to safe levels.</p>



<p class="wp-block-paragraph">Professional bodies include the Board of Certified Safety Professionals (BCSP), the American Industrial Hygiene Association (AIHA), the UK Institution of Occupational Safety and Health (IOSH), the British Occupational Hygiene Society (BOHS), and the International Occupational Hygiene Association (IOHA) as the global umbrella.</p>



<p class="wp-block-paragraph">Credentials include the Certified Safety Professional (CSP), the Certified Industrial Hygienist (CIH), and the Chartered Member of IOSH (CMIOSH).</p>



<p class="wp-block-paragraph">Day-to-day work is workplace chemical exposure monitoring (airborne contaminants, dust, vapors), Personal Protective Equipment (PPE) programs, ergonomic assessments, incident investigation for personnel injuries, and regulatory compliance for worker protection.</p>



<h3 class="wp-block-heading">Process safety professionals</h3>



<p class="wp-block-paragraph">Backgrounds are typically chemical engineering, though mechanical, electrical and controls, and other engineering disciplines with relevant process experience are also represented.</p>



<p class="wp-block-paragraph">Professional bodies include the Center for Chemical Process Safety (CCPS, part of the American Institute of Chemical Engineers, AIChE), the Institution of Chemical Engineers (IChemE) in the UK, and the Mary Kay O&#8217;Connor Process Safety Center on the academic side.</p>



<p class="wp-block-paragraph">Credentials include the Certified Process Safety Professional (CCPSC) via CCPS, IChemE professional registration, and often a Professional Engineer license (PE in the United States, P.Eng in Canada, CEng in the UK) in chemical, electrical, or other relevant engineering field.</p>



<p class="wp-block-paragraph">Day-to-day work is Hazard and Operability (HAZOP) and Layer of Protection Analysis (LOPA) facilitation, mechanical integrity programs, management of change, and Process Hazard Analysis (PHA) revalidation.</p>



<h3 class="wp-block-heading">Functional safety professionals</h3>



<p class="wp-block-paragraph">Functional safety professionals come from the same engineering backgrounds as process safety professionals, with deeper instrumentation, controls, and reliability engineering experience. The same professional bodies apply (AIChE, IChemE, and equivalents cover functional safety to a degree), with additional bodies that focus directly on it: TÜV Rheinland, TÜV SÜD, Exida, and the CFSE Governance Board.</p>



<p class="wp-block-paragraph">The directly relevant credentials are the Certified Functional Safety Expert (CFSE) and Certified Functional Safety Professional (CFSP) from Exida, and TÜV FS Eng. Additionally, the International Society of Automation (ISA) and Underwriters Laboratories (UL) offer certifications.</p>



<p class="wp-block-paragraph">Day-to-day work spans the full life-cycle: Safety Instrumented System (SIS) design and verification, proof testing oversight, functional safety audits, training, procedure maintenance, and competency matrix management, all governed by IEC 61511.</p>



<p class="wp-block-paragraph">Occupational safety credentials and process / functional safety credentials do not cross. A CIH is not on a track to become a CFSE. The training, the math, and the body of standards are different worlds. Process safety and functional safety credentials, on the other hand, do cross. The underlying engineering background is the same, and many practitioners hold both. A CCPSC who is also a CFSE is a common and useful profile.</p>



<h2 class="wp-block-heading">Where Functional Safety Meets Occupational Safety</h2>



<p class="wp-block-paragraph">The disciplines are not sealed off from each other.</p>



<ul class="wp-block-list">
<li><strong>Hazardous area classification.</strong> Process safety defines the zones (Class I Div 1/2, Zone 0/1/2) based on the flammable inventory and release sources. Occupational safety enforces the consequences in the field: no uncertified tools, no non-intrinsically-safe radios, no work without the right PPE and procedures. A worker cannot walk into a Zone 1 area with an off-the-shelf iPad or their phone in their pocket. The classification is a process safety output. The daily enforcement is an occupational discipline.</li>



<li><strong>Personal gas monitors (flammable gas / Lower Explosive Limit, hydrogen sulfide, oxygen).</strong> A worker wearing a personal monitor is an occupational control protecting that worker from a process hazard. The hazard originated in the process. The control is occupational. Both disciplines have a stake.</li>



<li><strong>Confined space entry.</strong> Occupational program (permits, attendants, retrieval), but the atmosphere being tested exists because of process residues and the equipment being entered is process equipment. The entry decision depends on understanding the process chemistry: what was last in the vessel, what reactions are possible, what residual hazards remain after isolation.</li>
</ul>



<p class="wp-block-paragraph">Overlap at the activity level is real and common practice in a healthy organization. Overlap at the competency level is minimal. The HSE professional running the confined space program is not qualified to verify a Safety Integrity Level 2 (SIL 2) Safety Instrumented Function (SIF). The functional safety engineer verifying that SIF is not qualified to write the confined space program. The activities cross. The competencies do not.</p>



<h2 class="wp-block-heading">Common Mistakes</h2>



<p class="wp-block-paragraph"><strong>Strong personnel safety record interpreted as process safety health.</strong> A facility logs years without a recordable injury, leadership concludes the safety program is working, and a major release blindsides them six months later. Texas City, Buncefield, and Jaipur all happened at facilities with strong personnel safety records. The Baker Panel report after Texas City made this pattern explicit: BP had been managing personal safety while neglecting process safety. Personnel injury rate measures slips, trips, hand cuts, and ergonomic strains. None of those metrics tell you anything about whether the SIS will work on demand or whether the relief valves are adequately sized.</p>



<ul class="wp-block-list">
<li><strong>Leadership asking for process safety KPIs and getting handed occupational ones.</strong> Boards and executives ask &#8220;how safe are we?&#8221; and HSE shows them Total Recordable Incident Rate (TRIR), Lost Time Incident Rate (LTIR), and Days Away, Restricted, or Transferred (DART) trending down. The metrics are real and the trend may be real, but they answer a different question. Process safety health is measured by leading indicators tied to Process Safety Management (PSM) and Risk-Based Process Safety (RBPS) elements: management of change (MOC) backlog, PHA action items overdue, proof test completion rate, SIF demand rate. Personnel injury statistics are not those indicators.</li>



<li><strong>Putting functional safety under the HSE function.</strong> Org chart logic (&#8220;safety is safety, put it under the safety guy&#8221;) or cost pressure pushes functional safety, the Safety Requirements Specification (SRS), and the Hazard and Risk Assessment (H&amp;RA) under HSE ownership. The work either gets skipped or gets done badly by someone with the wrong training, and the deliverables end up in the wrong hands organizationally.</li>
</ul>



<h2 class="wp-block-heading">Q&amp;A</h2>



<h3 class="wp-block-heading">Q1. My boss says our safety manager can handle the SIL stuff. What do I tell him?</h3>



<p class="wp-block-paragraph">Tell him that your safety manager is properly an Occupational Safety Manager by role, even though the actual title varies (Safety Manager, HSE Manager, EHS Director, and so on). An occupational safety professional is fundamentally a different animal than a functional safety engineer. The safety manager probably runs PPE programs, exposure monitoring, ergonomic assessments, and injury investigations. SIL work is a different domain: IEC 61511, SIS architecture, failure modes, reliability mathematics, proof testing. The training does not transfer, and the credentials do not overlap. A CSP is not on a track to a CFSE, and a few hours of online refresher will not bridge the gap.</p>



<p class="wp-block-paragraph">In the safety manager&#8217;s defense, occupational safety professionals carry a confusing array of job titles, and the word &#8220;safety&#8221; appearing in both disciplines does not help. The conflation is understandable. The substitution is not. The two roles can sit in the same organization and respect each other, but one cannot substitute for the other.</p>



<h3 class="wp-block-heading">Q2. Isn&#8217;t process safety just functional safety at a bigger scale?</h3>



<p class="wp-block-paragraph">It is the other way around. Process safety is the broader engineering discipline. Functional safety is a subset of process safety, tied to the standards (IEC 61511 and IEC 61508) that govern instrumented protective functions. HAZOP, LOPA, mechanical integrity, management of change, and Pre-Startup Safety Review are all process safety activities that are not functional safety.</p>



<h3 class="wp-block-heading">Q3. We have an excellent TRIR. Doesn&#8217;t that mean our process safety program is working?</h3>



<p class="wp-block-paragraph">No, but there is some overlap. A process safety incident with injuries does show up in TRIR. The metric itself is dominated by occupational events, though: slips, trips, hand cuts, strains. A facility can drive TRIR to industry-leading levels while its SIS portfolio is overdue for proof testing, its MOC backlog is unmanaged, and its layers of protection have eroded. The Baker Panel report after Texas City documented exactly that pattern at BP. TRIR is a useful occupational metric. It is not a leading indicator of functional safety or process safety health.</p>



<h3 class="wp-block-heading">Q4. If a personal gas monitor protects a worker from a process release, why isn&#8217;t it a LOPA credit?</h3>



<p class="wp-block-paragraph">A personal gas monitor can sometimes meet IPL (Independent Protection Layer) criteria within a functional safety LOPA, but the claim is often made without testing them. An IPL must satisfy three requirements:</p>



<ul class="wp-block-list">
<li><strong>Independence.</strong> The protection layer cannot share failure modes with the initiating cause or with other claimed layers.</li>



<li><strong>Specificity to the scenario.</strong> The monitor must detect the specific release and give the wearer enough time to act before harm.</li>



<li><strong>Auditability.</strong> Calibration records, bump test records, and battery and gas check records must be in place and reviewable.</li>
</ul>



<p class="wp-block-paragraph">Take two scenarios:</p>



<ul class="wp-block-list">
<li><strong>Slow flange leak detected on a routine round.</strong> The wearer has time to back away, and an audited monitor program can credibly support an IPL claim for that scenario.</li>



<li><strong>Line rupture that puts the wearer in a vapor cloud within seconds.</strong> The same monitor cannot stand as an IPL, because the detection-to-action time is not there.</li>
</ul>



<p class="wp-block-paragraph">The other common failures are claiming credit against the wrong consequence type (the monitor does nothing about equipment damage, fire escalation, or fence-line population) and double-counting the same monitor across multiple scenarios where independence does not hold. A personal gas monitor is a real occupational control that saves lives. It is not, by default, a LOPA credit, and the judgment can be tricky.</p>



<h3 class="wp-block-heading">Q5. Does the functional safety person need to work with and talk to the HSE / occupational safety team?</h3>



<p class="wp-block-paragraph">Absolutely, they should be considered colleagues. Hazardous area classification needs both sides aligned: process safety defines the zones, and HSE enforces the field consequences. Personal gas monitor programs cross both disciplines, with the monitor itself an occupational control protecting against a process hazard. Confined space work that involves SIS equipment needs functional safety input on bypass and recovery procedures, while the HSE program owns the entry permit itself. The relationship is collaborative, not hierarchical. Each side owns its discipline.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<h3 class="wp-block-heading">Internal (SIL Safe)</h3>



<ul class="wp-block-list">
<li><a href="https://silsafe.net/hazard-and-risk-assessment-hra/">Hazard and Risk Assessment (H&amp;RA): The Foundation of Functional Safety</a></li>



<li><a href="https://silsafe.net/layer-of-protection-analysis-lopa/">Layer of Protection Analysis (LOPA): The Engineer&#8217;s Guide to SIL Selection</a></li>



<li><a href="https://silsafe.net/sil-verification-three-gates/">SIL Verification: The Three Gates Every SIF Must Clear</a></li>
</ul>



<h3 class="wp-block-heading">External</h3>



<ul class="wp-block-list">
<li><a href="https://webstore.iec.ch/en/publication/5527" target="_blank" rel="noopener">IEC 61511 standard page (IEC)</a></li>



<li><a href="https://www.hse.gov.uk/comah/" target="_blank" rel="noopener">UK Health and Safety Executive: Control of Major Accident Hazards (COMAH)</a></li>



<li><a href="https://www.aiha.org/ih-careers/discover-industrial-hygiene" target="_blank" rel="noopener">American Industrial Hygiene Association: Discover Industrial Hygiene</a></li>



<li><a href="https://ioha.net/" target="_blank" rel="noopener">International Occupational Hygiene Association (IOHA)</a></li>
</ul>



<p class="wp-block-paragraph">Functional safety is complex, and the stakes are high. If you have questions about your SIS design, SIL verification, or where to start with IEC 61511, the team at SIL Safe is here to help. Reach out to us today.</p>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "My boss says our safety manager can handle the SIL stuff. What do I tell him?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Tell him that your safety manager is properly an Occupational Safety Manager by role, even though the actual title varies (Safety Manager, HSE Manager, EHS Director, and so on). An occupational safety professional is fundamentally a different animal than a functional safety engineer. The safety manager probably runs PPE programs, exposure monitoring, ergonomic assessments, and injury investigations. SIL work is a different domain: IEC 61511, SIS architecture, failure modes, reliability mathematics, proof testing. The training does not transfer, and the credentials do not overlap. A CSP is not on a track to a CFSE, and a few hours of online refresher will not bridge the gap."
      }
    },
    {
      "@type": "Question",
      "name": "Isn't process safety just functional safety at a bigger scale?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "It is the other way around. Process safety is the broader engineering discipline. Functional safety is a subset of process safety, tied to the standards (IEC 61511 and IEC 61508) that govern instrumented protective functions. HAZOP, LOPA, mechanical integrity, management of change, and Pre-Startup Safety Review are all process safety activities that are not functional safety."
      }
    },
    {
      "@type": "Question",
      "name": "We have an excellent TRIR. Doesn't that mean our process safety program is working?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "No, but there is some overlap. A process safety incident with injuries does show up in TRIR. The metric itself is dominated by occupational events, though: slips, trips, hand cuts, strains. A facility can drive TRIR to industry-leading levels while its SIS portfolio is overdue for proof testing, its MOC backlog is unmanaged, and its layers of protection have eroded. The Baker Panel report after Texas City documented exactly that pattern at BP. TRIR is a useful occupational metric. It is not a leading indicator of functional safety or process safety health."
      }
    },
    {
      "@type": "Question",
      "name": "If a personal gas monitor protects a worker from a process release, why isn't it a LOPA credit?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "A personal gas monitor can sometimes meet IPL (Independent Protection Layer) criteria within a functional safety LOPA, but the claim is often made without testing them. An IPL must satisfy three requirements:"
      }
    },
    {
      "@type": "Question",
      "name": "Does the functional safety person need to work with and talk to the HSE / occupational safety team?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Absolutely, they should be considered colleagues. Hazardous area classification needs both sides aligned: process safety defines the zones, and HSE enforces the field consequences. Personal gas monitor programs cross both disciplines, with the monitor itself an occupational control protecting against a process hazard. Confined space work that involves SIS equipment needs functional safety input on bypass and recovery procedures, while the HSE program owns the entry permit itself. The relationship is collaborative, not hierarchical. Each side owns its discipline."
      }
    }
  ]
}
</script>
]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/functional-safety-vs-occupational-safety/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>The Garden Grove Chemical Incident: What We Know So Far</title>
		<link>https://silsafe.net/garden-grove-chemical-incident/</link>
					<comments>https://silsafe.net/garden-grove-chemical-incident/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Tue, 26 May 2026 22:02:23 +0000</pubDate>
				<category><![CDATA[Advanced]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6633</guid>

					<description><![CDATA[A tank of methyl methacrylate at a GKN Aerospace facility in Garden Grove, California has been the center of a multi-day chemical emergency. This article walks through the facility, the chemistry of the hazard, and the US regulatory analysis of whether OSHA PSM, EPA RMP, and California's CalARP apply.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">A tank of methyl methacrylate (MMA) at a GKN Aerospace facility in Garden Grove, California has been the center of a multi-day chemical emergency that displaced tens of thousands of residents and put roughly nine square miles of Orange County under evacuation. The situation is active as of writing, and what follows is a US regulatory and functional safety perspective, not a root cause analysis.</p>



<h2 class="wp-block-heading">What is the GKN Aerospace facility?</h2>



<p class="wp-block-paragraph">The site is operated by GKN Aerospace Transparency Systems Inc., a subsidiary of GKN Aerospace, and sits on roughly 15.5 acres in Garden Grove, California. GKN has been at this location since 2004.</p>



<p class="wp-block-paragraph">The facility makes the windows and canopies of major aircraft. The Garden Grove site designs and produces cockpit canopies, fighter canopies, and passenger windows, including the F-35 canopy and transparencies for the Boeing 787 and 737, the Airbus A350, HondaJet, and the Bombardier C-Series. Those transparencies are made from polymethyl methacrylate (PMMA, also known as acrylic, Plexiglas, or Lucite), and that is why a substantial inventory of MMA is on site. MMA is the monomer feedstock for making the polymer.</p>



<h2 class="wp-block-heading">The tank and the chemical</h2>



<p class="wp-block-paragraph">Methyl methacrylate is a colorless flammable liquid monomer with a fruity odor. Its flash point is 36°F (2°C), which puts it in the same general &#8220;ignites at room temperature&#8221; category as gasoline (flash point roughly -45°F). For context on the other end of the common fuels, diesel sits around 125°F. The boiling point of MMA is 214°F (101°C), so it stays liquid through any realistic ambient condition.</p>



<p class="wp-block-paragraph">The tank has a capacity of approximately 34,000 gallons; reporting indicates an MMA inventory of 6,000 to 7,000 gallons at the time of the incident. From photos circulating in news coverage, the tank appears to be fully enclosed, fixed-roof construction with external insulation that has since been removed by responders to aid cooling. Enclosure alone does not determine whether the tank is &#8220;atmospheric&#8221; in the regulatory sense, which has a specific technical meaning that matters for the PSM analysis.</p>



<p class="wp-block-paragraph">The reason MMA is stored with care is that it is a reactive monomer. If temperature control is lost or if the inhibitor (typically methyl ether of hydroquinone, MEHQ, at low ppm) is depleted, MMA can undergo exothermic polymerization, where the chemical reacts with itself to form the polymer. That reaction releases heat, which accelerates the reaction, which releases more heat. That is the runaway pattern, and once it starts there is no easy way to stop it. It is the central hazard of bulk MMA storage and the reason a tank of this material is taken so seriously.</p>



<h2 class="wp-block-heading">What occurred in May 2026</h2>



<p class="wp-block-paragraph">The incident began on May 21, 2026, when the storage tank started overheating and venting vapors. Orange County Fire Authority responded that afternoon, and over the next several days the Garden Grove chemical incident escalated. The evacuation zone started as a roughly one-mile radius around the facility, then expanded to cover approximately nine square miles, ultimately displacing somewhere between 44,000 and 50,000 residents. The Governor of California declared a state of emergency.</p>



<p class="wp-block-paragraph">As of writing, stabilization efforts are ongoing. Reports indicate that a crack in the tank may have vented pressure and reduced the immediate explosion risk, but the data is preliminary. The Orange County District Attorney has opened a tip line for information related to GKN&#8217;s operations.</p>



<p class="wp-block-paragraph">The cause of the temperature excursion has not been publicly established, and we do not speculate.</p>



<h2 class="wp-block-heading">What regulations apply?</h2>



<p class="wp-block-paragraph">The Garden Grove chemical incident plays out under three US frameworks: OSHA&#8217;s Process Safety Management (PSM) standard, EPA&#8217;s Risk Management Program (RMP), and California&#8217;s CalARP. International readers will be familiar with frameworks like Seveso (EU) and COMAH (UK), which operate on a different basis and are not within scope here.</p>



<h3 class="wp-block-heading">Does OSHA PSM apply?</h3>



<p class="wp-block-paragraph">PSM (29 CFR 1910.119) has two pathways for coverage. One is the list of specifically named highly hazardous chemicals in Appendix A. MMA is not on that list. The other pathway, in 1910.119(a)(1)(ii), covers any process involving a flammable liquid with a flash point below 100°F in quantities of 10,000 lb or more at one location.</p>



<p class="wp-block-paragraph">MMA clears both criteria easily:</p>



<ul class="wp-block-list">
<li>Flash point of 36°F is well below the 100°F threshold.</li>



<li>At a density of 0.94 g/cm³, even the lower-end reported inventory of 6,000 gallons is roughly 47,000 lb.</li>
</ul>



<p class="wp-block-paragraph">The MMA storage at GKN satisfies the PSM flammable liquid trigger. The standard has an important exemption.</p>



<p class="wp-block-paragraph">The atmospheric tank exemption in 1910.119(a)(1)(ii)(B) excludes flammable liquids stored in atmospheric tanks kept below their normal boiling point without chilling or refrigeration. Two things to get right here:</p>



<ul class="wp-block-list">
<li>&#8220;Atmospheric tank&#8221; is a pressure-rating definition, not a description of the lid. Many atmospheric tanks have fixed roofs, vents, and conservation devices. A tank can be fully enclosed and still be atmospheric.</li>



<li>The &#8220;below normal boiling point without chilling&#8221; condition is satisfied by ordinary ambient storage of MMA. MMA boils at 214°F and an unrefrigerated tank at typical ambient temperature is well below that.</li>
</ul>



<p class="wp-block-paragraph">On the pressure-rating side, the GKN tank is almost certainly atmospheric. Bulk MMA storage in industry is standard atmospheric tank service. The vapor pressure of MMA at typical storage temperature is low, and there is no operational reason to put MMA in a pressure-rated vessel. The major MMA producers describe atmospheric storage practice on their technical data sheets. So the pressure-rating element of the test is not the issue. The issue, as covered further below, is the chilling and refrigeration clause.</p>



<p class="wp-block-paragraph">The atmospheric tank exemption is the reason every gas station in the US is not subject to PSM and RMP. OSHA&#8217;s view was that ordinary atmospheric storage of flammable liquids is adequately covered by NFPA 30, API 650, OSHA&#8217;s own 1910.106 flammable liquids standard, and local fire code.</p>



<p class="wp-block-paragraph">Interconnection cuts the other direction. If something else on the GKN site triggers PSM (a covered process, a covered chemical inventory elsewhere), and the MMA tank is interconnected to it through piping, the MMA tank gets pulled into scope by virtue of that connection. So even if the MMA tank by itself qualifies for the atmospheric tank exemption, the tank may still be PSM-covered through the rest of the facility. Whether that is the case at GKN depends on facility details that are not public.</p>



<h4 class="wp-block-heading">Regulatory uncertainty of a chiller</h4>



<p class="wp-block-paragraph">One more wrinkle, and this is where engineering ends and law begins. According to OCFA statements during the response, the MMA tank had an active cooling system designed to keep the contents at around 50°F, and the failure of that cooling system was central to the incident. The atmospheric tank exemption requires storage &#8220;kept below their normal boiling point without benefit of chilling or refrigeration.&#8221; That clause supports two reads:</p>



<ul class="wp-block-list">
<li>&#8220;No chilling of any kind.&#8221; Any chilling on the tank disqualifies the exemption.</li>



<li>&#8220;No chilling needed to keep below boiling.&#8221; Only chilling that exists to manage boiling point disqualifies the exemption. Chilling for other purposes (polymerization control, viscosity control) does not.</li>
</ul>



<p class="wp-block-paragraph">MMA boils at 214°F and would stay well below boiling at any reasonable ambient temperature with no cooling at all. The cooling system at GKN was almost certainly there to suppress the polymerization reaction, not to keep the liquid below boiling. So a refrigerated MMA tank sits squarely in the gap between those two readings. This is the kind of question engineers end up handing to lawyers, because regulatory interpretation isn&#8217;t an engineering call.</p>



<h3 class="wp-block-heading">Does EPA RMP apply?</h3>



<p class="wp-block-paragraph">EPA&#8217;s Risk Management Program (RMP, 40 CFR Part 68) works differently from PSM. Rather than a generic flammable-liquid trigger, RMP applies only to a curated list of regulated substances in 40 CFR 68.130. MMA is not on that list.</p>



<p class="wp-block-paragraph">The reason MMA is not on the list comes down to chemistry. EPA built Table 3 (the flammable substances portion) using the criteria for a flammability rating of 4 on the NFPA 704 fire diamond, which require flash point below 73°F AND boiling point below 100°F. MMA&#8217;s flash point of 36°F clears the first hurdle, but its boiling point of 214°F fails the second by a wide margin. The substances that did make Table 3 are predominantly gases or near-gases at ambient: propane, ethylene, butadiene, methane. These are the materials that flash to vapor on release and produce expanding vapor clouds. MMA does not fit that profile.</p>



<p class="wp-block-paragraph">Therefore, RMP does not apply to MMA storage at GKN.</p>



<h3 class="wp-block-heading">California regulation: CalARP</h3>



<p class="wp-block-paragraph">California has a state-level analog called CalARP (California Accidental Release Prevention Program, Title 19 CCR §5130.6). MMA is not on any CalARP table, so CalARP does not apply either.</p>



<p class="wp-block-paragraph">Even if any of these regulations did apply, none of them directly require IEC 61511. They require &#8220;recognized and generally accepted good engineering practices&#8221; (RAGAGEP) for safety-critical systems, and IEC 61511 is the RAGAGEP for safety instrumented systems in the process industry. The hook into functional safety is by reference, not by name.</p>



<h2 class="wp-block-heading">Was OSHA PSM actually being applied at GKN?</h2>



<p class="wp-block-paragraph">There is no way to know with certainty from outside the fence line.</p>



<p class="wp-block-paragraph">OSHA does not maintain a public registry of PSM-covered facilities. Coverage is self-determining; an operator evaluates its own processes against the standard, and verification happens through inspections.</p>



<p class="wp-block-paragraph">Prior OSHA inspections and worker complaints at the Garden Grove site have been reported in recent coverage, along with a 2018 California Department of Industrial Relations penalty involving machinery and fabrication concerns. None of that confirms whether PSM applied to the MMA storage. Those inspections may have addressed occupational safety topics like machine guarding, lockout-tagout, and fall protection rather than process safety. The two are distinct disciplines, often handled by different teams inside a facility and inspected against different standards.</p>



<p class="wp-block-paragraph">GKN&#8217;s public statement says the company &#8220;follows all standard safety protocols and processes and is regularly audited by numerous state and federal agencies.&#8221; The statement does not specifically address PSM coverage of the MMA storage.</p>



<h2 class="wp-block-heading">Is there evidence GKN has a functional safety program?</h2>



<p class="wp-block-paragraph">Not publicly. Functional safety programs (IEC 61511 SIS implementations, SIL verifications, safety requirements specifications) are not typically disclosed publicly by operators. They live in internal documentation and are reviewed by regulators or third-party assessors, not posted on company websites.</p>



<p class="wp-block-paragraph">The Orange County DA tip line and ongoing investigations suggest authorities are looking at safety system adequacy, but no findings are public.</p>



<h2 class="wp-block-heading">Has the CSB said anything?</h2>



<p class="wp-block-paragraph">The U.S. Chemical Safety and Hazard Investigation Board (CSB) is an independent federal agency that investigates industrial chemical accidents. As of writing, the CSB has not publicly announced a deployment to the Garden Grove chemical incident.</p>



<h2 class="wp-block-heading">What we do not know at the time of writing (May 2026)</h2>



<ul class="wp-block-list">
<li>The cause of the temperature excursion. Public reporting has not established what initiated the runaway, and we do not speculate.</li>



<li>What safety systems were in place at GKN, and how they performed.</li>



<li>Whether GKN&#8217;s specific tank configuration qualifies for the atmospheric tank exemption. That requires facility data not in public reporting.</li>



<li>Long-term health and environmental impact in the affected zone.</li>



<li>Anything that would require information held by GKN, OSHA, EPA, or the Orange County Fire Authority but not yet released publicly.</li>
</ul>



<h2 class="wp-block-heading">Q&amp;A</h2>



<h3 class="wp-block-heading">Q1: What is MMA and why is it stored in tanks like this?</h3>



<p class="wp-block-paragraph">MMA is methyl methacrylate, a flammable liquid organic compound. It is a monomer, meaning a small molecule that chemically links with copies of itself to form a long-chain polymer. The polymer in MMA&#8217;s case is polymethyl methacrylate (PMMA, also called acrylic or Plexiglas). Bulk MMA storage is common anywhere PMMA is manufactured.</p>



<h3 class="wp-block-heading">Q2: Is methyl methacrylate covered by OSHA PSM?</h3>



<p class="wp-block-paragraph">Possibly. MMA is not on the OSHA PSM Appendix A list of named highly hazardous chemicals, but PSM has a separate pathway for flammable liquids with flash point below 100°F in quantities of 10,000 lb or more. MMA meets both criteria. The complication is the atmospheric tank exemption, which excludes flammable liquids in atmospheric tanks kept below boiling point without chilling or refrigeration. For an unrefrigerated MMA tank at ordinary ambient conditions, the exemption likely applies. For a refrigerated MMA tank like the one at GKN, applicability is genuinely unsettled and depends on how the chilling clause is interpreted.</p>



<h3 class="wp-block-heading">Q3: The tank at GKN has a lid on it. Doesn&#8217;t that mean it&#8217;s not an atmospheric tank under PSM?</h3>



<p class="wp-block-paragraph">No. &#8220;Atmospheric tank&#8221; in 1910.119 is a pressure-rating definition, not a description of whether the tank is open to the air. Most atmospheric tanks have fixed roofs, vents, manways, and conservation devices. A tank can be fully enclosed and still be atmospheric. What disqualifies a tank from being atmospheric is being designed and rated for higher internal pressure than atmospheric service. The presence of a lid is not the deciding factor.</p>



<h3 class="wp-block-heading">Q4: Do engineers ever need lawyers to interpret regulatory language?</h3>



<p class="wp-block-paragraph">More often than people outside the field would guess. Regulations like PSM are written in language that looks straightforward but contains clauses with multiple defensible readings. The atmospheric tank exemption is a good example: the phrase &#8220;kept below their normal boiling point without benefit of chilling or refrigeration&#8221; can mean &#8220;no chilling of any kind&#8221; or &#8220;no chilling needed for boiling-point control,&#8221; and a tank that is refrigerated for a non-boiling-point reason sits squarely in the gap between those two readings. Engineers can describe the tank, the chemistry, and how the system operates. Whether those facts add up to the exemption applying or not is a regulatory interpretation question, and that is lawyer territory. On material questions, engineering and legal teams have to work together — the engineer establishes what the system is, and the lawyer establishes what the regulation says about it.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<p class="wp-block-paragraph"><strong>Internal (silsafe.net):</strong></p>



<ul class="wp-block-list">
<li><a href="https://silsafe.net/hazard-and-risk-assessment-hra/">Hazard and Risk Assessment (H&amp;RA): The Foundation of Functional Safety</a></li>



<li><a href="https://silsafe.net/layer-of-protection-analysis-lopa/">Layer of Protection Analysis (LOPA): The Engineer&#8217;s Guide to SIL Selection</a></li>
</ul>



<p class="wp-block-paragraph"><strong>External:</strong></p>



<ul class="wp-block-list">
<li><a href="https://en.wikipedia.org/wiki/Garden_Grove_chemical_leak" target="_blank" rel="noopener">Garden Grove chemical leak — Wikipedia</a></li>



<li><a href="https://www.npr.org/2026/05/24/nx-s1-5833165/california-chemical-tank-malfunction-leak-explode-emergency-evacuate" target="_blank" rel="noopener">California chemical tank has cracked, causing state of emergency, thousands to evacuate — NPR</a></li>



<li><a href="https://www.nbclosangeles.com/news/local/live-updates-garden-grove-chemical-tank-emergency/3894473/" target="_blank" rel="noopener">Live updates: Garden Grove chemical tank emergency — NBC Los Angeles</a></li>



<li><a href="https://calepa.ca.gov/california-accidental-release-prevention/" target="_blank" rel="noopener">CalEPA — California Accidental Release Prevention (CalARP) Program</a></li>



<li><a href="https://www.csb.gov/" target="_blank" rel="noopener">U.S. Chemical Safety and Hazard Investigation Board</a></li>



<li><a href="https://www.cdc.gov/niosh/npg/npgd0426.html" target="_blank" rel="noopener">CDC/NIOSH Pocket Guide — Methyl Methacrylate</a></li>
</ul>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What is MMA and why is it stored in tanks like this?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "MMA is methyl methacrylate, a flammable liquid organic compound. It is a monomer, meaning a small molecule that chemically links with copies of itself to form a long-chain polymer. The polymer in MMA's case is polymethyl methacrylate (PMMA, also called acrylic or Plexiglas). Bulk MMA storage is common anywhere PMMA is manufactured."
      }
    },
    {
      "@type": "Question",
      "name": "Is methyl methacrylate covered by OSHA PSM?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Possibly. MMA is not on the OSHA PSM Appendix A list of named highly hazardous chemicals, but PSM has a separate pathway for flammable liquids with flash point below 100°F in quantities of 10,000 lb or more. MMA meets both criteria. The complication is the atmospheric tank exemption, which excludes flammable liquids in atmospheric tanks kept below boiling point without chilling or refrigeration. For an unrefrigerated MMA tank at ordinary ambient conditions, the exemption likely applies. For a refrigerated MMA tank like the one at GKN, applicability is genuinely unsettled and depends on how the chilling clause is interpreted."
      }
    },
    {
      "@type": "Question",
      "name": "The tank at GKN has a lid on it. Doesn't that mean it's not an atmospheric tank under PSM?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "No. \"Atmospheric tank\" in 1910.119 is a pressure-rating definition, not a description of whether the tank is open to the air. Most atmospheric tanks have fixed roofs, vents, manways, and conservation devices. A tank can be fully enclosed and still be atmospheric. What disqualifies a tank from being atmospheric is being designed and rated for higher internal pressure than atmospheric service. The presence of a lid is not the deciding factor."
      }
    },
    {
      "@type": "Question",
      "name": "Do engineers ever need lawyers to interpret regulatory language?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "More often than people outside the field would guess. Regulations like PSM are written in language that looks straightforward but contains clauses with multiple defensible readings. The atmospheric tank exemption is a good example: the phrase \"kept below their normal boiling point without benefit of chilling or refrigeration\" can mean \"no chilling of any kind\" or \"no chilling needed for boiling-point control,\" and a tank that is refrigerated for a non-boiling-point reason sits squarely in the gap between those two readings. Engineers can describe the tank, the chemistry, and how the system operates. Whether those facts add up to the exemption applying or not is a regulatory interpretation question, and that is lawyer territory. On material questions, engineering and legal teams have to work together — the engineer establishes what the system is, and the lawyer establishes what the regulation says about it."
      }
    }
  ]
}
</script>



<p class="wp-block-paragraph"></p>
]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/garden-grove-chemical-incident/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>SIL Verification: The Three Gates Every SIF Must Clear</title>
		<link>https://silsafe.net/sil-verification-three-gates/</link>
					<comments>https://silsafe.net/sil-verification-three-gates/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sun, 10 May 2026 19:42:39 +0000</pubDate>
				<category><![CDATA[Advanced]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6515</guid>

					<description><![CDATA[SIL verification under IEC 61511 is three independent gates, all of which must pass: PFDavg, systematic capability, and architectural constraints. The gate that gets the most attention rarely fails an audit. This article walks through each gate, where each one fails, and three worked scenarios that show how auditors actually catch problems.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">When practitioners say &#8220;SIL verification,&#8221; they almost always mean one specific process: design-phase verification of a Safety Instrumented Function (SIF) against the three independent requirements in IEC 61511 Clause 11. Three gates, all independent, all of which must pass.</p>



<p class="wp-block-paragraph">The three gates are:</p>



<ol class="wp-block-list">

<li><strong>PFDavg / PFH</strong> — the quantitative gate. The SIF&#8217;s failure probability must meet its allocated SIL band.</li>


<li><strong>Systematic capability</strong> — every device in the SIF must be fit for the required SIL, by certification or by prior use.</li>


<li><strong>Architectural constraints</strong> — the SIF&#8217;s hardware fault tolerance must be sufficient for its target SIL.</li>

</ol>



<p class="wp-block-paragraph">A SIF that nails its PFDavg but fails on systematic capability or architecture is not verified. The gate that gets the most attention, PFDavg, is rarely the one that fails an audit.</p>



<p class="wp-block-paragraph">A note on terminology before going further. &#8220;Verification&#8221; in IEC 61511 is broader than what this article covers. It spans the whole life-cycle: H&#038;RA, SRS, design, FAT, operations. &#8220;SIL verification&#8221; is the industry shorthand for the design-phase SIF verification specifically, and that is what this article covers.</p>



<p class="wp-block-paragraph">The three worked scenarios at the end show the gates working together: one clean pass and two failure cases that mirror what auditors actually find.</p>



<h2 class="wp-block-heading">Gate 1: PFDavg / PFH</h2>



<p class="wp-block-paragraph">This is the quantitative gate, defined in IEC 61511 Clause 11.9. The SIF&#8217;s calculated probability of failing to act on demand must meet or exceed its allocated SIL band.</p>



<p class="wp-block-paragraph">Demand mode determines which metric applies. Low demand, where the SIF is called on less than once per year, covers most process industry shutdown SIFs and uses Probability of Failure on Demand average (PFDavg). High demand and continuous mode, more frequent than once per year, cover some compressor protection, fired equipment trips, and machinery safety, and use Probability of Failure per Hour (PFH). High demand and continuous are treated together for SIL verification purposes.</p>



<p class="wp-block-paragraph">This article is not a PFDavg tutorial. For depth on inputs and methods, see the dedicated PFDavg article in Further Reading.</p>



<h3 class="wp-block-heading">Why this is the gate everyone fixates on</h3>



<p class="wp-block-paragraph">It produces a number, and numbers feel definitive. Tools automate it. The other two gates require judgment, evidence, and documentation. The PFDavg report is the tangible deliverable a manager can hold up and point to.</p>



<p class="wp-block-paragraph">The trap is that it&#8217;s also the gate easiest to engineer to pass on paper while the SIF still fails on Gates 2 or 3.</p>



<h3 class="wp-block-heading">Where this gate fails</h3>



<p class="wp-block-paragraph">The PFDavg calculation is only as honest as its inputs and architecture assumptions. The same calculation can produce a false pass or a false fail depending on which way the assumptions lean.</p>



<ul class="wp-block-list">

<li><strong>False pass:</strong> optimistic TI, overstated Cpt, generic failure rates that don&#8217;t match the real device. The number passes, but real-world risk reduction is lower than the report claims.</li>


<li><strong>False fail:</strong> overly conservative inputs reject a SIF that would actually meet its target. The design ends up over-engineered relative to the real risk.</li>


<li><strong>Wrong architecture:</strong> sometimes the architecture is just wrong for the target SIL. No amount of input tweaking saves a wrong-fit architecture.</li>

</ul>



<h2 class="wp-block-heading">Gate 2: Systematic Capability</h2>



<p class="wp-block-paragraph">This gate is about whether the components of the SIF have an applicable level of quality and design rigor for the required SIL. It comes from IEC 61511 Clause 11.5, with the prior use path specifically governed by Clause 11.5.3, and the systematic capability concept itself defined in IEC 61508-2. Systematic capability is a property of the device, not the loop. It addresses systematic faults (design errors, software bugs, manufacturing defects) that redundancy can&#8217;t solve.</p>



<p class="wp-block-paragraph">The systematic capability rating of every device must equal or exceed the SIF&#8217;s SIL.</p>



<p class="wp-block-paragraph">There are two paths to demonstrate it.</p>



<h3 class="wp-block-heading">IEC 61508 certification (manufacturer-side)</h3>



<p class="wp-block-paragraph">The manufacturer has had the device assessed against IEC 61508 by a certification body. The outcome is a SIL Certificate stating the systematic capability rating, typically expressed as SC 2, SC 3, etc.</p>



<p class="wp-block-paragraph">Inside IEC 61508, certification can be achieved through different S routes. Route 1S (design-process rigor) is the most common. Route 2S (proven in use) is less common. Route 3S applies to software.</p>



<p class="wp-block-paragraph">For the facility verifier, what matters is the systematic capability rating on the certificate. The route the manufacturer used to achieve it is documented in the SIL Certificate. Software systematic capability is handled by Route 3S, applied to embedded firmware and application software during certification of devices like logic solvers. For a facility verifier using a certified logic solver, this is captured by the SC rating on the certificate. No separate software analysis required at the facility level.</p>



<h3 class="wp-block-heading">IEC 61511 prior use (facility-side)</h3>



<p class="wp-block-paragraph">This is IEC 61511&#8217;s alternative path when no IEC 61508 certification exists for the device, governed by Clause 11.5.3.</p>



<p class="wp-block-paragraph">The owner-operator demonstrates suitability through documented field history under similar operating conditions. No certificate of conformance is involved. Prior use is a facility-side justification, not a third-party attestation. The deliverable is a documented prior use file evaluated by the facility&#8217;s functional safety engineer and audited by an FSA team.</p>



<p class="wp-block-paragraph">It requires evidence of:</p>



<ul class="wp-block-list">

<li>Manufacturer quality management</li>


<li>Device identification and version control</li>


<li>Performance in similar operating environments</li>


<li>Sufficient volume of operating experience</li>

</ul>



<p class="wp-block-paragraph">Generally a more difficult and document-heavy path than using a certified component.</p>



<h3 class="wp-block-heading">Terminology traps to watch for</h3>



<p class="wp-block-paragraph"><strong>Prior use vs. proven in use.</strong> Prior use is the IEC 61511 facility-side path. Proven in use is a specific IEC 61508 route (Route 2S) used by manufacturers seeking certification without a full FMEDA-driven path. Even Goble&#8217;s books and many practitioners use &#8220;proven in use&#8221; loosely to describe both. Be precise in your own documentation.</p>



<p class="wp-block-paragraph"><strong>The H/S route split.</strong> The H routes (1H, 2H) belong to architectural constraints (Gate 3). The S routes (1S, 2S, 3S) belong to systematic capability (Gate 2). The H/S split distinguishes what kind of capability is being demonstrated, not which subsystem the route applies to. Practitioners regularly try to apply S routes to Gate 3 or H routes to Gate 2. Keep them separate.</p>



<h3 class="wp-block-heading">Where this gate fails</h3>



<ul class="wp-block-list">

<li>Legacy field devices with no certificate and no defensible prior use file. &#8220;We&#8217;ve used it for years&#8221; claimed without documentation, version control, or operating-condition records is the typical pattern.</li>


<li>Mismatched systematic capability rating, like using an SC 2 device in a SIL 3 SIF.</li>

</ul>



<h2 class="wp-block-heading">Gate 3: Architectural Constraints (Hardware Fault Tolerance)</h2>



<p class="wp-block-paragraph">This gate is about the inherent redundancy of the SIF: does it have enough fault tolerance to survive a single dangerous failure at its required SIL. It comes from IEC 61511 Clause 11.4, with route definitions inherited from IEC 61508-2 Clause 7.4.4.</p>



<p class="wp-block-paragraph">Hardware fault tolerance (HFT) is the number of dangerous failures a subsystem can tolerate before losing its safety function. A 1oo1 subsystem has no hardware fault tolerance (HFT = 0). A 1oo2 subsystem has HFT = 1.</p>



<p class="wp-block-paragraph">Two H routes are available, applied per element. Different elements in the same SIF can use different routes.</p>



<h3 class="wp-block-heading">Route 1H</h3>



<p class="wp-block-paragraph">Route 1H is based on Safe Failure Fraction (SFF) and Type A vs. Type B classification, using IEC 61508-2 Tables 2 and 3. It requires FMEDA-grade failure data to compute SFF.</p>



<p class="wp-block-paragraph">Type A devices have simple, well-understood failure modes (most mechanical devices). Type B devices are complex with embedded software or microprocessors. The Route 1H tables apply different SFF thresholds to each. Type A is treated more leniently than Type B at the same SIL.</p>



<p class="wp-block-paragraph">This is the most common route for modern certified components, particularly logic solvers, smart instruments, and modern positioners with documented FMEDAs.</p>



<p class="wp-block-paragraph">A note on diagnostics. SFF improves when diagnostics catch dangerous failures, but high SFF doesn&#8217;t strictly <em>require</em> diagnostics. A device with intrinsically safe failure modes (for example, a spring-return solenoid where loss of power drives the safe state) can hit high SFF with no diagnostics at all. In practice, though, most devices that achieve high SFF on Route 1H do so because of diagnostic coverage.</p>



<p class="wp-block-paragraph"><strong>Route 1H — Type A devices (IEC 61508-2 Table 2)</strong></p>



<figure class="wp-block-table"><table><thead><tr><th>SFF</th><th>HFT = 0</th><th>HFT = 1</th><th>HFT = 2</th></tr></thead><tbody><tr><td>&lt; 60%</td><td>SIL 1</td><td>SIL 2</td><td>SIL 3</td></tr><tr><td>60% to &lt; 90%</td><td>SIL 2</td><td>SIL 3</td><td>SIL 4</td></tr><tr><td>90% to &lt; 99%</td><td>SIL 3</td><td>SIL 4</td><td>SIL 4</td></tr><tr><td>&ge; 99%</td><td>SIL 3</td><td>SIL 4</td><td>SIL 4</td></tr></tbody></table></figure>



<p class="wp-block-paragraph"><strong>Route 1H — Type B devices (IEC 61508-2 Table 3)</strong></p>



<figure class="wp-block-table"><table><thead><tr><th>SFF</th><th>HFT = 0</th><th>HFT = 1</th><th>HFT = 2</th></tr></thead><tbody><tr><td>&lt; 60%</td><td>Not allowed</td><td>SIL 1</td><td>SIL 2</td></tr><tr><td>60% to &lt; 90%</td><td>SIL 1</td><td>SIL 2</td><td>SIL 3</td></tr><tr><td>90% to &lt; 99%</td><td>SIL 2</td><td>SIL 3</td><td>SIL 4</td></tr><tr><td>&ge; 99%</td><td>SIL 3</td><td>SIL 4</td><td>SIL 4</td></tr></tbody></table></figure>



<h3 class="wp-block-heading">Route 2H</h3>



<p class="wp-block-paragraph">Route 2H is based on hardware structural resilience and field reliability data. It does not require SFF and uses simpler tables driven by SIL alone.</p>



<p class="wp-block-paragraph">This is effectively the route IEC 61511 uses on its own. IEC 61511&#8217;s architectural constraint table is derived from IEC 61508 Route 2H, and Goble notes the two are essentially identical.</p>



<p class="wp-block-paragraph">Route 2H is most common for mechanical devices with no or minimal diagnostics (rack-and-pinion actuators, manual valves), legacy equipment that predates IEC 61508, and any device where FMEDA data is not available.</p>



<p class="wp-block-paragraph">Prior use data can contribute to other parts of the verification, most directly to Gate 2 systematic capability. The same field history that supports a Route 2H argument for hardware integrity may also support a prior use justification under Clause 11.5.3.</p>



<p class="wp-block-paragraph"><strong>Route 2H — minimum HFT by SIL</strong></p>



<figure class="wp-block-table"><table><thead><tr><th>SIL</th><th>Minimum HFT</th></tr></thead><tbody><tr><td>1</td><td>0</td></tr><tr><td>2</td><td>1 for high-demand or continuous mode, 0 for low-demand mode</td></tr><tr><td>3</td><td>2</td></tr><tr><td>4</td><td>Special requirements per IEC 61508</td></tr></tbody></table></figure>



<h3 class="wp-block-heading">Choosing an H route</h3>



<p class="wp-block-paragraph">For new SIFs designed with modern equipment, the route appears on the SIL Certificate for certified components, typically labeled &#8220;Route 2H Device&#8221; or similar. This is the most common situation a facility verifier will encounter.</p>



<p class="wp-block-paragraph">The decision is per-element and driven by available evidence:</p>



<ul class="wp-block-list">

<li>Devices with FMEDA data and meaningful diagnostics generally go Route 1H.</li>


<li>Devices without FMEDA data, or mechanical devices where diagnostics can&#8217;t catch the dominant failure modes, go Route 2H.</li>

</ul>



<p class="wp-block-paragraph">Both routes are legitimate. Route 2H is not a fallback or a downgrade. It is the appropriate route for hardware where the SFF-based approach doesn&#8217;t fit, and either route is acceptable evidence for SIL verification at the target SIL.</p>



<h3 class="wp-block-heading">Where this gate fails</h3>



<ul class="wp-block-list">

<li>Claiming Route 1H without the FMEDA data to back it.</li>


<li>Selecting a 1oo1 architecture for a device whose SFF and Type combination can&#8217;t reach the target SIL because of required HFT.</li>


<li>Using a Route 2H legacy device in a SIL 3 application where Route 2H can&#8217;t get there regardless of redundancy.</li>


<li>Mixing route claims across elements without verifying each element&#8217;s evidence stands on its own.</li>

</ul>



<h2 class="wp-block-heading">Worked SIL Verification Examples</h2>



<p class="wp-block-paragraph">Three scenarios to make the three-gate framing concrete. Each zooms in on a single component (or two in Scenario 2) within an otherwise-passing SIF. Each scenario opens with a scope statement, then walks through all three gates in whichever order makes the component&#8217;s story easiest to follow. These are SIL verification scenario walkthroughs, not PFDavg calculation tutorials.</p>



<h3 class="wp-block-heading">Scenario 1: A straightforward verification</h3>



<p class="wp-block-paragraph">For this scenario we focus on a smart pressure instrument on a SIL 2 SIF, used in a 1oo1 architecture. Assume the rest of the SIF verifies cleanly.</p>



<ul class="wp-block-list">

<li><strong>Gate 2:</strong> The device is IEC 61508 certified, with an SC 2 rating on the SIL Certificate. SC 2 matches the SIF&#8217;s required SIL — <strong>passes</strong>.</li>


<li><strong>Gate 1:</strong> The certificate provides manufacturer failure rate data. This instrument&#8217;s contribution to the SIF-level PFDavg, combined with the rest of the SIF&#8217;s components, clears the SIL 2 band — <strong>passes</strong>.</li>


<li><strong>Gate 3:</strong> The SIL Certificate confirms Route 1H, Type B, and a documented SFF of 93% (in the 90% to &lt;99% band). The Route 1H Type B table allows SIL 2 at HFT 0, so the 1oo1 architecture clears — <strong>passes</strong>.</li>

</ul>



<p class="wp-block-paragraph">The rest of the SIF (final element and logic solver) verifies via the same logic. This is what a modern and clean SIL verification looks like: a certified component, a matching SC rating, an architecture that fits the route&#8217;s table, and a PFDavg that lands in band.</p>



<h3 class="wp-block-heading">Scenario 2: A trickier case</h3>



<p class="wp-block-paragraph">For this scenario we focus on two components (a final element and an instrument) on a SIL 2 SIF in low-demand mode. Assume the rest of the SIF verifies cleanly, including the SIF-level PFDavg contribution from other components.</p>



<p class="wp-block-paragraph"><strong>Final element walkthrough, non-certified valve in a 1oo2 architecture:</strong></p>



<ul class="wp-block-list">

<li><strong>Gate 2:</strong> Non-certified valve with no IEC 61508 certificate. The facility attempted prior use, but the operating-history records don&#8217;t meet Clause 11.5.3 documentation requirements — <strong>fails</strong>.</li>


<li><strong>Gate 1:</strong> Failure rate data sourced from OREDA for this device class, applied to the SIF-level PFDavg — <strong>passes</strong>.</li>


<li><strong>Gate 3:</strong> Route 2H requires minimum HFT 0 for SIL 2 in low-demand mode. The 1oo2 architecture provides HFT 1, which exceeds the minimum — <strong>passes</strong>.</li>

</ul>



<p class="wp-block-paragraph"><strong>Instrument walkthrough, certified instrument in a 1oo1 architecture:</strong></p>



<ul class="wp-block-list">

<li><strong>Gate 2:</strong> The device is IEC 61508 certified with an SC 2 rating — <strong>passes</strong>.</li>


<li><strong>Gate 1:</strong> Failure rate from the SIL Certificate plugs into the SIF-level PFDavg — <strong>passes</strong>.</li>


<li><strong>Gate 3:</strong> Type B smart instrument with documented SFF in the 60% to &lt;90% band, in a 1oo1 architecture, is capped at SIL 1 by the Route 1H Type B table — <strong>fails</strong>.</li>

</ul>



<p class="wp-block-paragraph">Two components, two unrelated gate failures, both blocking SIL verification of the SIF as a whole. The final element passes its architecture check but fails systematic capability. The instrument passes systematic capability but fails its architecture check. This is why the gates are treated as independent: a SIF doesn&#8217;t get partial credit for clearing two of three on any given component, and clearing all three on most components doesn&#8217;t help if one element fails one gate.</p>



<h3 class="wp-block-heading">Scenario 3: The hardest one to catch</h3>



<p class="wp-block-paragraph">For this scenario we focus on a non-certified globe valve, used in a 1oo1 architecture. Assume the rest of the SIF verifies cleanly.</p>



<p class="wp-block-paragraph">The engineer specified this valve from a familiar vendor because it was the right size, available, and had been used in non-safety service at the site for years. To populate the verification report, the engineer pulled an SFF figure from the SIL-rated version of the same product family, same vendor, same valve series, but a different model.</p>



<ul class="wp-block-list">

<li><strong>Gate 2:</strong> No IEC 61508 certificate for the actual valve installed. The non-safety service history doesn&#8217;t qualify for prior use under Clause 11.5.3 — <strong>fails</strong>.</li>


<li><strong>Gate 1:</strong> PFDavg calculation passes using the borrowed SFF and a generic failure rate from a database — <strong>passes</strong>.</li>


<li><strong>Gate 3:</strong> Both Route 1H and Route 2H were considered. Route 1H requires FMEDA-grade failure data for the actual device installed. The borrowed SFF from a sibling SIL-rated model doesn&#8217;t qualify, and no FMEDA exists for this valve. Route 2H doesn&#8217;t require SFF, but does require documented hardware reliability evidence, and the non-safety service history doesn&#8217;t meet that bar either — <strong>fails</strong>.</li>

</ul>



<p class="wp-block-paragraph">The rest of the SIF verifies cleanly. One bad component sinks the SIL verification.</p>



<p class="wp-block-paragraph">The lesson here is that the gates are independent in principle but often connected in practice. A device with no certificate and no defensible field history typically fails on Gate 2 <em>and</em> leaves Gate 3 with no defensible route. And route claims on Gate 3 must be defended with evidence that&#8217;s specific to the device installed, not borrowed from a similar product.</p>



<h2 class="wp-block-heading">Life-cycle Placement</h2>



<p class="wp-block-paragraph">SIL verification sits in the design phase of the IEC 61511 safety life-cycle.</p>



<p class="wp-block-paragraph"><strong>What comes before:</strong> H&#038;RA and LOPA produce the SIL allocation for each SIF. The SRS captures the safety requirements. Conceptual SIS design proposes a candidate architecture and equipment selection. SIL verification then takes those inputs and tests them against the three gates.</p>



<p class="wp-block-paragraph"><strong>What comes after:</strong> continued detailed design, FAT, installation and commissioning. The verification report is a key input to the Functional Safety Assessment and to operational handover.</p>



<h2 class="wp-block-heading">Common Mistakes</h2>



<p class="wp-block-paragraph">The mistakes that surface during SIL verification audits are rarely about the math. They are about evidence, judgment, and which gate the engineer treated as optional.</p>



<ul class="wp-block-list">

<li><strong>Treating PFDavg as the only gate.</strong> The calculation passes, often using manufacturer numbers without scrutiny, and the audit fails on Gate 2 or Gate 3. This is the most common pattern, and it&#8217;s the spine of most of the failures below.</li>


<li><strong>Claiming prior use without the documentation to defend it.</strong> &#8220;We&#8217;ve run this valve for 15 years&#8221; is not a prior use file. Clause 11.5.3 wants quality management, version control, and operating-condition records, not informal recollection.</li>


<li><strong>Confusing prior use with proven in use,</strong> then defending neither correctly when challenged.</li>


<li><strong>Not realizing a route was claimed at all.</strong> Accepting whichever H route the calculation tool defaulted to without checking whether the device&#8217;s evidence supports it.</li>

</ul>



<p class="wp-block-paragraph">Most of these share a root cause: engineers default to whichever gate they&#8217;re most comfortable with and underweight the other two.</p>



<h2 class="wp-block-heading">Q&#038;A</h2>



<h3 class="wp-block-heading">Q1: I thought PFDavg was all we had to do. What&#8217;s the deal?</h3>



<p class="wp-block-paragraph">PFDavg is one of three independent gates, not the whole verification. IEC 61511 Clause 11 also requires systematic capability (the device is fit for the SIL by certification or prior use) and architectural constraints (the hardware fault tolerance is sufficient for the SIL). A SIF can have a beautiful PFDavg report and still fail SIL verification. PFDavg gets the attention because it produces a number and tools automate it, but the other two gates are where audits most often turn up problems.</p>



<h3 class="wp-block-heading">Q2: I heard we have to use SIL-certified devices. Is that correct?</h3>



<p class="wp-block-paragraph">Certified devices make your life easier and are recommended, but not strictly required. IEC 61511 also allows prior use as an alternative path under Clause 11.5.3. Prior use is harder and document-heavy. That is the trade-off for skipping certification. Most facilities find the cost of building defensible prior use files component by component exceeds the price premium on certified equipment. SIL Safe defaults to certified components on every project for exactly this reason.</p>



<h3 class="wp-block-heading">Q3: I&#8217;ve been using this family of PLCs for years and they&#8217;re great. Never had a problem. How can I use this in a SIF?</h3>



<p class="wp-block-paragraph">Possible in theory, expensive in practice, scrutinized at all three gates of SIL verification. On Gate 2, prior use under Clause 11.5.3 needs more than a track record. It needs documented manufacturer quality management, version control, and operating-environment records. On Gate 3, general-purpose PLCs typically lack the diagnostic coverage and architectural features that certified safety PLCs are designed around, so SFF and HFT arguments get hard to defend.</p>



<p class="wp-block-paragraph">On Gate 1, manufacturer failure rate data is rarely available for non-safety PLCs, and generic database values may not match the actual hardware. The same field history that might support prior use only contributes to the failure rate argument if the facility has safety-grade failure recording, which most general-purpose PLC installations don&#8217;t. Add it up and a certified safety PLC almost always wins on total cost.</p>



<h3 class="wp-block-heading">Q4: We just did all this work to use a valve under prior use, and I thought that was it. Now I need to calculate PFDavg too. Where does that data come from?</h3>



<p class="wp-block-paragraph">Prior use clears Gate 2 only. Gate 1 still needs failure rate data to support the PFDavg calculation. For a non-certified valve, that typically means industry databases (OREDA, Exida&#8217;s data handbook, or similar) applied to the device class. The data is generally more conservative than what an FMEDA would produce on a certified device, which is one of the practical reasons certified components ease SIL verification across all three gates simultaneously.</p>



<h3 class="wp-block-heading">Q5: Our FuSa engineer says we need 2oo3 voting on this SIF instead of a single instrument, which is going to triple our hardware cost. The PFDavg calculation passes fine on a 1oo1, so why do we need 2oo3?</h3>



<p class="wp-block-paragraph">PFDavg and HFT are independent gates. The PFDavg calculation can pass on a 1oo1 architecture, but Gate 3 is a separate check that the architecture has enough hardware fault tolerance to be trusted at the target SIL. If the instrument&#8217;s SFF and Type combination requires HFT ≥ 1 to reach the target SIL per the Route 1H tables, a 1oo1 architecture (HFT 0) is insufficient regardless of what the PFDavg math says.</p>



<p class="wp-block-paragraph">Two paths through this:</p>



<ul class="wp-block-list">

<li>Pick an instrument with higher SFF (typically smart, fail-safe-designed, or with stronger diagnostics) so 1oo1 clears Gate 3.</li>


<li>Accept the redundancy.</li>

</ul>



<p class="wp-block-paragraph">The expensive smart instrument frequently beats three cheaper instruments plus the wiring, I/O, and proof-testing overhead once total cost is honest.</p>



<h3 class="wp-block-heading">Q6: How do we at SIL Safe advise our clients on this topic?</h3>



<p class="wp-block-paragraph">Use IEC 61508-certified components wherever possible. It is the simplest path through all three gates.</p>



<p class="wp-block-paragraph">Gate 1 is easier because the manufacturer provides failure rate data, and that data is typically less conservative than generic database values for the same equipment type. The FMEDA process produces a component-specific number rather than a worst-case estimate across a heterogeneous population.</p>



<p class="wp-block-paragraph">Gate 2 collapses to a check that the SC rating matches the SIL.</p>



<p class="wp-block-paragraph">Gate 3 collapses to a check of the architecture against what the certificate documents (route, SFF where applicable, and Type). SFF on certified components tends to be relatively high because the FMEDA process and the certification market push toward devices with strong diagnostics, which makes the table lookup more likely to clear at the target SIL.</p>



<h3 class="wp-block-heading">Q7: Does the standard clearly state these three gates in those exact terms?</h3>



<p class="wp-block-paragraph">No. The three-gate framing for SIL verification is shorthand extracted from various clauses (11.4, 11.5, 11.9). The standard treats the requirements as separate within the design phase. The framing is a teaching device, not a quoted structure.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<p class="wp-block-paragraph">Internal:</p>



<ul class="wp-block-list">

<li><a href="https://silsafe.net/layer-of-protection-analysis-lopa/">Layer of Protection Analysis (LOPA): The Engineer&#8217;s Guide to SIL Selection</a></li>


<li><a href="https://silsafe.net/hazard-and-risk-analysis-methods/">Hazard and Risk Analysis Methods: How HAZOP, What-If, LOPA, Risk Graph, FTA, ETA, and Bowtie Fit Together</a></li>


<li><a href="https://silsafe.net/pfdavg-explained/">PFDavg Explained: 6 Essentials for Getting Started with SIL Calculations</a></li>

</ul>



<p class="wp-block-paragraph">External:</p>



<ul class="wp-block-list">

<li><a href="https://webstore.iec.ch/en/publication/24241" target="_blank" rel="noopener">IEC 61511-1:2016+A1:2017 — official IEC publication page</a></li>


<li><a href="https://www.hse.gov.uk/eci/functional.htm" target="_blank" rel="noopener">HSE — Functional safety (BS EN 61511 reference page)</a></li>


<li><a href="https://www.isa.org/standards-and-publications/isa-standards/isa-84-standards" target="_blank" rel="noopener">ISA-84 Series of Standards</a></li>


<li><a href="https://61508.org/wp-content/uploads/2023/11/T6A024_Technical_Guide_-_Proven_in_Use_V8_-_e012023.pdf" target="_blank" rel="noopener">The 61508 Association — Proven in Use technical guide</a></li>

</ul>



<h2 class="wp-block-heading">Closing Synthesis</h2>



<p class="wp-block-paragraph">SIL verification is three independent gates, all of which must pass. Most failures come from underweighting two of the three, usually because the third produces a clean number on a report. Treat them as separate checks, defend each one with its own evidence, and the verification holds up under audit.</p>



<p class="wp-block-paragraph">Functional safety is complex, and the stakes are high. If you have questions about your SIS design, SIL verification, or where to start with IEC 61511, the team at SIL Safe is here to help. Reach out to us today.</p>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "Q1: I thought PFDavg was all we had to do. What's the deal?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "PFDavg is one of three independent gates, not the whole verification. IEC 61511 Clause 11 also requires systematic capability (the device is fit for the SIL by certification or prior use) and architectural constraints (the hardware fault tolerance is sufficient for the SIL). A SIF can have a beautiful PFDavg report and still fail SIL verification. PFDavg gets the attention because it produces a number and tools automate it, but the other two gates are where audits most often turn up problems."
      }
    },
    {
      "@type": "Question",
      "name": "Q2: I heard we have to use SIL-certified devices. Is that correct?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Certified devices make your life easier and are recommended, but not strictly required. IEC 61511 also allows prior use as an alternative path under Clause 11.5.3. Prior use is harder and document-heavy. That is the trade-off for skipping certification. Most facilities find the cost of building defensible prior use files component by component exceeds the price premium on certified equipment. SIL Safe defaults to certified components on every project for exactly this reason."
      }
    },
    {
      "@type": "Question",
      "name": "Q3: I've been using this family of PLCs for years and they're great. Never had a problem. How can I use this in a SIF?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Possible in theory, expensive in practice, scrutinized at all three gates of SIL verification. On Gate 2, prior use under Clause 11.5.3 needs more than a track record. It needs documented manufacturer quality management, version control, and operating-environment records. On Gate 3, general-purpose PLCs typically lack the diagnostic coverage and architectural features that certified safety PLCs are designed around, so SFF and HFT arguments get hard to defend."
      }
    },
    {
      "@type": "Question",
      "name": "Q4: We just did all this work to use a valve under prior use, and I thought that was it. Now I need to calculate PFDavg too. Where does that data come from?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Prior use clears Gate 2 only. Gate 1 still needs failure rate data to support the PFDavg calculation. For a non-certified valve, that typically means industry databases (OREDA, Exida's data handbook, or similar) applied to the device class. The data is generally more conservative than what an FMEDA would produce on a certified device, which is one of the practical reasons certified components ease SIL verification across all three gates simultaneously."
      }
    },
    {
      "@type": "Question",
      "name": "Q5: Our FuSa engineer says we need 2oo3 voting on this SIF instead of a single instrument, which is going to triple our hardware cost. The PFDavg calculation passes fine on a 1oo1, so why do we need 2oo3?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "PFDavg and HFT are independent gates. The PFDavg calculation can pass on a 1oo1 architecture, but Gate 3 is a separate check that the architecture has enough hardware fault tolerance to be trusted at the target SIL. If the instrument's SFF and Type combination requires HFT ≥ 1 to reach the target SIL per the Route 1H tables, a 1oo1 architecture (HFT 0) is insufficient regardless of what the PFDavg math says."
      }
    },
    {
      "@type": "Question",
      "name": "Q6: How do we at SIL Safe advise our clients on this topic?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Use IEC 61508-certified components wherever possible. It is the simplest path through all three gates."
      }
    },
    {
      "@type": "Question",
      "name": "Q7: Does the standard clearly state these three gates in those exact terms?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "No. The three-gate framing for SIL verification is shorthand extracted from various clauses (11.4, 11.5, 11.9). The standard treats the requirements as separate within the design phase. The framing is a teaching device, not a quoted structure."
      }
    }
  ]
}
</script>
]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/sil-verification-three-gates/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Hazard and Risk Analysis Methods: How HAZOP, What-If, LOPA, Risk Graph, FTA, ETA, and Bowtie Fit Together</title>
		<link>https://silsafe.net/hazard-and-risk-analysis-methods/</link>
					<comments>https://silsafe.net/hazard-and-risk-analysis-methods/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sun, 03 May 2026 23:36:16 +0000</pubDate>
				<category><![CDATA[Advanced]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6468</guid>

					<description><![CDATA[A practitioner's orientation to the methods that make up an IEC 61511 hazard and risk assessment — HAZOP, What-If, LOPA, risk graph, FTA, ETA, and bowtie — and how they fit together.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">Most engineers learn hazard and risk analysis methods one at a time, in the order their careers happened to expose them. HAZOP in a class. LOPA when they joined a site that used it. Bowtie in a corporate training deck. Years pass and they end up fluent in a handful of methods without ever assembling a clean mental model of how the methods fit together.</p>



<p class="wp-block-paragraph">That gap creates real problems. Teams default to whatever their site does and can&#8217;t articulate why, can&#8217;t recognize when the wrong tool is being used, and outgrow their methodology without realizing it. IEC 61511 requires a hazard and risk assessment (H&#038;RA) but is method-agnostic. The choice of hazard and risk analysis methods is on you, and the right choice depends on your facility, your scenarios, and the depth of analysis your risk picture demands.</p>



<h2 class="wp-block-heading">The Two Halves of Hazard and Risk Assessment</h2>



<p class="wp-block-paragraph">Clause 8 of IEC 61511 requires both halves of an H&#038;RA: hazard identification and risk assessment.</p>



<p class="wp-block-paragraph"><strong>Hazard identification</strong> is discovery work: <em>what can go wrong here?</em> The output is a list of credible scenarios with causes, consequences, and existing safeguards.</p>



<p class="wp-block-paragraph"><strong>Risk assessment</strong> is evaluation work: <em>for each identified hazard, how bad is it, how likely is it, and how much risk reduction is needed to make it tolerable?</em> The output is a defensible determination of required risk reduction, often expressed as a Safety Integrity Level (SIL) when the reduction is allocated to a Safety Instrumented Function (SIF).</p>



<p class="wp-block-paragraph">Most method confusion comes from not recognizing which half a given tool serves. HAZOP is hazard identification. LOPA is mostly risk assessment. They do different cognitive work, and they aren&#8217;t interchangeable.</p>



<p class="wp-block-paragraph">Another source of confusion: some methods are treated differently by different practitioners. The same method can be identification-only at one site and identification-plus-assessment at another. This article names the common patterns, but your facility may run them differently.</p>



<h2 class="wp-block-heading">Hazard Identification Methods</h2>



<p class="wp-block-paragraph">The hazard and risk analysis methods in this section are collectively known as Process Hazard Analysis (PHA) methods in much of industry practice. PHA usually refers to the identification activity specifically, while H&#038;RA is the IEC 61511 phrasing for the full activity covering both halves.</p>



<h3 class="wp-block-heading">HAZOP</h3>



<p class="wp-block-paragraph">A Hazard and Operability Study (HAZOP) is a hazard identification method by design, but in practice it often extends into risk assessment. Two patterns are worth recognizing:</p>



<ul class="wp-block-list">

<li><strong>HAZOP-as-identification:</strong> the team identifies hazards, applies a risk ranking for prioritization, and hands the scenarios off to LOPA or risk graph for assessment. Cleaner pattern, dominant at larger facilities.</li>


<li><strong>HAZOP-as-everything:</strong> the team identifies hazards and uses a calibrated risk matrix to determine SIL requirements directly, with no separate risk assessment step. Common at smaller facilities and in revalidations.</li>

</ul>



<p class="wp-block-paragraph">The workshop mechanics are the same either way. A multidisciplinary team works node-by-node through the process, applying parameter and guideword combinations (no flow, more pressure, reverse flow) to surface deviations from design intent. HAZOP is time-intensive, depends heavily on facilitator skill, and works best on continuous processes.</p>



<h3 class="wp-block-heading">What-If (Typically Run as What-If/Checklist)</h3>



<p class="wp-block-paragraph">What-If is a hazard identification method. A team works through the process asking open-ended <em>what if</em> questions and brainstorms consequences and safeguards.</p>



<p class="wp-block-paragraph">In practice almost nobody runs pure What-If — the risk of missing something obvious is too high. What sites actually do is What-If/Checklist: the team brainstorms freely, then uses a backing checklist as a safety net to catch what was missed. When practitioners say &#8220;we did a What-If,&#8221; they almost always mean What-If/Checklist.</p>



<p class="wp-block-paragraph">What-If/Checklist is the practical alternative to HAZOP for smaller, simpler, or batch processes. It&#8217;s accepted under major regulatory frameworks (OSHA Process Safety Management in the US, COMAH in the UK, Seveso in the EU) and is the right call for Management of Change (MOC) reviews, Pre-Startup Safety Reviews, and revalidations of well-understood operations.</p>



<h3 class="wp-block-heading">Pure Checklist</h3>



<p class="wp-block-paragraph">Pure checklist is verification, not discovery — the team walks through a pre-built list of standard hazards or design conditions for that process type and confirms each one is addressed. It&#8217;s used for screening reviews, very simple operations, or as an MOC tool for minor changes.</p>



<p class="wp-block-paragraph">Example: a process safety engineer is evaluating a proposed minor MOC. They walk through a standard checklist (does it change relief valve sizing, introduce new hazardous materials, affect classified area boundaries, touch a SIF), and the checklist confirms whether a fuller PHA review is needed.</p>



<h3 class="wp-block-heading">Related but Out of Scope: FMEA and FMEDA</h3>



<p class="wp-block-paragraph">Failure Modes and Effects Analysis (FMEA) and Failure Modes, Effects and Diagnostic Analysis (FMEDA) don&#8217;t belong in the H&#038;RA toolkit. Equipment-level FMEA and FMEDA belong to device certification under IEC 61508, the manufacturer&#8217;s domain, with outputs consumed by IEC 61511 users during SIS design and SIL verification. Process FMEA exists as a hazard identification method in adjacent industries (pharma, food, automotive) but isn&#8217;t standard in the process industry. Not covered further.</p>



<h2 class="wp-block-heading">Risk Assessment Methods</h2>



<h3 class="wp-block-heading">LOPA</h3>



<p class="wp-block-paragraph">Layer of Protection Analysis (LOPA) is a risk assessment method. It serves the assessment half of the H&#038;RA, but it straddles the Clause 8 / Clause 9 boundary of IEC 61511. The analytical work is risk assessment under Clause 8, while the output feeds SIL allocation under Clause 9. That dual nature is why LOPA gets described both ways depending on context.</p>



<p class="wp-block-paragraph">The mechanics are semi-quantitative and scenario-based. For each scenario, the team multiplies the initiating event frequency by the probability of failure on demand of each Independent Protection Layer (IPL) credited against the scenario, then compares the residual risk to the tolerable risk criterion. Where residual risk exceeds tolerable, the gap defines the required risk reduction factor (RRF) and corresponding SIL.</p>



<p class="wp-block-paragraph">LOPA is the most widely adopted risk assessment method globally, the default tool for most facilities. Its strengths are visible math, defensible documentation, and scalability across hundreds of scenarios. Its limitations are real: order-of-magnitude resolution can hide meaningful differences, and IPL crediting requires real discipline. Sloppy IPL crediting produces inflated risk reduction credit and leaves real gaps.</p>



<h3 class="wp-block-heading">Risk Graph</h3>



<p class="wp-block-paragraph">Risk graph is a risk assessment method. A calibrated decision tree uses four parameters: consequence severity (C), frequency or exposure (F), possibility of avoidance (P), and demand rate (W). The path through the tree lands on a SIL.</p>



<p class="wp-block-paragraph">Risk graph compresses risk assessment and SIL selection into a single traversal: consequence severity, exposure, avoidance, and tolerable risk comparison are all baked into the calibrated tree. It&#8217;s common in European-influenced practice, in oil &#038; gas globally, and in machinery safety; it remains a fully legitimate IEC 61511-3 method.</p>



<p class="wp-block-paragraph">Strengths: speed, no failure rate data required for IPLs, easy to teach. Limitations: calibration is everything, sensitivity is poor, qualitative judgments hide assumptions that LOPA forces explicit.</p>



<h3 class="wp-block-heading">LOPA and Risk Graph Are Alternatives, Not Complements</h3>



<p class="wp-block-paragraph">Both methods occupy the same workflow slot: risk assessment with a SIL output. Running both on the same scenario set creates conflicting answers and an unauditable documentation trail. Sites pick one or the other as their site standard and apply it consistently.</p>



<h2 class="wp-block-heading">The Quantitative Deep-Dive Methods</h2>



<p class="wp-block-paragraph">FTA, ETA, and bowtie are risk assessment tools, not hazard identification tools. Like LOPA and risk graph, they require an upstream identification activity to define what gets analyzed. They cannot tell you what hazards you forgot to consider.</p>



<h3 class="wp-block-heading">Fault Tree Analysis (FTA)</h3>



<p class="wp-block-paragraph">FTA works top-down and deductively. Start with a top event and work backward through the failure combinations that produce it using Boolean logic. Logic gates (AND, OR), basic events with failure rates, and minimal cut sets give a quantified frequency for the top event.</p>



<p class="wp-block-paragraph">Strengths: rigorous quantification, handles complex failure logic, identifies common cause failure (CCF) contributors. Limitation: FTA only analyzes the top event you defined. It cannot surface hazards nobody named.</p>



<h3 class="wp-block-heading">Event Tree Analysis (ETA)</h3>



<p class="wp-block-paragraph">ETA is the structural mirror of FTA: bottom-up and inductive. Start with an initiating event and work forward through outcome paths, assigning branching probabilities at each protection layer or conditional modifier (ignition / no ignition, immediate vs. delayed, occupancy at the time).</p>



<p class="wp-block-paragraph">The output is the full consequence space mapped quantitatively. Limitation: ETA is only as good as the initiating event you picked.</p>



<h3 class="wp-block-heading">Bowtie Analysis: The Integrating Visual</h3>



<p class="wp-block-paragraph">A bowtie puts the hazardous event in the middle, with threats and causes on the left (FTA territory), consequences on the right (ETA territory), and barriers across both sides. It can be qualitative (illustrative) or quantitative (backed by FTA/ETA data).</p>



<p class="wp-block-paragraph">Bowtie&#8217;s strength is communication: management, operations, and maintenance can read a bowtie even if they can&#8217;t read a fault tree. It&#8217;s also powerful for barrier management, making preventive and mitigative barriers visible together. Limitation: a bowtie is only as rigorous as the analysis behind it. A whiteboard sketch is not an H&#038;RA.</p>



<h2 class="wp-block-heading">How a Complete H&#038;RA Comes Together</h2>



<p class="wp-block-paragraph">A complete H&#038;RA combines hazard and risk analysis methods from both halves: one for identification, one for assessment. Several pairings are common in practice.</p>



<h3 class="wp-block-heading">The Common Pairing: HAZOP + LOPA</h3>



<p class="wp-block-paragraph">HAZOP plus LOPA is the dominant H&#038;RA pairing globally. HAZOP identifies, LOPA assesses, with a clean handoff between the two halves. The pairing extends one step beyond H&#038;RA. LOPA&#8217;s output feeds Clause 9 SIL allocation directly.</p>



<h3 class="wp-block-heading">The Alternative Pairing: HAZOP + Risk Graph</h3>



<p class="wp-block-paragraph">Same handoff structure, different risk assessment tool. Common in European-influenced practice. Site-wide methodology choice. Risk graph outputs a SIL directly, also feeding Clause 9 allocation.</p>



<h3 class="wp-block-heading">The Quantitative Pairing: Identification + FTA/ETA + Bowtie</h3>



<p class="wp-block-paragraph">Reserved for high-consequence facilities or specific scenarios within an otherwise LOPA-based program. Identification can be HAZOP or another systematic method, but the upstream identification step cannot be skipped. The Common Mistakes section below walks through what happens when it is.</p>



<h3 class="wp-block-heading">The Small-Facility Pattern: What-If/Checklist (or HAZOP) + Calibrated Risk Matrix</h3>



<p class="wp-block-paragraph">A single workshop does identification, assessment, and SIL determination using a calibrated matrix. The identification method is What-If/Checklist for smaller operations; some sites use HAZOP and run it the same way. The matrix serves double duty: both the tolerable risk reference and the SIL determination output. Common at facilities that don&#8217;t have the scenario count to justify a separate LOPA program. The matrix&#8217;s resolution caps how defensible higher-SIL determinations can be.</p>



<h3 class="wp-block-heading">Why You Can&#8217;t Skip Hazard Identification</h3>



<p class="wp-block-paragraph">FTA and ETA both start from a known event. LOPA and risk graph both evaluate scenarios they&#8217;re given. None of them can tell you what hazards you forgot to analyze. Identification and assessment are different cognitive tasks, and skipping the identification step doesn&#8217;t reduce the rigor of the H&#038;RA. It produces a different activity that happens to share vocabulary with one.</p>



<h2 class="wp-block-heading">Common Mistakes</h2>



<p class="wp-block-paragraph">The mistakes below show up across all the hazard and risk analysis methods covered in this article. They&#8217;re field-grounded patterns, not generic warnings about following the standard.</p>



<p class="wp-block-paragraph"><strong>1. Doing a high-quality assessment on a hazard while forgetting that not all hazards have been identified.</strong></p>



<p class="wp-block-paragraph">A specialty chemical facility decides to do a quantitative analysis on a 50,000-gallon flammable solvent tank. The team builds a rigorous fault tree on loss of containment, an event tree on ignition and consequences, and a bowtie that ties it all together. The analysis is technically excellent. A SIL is assigned to the overfill protection SIF.</p>



<p class="wp-block-paragraph">Six months later, an operator opens a sample valve that sticks open. Several thousand gallons release into a containment area not designed for that volume, and an uncoordinated maintenance activity provides the ignition source. None of it was in the analysis. Nobody ever did systematic hazard identification. The team jumped straight to &#8220;tank fire&#8221; as the assumed hazard and built outward.</p>



<p class="wp-block-paragraph">FTA and ETA are powerful at analyzing hazards you&#8217;ve already named. They cannot tell you what hazards you forgot to name. That&#8217;s HAZOP&#8217;s job.</p>



<p class="wp-block-paragraph"><strong>2. Mixing LOPA and risk graph on the same scenario set.</strong> Two methods occupying the same workflow slot produces conflicting answers and unauditable documentation.</p>



<p class="wp-block-paragraph"><strong>3. Crediting IPLs in LOPA without independence and auditability discipline.</strong> Loose IPL crediting inflates risk reduction credit and leaves real gaps. Independence, auditability, and access integrity have to be earned, not assumed.</p>



<p class="wp-block-paragraph"><strong>4. Drawing a bowtie without the underlying analysis.</strong> A diagram with barriers in marker is a communication tool, not an assessment.</p>



<p class="wp-block-paragraph"><strong>5. Defaulting to whatever the site has always done.</strong> Methodology should match process complexity and scenario severity, not just inherit.</p>



<p class="wp-block-paragraph"><strong>6. Treating FTA and ETA as substitutes for HAZOP.</strong> They analyze hazards deeply. They don&#8217;t discover them.</p>



<h2 class="wp-block-heading">Q&#038;A</h2>



<h3 class="wp-block-heading">Q1. I sometimes hear PHA and sometimes H&#038;RA. Are they the same thing?</h3>



<p class="wp-block-paragraph">PHA is the umbrella term most practitioners use, particularly in CCPS-influenced documentation, and usually refers to the hazard identification activity specifically. H&#038;RA is the IEC 61511 phrasing for the full activity covering both identification and assessment. Practitioners sometimes use PHA loosely to mean the whole exercise, which is part of where the confusion comes from.</p>



<h3 class="wp-block-heading">Q2. How do I know if my site is using the wrong H&#038;RA method?</h3>



<p class="wp-block-paragraph">This is a hard question and often needs the judgment of experienced professionals to answer well. Some clues:</p>



<ul class="wp-block-list">

<li><strong>Mismatch:</strong> a small facility running a full quantitative program for routine scenarios is using more methodology than the risk profile warrants; a complex high-consequence facility relying on a calibrated risk matrix for SIL determination is using less methodology than the risk profile warrants</li>


<li><strong>Inconsistency:</strong> scenarios assessed by different methods with no documented basis for the choice, or IPLs credited differently across similar scenarios</li>

</ul>



<h3 class="wp-block-heading">Q3. Can FTA and ETA satisfy IEC 61511&#8217;s H&#038;RA requirement on their own?</h3>



<p class="wp-block-paragraph">No. FTA and ETA take hazardous events as a design input. The methods cannot generate the events themselves. Something upstream has to identify them, which is the work of a hazard identification method like HAZOP. Without that step, the quantitative analysis is rigorous about whatever events the team happened to think of, and silent about everything else.</p>



<h3 class="wp-block-heading">Q4. Is risk graph an acceptable method, or do I have to use LOPA?</h3>



<p class="wp-block-paragraph">Risk graph is a fully legitimate IEC 61511-3 method. The standard does not mandate LOPA. The right choice depends on facility context. Risk graph is common in European-influenced practice, oil &#038; gas globally, and machinery safety; LOPA dominates in many regulatory contexts, particularly under PSM/RMP. What&#8217;s not acceptable is mixing the two on the same scenario set.</p>



<h3 class="wp-block-heading">Q5. My boss says HAZOP takes too many people for too many days and costs too much. Why can&#8217;t we just do a What-If/Checklist?</h3>



<p class="wp-block-paragraph">Sometimes you can. What-If/Checklist is a legitimate hazard identification method, accepted under OSHA PSM, COMAH, Seveso, and similar frameworks. It&#8217;s the right tool for smaller, simpler, or batch processes, for MOC reviews, and for revalidations of well-understood operations. The choice should be based on process complexity and consequence severity, not on cost alone. Cost is a real factor, and not every scope warrants a full HAZOP. As process complexity, scenario count, or consequence severity increase, the cost-benefit shifts back toward HAZOP. Using What-If/Checklist on a scope that genuinely needs HAZOP is how facilities miss hazards that come back to bite them later.</p>



<h3 class="wp-block-heading">Q6. Where does LOPA actually sit in the IEC 61511 life-cycle, Clause 8 or Clause 9?</h3>



<p class="wp-block-paragraph">Both. The analytical work is risk assessment under Clause 8. The output (required risk reduction expressed as a SIL) is consumed by Clause 9 for protection layer allocation. LOPA spans the boundary, which is why it&#8217;s described both as an H&#038;RA method and a SIL determination method.</p>



<h3 class="wp-block-heading">Q7. Do I need a bowtie for every hazardous event?</h3>



<p class="wp-block-paragraph">No. Bowtie is most valuable for high-consequence scenarios where barrier management and stakeholder communication justify the effort. For routine scenarios handled in a HAZOP and LOPA workflow, a bowtie adds little beyond what the LOPA worksheet documents.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<ul class="wp-block-list">

<li>SIL Safe: <a href="https://silsafe.net/hazard-and-risk-assessment-hra/">Hazard and Risk Assessment (H&#038;RA): The Foundation of Functional Safety</a></li>


<li>SIL Safe: <a href="https://silsafe.net/layer-of-protection-analysis-lopa/">Layer of Protection Analysis (LOPA): The Engineer&#8217;s Guide to SIL Selection</a></li>


<li><a href="https://webstore.iec.ch/en/publication/25480" target="_blank" rel="noopener">IEC 61511-3:2016</a>: informative annex on risk assessment methods</li>


<li>CCPS, <a href="https://www.aiche.org/resources/publications/books/guidelines-hazard-evaluation-procedures-3rd-edition" target="_blank" rel="noopener"><em>Guidelines for Hazard Evaluation Procedures, 3rd Edition</em></a></li>


<li>CCPS, <a href="https://ccps.aiche.org/publications/books/layer-protection-analysis-simplified-process-risk-assessment" target="_blank" rel="noopener"><em>Layer of Protection Analysis: Simplified Process Risk Assessment</em></a></li>


<li><a href="https://www.osha.gov/laws-regs/regulations/standardnumber/1910/1910.119" target="_blank" rel="noopener">OSHA PSM 29 CFR 1910.119</a>: PHA requirements for US-regulated facilities (paragraph (e))</li>


<li><a href="https://www.hse.gov.uk/comah/index.htm" target="_blank" rel="noopener">HSE COMAH guidance</a>: UK regulatory framework for major accident hazards</li>

</ul>



<p class="wp-block-paragraph">Functional safety is complex, and the stakes are high. If you have questions about your SIS design, SIL verification, or where to start with IEC 61511-1, the team at SIL Safe is here to help. Reach out to us today.</p>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "I sometimes hear PHA and sometimes H&RA. Are they the same thing?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "PHA is the umbrella term most practitioners use, particularly in CCPS-influenced documentation, and usually refers to the hazard identification activity specifically. H&RA is the IEC 61511 phrasing for the full activity covering both identification and assessment. Practitioners sometimes use PHA loosely to mean the whole exercise, which is part of where the confusion comes from."
      }
    },
    {
      "@type": "Question",
      "name": "How do I know if my site is using the wrong H&RA method?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "This is a hard question and often needs the judgment of experienced professionals to answer well. Some clues: Mismatch — a small facility running a full quantitative program for routine scenarios is using more methodology than the risk profile warrants; a complex high-consequence facility relying on a calibrated risk matrix for SIL determination is using less methodology than the risk profile warrants. Inconsistency — scenarios assessed by different methods with no documented basis for the choice, or IPLs credited differently across similar scenarios."
      }
    },
    {
      "@type": "Question",
      "name": "Can FTA and ETA satisfy IEC 61511's H&RA requirement on their own?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "No. FTA and ETA take hazardous events as a design input. The methods cannot generate the events themselves. Something upstream has to identify them, which is the work of a hazard identification method like HAZOP. Without that step, the quantitative analysis is rigorous about whatever events the team happened to think of, and silent about everything else."
      }
    },
    {
      "@type": "Question",
      "name": "Is risk graph an acceptable method, or do I have to use LOPA?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Risk graph is a fully legitimate IEC 61511-3 method. The standard does not mandate LOPA. The right choice depends on facility context. Risk graph is common in European-influenced practice, oil & gas globally, and machinery safety; LOPA dominates in many regulatory contexts, particularly under PSM/RMP. What's not acceptable is mixing the two on the same scenario set."
      }
    },
    {
      "@type": "Question",
      "name": "My boss says HAZOP takes too many people for too many days and costs too much. Why can't we just do a What-If/Checklist?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Sometimes you can. What-If/Checklist is a legitimate hazard identification method, accepted under OSHA PSM, COMAH, Seveso, and similar frameworks. It's the right tool for smaller, simpler, or batch processes, for MOC reviews, and for revalidations of well-understood operations. The choice should be based on process complexity and consequence severity, not on cost alone. Cost is a real factor, and not every scope warrants a full HAZOP. As process complexity, scenario count, or consequence severity increase, the cost-benefit shifts back toward HAZOP. Using What-If/Checklist on a scope that genuinely needs HAZOP is how facilities miss hazards that come back to bite them later."
      }
    },
    {
      "@type": "Question",
      "name": "Where does LOPA actually sit in the IEC 61511 life-cycle, Clause 8 or Clause 9?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Both. The analytical work is risk assessment under Clause 8. The output (required risk reduction expressed as a SIL) is consumed by Clause 9 for protection layer allocation. LOPA spans the boundary, which is why it's described both as an H&RA method and a SIL determination method."
      }
    },
    {
      "@type": "Question",
      "name": "Do I need a bowtie for every hazardous event?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "No. Bowtie is most valuable for high-consequence scenarios where barrier management and stakeholder communication justify the effort. For routine scenarios handled in a HAZOP and LOPA workflow, a bowtie adds little beyond what the LOPA worksheet documents."
      }
    }
  ]
}
</script>

]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/hazard-and-risk-analysis-methods/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Layer of Protection Analysis (LOPA): The Engineer&#8217;s Guide to SIL Selection</title>
		<link>https://silsafe.net/layer-of-protection-analysis-lopa/</link>
					<comments>https://silsafe.net/layer-of-protection-analysis-lopa/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sat, 25 Apr 2026 17:57:26 +0000</pubDate>
				<category><![CDATA[Beginner]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6422</guid>

					<description><![CDATA[LOPA — layer of protection analysis — is the most widely used method for SIL selection in the process industry. This guide covers how it works, where it fits in the IEC 61511 life-cycle, and two fully worked examples from a natural gas processing facility.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">In the process industry, determining how much risk reduction a safety instrumented system (SIS) needs to provide is one of the most consequential decisions in SIS design. Layer of protection analysis, or LOPA, is the method most facilities use to make that determination. It is the most widely used safety integrity level (SIL) selection method in the process industry, and this article explains how it works, where it fits in the IEC 61511 safety life-cycle, and what it looks like applied to real scenarios at a natural gas processing facility.For the article schema what&#8217;s confusing is that</p>



<p class="wp-block-paragraph">LOPA can be conducted as a semi-quantitative or quantitative analysis. The vast majority of LOPA studies — roughly 80% — are semi-quantitative.</p>



<h2 class="wp-block-heading">LOPA Key Terms</h2>



<p class="wp-block-paragraph">These terms appear throughout every LOPA study — precision matters.</p>



<p class="wp-block-paragraph"><strong>Initiating event</strong> — the specific failure or error that starts the hazard sequence, such as a pump trip, a valve failing open, or an operator error.</p>



<p class="wp-block-paragraph"><strong>Independent protection layer (IPL)</strong> — a safeguard capable of preventing the hazard from escalating to a consequence, provided it meets all four criteria: specific, independent, dependable, and auditable.</p>



<p class="wp-block-paragraph"><strong>Risk reduction factor (RRF)</strong> — a measure of how much a protection layer reduces risk. An IPL with an RRF of 100 prevents the hazardous outcome on 99 out of 100 demands. RRF and probability of failure on demand (PFDavg) are inverses: RRF = 1 / PFDavg.</p>



<p class="wp-block-paragraph"><strong>Tolerable risk</strong> — the maximum frequency of a hazardous event the facility has determined is acceptable. LOPA compares mitigated event frequency against this threshold.</p>



<p class="wp-block-paragraph"><em>Additional terms specific to quantitative LOPA are introduced later in this article.</em></p>



<h2 class="wp-block-heading">How a LOPA Scenario Works</h2>



<p class="wp-block-paragraph">LOPA determines how much risk reduction is needed for each specific hazard scenario — not a facility-wide number. Each scenario is analyzed individually, which is what makes LOPA more rigorous than a risk graph or qualitative approach. The structure is simple: one initiating event, one consequence. That narrow scope is intentional.</p>



<p class="wp-block-paragraph">The core equation is:</p>



<p class="wp-block-paragraph">Mitigated Event Frequency = Initiating Event Frequency × (PFD of IPL-1) × (PFD of IPL-2) × &#8230; × modifiers</p>



<p class="wp-block-paragraph">The mitigated event frequency is compared to the facility&#8217;s tolerable risk threshold. If the mitigated frequency falls below the target, no additional protection is needed. If a gap remains, the required RRF is calculated and a SIL target assigned to close it. LOPA establishes the target — it does not calculate whether the SIS design achieves it. That is the job of SIL verification, which comes later in the life-cycle.</p>



<p class="wp-block-paragraph">LOPA was formalized by the Center for Chemical Process Safety (CCPS) in their 2001 book <em>Layer of Protection Analysis: Simplified Process Risk Assessment</em>. IEC 61511-3, Annex F (informative) incorporates it as a recognized semi-quantitative method within the standard.</p>



<h3 class="wp-block-heading">Initiating Events</h3>



<p class="wp-block-paragraph">Before evaluating whether safeguards are sufficient, a credible frequency for the initiating event is needed. Typical frequencies used in semi-quantitative LOPA:</p>



<ul class="wp-block-list">
<li>Control valve fails open or closed: 1E-1 per year</li>



<li>Pump trip (loss of flow): 1E-1 per year</li>



<li>Operator error on a routine task: 1E-2 per year</li>



<li>Cooling water failure: 1E-1 per year</li>



<li>Small bore pipe rupture: 1E-3 per year</li>
</ul>



<p class="wp-block-paragraph">These are starting points. Facility-specific data and operating history should inform what is reasonable for a given site.</p>



<p class="wp-block-paragraph">One important point: the basic process control system (BPCS) can itself be the initiating event. If a BPCS pressure control loop failure causes an overpressure scenario, that same loop cannot be claimed as an IPL for that scenario — it is not independent of the cause.</p>



<h3 class="wp-block-heading">Independent Protection Layers (IPLs)</h3>



<p class="wp-block-paragraph">To qualify as an IPL, a safeguard must satisfy all four criteria — and all four means all four:</p>



<ul class="wp-block-list">
<li><strong>Specific</strong> — capable of preventing the specific consequence being analyzed</li>



<li><strong>Independent</strong> — functions independently of the initiating event and of other IPLs claimed in the same scenario</li>



<li><strong>Dependable</strong> — performs with the reliability assumed in the analysis</li>



<li><strong>Auditable</strong> — supported by documentation proving it exists, works, and is maintained</li>
</ul>



<p class="wp-block-paragraph">Common IPL types and typical risk reduction credit in semi-quantitative LOPA:</p>



<ul class="wp-block-list">
<li>BPCS control loop: RRF of 10</li>



<li>Operator response to alarm: RRF of 10, with sufficient response time and documented procedures</li>



<li>Pressure safety valve (PSV): RRF of 10 to 100, depending on sizing and testing for the specific scenario</li>



<li>Safety instrumented function (SIF): RRF varies by SIL — this is what LOPA is sizing</li>
</ul>



<p class="wp-block-paragraph"><strong>The BPCS as an IPL: why RRF 10 is the limit</strong></p>



<p class="wp-block-paragraph">The cap of RRF 10 for a BPCS loop is based on real-world data. A control loop has many ways to fail beyond the primary sensing element — control valve degradation, signal wiring faults, software changes, maintenance errors, the loop being placed in manual. When all failure modes across the full loop are accounted for, historical plant data consistently supports an RRF of 10 as a reasonable and defensible credit. Claiming more is not supported by what facilities actually see in the field.</p>



<p class="wp-block-paragraph"><strong>What disqualifies a safeguard</strong></p>



<ul class="wp-block-list">
<li>A high-level alarm tied to the same transmitter as the control loop that caused the initiating event — fails independence</li>



<li>A spare pump with no documented proof test procedure — fails dependability and auditability</li>



<li>An operator response procedure requiring 45 minutes when the scenario allows 20 minutes — fails specificity</li>
</ul>



<h3 class="wp-block-heading">Enabling Conditions and Conditional Modifiers</h3>



<p class="wp-block-paragraph">Enabling conditions and conditional modifiers can legitimately reduce the calculated demand frequency or consequence likelihood in a LOPA scenario. An <strong>enabling condition</strong> is something that must be present for the consequence to occur but does not itself cause the initiating event — the presence of an ignition source, or a person in the hazard zone. A <strong>conditional modifier</strong> is a probability factor applied to account for conditions that affect whether the initiating event leads to the full consequence, such as probability of ignition or occupancy probability.</p>



<p class="wp-block-paragraph">In semi-quantitative LOPA, these are typically embedded in the initiating event frequency or handled with conservative round numbers. The risk of abuse is real: stacking multiple modifiers without documented evidence produces an analysis that is optimistic rather than conservative.</p>



<h2 class="wp-block-heading">Semi-Quantitative vs. Quantitative LOPA</h2>



<p class="wp-block-paragraph">Semi-quantitative LOPA uses order-of-magnitude estimates for initiating event frequencies and IPL PFD values, drawn from standardized tables rather than facility-specific reliability data. It is conservative by design and appropriate for the data quality available at most facilities.</p>



<p class="wp-block-paragraph">Quantitative LOPA assigns precise numerical values to each input and applies conditional modifiers explicitly as documented multipliers. The following terms are specific to quantitative LOPA and are unlikely to appear on a standard semi-quantitative worksheet:</p>



<ul class="wp-block-list">
<li><strong>Implicit vs. explicit modifiers</strong> — a modifier is implicit when its effect is already embedded in the source data; explicit when it is a named, separate multiplier on the worksheet</li>



<li><strong>Operational factor</strong> — the fraction of time the process is running in the hazardous state (e.g., a unit running 6 months per year applies a factor of 0.5 to the initiating event frequency)</li>



<li><strong>Use factor / occupancy factor</strong> — the fraction of time a person is present in the hazard zone, used to probability-weight a consequence</li>
</ul>



<p class="wp-block-paragraph">Mixing the two approaches is a common mistake — applying precise numerical modifiers in an otherwise semi-quantitative study creates false precision and can produce unconservative results. A future article will cover quantitative LOPA in depth.</p>



<h2 class="wp-block-heading">Where LOPA Fits in the IEC 61511 Safety Life-Cycle</h2>



<p class="wp-block-paragraph">IEC 61511 requires a documented, defensible basis for every SIL target. Layer of protection analysis is how most facilities satisfy that requirement. The standard does not mandate LOPA specifically — several methods are acceptable — but LOPA dominates in practice.</p>



<p class="wp-block-paragraph">Note that where LOPA sits can confuse practitioners as it is at the boundary between two phases of the IEC 61511 life-cycle. The scenario evaluation, IPL crediting, and risk gap calculation are Clause 8 risk H&amp;RA (specifically on the &#8220;A&#8221; side), while the resulting required risk reduction is consumed by Clause 9 when that risk reduction is allocated to a SIF and the SIL is assigned.</p>



<ol class="wp-block-list"></ol>



<h3 class="wp-block-heading">What Comes Before: The &#8220;H&#8221; of the H&amp;RA</h3>



<p class="wp-block-paragraph">The hazard and risk assessment (H&amp;RA) is a mandatory IEC 61511 life-cycle activity. The HAZOP (Hazard and Operability Study) is the most common method used to carry it out the &#8220;H&#8221; portion. The term process hazard analysis (PHA) refers to the same concept in OSHA PSM and EPA RMP contexts. The H&amp;RA identifies hazard scenarios, defines consequences, and establishes unmitigated risk.  LOPA does not re-identify hazards (the &#8220;H&#8221; term); it takes the HAZOP output and determines whether existing protection is sufficient.</p>



<p class="wp-block-paragraph">HAZOP and LOPA are often conducted together in the same study or meeting, and some facilities call the entire exercise a &#8220;HAZOP&#8221; even when LOPA is embedded within it. If a study only lists safeguards without comparing them to a tolerable risk threshold, it is a HAZOP — not a LOPA, regardless of what it was called.</p>



<h3 class="wp-block-heading">What Comes After: SIS Design Begins</h3>



<p class="wp-block-paragraph">The SIL target produced by LOPA is the starting point for SIS design. From there, engineers move into the full design process — developing the safety requirements specification (SRS), working through conceptual and detailed design, selecting equipment, performing SIL verification, and ultimately installing and validating the system. PFDavg calculations, spurious trip rate (STR) analysis, proof test interval (TI) optimization, and common cause failure (CCF) evaluation are all part of that design and verification work. LOPA hands off a SIL target; everything that follows is about proving the design meets it.</p>



<p class="wp-block-paragraph"><strong>A note on ALARP</strong>: LOPA establishes the minimum required risk reduction. As low as reasonably practicable (ALARP) is a separate obligation that asks whether risk can be reduced further at a reasonable cost — it is not a basis for doing less than LOPA requires. Facilities subject to ALARP obligations should treat LOPA output as the starting point, not the conclusion.</p>



<h2 class="wp-block-heading">Other SIL Selection Methods — and Why LOPA Dominates</h2>



<p class="wp-block-paragraph">IEC 61511 accepts several SIL selection methods. The right choice depends on the facility, available data, and scenario complexity.</p>



<p class="wp-block-paragraph"><strong>Risk graph</strong> — a parameter-based method that works through structured questions about consequence severity, occupancy, avoidability, and demand rate to arrive at a SIL target. Used in some sectors and regulatory contexts; works well when properly calibrated to the facility&#8217;s risk tolerance.</p>



<p class="wp-block-paragraph"><strong>Fault tree analysis (FTA) / quantitative risk analysis (QRA)</strong> — the most rigorous end of the spectrum. Appropriate when consequence modeling is required (toxic dispersion, blast overpressure) or when regulatory requirements demand precise probabilistic treatment.</p>



<p class="wp-block-paragraph">LOPA dominates because it fills the practical gap between a qualitative approach and a full QRA — rigorous enough to be defensible, practical enough to execute without a specialized modeling team. That said, a risk graph may serve better when a facility has built its procedure around one and knows how to use it well, and a full QRA may be necessary when consequence modeling is non-negotiable.</p>



<h2 class="wp-block-heading">LOPA in Practice — Two Examples</h2>



<h3 class="wp-block-heading">Setting the Scene</h3>



<p class="wp-block-paragraph">The facility is a mid-size natural gas processing plant subject to OSHA PSM. It processes raw natural gas to remove liquids, water, and acid gases (H₂S and CO₂) before pipeline delivery. Both examples are set in the amine treating unit — a section of the plant that uses a liquid amine solution to absorb H₂S from the gas stream through direct gas-liquid contact.</p>



<p class="wp-block-paragraph">The facility&#8217;s tolerable risk criteria:</p>



<ul class="wp-block-list">
<li>Potential single fatality: 1E-4 per year</li>



<li>Potential multiple fatalities: 1E-5 per year</li>
</ul>



<h3 class="wp-block-heading">Example 1: Existing Safeguards Are Sufficient — No SIF Required</h3>



<p class="wp-block-paragraph"><strong>Scenario</strong></p>



<p class="wp-block-paragraph">The gas-liquid contact vessel in the amine treating unit operates at elevated pressure. A control valve failing open on the high-pressure gas inlet could allow pressure to build beyond the normal operating range, leading to a flange leak or seal failure and H₂S release near the unit. Consequence: potential single fatality.</p>



<p class="wp-block-paragraph"><strong>Initiating event:</strong> Control valve fails open — 1E-1 per year</p>



<p class="wp-block-paragraph"><strong>Tolerable risk target:</strong> 1E-4 per year (single fatality)</p>



<p class="wp-block-paragraph"><strong>Required RRF before IPLs:</strong> 1E-1 / 1E-4 = 1,000</p>



<p class="wp-block-paragraph"><strong>IPL evaluation</strong></p>



<p class="wp-block-paragraph"><em>IPL 1 — BPCS pressure control loop</em> The BPCS loop is independent of the failed inlet valve. Functional, tested, and documented. Credit: RRF = 10</p>



<p class="wp-block-paragraph"><em>IPL 2 — High-pressure alarm with operator response</em> The alarm activates at 110% of normal operating pressure on a separate transmitter. The operator has a documented response procedure and approximately 25 minutes to respond before a credible consequence — sufficient for trained response. Credit: RRF = 10</p>



<p class="wp-block-paragraph"><em>IPL 3 — Pressure safety valve</em> The PSV is sized for the credible overpressure load from this specific scenario, including the valve fail-open case. Bench tested annually with documentation on file. Credit: RRF = 100</p>



<p class="wp-block-paragraph"><strong>Mitigated event frequency</strong></p>



<p class="wp-block-paragraph">1E-1 × (1/10) × (1/10) × (1/100) = 1E-5 per year</p>



<p class="wp-block-paragraph">Achieved RRF = 1E-1 / 1E-5 = 10,000</p>



<p class="wp-block-paragraph"><strong>Result:</strong> The mitigated event frequency of 1E-5 per year is below the tolerable risk target of 1E-4 per year for a single fatality event. <strong>No SIF is required.</strong></p>



<h3 class="wp-block-heading">Example 2: Existing Safeguards Fall Short — SIL 2 SIF Required</h3>



<p class="wp-block-paragraph"><strong>Scenario</strong></p>



<p class="wp-block-paragraph">The amine regenerator column — the vessel that thermally strips H₂S from the amine solution for reuse — operates at elevated temperature. Loss of cooling water to the overhead condenser allows column pressure to rise. If pressure exceeds the column design rating, a catastrophic release of hot amine and H₂S-bearing vapor is possible. Consequence: potential multiple fatalities.</p>



<p class="wp-block-paragraph"><strong>Initiating event:</strong> Cooling water failure — 1E-2 per year</p>



<p class="wp-block-paragraph"><strong>Tolerable risk target:</strong> 1E-5 per year (multiple fatalities)</p>



<p class="wp-block-paragraph"><strong>Required RRF before IPLs:</strong> 1E-2 / 1E-5 = 1,000</p>



<p class="wp-block-paragraph"><strong>IPL evaluation</strong></p>



<p class="wp-block-paragraph"><em>IPL 1 — BPCS pressure control loop</em> The BPCS loop is independent of the cooling water failure. Functional, tested, and documented. Credit: RRF = 10</p>



<p class="wp-block-paragraph"><em>Candidate IPL — High-pressure alarm with operator response</em> A high-pressure alarm is present. However, the response procedure requires approximately 18 minutes to complete, and the time available from alarm to potential consequence — based on process dynamics — is 12 minutes. The response time does not fit the scenario. <strong>This safeguard does not qualify as an IPL.</strong></p>



<p class="wp-block-paragraph"><em>Candidate IPL — Pressure safety valve</em> A PSV is installed on the column but was sized for a blocked outlet case. Documentation confirming adequacy for the loss-of-cooling scenario is not available. <strong>This safeguard does not qualify as an IPL for this scenario as evaluated.</strong></p>



<p class="wp-block-paragraph"><strong>Mitigated event frequency</strong></p>



<p class="wp-block-paragraph">1E-2 × (1/10) = 1E-3 per year</p>



<p class="wp-block-paragraph">Achieved RRF = 1E-2 / 1E-3 = 10</p>



<p class="wp-block-paragraph"><strong>Gap analysis</strong></p>



<p class="wp-block-paragraph">Required RRF: 1,000 | Achieved RRF: 10 | Remaining gap: RRF of 100</p>



<p class="wp-block-paragraph">An RRF of 100 falls within the SIL 2 range (RRF 100 to 1,000). <strong>A SIL 2 SIF is required.</strong></p>



<p class="wp-block-paragraph"><strong>Result:</strong> The mitigated event frequency of 1E-3 per year does not meet the tolerable risk target of 1E-5 per year for a multiple fatality event. Existing safeguards are insufficient. The LOPA identifies a SIL 2 SIF as the minimum required protection layer.</p>



<p class="wp-block-paragraph">This example also surfaces two recommendations: confirm whether the operator response procedure can be shortened — if the time gap closes, that safeguard may qualify as an IPL and reduce the SIL requirement. And confirm PSV sizing for this specific relief scenario — if adequate, it may also qualify as an IPL.</p>



<h2 class="wp-block-heading">Common LOPA Mistakes</h2>



<p class="wp-block-paragraph"><strong>Claiming IPL credit for untested or undocumented safeguards.</strong> If there is no proof test record and no written procedure, the safeguard fails the auditable criterion. No documentation means no credit, regardless of whether the safeguard is functioning.</p>



<p class="wp-block-paragraph"><strong>Double-counting BPCS-based safeguards.</strong> A BPCS pressure control loop and a BPCS high-pressure alarm on the same transmitter are not two independent IPLs. Independence means independence all the way through — sensing element, signal path, and control action.</p>



<p class="wp-block-paragraph"><strong>Misidentifying available operator response time.</strong> Response time in LOPA is the window between alarm activation and the point at which the consequence becomes unavoidable — not the total time available to complete a recovery procedure, and not the time from process deviation to alarm.</p>



<p class="wp-block-paragraph"><strong>Failing to evaluate IPL validity for each initiating event independently.</strong> An IPL that qualifies for one scenario may not qualify for another — this check must be done scenario by scenario, not at the system level. More critically, an IPL&#8217;s own failure mode can itself become an initiating event on a different accident path, a cross-scenario interaction that is easy to miss.</p>



<p class="wp-block-paragraph"><strong>Mixing semi-quantitative and quantitative practices.</strong> Applying precise numerical modifiers — occupancy factors, operational factors — in an otherwise semi-quantitative study creates false precision. If the data doesn&#8217;t support the specificity, the result can be unconservative.</p>



<p class="wp-block-paragraph"><strong>Not revisiting LOPA when the process changes.</strong> This is fundamentally a management of change (MOC) failure. Changes to operating conditions, throughput, equipment, or chemicals can invalidate initiating event frequencies and IPL credits — but if LOPA isn&#8217;t triggered as part of the MOC review, those changes go unexamined. LOPA is a point-in-time analysis. The MOC process is what keeps it current.</p>



<h2 class="wp-block-heading">LOPA Q&amp;A</h2>



<p class="wp-block-paragraph"><strong>Why do some facilities call it a HAZOP when they&#8217;re also doing LOPA?</strong></p>



<p class="wp-block-paragraph">Two reasons.  First, a HAZOP itself is treated slightly differently between facilities, some facilities go beyond the identification (the &#8220;H&#8221;) and do assessment activities.  It is not a consistent term. </p>



<p class="wp-block-paragraph">Second, because the two processes are often conducted together and many facilities never distinguished them in their terminology. A HAZOP team identifies scenarios and then — sometimes in the same meeting — evaluates whether safeguards are sufficient using LOPA methodology. The distinction matters because the two produce different outputs. A HAZOP identifies and characterizes scenarios; LOPA evaluates whether those scenarios require a SIF and at what SIL.</p>



<p class="wp-block-paragraph"><strong>I don&#8217;t see conditional modifiers on my LOPA worksheet. Does that mean something is wrong?</strong></p>



<p class="wp-block-paragraph">Not necessarily. In semi-quantitative LOPA, conditional modifiers are often implicit — their effect is already embedded in the initiating event frequencies or consequence categories being used, so they don&#8217;t appear as separate columns. In quantitative LOPA, these factors are explicit: each one is named, assigned a specific probability value, and multiplied through the calculation.</p>



<p class="wp-block-paragraph"><strong>Are there situations where LOPA is not the right choice?</strong></p>



<p class="wp-block-paragraph">In a few specific situations, another method is the better fit. For scenarios requiring consequence modeling — toxic dispersion, blast overpressure — LOPA doesn&#8217;t do that work; a full quantitative risk analysis (QRA) is needed. For highly complex scenarios with multiple interacting causes or common-cause dependencies, fault tree analysis can give a more accurate picture than LOPA&#8217;s simplified scenario structure. And for facilities where a risk graph has been well-calibrated and the team knows how to use it, that method may be entirely appropriate. The method should match the facility and the problem.</p>



<p class="wp-block-paragraph"><strong>Can an IPL be claimed for more than one scenario?</strong></p>



<p class="wp-block-paragraph">Yes, but each claim must be evaluated independently. A PSV that qualifies as an IPL for a blocked outlet overpressure scenario may or may not qualify for a fire case. The specificity criterion must be satisfied for every scenario in which an IPL is claimed.</p>



<p class="wp-block-paragraph"><strong>Does LOPA apply to continuous mode SIFs?</strong></p>



<p class="wp-block-paragraph">Standard LOPA methodology is designed for low-demand mode SIFs — functions called upon infrequently where PFDavg is the relevant performance measure. Continuous mode SIFs use probability of failure per hour (PFH) rather than PFDavg. Quantitative LOPA can be adapted for continuous mode by adjusting the variable in the calculation.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<h3 class="wp-block-heading">External Links</h3>



<p class="wp-block-paragraph"><strong>Internal — related articles on this site:</strong></p>



<ul class="wp-block-list">

<li><a href="https://silsafe.net/garden-grove-chemical-incident/" data-type="post" data-id="6633">The Garden Grove Chemical Incident: What We Know So Far</a></li>

</ul>



<ul class="wp-block-list">
<li>CCPS <em>Layer of Protection Analysis: Simplified Process Risk Assessment</em> (2001) — <a href="https://www.aiche.org/ccps/resources/publications/books/layer-protection-analysis-simplified-process-risk-assessment" target="_blank" rel="noopener">aiche.org</a></li>



<li>IEC 61511-3 Annex F —<a href="https://webstore.iec.ch/en/publication/25480" target="_blank" rel="noopener"> iec.ch</a></li>



<li>University of Michigan Chemical Engineering – <strong><a href="https://safeche.engin.umich.edu/tutorials/lopa-tutorial/" target="_blank" rel="noopener">Layers of Protection</a></strong></li>



<li>Wikipedia article <strong><a href="https://en.wikipedia.org/wiki/Layers_of_protection_analysis" target="_blank" rel="noopener">Layer of Protection Analysis</a></strong></li>



<li>Sphera – <strong><a href="https://sphera.com/resources/glossary/what-is-the-difference-between-pha-hazop-lopa/" target="_blank" rel="noopener">What is the Difference Between PHA, HAZOP, &amp; LOPA</a></strong></li>



<li>Mary Kay O’Connor Process Safety Center – <strong><a href="https://risktec.tuv.com/wp-content/uploads/2018/10/on-the-use-of-lopa-and-risk-graphs-for-sil-determination-as-published-mkopsc-2014.pdf" target="_blank" rel="noopener">On the Use of LOPA and Risk Graphs for SIL Determination</a></strong></li>
</ul>



<p class="wp-block-paragraph">Functional safety is complex, and the stakes are high. If you have questions about your SIS design, SIL verification, or where to start with IEC 61511-1, the team at SIL Safe is here to help. Reach out to us today.</p>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "Why do some facilities call it a HAZOP when they're also doing LOPA?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Because the two are often conducted together and many facilities never distinguished them in their terminology. A HAZOP team identifies scenarios and then — sometimes in the same meeting — evaluates whether safeguards are sufficient using LOPA methodology. The distinction matters because the two produce different outputs. A HAZOP identifies and characterizes scenarios; LOPA evaluates whether those scenarios require a SIF and at what SIL."
      }
    },
    {
      "@type": "Question",
      "name": "I don't see conditional modifiers on my LOPA worksheet. Does that mean something is wrong?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Not necessarily. In semi-quantitative LOPA, conditional modifiers are often implicit — their effect is already embedded in the initiating event frequencies or consequence categories being used, so they don't appear as separate columns. In quantitative LOPA, these factors are explicit: each one is named, assigned a specific probability value, and multiplied through the calculation."
      }
    },
    {
      "@type": "Question",
      "name": "Are there situations where LOPA is not the right choice?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "In a few specific situations, another method is the better fit. For scenarios requiring consequence modeling — toxic dispersion, blast overpressure — LOPA doesn't do that work; a full quantitative risk analysis (QRA) is needed. For highly complex scenarios with multiple interacting causes or common-cause dependencies, fault tree analysis can give a more accurate picture than LOPA's simplified scenario structure. And for facilities where a risk graph has been well-calibrated and the team knows how to use it, that method may be entirely appropriate. The method should match the facility and the problem."
      }
    },
    {
      "@type": "Question",
      "name": "Can an IPL be claimed for more than one scenario?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Yes, but each claim must be evaluated independently. A PSV that qualifies as an IPL for a blocked outlet overpressure scenario may or may not qualify for a fire case. The specificity criterion must be satisfied for every scenario in which an IPL is claimed."
      }
    },
    {
      "@type": "Question",
      "name": "Does LOPA apply to continuous mode SIFs?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Standard LOPA methodology is designed for low-demand mode SIFs — functions called upon infrequently where PFDavg is the relevant performance measure. Continuous mode SIFs use probability of failure per hour (PFH) rather than PFDavg. Quantitative LOPA can be adapted for continuous mode by adjusting the variable in the calculation."
      }
    }
  ]
}
</script>



<p class="wp-block-paragraph"></p>
]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/layer-of-protection-analysis-lopa/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Hazard and Risk Assessment (H&#038;RA): The Foundation of Functional Safety</title>
		<link>https://silsafe.net/hazard-and-risk-assessment-hra/</link>
					<comments>https://silsafe.net/hazard-and-risk-assessment-hra/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sat, 04 Apr 2026 21:07:09 +0000</pubDate>
				<category><![CDATA[Beginner]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6213</guid>

					<description><![CDATA[A hazard and risk assessment (H&#038;RA) is the foundation of every IEC 61511 safety case. This guide covers what it is, how it works, the methodologies used to conduct one, and what PHA means in the context of OSHA PSM and EPA RMP — written for process safety engineers and functional safety practitioners.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">If there is one activity in the functional safety life-cycle that sets the tone for everything that follows, it is the Hazard &amp; Risk Assessment (H&amp;RA). Get it right and you have a solid technical foundation for your Safety Instrumented System (SIS). Get it wrong — or skip it — and every decision downstream is built on sand.</p>



<h2 class="wp-block-heading">What Is a Hazard &amp; Risk Assessment (H&amp;RA)?</h2>



<p class="wp-block-paragraph">A <strong>hazard</strong> is a physical situation with the potential to cause harm. A <strong>risk</strong> is what you get when you combine two things: the <em>probability</em> that the hazard leads to a harmful event, and the <em>severity</em> of the consequences. Risk is never just one of those dimensions. A high-severity outcome with negligible probability may be entirely tolerable. A low-severity outcome that happens constantly may not be. Both dimensions must be assessed together — always.</p>



<p class="wp-block-paragraph">The H&amp;RA is the structured process of identifying hazards, evaluating the associated risks, and determining whether those risks are tolerable. It is the foundation of the <strong>functional safety life-cycle</strong> — the end-to-end engineering process defined in IEC 61511 that governs how Safety Instrumented Systems are designed, implemented, operated, and maintained. Without the H&amp;RA, there is no technical basis for any of what follows.</p>



<p class="wp-block-paragraph"><strong>A note on terminology</strong> — you will encounter several names and acronyms for this activity depending on the standard or industry context:</p>



<ul class="wp-block-list">
<li><strong>H&amp;RA</strong> (Hazard and Risk Assessment) — the term used in IEC 61511</li>



<li><strong>HRA</strong> and <strong>HARA</strong> — also widely used within the functional safety community; same activity, different shorthand</li>



<li><strong>PHA</strong> (Process Hazard Analysis) — the equivalent term under the OSHA PSM regulation (29 CFR 1910.119) and the EPA RMP regulation; same concept, different regulatory language</li>



<li>Other safety disciplines — machinery safety, for example — may use different terms again</li>
</ul>



<p class="wp-block-paragraph">The variation is mostly regulatory and organizational preference. The underlying activity is the same.</p>



<h2 class="wp-block-heading">When a Hazard and Risk Assessment Fits in the IEC 61511 Safety Life-Cycle</h2>



<p class="wp-block-paragraph">A hazard and risk assessment must be conducted early in the safety life-cycle — specifically under Clause 8 — before SIS design begins.</p>



<p class="wp-block-paragraph">The accepted practice is to conduct the H&amp;RA when the <strong>P&amp;IDs (Piping and Instrumentation Diagrams) are at Rev 0</strong> — the point at which the process design is sufficiently defined to support a meaningful hazard study, but not so advanced that changes identified during the study become costly or impractical to implement. Too early and the hazard picture is incomplete; too late and the window to influence design has closed.</p>



<p class="wp-block-paragraph">A poorly timed H&amp;RA compounds every problem that follows.</p>



<h2 class="wp-block-heading">Key Steps in Conducting an H&amp;RA</h2>



<p class="wp-block-paragraph">The H&amp;RA is not a single task — it is a structured sequence of activities. The order matters.</p>



<p class="wp-block-paragraph"><strong>Step 1 – Determine tolerable risk.</strong> Before you can assess whether any risk is acceptable, you need to define what &#8220;acceptable&#8221; means for your organization. This benchmark must be established first. It governs every risk judgment that follows. (See the next section for details.)</p>



<p class="wp-block-paragraph"><strong>Step 2 – Define the scope and boundaries.</strong> What process units, equipment, and operating modes are included in this H&amp;RA? Scope creep and scope gaps are both problems. A clearly documented boundary prevents both.</p>



<p class="wp-block-paragraph"><strong>Step 3 – Identify hazards.</strong> What physical situations exist that have the potential to cause harm? This is where the structured identification methodology — HAZOP, What-If, or similar — is applied.</p>



<p class="wp-block-paragraph"><strong>Step 4 – Identify hazardous events and demand scenarios.</strong> A hazard becomes a hazardous <em>event</em> when a specific initiating cause triggers it — the conditions under which a safety function would be called upon to act.</p>



<p class="wp-block-paragraph"><strong>Step 5 – Assess consequences.</strong> For each hazardous event, what is the worst credible outcome? Consequences are typically assessed in terms of harm to people, environmental impact, and asset damage.</p>



<p class="wp-block-paragraph"><strong>Step 6 – Assess likelihood/frequency.</strong> How often is the hazardous event expected to occur, without any protection layers in place? This is the <em>unmitigated</em> or <em>inherent</em> demand rate.</p>



<p class="wp-block-paragraph"><strong>Step 7 – Identify the risk gap.</strong> With both consequence severity and likelihood established, the assessed risk can be compared against the tolerable risk criteria and the calibrated risk matrix. Where the assessed risk exceeds tolerable risk, a gap exists. That gap is the direct trigger for a SIS.</p>



<h2 class="wp-block-heading">Determining Tolerable Risk — The Foundation of Step 1</h2>



<p class="wp-block-paragraph">Under IEC 61511, tolerable risk is the level of risk accepted in a given context based on the current values of society. It is <em>not</em> zero risk — every industrial process carries some inherent risk, and the goal of IEC 61511 is to ensure that risk is <em>identified</em>, <em>evaluated</em>, and <em>reduced to a tolerable level</em>, not eliminated entirely. The instinct to demand &#8220;no risk&#8221; is understandable but neither achievable nor the intent of the standard.</p>



<p class="wp-block-paragraph">Tolerable risk criteria must be documented — a specific IEC 61511 requirement and a common gap at smaller PSM and RMP facilities. Without documented criteria, every risk judgment becomes too subjective and the study loses its technical defensibility.</p>



<p class="wp-block-paragraph">A <strong>calibrated risk matrix</strong> is the standard tool for establishing and communicating tolerable risk criteria. It is a specific type of risk matrix in which the axis boundaries are anchored to actual numerical frequency and consequence values — not vague qualitative descriptors like &#8220;frequent&#8221; or &#8220;catastrophic.&#8221; Calibration reduces subjectivity as much as possible, improves consistency across the study, and makes the risk criteria defensible under regulatory or third-party scrutiny. Larger organizations and multi-site facilities often maintain multiple calibrated risk matrices — one for each consequence category or tailored to specific process units. The PEAR Model, discussed further in the Related Topics section, provides a useful framework for structuring those consequence categories.</p>



<h2 class="wp-block-heading">The Risk Gap and the Case for a SIS</h2>



<p class="wp-block-paragraph">The risk gap is the difference between the <em>unmitigated risk</em> of a hazardous event and the <em>tolerable risk</em> threshold. When a gap exists — when the assessed risk exceeds what is tolerable — a protection layer is required.</p>



<p class="wp-block-paragraph">Protection layers can take many forms:</p>



<ul class="wp-block-list">
<li>Inherently safer design</li>



<li>Basic process control</li>



<li>Physical relief devices</li>



<li>Operator response</li>



<li>Safety Instrumented Functions (SIFs)</li>
</ul>



<p class="wp-block-paragraph">When a hazard cannot be reduced to a tolerable level through other means, it requires a <strong>Safety Instrumented Function (SIF)</strong> — a specific automated safety action that brings the process to a safe state in response to a defined demand. A process will typically have multiple hazards that each require their own SIF. All of those SIFs together constitute the <strong>Safety Instrumented System (SIS)</strong> — the SIS does not exist independently of the SIFs that define it; it is the sum of them.</p>



<p class="wp-block-paragraph">This is why the hazard and risk assessment is the primary input to SIS design. Without it, there is no basis for knowing which SIFs are needed or what they must do. The H&amp;RA outputs flow directly into other life-cycle documents, such as the Safety Requirements Specification (SRS).</p>



<h2 class="wp-block-heading">The Link Between H&amp;RA and SIL Determination</h2>



<p class="wp-block-paragraph">Identifying that a SIF is needed is only the first step. The H&amp;RA provides the information that a SIL allocation is required — the size and nature of the risk gap determines how much risk reduction each SIF must deliver.</p>



<p class="wp-block-paragraph"><strong>Safety Integrity Level (SIL)</strong> is a discrete measure of the required risk reduction performance of a SIF. IEC 61511 defines three SIL levels for the process sector — SIL 1, SIL 2, and SIL 3 — each representing an order-of-magnitude increase in performance. The required SIL is determined by the size of the risk gap: the larger the gap, the higher the SIL required to close it.</p>



<p class="wp-block-paragraph"><strong>Layer of Protection Analysis (LOPA)</strong> is the most widely used SIL determination methodology in the process industry.  Note though that where LOPA sits can confuse practitioners as it is at the boundary between two phases of the IEC 61511 life-cycle. The scenario evaluation, IPL crediting, and risk gap calculation are Clause 8 risk H&amp;RA (specifically on the &#8220;A&#8221; side of that), while the resulting required risk reduction is consumed by Clause 9 when that risk reduction is allocated to a SIF and the SIL is assigned.</p>



<h2 class="wp-block-heading">H&amp;RA Methodologies</h2>



<p class="wp-block-paragraph">IEC 61511 does not prescribe a specific H&amp;RA methodology. The right choice depends on process complexity, the stage of design, available data, and the level of rigor required. What the standard does require is that the methodology is appropriate and applied systematically.</p>



<p class="wp-block-paragraph">All H&amp;RA methodologies fall into one of three umbrella categories:</p>



<p class="wp-block-paragraph"><strong>Qualitative</strong> — judgment-based and descriptive. No numerical failure frequencies or consequence magnitudes are required. The output is a structured list of hazards, causes, consequences, and existing safeguards. Qualitative methods are the most widely used in the process industry for H&amp;RA purposes.</p>



<p class="wp-block-paragraph"><strong>Semi-quantitative</strong> — structured scoring or ranking. Numbers are used to characterize risk, but a full probabilistic analysis is not performed. The output provides more consistency and defensibility than a purely qualitative approach.</p>



<p class="wp-block-paragraph"><strong>Quantitative</strong> — numerical frequency and consequence analysis with full probabilistic treatment. The output is a calculated risk value that can be compared directly against a numerical tolerable risk criterion.</p>



<h3 class="wp-block-heading">Qualitative Methods</h3>



<p class="wp-block-paragraph"><strong>HAZID (Hazard Identification Study)</strong> is an upstream screening tool used to identify major hazards early in a project, before detailed design is available. It is typically applied at the conceptual or FEED stage and is less structured than a HAZOP. The primary reference standard is ISO 17776.</p>



<p class="wp-block-paragraph"><strong>HAZOP (Hazard and Operability Study)</strong> is the dominant H&amp;RA methodology in the process industry. It uses a systematic, guide-word-driven approach applied by a multi-disciplinary team to identify deviations from design intent and evaluate their causes and consequences. HAZOP produces a structured, auditable record that serves as the primary H&amp;RA documentation. The governing standard is IEC 61882.</p>



<p class="wp-block-paragraph"><strong>What-If Analysis</strong> is a less formal brainstorming technique useful for simpler processes, preliminary reviews, or as a supplement to a more structured study.</p>



<p class="wp-block-paragraph"><strong>FMEA (Failure Mode and Effects Analysis)</strong> examines individual components to identify failure modes and their system-level effects. It is a bottom-up analysis most commonly used in equipment-focused assessments.</p>



<h3 class="wp-block-heading">Semi-Quantitative Methods</h3>



<p class="wp-block-paragraph"><strong>Risk Graph</strong> is a structured method that uses defined parameters — consequence severity, occupancy, probability of avoiding harm, and demand rate — to assign a SIL target. It is widely used as an initial SIL targeting tool where a full LOPA is not warranted.</p>



<h3 class="wp-block-heading">Quantitative Methods</h3>



<p class="wp-block-paragraph"><strong>Event Tree Analysis (ETA)</strong> models the sequences of events following an initiating cause, branching at each point where a safety barrier succeeds or fails. It is used to quantify outcome frequencies and is commonly paired with fault tree analysis in broader QRA work.</p>



<p class="wp-block-paragraph"><strong>Quantitative Risk Analysis (QRA)</strong> is a formal methodology combining frequency analysis and consequence modeling to produce a quantified risk picture — typically expressed as individual risk contours or F-N curves. QRA draws on ETA, fault tree analysis, and dispersion analysis as inputs and is the most rigorous and resource-intensive approach.</p>



<p class="wp-block-paragraph"><strong>FMEDA (Failure Mode, Effects and Diagnostic Analysis)</strong> is primarily a <strong>manufacturer and device certification tool</strong> conducted against IEC 61508. It produces quantitative failure rate data — including dangerous undetected failure rate and diagnostic coverage — that process facilities consume from equipment supplier safety manuals rather than generating themselves.</p>



<h2 class="wp-block-heading">Related Topics and Tools</h2>



<p class="wp-block-paragraph"><strong>Bow-Tie Analysis</strong> places the top event at the center, with threat pathways on the left and consequence pathways on the right. Barriers appear on both sides. Bow-tie diagrams are effective for communicating H&amp;RA outputs to management, regulators, and operating teams.</p>



<p class="wp-block-paragraph"><strong>Fault Tree Analysis (FTA)</strong> is a top-down tool that works backward from an undesired top event to identify the combinations of failures that could cause it. FTA can be used qualitatively as a logic map or quantitatively with failure rate data. It is often paired with ETA in QRA studies.</p>



<p class="wp-block-paragraph"><strong>Dispersion Analysis</strong> models the physical spread of a hazardous material release — gas cloud, toxic plume, or flammable vapor — to quantify the geographic extent and frequency of harm. It is a quantitative calculation that supports consequence assessment and is an essential input to QRA at facilities handling hazardous materials.</p>



<p class="wp-block-paragraph"><strong>The PEAR Model</strong> structures consequence assessment around four dimensions: <em>People</em>, <em>Environment</em>, <em>Assets</em>, and <em>Reputation</em>. It is particularly relevant where tolerable risk criteria extend beyond personnel safety — and in practice, many facilities maintain a separate calibrated risk matrix for each PEAR dimension.</p>



<p class="wp-block-paragraph"><strong>Hazardous Area Classification</strong> is a discipline that should be completed prior to the H&amp;RA. The already-established hazardous areas (or zones) can serve as an independent protection layer and may be credited as such during the risk assessment process.</p>



<p class="wp-block-paragraph"><strong>Security Risk Assessment (SRA)</strong> is a parallel requirement under IEC 61511 Clause 8 — the same clause that governs the H&amp;RA. Where the H&amp;RA addresses process hazards, the SRA addresses intentional threats and cybersecurity vulnerabilities to the SIS. Both are required to meet full Clause 8 obligations.</p>



<h2 class="wp-block-heading">Common Mistakes and Pitfalls</h2>



<p class="wp-block-paragraph"><strong>Timing it wrong.</strong> Too early and the hazard picture is incomplete; too late and design changes to address identified hazards may already be impractical. The H&amp;RA should be conducted when the P&amp;IDs are at Rev 0.</p>



<p class="wp-block-paragraph"><strong>No documented tolerable risk criteria.</strong> Without a defined, documented calibrated risk matrix, every risk judgment in the study is a personal opinion — and the study cannot be defended or verified by a third party.</p>



<p class="wp-block-paragraph"><strong>Poorly defined hazardous events.</strong> Vague events produce incorrect consequence and frequency assessments, which produce incorrect risk gap calculations. Each hazardous event needs a defined initiating cause, affected equipment, and demand scenario.</p>



<p class="wp-block-paragraph"><strong>Ignoring human factors and demand rates.</strong> Human error is a significant source of SIF demand. Underestimating it produces an overly optimistic risk picture and may result in incorrect SIL targets.</p>



<p class="wp-block-paragraph"><strong>Treating the H&amp;RA as a one-time exercise.</strong> The H&amp;RA is a living document. When the process changes, it must be reviewed and updated.</p>



<p class="wp-block-paragraph"><strong>Confusing inherent risk with residual risk.</strong> Inherent risk is assessed <em>before</em> protection layers. Residual risk is what remains <em>after</em>. Conflating the two leads to incorrect safeguard crediting and inaccurate risk gap calculations.</p>



<h2 class="wp-block-heading">Keeping Your Hazard and Risk Assessment Current (Management of Change)</h2>



<p class="wp-block-paragraph">IEC 61511 is explicit: the H&amp;RA must be reviewed whenever changes occur that could affect its validity. Triggers include process modifications, near misses, new or modified equipment, regulatory updates, and periodic revalidation obligations under the safety life-cycle. A formal Management of Change (MOC) process that flags these triggers is the most reliable way to keep the H&amp;RA aligned with the actual process.</p>



<h2 class="wp-block-heading">Frequently Asked Questions</h2>



<p class="wp-block-paragraph"><strong>Q1: I keep hearing different terms — PHA, HARA, HRA, H&amp;RA. Are these all the same thing?</strong></p>



<p class="wp-block-paragraph">H&amp;RA, HRA, and HARA all refer to the same IEC 61511 activity. Process Hazard Analysis (PHA) is the equivalent term under the OSHA PSM and EPA RMP regulations — different regulatory language, same concept. Other safety disciplines, such as machinery safety, may use different terms again — it can be a bit confusing.</p>



<p class="wp-block-paragraph"><strong>Q2: I&#8217;ve heard that LOPA and HAZOP are often done together in the same study. Is that correct, and if so, how does that work?</strong></p>



<p class="wp-block-paragraph">Yes — many organizations run HAZOP and LOPA back-to-back in the same workshop series for efficiency, completing the HAZOP for each node before immediately running the LOPA on the identified scenarios. They remain technically distinct activities: HAZOP is the &#8220;H&#8221; of the H&amp;RA.  LOPA is the &#8220;A&#8221; + SIL determination (thus a LOPA straddles two lifecycle phases).  Combining the workshops is a logistical choice, not a technical one.  Most tools now are setup for that purpose.</p>



<p class="wp-block-paragraph"><strong>Q3: Is HAZOP the standard way to conduct an H&amp;RA in the process industry?</strong></p>



<p class="wp-block-paragraph">In practice, yes — HAZOP is the de facto H&amp;RA methodology for most process facilities operating under IEC 61511, ISA 84, PSM, and RMP. That said, IEC 61511 does not mandate it; other methodologies such as What-If or QRA are appropriate depending on the project stage and complexity.</p>



<p class="wp-block-paragraph"><strong>Q4: Does IEC 61511 require a specific H&amp;RA methodology?</strong></p>



<p class="wp-block-paragraph">No — IEC 61511 requires that an appropriate methodology is applied systematically, but does not mandate a specific one. HAZOP is the most common choice in the process sector, but What-If and other approaches are all valid depending on the context.</p>



<p class="wp-block-paragraph"><strong>Q5: What is the difference between inherent risk and residual risk?</strong></p>



<p class="wp-block-paragraph">Inherent risk is the risk before any protection layers are applied. Residual risk is what remains after all protection layers — including the SIS — have been credited.</p>



<p class="wp-block-paragraph"><strong>Q6: Who should be involved in conducting an H&amp;RA?</strong></p>



<p class="wp-block-paragraph">An H&amp;RA is a multi-disciplinary team activity requiring input from process, operations, instrumentation, and safety disciplines at a minimum — these can be multi-day affairs, painful, particularly when LOPA is run back-to-back with the HAZOP. IEC 61511 requires that at least one team member holds functional safety competence; this role is often referred to in practice as the Team Leader or HAZOP Leader.</p>



<p class="wp-block-paragraph"><strong>Q7: How often does an H&amp;RA need to be revalidated?</strong></p>



<p class="wp-block-paragraph">IEC 61511 requires H&amp;RA review whenever changes occur that could affect its validity — process modifications, near misses, new equipment, or regulatory changes. Periodic revalidation is also required as part of the broader safety life-cycle review obligations.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<p class="wp-block-paragraph"><strong>Internal — related articles on this site:</strong></p>



<ul class="wp-block-list">
<li><a href="https://silsafe.net/functional-safety-for-the-process-industry/" data-type="post" data-id="6100">What is Functional Safety?</a></li>



<li><a href="https://silsafe.net/proof-test-coverage-cpt-to-improve-pfdavg/" data-type="post" data-id="2784">Proof Testing and Management of Change</a></li>



<li><a href="https://silsafe.net/glossary/stated-risk/" data-type="glossary" data-id="898">Stated Risk</a> vs. <a href="https://silsafe.net/glossary/revealed-risk/" data-type="glossary" data-id="814">Revealed Risk</a></li>



<li>SIL Safe Glossary &#8211; <a href="https://silsafe.net/glossary-cat/hazard-and-risk-assessment/" data-type="page" data-id="3787">H&amp;RA Terms</a></li>



<li><a href="https://silsafe.net/hazard-and-risk-analysis-methods/" data-type="post" data-id="6468">Hazard and Risk Analysis Methods — How They Fit Together</a></li>



<li>HAZOP — A Deep Dive &#8211; <em>coming soon</em></li>



<li><a href="https://silsafe.net/layer-of-protection-analysis-lopa/" data-type="post" data-id="6422">Layer of Protection Analysis (LOPA): The Engineer’s Guide to SIL Selection</a></li>

<li><a href="https://silsafe.net/garden-grove-chemical-incident/" data-type="post" data-id="6633">The Garden Grove Chemical Incident: What We Know So Far</a></li>


<li><a href="https://silsafe.net/functional-safety-vs-occupational-safety/" data-type="post" data-id="6690">Functional Safety Is Not the Same as Occupational Safety</a></li>

</ul>



<p class="wp-block-paragraph"><strong>External authoritative references:</strong></p>



<ul class="wp-block-list">
<li><a href="https://www.isa.org/" target="_blank" rel="noopener">ISA — ISA 84 / IEC 61511 resources</a></li>



<li><a href="https://www.iec.ch/" target="_blank" rel="noopener">IEC — IEC 61511 standard</a></li>



<li><a href="https://www.osha.gov/" target="_blank" rel="noopener">OSHA — Process Safety Management regulation (29 CFR 1910.119)</a></li>



<li><a href="https://www.epa.gov/" target="_blank" rel="noopener">EPA — Risk Management Program (RMP)</a></li>
</ul>



<h2 class="wp-block-heading">Conclusion</h2>



<p class="wp-block-paragraph">A hazard and risk assessment is not a compliance checkbox. It is the technical foundation on which every downstream SIS decision is built — from SIF definition to SIL determination to verification. A well-conducted H&amp;RA, performed at the appropriate time, gives you a defensible, documented basis for your safety case. A poor one — or one conducted too early or too late — leaves your entire SIS design without a credible technical basis.</p>



<p class="wp-block-paragraph">For facilities operating under IEC 61511, PSM, or RMP, the message is straightforward: invest in getting the H&amp;RA right, time it correctly, and make sure it is conducted by people who understand both the process and the standard.</p>



<p class="wp-block-paragraph">Functional safety is complex, and the stakes are high. If you have questions about your SIS design, SIL verification, or where to start with IEC 61511, the team at SIL Safe is here to help. Reach out to us today.</p>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "I keep hearing different terms — PHA, HARA, HRA, H&RA. Are these all the same thing?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "H&RA, HRA, and HARA all refer to the same IEC 61511 activity. Process Hazard Analysis (PHA) is the equivalent term under the OSHA PSM and EPA RMP regulations — different regulatory language, same concept. Other safety disciplines, such as machinery safety, may use different terms again — it can be a bit confusing."
      }
    },
    {
      "@type": "Question",
      "name": "I've heard that LOPA and HAZOP are often done together in the same study. Is that correct, and if so, how does that work?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Yes — many organizations run HAZOP and LOPA back-to-back in the same workshop series for efficiency, completing the HAZOP for each node before immediately running the LOPA on the identified scenarios. They remain technically distinct activities: HAZOP is the "H" of the H&RA.  LOPA is the "A" + SIL determination (thus a LOPA straddles two lifecycle phases).  Combining the workshops is a logistical choice, not a technical one.  Most tools now are setup for that purpose."
      }
    },
    {
      "@type": "Question",
      "name": "Is HAZOP the standard way to conduct an H&RA in the process industry?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "In practice, yes — HAZOP is the de facto H&RA methodology for most process facilities operating under IEC 61511, ISA 84, PSM, and RMP. That said, IEC 61511 does not mandate it; other methodologies such as What-If or QRA are appropriate depending on the project stage and complexity."
      }
    },
    {
      "@type": "Question",
      "name": "Does IEC 61511 require a specific H&RA methodology?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "No — IEC 61511 requires that an appropriate methodology is applied systematically, but does not mandate a specific one. HAZOP is the most common choice in the process sector, but What-If and other approaches are all valid depending on the context."
      }
    },
    {
      "@type": "Question",
      "name": "What is the difference between inherent risk and residual risk?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Inherent risk is the risk before any protection layers are applied. Residual risk is what remains after all protection layers — including the SIS — have been credited."
      }
    },
    {
      "@type": "Question",
      "name": "Who should be involved in conducting an H&RA?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "An H&RA is a multi-disciplinary team activity requiring input from process, operations, instrumentation, and safety disciplines at a minimum — these can be multi-day affairs, painful, particularly when LOPA is run back-to-back with the HAZOP. IEC 61511 requires that at least one team member holds functional safety competence; this role is often referred to in practice as the Team Leader or HAZOP Leader."
      }
    },
    {
      "@type": "Question",
      "name": "How often does an H&RA need to be revalidated?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "IEC 61511 requires H&RA review whenever changes occur that could affect its validity — process modifications, near misses, new equipment, or regulatory changes. Periodic revalidation is also required as part of the broader safety life-cycle review obligations."
      }
    }
  ]
}
</script>
]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/hazard-and-risk-assessment-hra/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Functional Safety for the Process Industry: 10 Core Concepts Every Engineer Should Know</title>
		<link>https://silsafe.net/functional-safety-for-the-process-industry/</link>
					<comments>https://silsafe.net/functional-safety-for-the-process-industry/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Wed, 25 Mar 2026 01:12:24 +0000</pubDate>
				<category><![CDATA[Beginner]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=6100</guid>

					<description><![CDATA[Functional safety in the process industry is more than calculations—it's a lifecycle approach to reducing risk. This guide explains IEC 61511, SIS, SIL, LOPA, and how safety systems actually work in practice, helping engineers move from basic understanding to real-world application.]]></description>
										<content:encoded><![CDATA[
<h2 class="wp-block-heading">Functional Safety at the Most Basic Level</h2>



<p class="wp-block-paragraph">Functional safety is the engineering discipline that is about making sure that when something goes wrong, the systems intended to protect people and the environment work when they are needed.</p>



<p class="wp-block-paragraph">A simple illustration helps.</p>



<p class="wp-block-paragraph">Over roughly 20 years, I have had about six driver’s side window regulators fail. Annoying—but not dangerous.&nbsp; If the window is used about three times per week, that’s roughly 3,120 operations. Six failures over that period corresponds to a probability of failure of about 1.92E-3 per demand.&nbsp; Is this acceptable?&nbsp; It has been annoying for me, but never once was a safety issue.&nbsp; It also was not a cost risk for the manufacturer since they all failed post-warranty.&nbsp; Now, imagine the regulator is a life safety device, that failure rate would be too high.&nbsp; Functional safety is the program that takes that failure rate and lowers it multiple orders of magnitude, to perhaps 1.92E-5.&nbsp; One cannot do that by building a “better” regulator.&nbsp; You can get there, but it would be via a layer of controls throughout the lifecycle of the regulator.</p>



<p class="wp-block-paragraph">Driving failure probabilities down by orders of magnitude is the core driver of functional safety. It is not achieved by a single “better” device. It requires coordinated engineering: design choices, architecture, testing, diagnostics, maintenance, and governance across the life of the system.&nbsp;&nbsp;In the process industry, this structured approach is most commonly defined and implemented through IEC 61511, which provides the framework for how functional safety is applied across the lifecycle of a facility.</p>



<p class="wp-block-paragraph">Functional safety is implemented through active protection functions that span the full loop—sensor, logic solver, and final element. When a hazardous condition is detected, this chain must act correctly to move the process to a safe state, whether that means closing a valve, stopping a motor, or isolating energy.</p>



<p class="wp-block-paragraph">Just as important, that performance has to hold over time. Functional safety is not about working once—it is about continuing to meet required performance over years of operation, testing, maintenance, and change.</p>



<h2 class="wp-block-heading">Why Functional Safety is Applied in the Process Industry</h2>



<p class="wp-block-paragraph">Consider a reactor that begins to over‑pressurize.</p>



<p class="wp-block-paragraph">If nothing intervenes, the vessel could rupture, leading to injury, environmental release, or significant damage. Operators may not respond in time. Standard control systems can help, but systems designed for normal control are typically not sufficient for safety-critical action when consequences are severe.</p>



<p class="wp-block-paragraph">A system that fails 1% of the time may be acceptable for control. It is often unacceptable for protection.</p>



<p class="wp-block-paragraph">Functional safety via IEC 61511 is applied to reduce the probability that the protection fails when it is needed, thus making the process move to a safe state under abnormal conditions.</p>



<h2 class="wp-block-heading">What Is Functional Safety?</h2>



<p class="wp-block-paragraph">Functional safety is an engineering discipline that ensures safety‑significant functions perform correctly when required. It is not a single device, calculation, or tool. It is a structured approach to designing, implementing, and managing systems so that their performance is sufficiently reliable for the hazards they control.</p>



<p class="wp-block-paragraph">The core concept to understand is risk.&nbsp; A process will have hazards (such as a tank rupture) and that hazard has an associated level of risk.&nbsp; Risk is always the combination of probability and severity.</p>



<p class="wp-block-paragraph">Functional safety contributes to risk reduction by lowering the probability that a hazardous outcome occurs and, in some designs, by limiting its severity.</p>



<h2 class="wp-block-heading">How Functional Safety Reduces Risk</h2>



<p class="wp-block-paragraph">Every facility has hazards. But before the hazards are evaluated, the facility needs to determine what risks are acceptable.&nbsp; This is an odd concept for people new to process safety because one has to decide what is an acceptable amount of death or injury.&nbsp; This decision making is documented in a calibrated risk matrix.&nbsp; The next task is to understand those hazards, determine what risks they hold, are those risks tolerable, and reduce risk where they exceed the tolerable level.</p>



<p class="wp-block-paragraph">Functional safety provides this method of reducing this risk.</p>



<p class="wp-block-paragraph">The central metric is the probability of failure on demand (PFDavg). If a safety function has a PFDavg of 0.01, it will fail about 1% of the time when demanded and succeed about 99% of the time. This corresponds to a risk reduction factor (RRF) of 100.&nbsp; RRF = 1/PFDavg</p>



<p class="wp-block-paragraph">In practical terms, this safety function reduces the probability of a hazardous outcome by a factor of 100. This quantified reduction is the mechanism by which functional safety works.</p>



<h2 class="wp-block-heading">Standards Governing Functional Safety</h2>



<p class="wp-block-paragraph">Functional safety is applied across multiple industries, each with standards built on the same underlying principles. Examples include ISO 26262 for automotive systems, IEC 62061 for machinery, and EN 50126/50128/50129 for rail.</p>



<p class="wp-block-paragraph">At the foundation is IEC 61508, which defines the general framework for functional safety of electrical, electronic, and programmable electronic systems.</p>



<p class="wp-block-paragraph">IEC 61511 applies these principles to the process industry. It is not a separate concept; it is a sector-specific implementation of the IEC 61508 framework tailored to process facilities.</p>



<p class="wp-block-paragraph">Note that in my experience as an engineer, almost all codes and standards are applicable to a certain area.&nbsp; Such as a country or perhaps the European Union.&nbsp; But functional safety is truly one of the few global standards.&nbsp; In almost all counties, if they implement a process safety approach, it will be functional safety.&nbsp; This is one of the reasons SIL Safe exists.</p>



<h2 class="wp-block-heading">Hazard and Risk Assessment (H&amp;RA)</h2>



<p class="wp-block-paragraph">The functional safety lifecycle begins with understanding hazards in your process.&nbsp; This is called the&nbsp;hazard and risk assessment (H&amp;RA) which identifies scenarios that could lead to harm, estimates their risk (probability and severity), and determines whether risk is tolerable.&nbsp; Under IEC 61511, the H&amp;RA is the starting point of the safety lifecycle, and it directly drives the identification of Safety Instrumented Functions (SIFs) and their required performance.</p>



<p class="wp-block-paragraph">Typical activities in an H&amp;RA include identifying credible scenarios, estimating probability and severity, and deciding where additional protection is required.</p>



<p class="wp-block-paragraph">Common methods in the process industry include HAZOP and risk matrices. The outcome of this work will establish whether SIFs are needed.</p>



<p class="wp-block-paragraph">When this occurs in the design process is important.&nbsp; The H&amp;RA cannot be too early or too late in the process.&nbsp; If too early, much of the hazards would likely change, not exist, or only be a partial list.&nbsp; If done too late, you are asking for challenges as changes may need to be modified in existing equipment.&nbsp; Think cutting pipe to add a SIF.&nbsp; That is never fun.</p>



<p class="wp-block-paragraph">See this full article for<a href="https://silsafe.net/hazard-and-risk-assessment-hra/" data-type="post" data-id="6213"> deeper dive into an H&amp;RA</a>.</p>



<h2 class="wp-block-heading">Independent Protection Layers (IPLs)</h2>



<p class="wp-block-paragraph">Facilities rely on multiple layers of protection rather than a single safeguard.&nbsp; Meaning safeguards should be put in place BEFORE a SIF is needed.&nbsp;If this happens, then perhaps a SIF would not be needed or a SIF at a lower SIL level.&nbsp;</p>



<p class="wp-block-paragraph">These layers can include the basic process control system (BPCS), relief devices, operator response, and safety instrumented systems. For layers to count independently, they must not fail for the same reasons.&nbsp; This collection of independent items that can protect a SIF are called Independent Protection Layers (IPL).</p>



<p class="wp-block-paragraph">The key question is whether the existing layers reduce risk to tolerable levels. If they do not, additional protection is required.</p>



<p class="wp-block-paragraph">This evaluation is often formalized through a Layer of Protection Analysis (LOPA), which builds directly on H&amp;RA results.&nbsp; This is one of the most common methods used within IEC 61511 programs to determine the required SIL for each SIF.&nbsp;&nbsp;This asks&#8230;</p>



<ul class="wp-block-list">
<li>Is the risk associated with each hazard tolerable?&nbsp; This would assess the risk for each hazard against the calibrated risk matrix.</li>
</ul>



<ul class="wp-block-list">
<li>If not, are there IPLs present or can they be added?&nbsp; This could be extra pressure relief valves, other instruments in the BPCS.&nbsp;</li>



<li>Does each hazard scenario need a SIF?&nbsp; This compares the risk with the IPLs, against the tolerable risk.&nbsp; If the risk is not tolerable, a SIF is added to mitigate the risk of that hazard.</li>



<li>How much risk must each SIF reduce?&nbsp; Generally, this is thought of in orders of magnitude via a calibrated risk matrix.&nbsp; Each box the hazard moves is considered one order of magnitude.</li>



<li>What is the SIL level of each SIF?&nbsp; This is how many orders of magnitude the risk must be reduced to be tolerable.</li>
</ul>



<h2 class="wp-block-heading">Safety Instrumented Systems (SIS)</h2>



<p class="wp-block-paragraph">When existing protection is insufficient, a Safety Instrumented System (SIS) is the system that comes into play.&nbsp; The SIS is the core system involved in IEC 61511, although the standard also has lifecycle requirements.&nbsp; Meaning functional safety is not just the SIS.</p>



<p class="wp-block-paragraph">A SIS is an independent system designed to detect hazardous conditions and move the process to a safe state. It operates separately from the basic process control system (BPCS), which manages normal operation. A typical SIS consists of sensors, a logic solver, and final elements such as shutdown valves or motor isolation devices.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">A Safety Instrumented Function (SIF) is a single protection loop within the SIS. A facility may have one SIF or many, depending on the number of scenarios requiring protection.&nbsp; Again, it is the LOPA that dictates where the SIFs are and the SIL needed per SIF.</p>



<p class="wp-block-paragraph">For example, a simple SIF could be a pressure switch sensing high pressure and turning off a pump with a contactor via the controls in a PLC.&nbsp; Then another SIF can sense a high level and open a dump valve to prevent an overflow.&nbsp; It goes to the same PLC as the first SIF.&nbsp; This would have two SIFs, both using the same PLC and together they would be the SIS.</p>



<h2 class="wp-block-heading">Safety Integrity Levels (SIL)</h2>



<p class="wp-block-paragraph">Each SIF must meet the required level of reliability, expressed as a Safety Integrity Level (SIL).</p>



<p class="wp-block-paragraph">SIL is always determined after the H&amp;RA and during the allocation of safety layers, typically determined during the LOPA (which itself is typically done directly after the H&amp;RA, often in the same long meeting). In the process industry, most applications are SIL 1 or SIL 2, with occasional SIL 3. SIL 4 is rarely used.</p>



<p class="wp-block-paragraph">SIL is often associated with equipment ratings, which is technically correct. But more importantly, it defines the required performance of the safety function across design, implementation, and maintenance. For example, higher SIL levels will need more redundancy and would have different levels of controls and different rules about software.&nbsp; This can be complicated and best discussed elsewhere.</p>



<p class="wp-block-paragraph">Below is the standard relationship between SIL, Risk Reduction Factor (RRF), and PFDavg for low demand mode:</p>



<figure class="wp-block-table"><table class="has-fixed-layout"><thead><tr><td><strong>SIL</strong></td><td><strong>RRF (Risk Reduction Factor)</strong></td><td><strong>PFDavg Range</strong></td><td><strong>Interpretation</strong></td></tr></thead><tbody><tr><td>&nbsp;1</td><td>10 to 100</td><td>1E-1 to 1E-2</td><td>Reduces risk by 1–2 orders of magnitude</td></tr><tr><td>2</td><td>100 to 1,000</td><td>1E-2 to 1E-3</td><td>Reduces risk by 2–3 orders of magnitude</td></tr><tr><td>3</td><td>1,000 to 10,000</td><td>1E-3 to 1E-4</td><td>Reduces risk by 3–4 orders of magnitude</td></tr><tr><td>4</td><td>10,000 to 100,000</td><td>1E-4 to 1E-5</td><td>Rare in process industry; extreme risk reduction</td></tr></tbody></table></figure>



<h2 class="wp-block-heading">Other Design Considerations in Functional Safety</h2>



<p class="wp-block-paragraph">Designing a SIF involves more than selecting components.&nbsp; There are many other concepts that need to be considered, understood, and decided during the design phase.</p>



<ul class="wp-block-list">
<li>hardware fault tolerance (HFT) &#8211; Reliability requirements</li>



<li>voting architecture &#8211; Such as 1oo1 voting, 2oo3, and others</li>



<li>proof test interval (TI) &#8211; Duration between proof tests.  See <a href="https://silsafe.net/proof-testing-of-sifs/" data-type="post" data-id="33">this in-depth article</a>.</li>



<li>proof test coverage (Cpt) &#8211; How good proof testing is at detecting failures.  See this <a href="https://silsafe.net/proof-test-coverage-cpt-to-improve-pfdavg/" data-type="post" data-id="2784">deeper dive on Cpt</a>.</li>



<li>mean time to restore (MTTR) &#8211; Facility&#8217;s time to repair a problem, relates to available spare parts.  Read more about <a href="https://silsafe.net/mean-time-to-restore-mttr/" data-type="post" data-id="1129">MTTR</a>.</li>



<li>spurious trip rate (STR) &#8211; The balance between safety and facility uptime</li>



<li>diagnostic coverage (DC) &#8211; How good the on-board diagnostics are at detecting failures</li>



<li>common cause failure (beta factor) &#8211; Redundant systems may jointly fail due to a common design flaw.  Lean more on <a href="https://silsafe.net/how-to-apply-beta-factor-for-common-cause-failure/" data-type="post" data-id="3628">CCF in this article</a>.</li>
</ul>



<p class="wp-block-paragraph">These parameters all interact. There is a push and pull between the terms and even departments in a facility.&nbsp; Decisions about testing, architecture, spare parts, and maintenance can materially change achieved performance via PFDavg. Each topic is substantial and better explored individually.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">These concepts receive significant attention in functional safety engineering and certification exams.&nbsp; These will also take significant effort to decide and work through during the detailed design phase of the SIS.</p>



<h2 class="wp-block-heading">Verifying Safety Instrumented Function Performance (PFDavg)</h2>



<p class="wp-block-paragraph">Once a SIF is designed, IEC 61511 requires that it be verified against its SIL requirement, per SIF.&nbsp;&nbsp;PFDavg calculations quantify whether the design meets the target. They account for failure rates of the SIF components, architecture, testing intervals, coverage, and repair assumptions.</p>



<p class="wp-block-paragraph">The process of doing the calculations is complicated.&nbsp; There are simple equations and complex equations.&nbsp; Think of it as a specific PFDavg calculation stems from a series of decisions that are made by engineering.&nbsp; For example, if the SIF is tested only when the unit is shutdown versus bypassed &#8211; that impacts PFDavg in different ways.&nbsp; Generally, practitioners use software or Excel to facilitate.</p>



<p class="wp-block-paragraph">There is a related approach called Markov Analysis which we will not get into here as it is a more advanced approach.</p>



<p class="wp-block-paragraph">While central, PFDavg is only one step in a broader discipline. It verifies performance; it does not define the entire program.&nbsp; Inexperienced users may think PFDavg is all functional safety is about.&nbsp; But that is an over-simplification.</p>



<p class="wp-block-paragraph">The simplest PFDavg equation is shown below.&nbsp; This is for a 1oo1 architecture.&nbsp;</p>



<figure class="wp-block-image size-full"><img decoding="async" width="212" height="58" src="https://silsafe.net/wp-content/uploads/2025/04/pfdavg-basic-equation.png" alt="PFDavg basic equation" class="wp-image-188"/></figure>



<ul class="wp-block-list">
<li>TI &#8211; proof test interval</li>



<li>λ<sub>DU</sub> &#8211; the dangerous undetected failure rate.</li>
</ul>



<h2 class="wp-block-heading">The Functional Safety Life-Cycle</h2>



<p class="wp-block-paragraph">Functional safety is governed by a structured life-cycle, and this is one of the most important concepts to understand. The structure of this lifecycle is defined in IEC 61511, and following it is what distinguishes a complete functional safety program from isolated design efforts.&nbsp; Many engineers initially approach functional safety as a design activity, but that is only one portion of the overall process.</p>



<p class="wp-block-paragraph">The life-cycle defines how safety is managed from the earliest concept of a facility through to its eventual decommissioning. It ensures that safety functions are not only designed correctly, but also installed, operated, maintained, and periodically assessed in a consistent and auditable manner.</p>



<p class="wp-block-paragraph">Typical phases include:</p>



<ul class="wp-block-list">
<li>Hazard and risk assessment (H&amp;RA) such as a HAZOP</li>



<li>Allocation of safety functions (such as via a LOPA)</li>



<li>Design and engineering &#8211; detailed design of the SIFs and SIS</li>



<li>Verification and validation</li>



<li>Operation and maintenance</li>



<li>Functional safety assessment (FSA)</li>



<li>Decommissioning</li>
</ul>



<p class="wp-block-paragraph">Each of these phases has specific deliverables and expectations. For example, the design phase may define the architecture of a SIF, but the operation and maintenance phase ensures that proof testing is performed and that failures are addressed correctly.</p>



<p class="wp-block-paragraph">The key point is continuity. Functional safety is not a one-time effort. A system that is properly designed but poorly maintained will not achieve its required performance over time.</p>



<p class="wp-block-paragraph">An all-too-common scenario is that a compliant SIS is installed in a facility.&nbsp; The company is then bought by a firm in an adjacent industry that has never implemented, nor understands, IEC 61511.&nbsp; Over time, budgets can be cut or key people can navigate to other departments, companies, or retire.&nbsp; In this possible scenario various things could happen that could impact the SIS design.&nbsp; For example, proof tests being done at the wrong intervals or not in accordance with the requirements.&nbsp; Components could be replaced with non-SIL qualified versions, or the competency and training program could become weak.&nbsp; All of this is why the functional safety lifecycle is so important.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full is-resized"><a href="https://silsafe.net/wp-content/uploads/2025/05/SIS-safety-lifecycle.webp"><img fetchpriority="high" decoding="async" width="757" height="981" src="https://silsafe.net/wp-content/uploads/2025/05/SIS-safety-lifecycle.webp" alt="SIS Safety Lifecycle. The very important diagram from IEC 61511 figure 7 which overlays the entire Functional Safety Process" class="wp-image-760" style="width:354px;height:auto" srcset="https://silsafe.net/wp-content/uploads/2025/05/SIS-safety-lifecycle.webp 757w, https://silsafe.net/wp-content/uploads/2025/05/SIS-safety-lifecycle-231x300.webp 231w" sizes="(max-width: 757px) 100vw, 757px" /></a></figure>
</div>


<h2 class="wp-block-heading">Regulatory Context</h2>



<p class="wp-block-paragraph">In the United States, regulatory frameworks such as OSHA’s Process Safety Management (PSM) and the EPA’s Risk Management Program (RMP) are the core two regulations.&nbsp; These get triggered if a certain weight of various materials is on site (called threshold quantity).&nbsp; These regulations do not prescribe exact methods for managing risk. Instead, they require that facilities follow sound engineering practices.&nbsp; IEC 61511 is not listed specifically as applicable in the Code of Federal Regulations (CFR), but both OSHA and EPA have stated in writing that using that standard is a sufficient and preferred way to meet the regulations.&nbsp; In other words, it is considered a RAGAGEP (recognized and generally acceptable good engineering practice).</p>



<p class="wp-block-paragraph">Therefore, IEC 61511 is often used to demonstrate that a facility is meeting those regulations. It provides a structured and well-understood approach that regulators, auditors, and engineers recognize.&nbsp; Other countries have similar laws and regulations requiring the standard.</p>



<p class="wp-block-paragraph">In practice, this means that even though IEC 61511 is not mandatory by law, it is frequently treated as if it were, because it defines what &#8220;good&#8221; looks like for functional safety in the process industry.</p>



<p class="wp-block-paragraph">In addition to RMP and PSM, other things could trigger using IEC 61511.&nbsp; This could be contracts between parties or insurance requirements.&nbsp; At times, projects for various reasons will not invoke IEC 61511 in full, but may require certain instruments or automated valve maintain a SIL certification.&nbsp; These SIL only requirements have become more typical as SIL rated components become more common.&nbsp; SIL Safe welcomes this change in the industry.</p>



<h2 class="wp-block-heading">Common Misconceptions About Functional Safety</h2>



<p class="wp-block-paragraph">Functional safety via IEC 61511 is often misunderstood, particularly by those who are new to the discipline or who have only been exposed to portions of it.</p>



<p class="wp-block-paragraph">One common misconception is that functional safety is primarily about calculations, especially PFDavg. While calculations are important, they are only one part of the overall process. Without proper hazard assessment, design, testing, and maintenance, calculations alone do not ensure safety.</p>



<p class="wp-block-paragraph">Another misconception is that functional safety is only relevant during design. In reality, long-term performance depends heavily on proof testing, maintenance practices, and how changes are managed over time.</p>



<p class="wp-block-paragraph">A third misconception is that higher SIL automatically means a better system. In practice, SIL is determined by the risk of the hazard along with the IPLs that exist. A higher SIL requirement often indicates a more severe hazard rather than a superior design choice.</p>



<p class="wp-block-paragraph">Understanding these misconceptions is important because they often lead to incomplete or ineffective implementations of functional safety programs.</p>



<h2 class="wp-block-heading">Why Functional Safety Programs Matter — and When Expertise Is Needed</h2>



<p class="wp-block-paragraph">Functional safety programs provide a structured way to manage risk across a facility. At a high level, they help ensure that hazards are identified, risks are evaluated, and appropriate protections are implemented and maintained over time.</p>



<p class="wp-block-paragraph">Effective programs reduce the probability and severity of major accidents, support regulatory compliance, and provide confidence that safety systems will perform when required.</p>



<p class="wp-block-paragraph">In practice, many organizations require additional expertise at certain points, such as:</p>



<ul class="wp-block-list">
<li>Implementing IEC 61511 for the first time</li>



<li>Performing SIL determination or verification</li>



<li>Preparing for and conducting functional safety assessments</li>



<li>Modifying or upgrading existing SIS implementations</li>
</ul>



<p class="wp-block-paragraph">These situations often involve complex decisions, tradeoffs, and documentation requirements that benefit from experienced practitioners.</p>



<h2 class="wp-block-heading">Q&amp;A Section</h2>



<ol start="1" class="wp-block-list">
<li>What is functional safety in simple terms?</li>
</ol>



<p class="wp-block-paragraph">Functional safety is the part of overall facility safety that depends on safety functions operating correctly when required. It focuses on ensuring that protection systems perform reliably enough to reduce risk to tolerable levels.&nbsp; It has the ability to take a typical failure rate of a safety system of perhaps 0.01 (1%) down multiple orders of magnitude.&nbsp; It does this through a layer of requirements throughout the lifecycle of the system.</p>



<p class="wp-block-paragraph">Functional Safety is applied to various industries.&nbsp; SIL Safe focuses on its application to the process industry via IEC 61511.</p>



<ol start="2" class="wp-block-list">
<li>What is the difference between a SIF and a SIS?</li>
</ol>



<p class="wp-block-paragraph">A Safety Instrumented System (SIS) is the overall system that performs safety functions. A Safety Instrumented Function (SIF) is a single protection loop within that system, typically consisting of a sensor, logic solver (think a PLC), and final element (like a valve or contactor).</p>



<ol start="3" class="wp-block-list">
<li>If I have hazardous scenarios, why can&#8217;t I just add an extra instrument?</li>
</ol>



<p class="wp-block-paragraph">Adding an extra instrument and connecting that to your BPCS does not necessarily reduce risk enough. Risk reduction must be quantified, and the resulting protection must meet the required performance level. Without it, the hazard may still exceed tolerable risk.</p>



<p class="wp-block-paragraph">For example, at times a SIF will have to have an architecture of 2oo3 (meaning three instruments at one point).&nbsp; One would not know that it was needed unless the process was followed and a PFDavg was calculated</p>



<ol start="4" class="wp-block-list">
<li>I&#8217;ve worked on projects where SIL 2 instruments were specified, but the facility was not doing functional safety in its entirety. What is happening there?</li>
</ol>



<p class="wp-block-paragraph">Some projects contractually require certain instruments or &#8220;safety instruments&#8221; to be SIL rated (for example SIL 2 transmitters or valves). This does not mean that the facility is implementing the full functional safety lifecycle. In many cases, companies attempt a compromise where equipment meets SIL capability requirements even if the broader IEC 61511 functional safety program is not fully implemented.</p>



<ol start="5" class="wp-block-list">
<li>What standards govern functional safety?</li>
</ol>



<p class="wp-block-paragraph">The foundational standard is IEC 61508, which defines the general framework for functional safety of electrical, electronic, and programmable electronic systems. For the process industry specifically, IEC 61511 defines how those principles are applied to Safety Instrumented Systems.</p>



<ol start="6" class="wp-block-list">
<li>What about machinery safety?</li>
</ol>



<p class="wp-block-paragraph">Machinery safety is important, of course.&nbsp; But it is distinct.&nbsp; Functional safety for the process industry is focused on reducing the risk (probability and severity) of a major accident.&nbsp; Machinery safety focuses on the user of the machine.</p>



<p class="wp-block-paragraph">However, &#8230;. as SIL ratings are more common, what is happening is machinery safety risk assessment now will often require a SIL rated instrument.&nbsp;&nbsp;SIL Safe fully supports this excellent use of SIL instruments.&nbsp; However, this should not be construed as functional safety.</p>



<h2 class="wp-block-heading">Conclusion</h2>



<p class="wp-block-paragraph">Functional safety via IEC 61511 combines hazard analysis, engineered protection systems, and structured life-cycle management to reduce the probability and severity of hazardous events.</p>



<p class="wp-block-paragraph">It is not a single calculation or device, but a coordinated engineering approach that spans the entire life of a facility.</p>



<p class="wp-block-paragraph">For engineers working in the process industry, understanding these concepts is essential to designing and operating safe systems.</p>



<h2 class="wp-block-heading">Call to Action</h2>



<p class="wp-block-paragraph">If your facility is implementing or improving a functional safety program, expert guidance can make the process significantly more effective.</p>



<p class="wp-block-paragraph">Contact SIL Safe to discuss consulting services for IEC 61511 programs, SIS design, and functional safety assessments.</p>



<h2 class="wp-block-heading">Additional Resources:</h2>



<p class="wp-block-paragraph"><strong>Internal — related articles on this site:</strong></p>



<ul class="wp-block-list">
<li><a href="https://silsafe.net/functional-safety-vs-occupational-safety/" data-type="post" data-id="6690">Functional Safety Is Not the Same as Occupational Safety</a></li>

</ul>



<p class="wp-block-paragraph"><strong>External authoritative references:</strong></p>



<ul class="wp-block-list">
<li><a href="https://webstore.iec.ch/en/publication/24241" target="_blank" rel="noopener">IEC 61511 Official Standard</a></li>



<li><a href="https://www.isa.org" target="_blank" rel="noopener">International Society of Automation (ISA)</a></li>



<li><a href="https://www.hse.gov.uk" target="_blank" rel="noopener">UK Health and Safety Executive (HSE)</a></li>



<li><a href="https://www.aiche.org/ccps" target="_blank" rel="noopener">CCPS Guidelines</a></li>
</ul>



<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What is functional safety in simple terms?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Functional safety is the part of overall facility safety that depends on safety functions operating correctly when required. It focuses on ensuring that protection systems perform reliably enough to reduce risk to tolerable levels. It has the ability to take a typical failure rate of a safety system of perhaps 0.01 (1%) down multiple orders of magnitude. It does this through a layer of requirements throughout the lifecycle of the system. Functional safety is applied to various industries. SIL Safe focuses on its application to the process industry via IEC 61511."
      }
    },
    {
      "@type": "Question",
      "name": "What is the difference between a SIF and a SIS?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "A Safety Instrumented System (SIS) is the overall system that performs safety functions. A Safety Instrumented Function (SIF) is a single protection loop within that system, typically consisting of a sensor, logic solver (think a PLC), and final element (like a valve or contactor)."
      }
    },
    {
      "@type": "Question",
      "name": "If I have hazardous scenarios, why can't I just add an extra instrument?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Adding an extra instrument and connecting that to your BPCS does not necessarily reduce risk enough. Risk reduction must be quantified, and the resulting protection must meet the required performance level. Without it, the hazard may still exceed tolerable risk. For example, at times a SIF will have to have an architecture of 2oo3, meaning three instruments at one point. One would not know that it was needed unless the process was followed and a PFDavg was calculated."
      }
    },
    {
      "@type": "Question",
      "name": "I've worked on projects where SIL 2 instruments were specified, but the facility was not doing functional safety in its entirety. What is happening there?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Some projects contractually require certain instruments or safety instruments to be SIL rated, for example SIL 2 transmitters or valves. This does not mean that the facility is implementing the full functional safety lifecycle. In many cases, companies attempt a compromise where equipment meets SIL capability requirements even if the broader IEC 61511 functional safety program is not fully implemented."
      }
    },
    {
      "@type": "Question",
      "name": "What standards govern functional safety?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "The foundational standard is IEC 61508, which defines the general framework for functional safety of electrical, electronic, and programmable electronic systems. For the process industry specifically, IEC 61511 defines how those principles are applied to Safety Instrumented Systems."
      }
    },
    {
      "@type": "Question",
      "name": "What about machinery safety?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Machinery safety is important, of course, but it is distinct. Functional safety for the process industry is focused on reducing the risk, meaning probability and severity, of a major accident. Machinery safety focuses on the user of the machine. However, as SIL ratings are more common, machinery safety risk assessment will often require a SIL rated instrument. SIL Safe fully supports this excellent use of SIL instruments. However, this should not be construed as functional safety."
      }
    }
  ]
}
</script>



<p class="wp-block-paragraph"></p>
]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/functional-safety-for-the-process-industry/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Failure Rates in Functional Safety: A Practical Guide for Working Engineers</title>
		<link>https://silsafe.net/failure-rates-in-functional-safety/</link>
					<comments>https://silsafe.net/failure-rates-in-functional-safety/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sun, 07 Dec 2025 23:07:26 +0000</pubDate>
				<category><![CDATA[Beginner]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=4221</guid>

					<description><![CDATA[Failure rates are central to SIL verification, device selection, diagnostics, and proof testing. This guide explains λDU, λDD, λSU, and λSD, where the values come from, and how functional safety engineers use them correctly under IEC 61511.]]></description>
										<content:encoded><![CDATA[
<h2 class="wp-block-heading" id="introduction-why-failure-rates-matter-in-functional-safety">Introduction: Why Failure Rates Matter in Functional Safety</h2>



<p class="wp-block-paragraph">Failure rates sit at the center of how we evaluate, verify, and maintain safety instrumented systems (SIS) under IEC 61511. They show up in SIL verification, PFDavg and STR calculations, equipment selection, and proof test strategy. If you understand the failure‑rate categories and how to obtain them correctly, you can avoid many of the mistakes that derail SIL verification or misrepresent SIS performance.</p>



<p class="wp-block-paragraph">This article explains the four failure‑rate categories, where the values come from, how to interpret them, and how a functional safety engineer uses them in practice.</p>



<h2 class="wp-block-heading" id="the-big-picture-what-we-mean-by-a-failure-rate">The Big Picture: What We Mean by a &#8220;Failure Rate&#8221;</h2>



<p class="wp-block-paragraph">In functional safety, <em>failure rate</em> is represented by <strong>λ (lambda)</strong>, typically shown in units of 1/hour or in FITs (failures per 1E9 hours). It represents the frequency of <em>random hardware failures</em>—the only failures that can be mathematically modeled.</p>



<h3 class="wp-block-heading" id="random-vs-systematic-failures">Random vs. Systematic Failures</h3>



<p class="wp-block-paragraph">Random hardware failures are the only failures that can be described by a rate. Systematic failures absolutely matter, but because they arise from design or process weaknesses, they cannot be represented by λ. You must manage them through quality processes and functional safety management—not statistics.</p>



<h3 class="wp-block-heading" id="constant-failure-rate-assumption">Constant Failure Rate Assumption</h3>



<p class="wp-block-paragraph">IEC 61511 modeling assumes a <em>constant</em> failure rate. Real‑world devices follow a classic bathtub curve: higher failures early in life (infant mortality), a long flat useful‑life period, and then increasing failures late in life. Failure‑rate data used for SIL verification assumes you are in that useful‑life region.</p>



<h3 class="wp-block-heading" id="statistical-nature-of-λ-values">Statistical Nature of λ Values</h3>



<p class="wp-block-paragraph">Certification bodies and data handbooks treat λ values as statistical estimates with confidence bounds. The SIL certificate condenses this into a single number, but it should be remembered that every published λ carries uncertainty.&nbsp; However, most of that is a bit behind the scenes for functional safety engineers.</p>



<h2 class="wp-block-heading" id="the-four-failure-rate-categories-in-functional-safety">The Four Failure Rate Categories in Functional Safety</h2>



<p class="wp-block-paragraph">Failure modes in functional safety fall into four buckets based on whether the failure is safe or dangerous, and whether it is detected or undetected by diagnostics:</p>



<ul class="wp-block-list">
<li><strong>λSD – Safe Detected</strong></li>



<li><strong>λSU – Safe Undetected</strong></li>



<li><strong>λDD – Dangerous Detected</strong></li>



<li><strong>λDU – Dangerous Undetected</strong></li>
</ul>



<p class="wp-block-paragraph">These categories determine whether the failure affects safety, reliability, or uptime—and how it appears in SIL and STR calculations.&nbsp; Note that &#8220;detected&#8221; means detected by diagnostics, not by proof tests.</p>



<h2 class="wp-block-heading" id="how-the-four-failure-rates-feed-functional-safety-calculations">How the Four Failure Rates Feed Functional Safety Calculations</h2>



<p class="wp-block-paragraph"><strong>λDU</strong> is the largest driver of safety risk. It represents failures that prevent the SIF from acting and are <em>not</em> discovered by diagnostics. This value is always used in <strong>PFDavg</strong>.&nbsp;&nbsp;<strong>λDD</strong> may be used in PFDavg if the detected failure notifies only (does not trip the SIF).</p>



<p class="wp-block-paragraph"><strong>λSU</strong> always contributes to <strong>spurious trip rate (STR).</strong>&nbsp;&nbsp;<strong>λDD and λSD</strong> contribute to <strong>STR</strong>&nbsp;if the control logic forces a safe‑state action when diagnostics detect a failure.&nbsp; See <a href="https://silsafe.net/spurious-trip-rate-explained/" data-type="post" data-id="3315">this article on STR </a>for more background.  </p>



<p class="wp-block-paragraph"><strong>λSU and λSD</strong> influence reliability and uptime but do not affect PFDavg.</p>



<p class="wp-block-paragraph">Finally, proof tests exist specifically to reveal <em>undetected dangerous failures</em>—the λDU portion. Some engineers misunderstand this and assume proof tests simply “detect failures,” but in functional safety terms, proof tests are how you manage the DU accumulation.&nbsp; See this article on <a href="https://silsafe.net/proof-testing-of-sifs/" data-type="post" data-id="33">proof testing</a>.</p>



<h2 class="wp-block-heading" id="where-failure-rates-come-from">Where Failure Rates Come From</h2>



<h3 class="wp-block-heading" id="where-the-functional-safety-engineer-actually-gets-failure-rates">Where the Functional Safety Engineer Actually Gets Failure Rates</h3>



<p class="wp-block-paragraph">In real SIS design work, most failure‑rate data comes from <strong>certified products</strong>, where a certification body (CB) has already performed a detailed IEC 61508 assessment. The FS engineer reads the <strong>SIL certificate</strong> and the <strong>Safety Manual</strong>, which provide the extracted λDU, λDD, λSU, and λSD values. These two documents are the authoritative sources for day‑to‑day engineering. The underlying FMEDA exists, but it is not normally reviewed or needed by the practitioner.</p>



<p class="wp-block-paragraph">When a certified device is not available, several alternate data routes exist. Each route has tradeoffs and requires engineering judgment:</p>



<ul class="wp-block-list">
<li><strong>Manufacturer‑supplied reliability data</strong> – useful when transparent and well‑supported, but assumptions must be confirmed.</li>



<li><strong>Validated site or company datasets</strong> – often the most realistic if maintenance and failure tracking are strong.</li>



<li><strong>User‑generated field data</strong> – applicable for legacy equipment with a long operating history.</li>



<li><strong>Industry sources such as OREDA</strong> – helpful when carefully matched to device type, service, and environment.</li>
</ul>



<p class="wp-block-paragraph">These alternatives are less typical in functional safety practice, but they require more scrutiny than certified data.</p>



<h3 class="wp-block-heading" id="how-failure-rates-are-determined-typical-scenario">How Failure Rates Are Determined (Typical Scenario)</h3>



<p class="wp-block-paragraph">For certified devices, IEC 61508 defines the process for establishing failure rates. Behind the scenes, the CB reviews or performs:</p>



<ul class="wp-block-list">
<li>FMEDA (failure‑mode analysis and diagnostic modeling)</li>



<li>Test campaigns and empirical validation</li>



<li>Diagnostic behavior evaluation</li>



<li>Environmental and installation assumption checks</li>
</ul>



<p class="wp-block-paragraph">The FS engineer does <strong>not</strong> redo this work. Instead, their responsibility is to:</p>



<ul class="wp-block-list">
<li>Use the published λ values correctly</li>



<li>Ensure the application matches the assumptions in the Safety Manual</li>



<li>Integrate diagnostics the way the certification expects</li>
</ul>



<p class="wp-block-paragraph">This is where many real‑world errors occur—not because the values are wrong, but because the application does not match the assumptions behind them.</p>



<h2 class="wp-block-heading" id="broader-reliability-concepts">Broader Reliability Concepts</h2>



<p class="wp-block-paragraph">Failure rates are not standalone constants; they are shaped by reliability principles that sit behind the λ numbers. A functional safety engineer must understand these broader ideas to avoid misusing published data.</p>



<p class="wp-block-paragraph"><strong>Systematic failures are not described by λ values.</strong> Random hardware failures can be modeled with rates; systematic failures cannot. They arise from design issues, configuration errors, software defects, or procedure gaps. They must be controlled through functional safety management (such as what IEC 61508 does), not reliability math.</p>



<p class="wp-block-paragraph"><strong>Failure‑rate uncertainty is always present.</strong> λ values are statistical estimates derived from limited testing, modeling, or field data. Certification bodies select a representative value for the SIL certificate, but there is natural variability behind every λ. The published number is not a perfect truth—it is a useful engineering approximation.</p>



<p class="wp-block-paragraph"><strong>Application and environment can change the true failure rate.</strong> A device used in corrosive service, high vibration, or aggressive cycling may experience a higher effective λ than the certified value. Likewise, poor installation, improper mounting, or low‑quality air supply (for valves) can shift failure behavior. The published λ applies only when the Safety Manual conditions are met.</p>



<p class="wp-block-paragraph"><strong>The Safety Manual controls the validity of the data.</strong> A λ value is only valid if the equipment is installed, wired, maintained, and operated according to the Safety Manual. If diagnostics are not used, if limits are exceeded, or if maintenance intervals differ from expectations, the certified failure rates no longer describe the real system.</p>



<h2 class="wp-block-heading" id="automated-valve-assemblies-and-how-their-failure-rates-combine">Automated Valve Assemblies and How Their Failure Rates Combine</h2>



<p class="wp-block-paragraph">Automated valves used as final elements are not single devices—they are assemblies made of several components, each with their own failure behavior. A functional safety engineer must gather λ values for <strong>each</strong> sub‑component and understand how they combine to represent the full final element.</p>



<p class="wp-block-paragraph">Typical valve‑assembly components include the valve body, actuator, solenoid, positioner, and any boosters or air relays. Each component contributes its own λDU, λDD, λSU, and λSD. Because the SIF fails if <strong>any one</strong> of these components cannot perform its intended action, the failure rates are combined using <strong>Boolean OR logic</strong>:</p>



<p class="wp-block-paragraph"><strong>λ_total ≈ λ₁ + λ₂ + λ₃ + …</strong></p>



<p class="wp-block-paragraph">In practice, most λDU comes from mechanical components such as the actuator and valve body. These parts typically lack meaningful diagnostics, so λDU remains the dominant contributor for the final element. Electronic components—like smart positioners—may reduce λDD by improving diagnostics, but they seldom reduce λDU in a significant way.</p>



<p class="wp-block-paragraph">Some manufacturers have begun certifying complete valve assemblies under IEC 61508. When available, this simplifies the engineer’s task: the assembly‑level λ values are already validated and consolidated under a single device boundary. See this example from Emerson: <a href="https://www.emerson.com/en-us/automation/valves/controlvalves/digital-isolation-solutions" target="_blank" rel="noopener">https://www.emerson.com/en-us/automation/valves/controlvalves/digital-isolation-solutions</a>. We at SIL Safe expect and hope this trend continues.</p>



<h2 class="wp-block-heading" id="practical-examples">Practical Examples</h2>



<h3 class="wp-block-heading" id="sensor-example-how-the-engineer-obtains-λ-values">Sensor Example: How the Engineer Obtains λ Values</h3>



<p class="wp-block-paragraph">A functional safety engineer begins by locating the λDU, λDD, λSU, and λSD values published in the device’s SIL certificate or Safety Manual. These documents reflect the certification body’s IEC 61508 assessment and define how the device behaves under expected diagnostic, installation, and environmental conditions. The engineer then confirms that the plant’s SIS logic and wiring actually use the diagnostic features assumed in the certification. Once these steps are complete, the engineer has the correct λ values that will be applied in later PFDavg or STR calculations.</p>



<h3 class="wp-block-heading" id="final-element-example-how-the-engineer-obtains-λ-values">Final Element Example: How the Engineer Obtains λ Values</h3>



<p class="wp-block-paragraph">For an automated valve assembly, the process is more involved because a final element is made of multiple components that must all function correctly. The engineer identifies each sub-component—such as the valve body, actuator, solenoid, positioner, and boosters—and retrieves λ values from each component’s SIL certificate or Safety Manual. The installation and diagnostic assumptions must match the application for the values to be valid. Because a failure in any single sub-component prevents the valve from performing its safety function, the engineer combines the λ values using OR logic to produce the total assembly failure rate. This assembled λ dataset will be used in downstream PFDavg and STR calculations.</p>



<h2 class="wp-block-heading" id="common-mistakes-engineers-make-with-failure-rates">Common Mistakes Engineers Make with Failure Rates</h2>



<p class="wp-block-paragraph">Even experienced engineers can misapply failure‑rate data if the context behind the numbers is not fully understood. A few issues show up repeatedly in real SIS design and verification work.</p>



<p class="wp-block-paragraph"><strong>Misinterpreting λDU vs. λDD.</strong> These two values behave very differently. λDU always goes into PFDavg because it represents failures that diagnostics cannot find. λDD may or may not impact STR or PFDavg depending on how diagnostics are integrated. Treating DD like DU—or assuming DD never matters—produces incorrect verification results.</p>



<p class="wp-block-paragraph"><strong>Using generic values without validating assumptions.</strong> Generic data tables, old spreadsheets, or handbook values can be misleading if the assumptions behind them do not match your application. Certified values come with defined conditions; generic values usually do not.</p>



<p class="wp-block-paragraph"><strong>Ignoring diagnostics.</strong> Sometimes diagnostics exist on the device but do not make it into the SIS logic or maintenance workflow. If a diagnostic bit is unwired, unmapped, filtered out, or simply ignored in operations, detected dangerous failures behave like undetected failures. In this case, λDD effectively becomes λDU.</p>



<p class="wp-block-paragraph"><strong>Treating λ values as universal constants.</strong> A λ value from a certificate is not automatically valid everywhere. Installation, environment, cycling, mounting, and maintenance determine whether the published λ truly reflects your plant’s conditions. Failure rates must be applied with engineering judgment, not copied blindly.</p>



<h2 class="wp-block-heading" id="when-diagnostics-exist-but-are-not-used">When Diagnostics Exist but Are Not Used</h2>



<p class="wp-block-paragraph">Diagnostics only add value when the SIS actually acts on them. A device may have excellent internal diagnostics, but if they are not used by the controls, the failure behaves as if it were <em>undetected</em>. In this situation, λDD is effectively added to λDU for purposes of SIL verification because the SIF remains impaired until someone actively responds.&nbsp; It could really impact PFDavg.</p>



<p class="wp-block-paragraph">This scenario is more common in brownfield facilities, older installations, poorly integrated SIS/BPCS architectures, or sites where diagnostics alarm but no work process exists to ensure timely repair. The lesson for the engineer is simple: <strong>diagnostics only help if the entire chain—from device to logic to maintenance—uses them correctly.</strong></p>



<h2 class="wp-block-heading" id="where-failure-rates-influence-sis-design-decisions">Where Failure Rates Influence SIS Design Decisions</h2>



<p class="wp-block-paragraph">Failure‑rate data influences several real‑world engineering choices throughout the SIS life‑cycle. Understanding how λ values behave helps the engineer select architectures, manage proof‑test strategies, and apply diagnostics intentionally—not blindly.</p>



<p class="wp-block-paragraph"><strong>Architecture selection.</strong> If λDU is high, additional redundancy may be required to achieve the target SIL. Failure‑rate data helps determine whether 1oo1, 1oo2, or 2oo3 architectures are appropriate for the SIF.</p>



<p class="wp-block-paragraph"><strong>Choosing the proof‑test interval (TI).</strong> Proof tests exist to reveal λDU—the part diagnostics cannot see. A higher λDU or lower proof‑test coverage (Cpt) typically requires a shorter TI. Failure‑rate data directly shapes the proof‑test strategy.&nbsp; See <a href="https://silsafe.net/proof-test-coverage-cpt-to-improve-pfdavg/" data-type="post" data-id="2784">this other article about CPT.</a></p>



<p class="wp-block-paragraph"><strong>Partial‑stroke testing for final elements.</strong> For valves that dominate λDU, partial‑stroke testing may reduce the exposure time of dangerous failures. This decision depends on understanding which failure modes are found by diagnostics versus proof tests.</p>



<p class="wp-block-paragraph"><strong>Diagnostic selection and integration.</strong> λDD and λSD only help if diagnostics are wired, mapped, and acted on. Understanding the diagnostic coverage of a device and the assumptions in the Safety Manual helps engineers design logic and maintenance workflows that truly reduce risk.</p>



<h2 class="wp-block-heading" id="summary-a-practical-way-to-think-about-failure-rates">Summary: A Practical Way to Think About Failure Rates</h2>



<p class="wp-block-paragraph">Failure rates are the foundation of how we model and manage random hardware failures in functional safety. λDU represents the portion of failures that silently erode the ability of a SIF to act when needed. λDD and λSD describe failures that diagnostics can reveal, informing how often a SIF may trip unnecessarily and how reliably it stays available. λSU supports reliability but does not influence risk directly.</p>



<p class="wp-block-paragraph">These four failure‑rate categories show up throughout the IEC 61511 safety life‑cycle: in equipment selection, architectural decisions, proof‑test strategy, diagnostic design, and SIL verification. If the failure‑rate assumptions in the SIL certificate and Safety Manual are respected—and if diagnostics are used correctly—then λ values become powerful tools for designing and maintaining a dependable SIS.</p>



<h2 class="wp-block-heading">Need Help With Failure Rates?</h2>



<p class="wp-block-paragraph">For more help applying failure‑rate data correctly—or for third‑party SIS verification—reach out through SIL Safe&#8217;s contact page.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<p class="wp-block-paragraph">Internal:</p>



<ul class="wp-block-list">

<li><a href="https://silsafe.net/functional-safety-glossary/">SIL Safe Glossary</a></li>


<li><a href="https://silsafe.net/proof-testing-of-sifs/">Proof Testing of SIFs: Understanding Its 3 Purposes and Its Importance</a></li>


<li><a href="https://silsafe.net/pfdavg-explained/">PFDavg Explained: 6 Essentials for Getting Started with SIL Calculations</a></li>

</ul>



<p class="wp-block-paragraph">External:</p>



<ul class="wp-block-list">

<li><a href="https://www.isa.org/intech-home/2020/may-june/departments/updated-guidelines-for-using-isa-iec-61511-funct" target="_blank" rel="noopener">ISA — Updated guidelines for using ISA-IEC 61511</a></li>


<li><a href="https://www.emerson.com/en-us/automation/valves/controlvalves/digital-isolation-solutions" target="_blank" rel="noopener">Emerson — Certified automated valves</a></li>

</ul>



<h2 class="wp-block-heading" id="q-a-section">Q&amp;A Section</h2>



<ul class="wp-block-list">
<li><strong>What’s the practical difference between λDU and λDD?</strong><br>λDU drives PFDavg because the failure is both dangerous and <em>undetected</em>. λDD is dangerous but <em>detected</em>, so it typically results in an alarm or forced-safe trip and may contribute to STR depending on configuration.</li>



<li><strong>Do failure rates change over time in my facility?</strong><br>Yes. The published λ values assume controlled conditions; however, real‑world factors like environment, cycling, installation quality, and maintenance can shift the true failure rate up or down.&nbsp;However, we assume constant for modeling purposes.</li>



<li><strong>Why do certified products help for accurate λ values?</strong><br>Certified devices provide validated λ values and clearly defined DU/DD/SU/SD splits under IEC 61508. This reduces interpretation errors and ensures the assumptions behind the numbers are understood and controlled.</li>



<li><strong>Is it okay to use generic failure rates?</strong><br>Only if you confirm that they match your device and application. Generic values may not reflect your environment, proof-test strategy, or diagnostic coverage.</li>



<li><strong>What if the device doesn’t have a SIL certificate?</strong><br>You can still use it, but you must rely on credible manufacturer reliability data, validated site history, or reputable sources such as OREDA. These represent the alternate data routes when a certified data path is not available. Assumptions must match the application, and justification must be documented.</li>



<li><strong>Why do final elements almost always have higher λDU than sensors?</strong><br>Most λDU in a final element comes from mechanical components like the actuator and valve body, which lack strong diagnostics. Sensors generally have better diagnostic coverage and fewer mechanical wear points, so their λDU values are typically much lower.</li>



<li><strong>I understand why λDU impacts PFDavg, but why would λDD impact PFDavg?</strong><br>It depends on how the controls and diagnostics are set up. In many low-demand configurations, only λDU enters PFDavg. But if a dangerous detected failure leaves the SIF unable to perform its function—and the system does not act on or repair that diagnostic—then that portion of λDD effectively behaves like λDU and may need to be included in the PFDavg analysis.</li>



<li><strong>How are λ values used differently in high-demand or continuous modes?</strong><br>Failure rate is highly relevant but used differently.&nbsp; In low-demand mode (most common), we use λDU to calculate <strong>PFDavg</strong>.&nbsp;&nbsp;In continuous or high-demand modes, <strong>PFDavg is not used</strong>. Instead, SIL is based on <strong>PFH</strong> — the rate of dangerous failure per hour. In these cases, λDU is used to calculate PFH through a different series of equations than PFDavg.&nbsp;</li>
</ul>



<!-- FAQPage Schema (JSON-LD) — Verbatim conversion -->
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What’s the practical difference between λDU and λDD?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "λDU drives PFDavg because the failure is both dangerous and undetected. λDD is dangerous but detected, so it typically results in an alarm or forced-safe trip and may contribute to STR depending on configuration."
      }
    },
    {
      "@type": "Question",
      "name": "Do failure rates change over time in my facility?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Yes. The published λ values assume controlled conditions; however, real-world factors like environment, cycling, installation quality, and maintenance can shift the true failure rate up or down. However, we assume constant for modeling purposes."
      }
    },
    {
      "@type": "Question",
      "name": "Why do certified products help for accurate λ values?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Certified devices provide validated λ values and clearly defined DU/DD/SU/SD splits under IEC 61508. This reduces interpretation errors and ensures the assumptions behind the numbers are understood and controlled."
      }
    },
    {
      "@type": "Question",
      "name": "Is it okay to use generic failure rates?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Only if you confirm that they match your device and application. Generic values may not reflect your environment, proof-test strategy, or diagnostic coverage."
      }
    },
    {
      "@type": "Question",
      "name": "What if the device doesn’t have a SIL certificate?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "You can still use it, but you must rely on credible manufacturer reliability data, validated site history, or reputable sources such as OREDA. These represent the alternate data routes when a certified data path is not available. Assumptions must match the application, and justification must be documented."
      }
    },
    {
      "@type": "Question",
      "name": "Why do final elements almost always have higher λDU than sensors?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Most λDU in a final element comes from mechanical components like the actuator and valve body, which lack strong diagnostics. Sensors generally have better diagnostic coverage and fewer mechanical wear points, so their λDU values are typically much lower."
      }
    },
    {
      "@type": "Question",
      "name": "I understand why λDU impacts PFDavg, but why would λDD impact PFDavg?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "It depends on how the controls and diagnostics are set up. In many low-demand configurations, only λDU enters PFDavg. But if a dangerous detected failure leaves the SIF unable to perform its function—and the system does not act on or repair that diagnostic—then that portion of λDD effectively behaves like λDU and may need to be included in the PFDavg analysis."
      }
    },
    {
      "@type": "Question",
      "name": "How are λ values used differently in high-demand or continuous modes?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Failure rate is highly relevant but used differently. In low-demand mode (most common), we use λDU to calculate PFDavg. In continuous or high-demand modes, PFDavg is not used. Instead, SIL is based on PFH—the rate of dangerous failure per hour. In these cases, λDU is used to calculate PFH through a different series of equations than PFDavg."
      }
    }
  ]
}
</script>
]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/failure-rates-in-functional-safety/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>How to Apply the Beta Factor: A Practical Guide to Common Cause Failures in SIL Verification</title>
		<link>https://silsafe.net/how-to-apply-beta-factor-for-common-cause-failure/</link>
					<comments>https://silsafe.net/how-to-apply-beta-factor-for-common-cause-failure/#respond</comments>
		
		<dc:creator><![CDATA[mamerten]]></dc:creator>
		<pubDate>Sun, 19 Oct 2025 16:44:46 +0000</pubDate>
				<category><![CDATA[Math Related]]></category>
		<category><![CDATA[PFDavg]]></category>
		<guid isPermaLink="false">https://silsafe.net/?p=3628</guid>

					<description><![CDATA[Learn how to apply the beta factor (β) in SIL verification and understand its link to common cause failures. This guide walks through where β values come from, how they affect PFDavg, and why small changes can shift your SIL rating.]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">When engineers perform SIL verification, most of the attention goes toward failure rates, proof test intervals, or diagnostics. But one input that carries enormous influence is <strong>beta factor (β)</strong> — the number that represents how much of your redundancy can be trusted to behave independently. Misunderstanding it can make even a perfect SIL calculation look better on paper than it really is in the field. This article walks through what β is, how it links to <strong>common cause failure (CCF)</strong>, and how to apply it correctly with practical math and examples.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">What Is It? (CCF vs. Beta Factor and Why It Matters)</h2>



<p class="wp-block-paragraph"><strong>Common cause failure (CCF)</strong>&nbsp;is a concept of multiple things failing for the same core reason. It is a concept in various industries and practices. In a Safety Instrumented System (SIS) it means multiple channels in redundant architectures fail together because of a shared cause. For example:</p>



<ul class="wp-block-list">
<li><strong>Sensors:</strong> Two pressure transmitters mounted side‑by‑side exposed to the same vibration or plugged impulse lines.</li>



<li><strong>Logic solver:</strong> Both processor cards affected by the same unknown software bug.</li>



<li><strong>Final element:</strong> Two solenoids sharing the same instrument air header that fail simultaneously.</li>
</ul>



<p class="wp-block-paragraph">The <strong>beta factor (β)</strong> is the fraction of all dangerous failures that occur from a CCF. It is the mathematical representation of the CCF concept. It converts the qualitative idea of “common cause” into a quantifiable term used in equations. Ignoring β will almost always make your <strong>average probability of failure on demand (PFDavg)</strong> appear lower than it really is — giving a false sense of security.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">Where to Get Beta Factor Values</h2>



<p class="wp-block-paragraph">There are two primary sources:</p>



<ol class="wp-block-list">
<li><strong>SIL certificates or FMEDA reports</strong> from manufacturers. These often include β values based on test data and modeling assumptions.</li>



<li><strong>IEC 61508‑6 Annex D</strong> – the formal method for determining β using dependent failure analysis. This process is complex and beyond the scope of this article, but it is the standard reference.</li>
</ol>



<p class="wp-block-paragraph">Other useful references include <strong>ISA TR84.00.02</strong>, <strong>OREDA</strong>, and <strong>CCPS</strong> reliability publications. Some companies maintain internal databases based on field performance.</p>



<p class="wp-block-paragraph">A practical approach:</p>



<ul class="wp-block-list">
<li>Start with the β from the SIL certificate.</li>



<li>Review Annex D factors to see if your installation justifies adjustment.</li>



<li>Document and justify the final value in your <strong>Safety Requirements Specification (SRS)</strong>.</li>
</ul>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">How Architecture Affects β</h2>



<p class="wp-block-paragraph">The β‑factor is not constant; it changes with architecture. This is a common point of confusion within Functional Safety. Think of redundancy as a team of <strong>security guards protecting a building</strong>:</p>



<ul class="wp-block-list">
<li>In a <strong>1oo2</strong> architecture, either guard can respond to stop a robbery. If both guards eat the same lunch and get food poisoning, that’s a CCF — a shared vulnerability.</li>



<li>In a <strong>1oo4</strong> architecture, four guards must all get sick for the robbery to succeed. The common cause event would have to be much stronger or more universal, or &#8220;more common,&#8221; to take them all down. Think of a really widespread case of food poisoning. Therefore, the effective β is lower.</li>
</ul>



<p class="wp-block-paragraph">This creates a feedback loop: changing architecture alters the PFDavg equation, but it also justifies a new β — which again changes the PFDavg. It is a bit odd, but it is correct. IEC 61508‑6 Annex D Table D.5 gives architectural correction factors (roughly 0.3 – 1.74) that help account for this effect. Most β values published in SIL certificates are assumed for 1oo2 configurations.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">NooN Architectures and Why β Is Not Present</h2>



<p class="wp-block-paragraph">For <strong>NooN</strong> architectures (e.g., 2oo2 or 3oo3), β is normally not present in the mathematics. Any single channel failure will cause the function to fail because all channels must work. The total λDU (dangerous undetected failure rate) already includes both independent and common‑cause contributions. Applying a separate β term would double‑count the effect.</p>



<p class="wp-block-paragraph">Returning to the guard analogy: if three guards must all respond correctly and any one failure causes system failure, it doesn’t matter if their failures were independent or shared — the result is the same. For this reason, modeling tools and standards do not include β for NooN designs.</p>



<p class="wp-block-paragraph">Note that this can be thought of as &#8220;β is set to 0 for NooN&#8221; but that is not how we at SIL Safe think of this. Yes, you can use the MooN PFDavg equations, set&nbsp;β = 0, and get the same results. But as we discussed above,&nbsp;β is accounted for in λDU.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">Typical Values and Influencing Factors</h2>



<figure class="wp-block-table"><table class="has-fixed-layout"><thead><tr><th>Subsystem</th><th>Typical β Range</th></tr></thead><tbody><tr><td>Sensors</td><td>0.02 – 0.10</td></tr><tr><td>Logic Solvers</td><td>0.01 – 0.05</td></tr><tr><td>Final Elements</td><td>0.05 – 0.15</td></tr></tbody></table></figure>



<p class="wp-block-paragraph">Factors that increase β:</p>



<ul class="wp-block-list">
<li>Common environment (same enclosure, same power, same impulse lines)</li>



<li>Identical design and firmware</li>



<li>Shared maintenance procedures or simultaneous testing</li>
</ul>



<p class="wp-block-paragraph">Factors that decrease β:</p>



<ul class="wp-block-list">
<li>Physical separation and shielding</li>



<li>Vendor or technology diversity</li>



<li>Independent power and utilities</li>



<li>Separate calibration schedules</li>



<li>Staggered proof testing</li>
</ul>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">PFDavg Equation Discussion</h2>



<p class="wp-block-paragraph">In a <strong>1oo2</strong> system, the total PFDavg is made up of independent and common‑cause terms. Using a simplified version of the equation:</p>



<figure class="wp-block-image size-full is-resized"><img decoding="async" width="560" height="78" src="https://silsafe.net/wp-content/uploads/2025/10/PFDavg-1oo2-basic.webp" alt="Equation showing how the beta factor (β) influences PFDavg in a 1oo2 safety architecture by combining independent and common cause failure probabilities." class="wp-image-3626" style="aspect-ratio:7.180925666199158;width:438px;height:auto" srcset="https://silsafe.net/wp-content/uploads/2025/10/PFDavg-1oo2-basic.webp 560w, https://silsafe.net/wp-content/uploads/2025/10/PFDavg-1oo2-basic-300x42.webp 300w" sizes="(max-width: 560px) 100vw, 560px" /></figure>



<p class="wp-block-paragraph">Where:</p>



<ul class="wp-block-list">
<li>λ<sub>DU</sub> = dangerous undetected failure rate (per hour)</li>



<li>TI = proof test interval (hours)</li>



<li>β = fraction of failures due to common causes</li>
</ul>



<p class="wp-block-paragraph">The first term (squared) represents two independent latent failures. The second (linear) term represents the probability that both channels fail from a shared cause. Because the independent term is squared, it becomes much smaller — making β a powerful driver in redundant designs.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">PFDavg Calculation Examples (1oo2)</h2>



<h3 class="wp-block-heading">Case A – Good β Factor (Target: SIL 2)</h3>



<p class="wp-block-paragraph"><strong>Given:</strong> β = 0.03, λ<sub>DU</sub> = 2E-6/hr, TI = 8,760 hr (1 year)</p>



<ol class="wp-block-list">
<li>Independent term:<br>(1/3) × [(1 − 0.03) × 2E-6 × 8,760]² = 9.63E-5</li>



<li>Common‑cause term:<br>(1/2) × 0.03 × 2E-6 × 8,760 = 2.63E-4</li>



<li><strong>Total PFDavg </strong>= 3.59E-4</li>



<li><strong>RRF = 1 / PFDavg </strong>= 2,785 → SIL 2</li>
</ol>



<p class="wp-block-paragraph"><strong>STR (spurious trip rate):</strong> assume λ<sub>SP</sub> = 1E-5/hr per channel. For 1oo2 (trip on any channel):<br>STR ≈ 2 × λ<sub>SP</sub> = 2E-5/hr → 0.175 trips per year ≈ a trip every 5.7 years.</p>



<h3 class="wp-block-heading">Case B – Poor β Factor (Target: SIL 1)</h3>



<p class="wp-block-paragraph"><strong>Given:</strong> β = 0.15, λ<sub>DU</sub> = 2E-6/hr, TI = 8,760 hr (same assumptions)</p>



<ol class="wp-block-list">
<li>Independent term:<br>(1/3) × [(1 − 0.15) × 2E-6 × 8,760]² = 7.4E-5</li>



<li>Common‑cause term:<br>(1/2) × 0.15 × 2E-6 × 8,760 = 1.31E-3</li>



<li><strong>Total PFDavg </strong>= 1.38E-3</li>



<li><strong>RRF = 1 / PFDavg </strong>= 725 → SIL 1</li>
</ol>



<p class="wp-block-paragraph"><strong>STR (same λ<sub>SP</sub>):</strong> 2E-5/hr → 0.175 trips/year or one trip every 5.7 years (same as Case A).</p>



<p class="wp-block-paragraph"><strong>Comparison:</strong> Raising β from 0.03 to 0.15 increased total PFDavg by ≈ 4×, dropping the SIL from 2 to 1.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">Impact of Common Cause Portions of PFDavg</h2>



<p class="wp-block-paragraph">Because the independent term is squared, it diminishes as reliability increases, while the common‑cause term remains linear. The result: CCF often dominates total PFDavg in redundant architectures.</p>



<ul class="wp-block-list">
<li><strong>Case A (β = 0.03)</strong>: PFDavg_ind = 9.63E-5, PFDavg = 2.63E-4 → CCF ≈ 73% of total.</li>



<li><strong>Case B (β = 0.15)</strong>: PFDavg_ind = 7.4E-5, PFDavg = 1.31E-3 → CCF ≈ 95% of total.</li>
</ul>



<p class="wp-block-paragraph">Even at modest β values, most of the total unavailability stems from common causes — which is why reducing β through independence and diversity is far more effective than upgrading the architecture. The same pattern holds true for higher architectures like 2oo3.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">Common Mistakes</h2>



<ul class="wp-block-list">
<li>Applying the same β to every subsystem without justification.</li>



<li>Assuming diagnostics lower β (they don’t; they reduce λ<sub>DU</sub>).</li>



<li>Forgetting to justify β selection in the SRS.</li>



<li>Focusing only on adding redundancy instead of lowering β.</li>
</ul>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">Summary</h2>



<p class="wp-block-paragraph">The β‑factor is what connects real‑world dependencies to your math. A small change in β can shift a design from SIL 2 to SIL 1 — even when every other parameter is identical. Real independence, not just redundancy, drives reliability.</p>



<p class="wp-block-paragraph">Document your β assumptions, revisit them throughout the SIS life‑cycle, and when in doubt, verify them against IEC 61508‑6 Annex D or manufacturer data.</p>



<p class="wp-block-paragraph"><em>If you’d like a second set of eyes on your β assumptions or SIL verification model, contact <strong>SIL Safe</strong> for a practical review or training session.</em></p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading">Q&amp;A</h2>



<p class="wp-block-paragraph"><strong>1. Does beta factor (β) apply only to 1oo2 architectures?</strong><br>No. The concept applies to any redundant system where channels can fail together, though it’s most visible in 1oo2.</p>



<p class="wp-block-paragraph"><strong>2. Can β ever be zero?</strong><br>Not realistically. There’s always some shared factor — environment, maintenance, or design — that introduces correlation.&nbsp; For NooN scenarios,&nbsp;<strong>β</strong> does not appear in the mathematics, so some say&nbsp; β is zero, but it is still included in the&nbsp;λ<sub>DU</sub> term.</p>



<p class="wp-block-paragraph"><strong>3. What does β actually represent?</strong><br>It’s the percentage of dangerous failures that are shared between channels rather than truly independent.</p>



<p class="wp-block-paragraph"><strong>4. How often should β be revisited?</strong><br>Any time design, environment, or procedures change — typically during periodic review or FSA Stage 4.&nbsp; It is common for&nbsp;β to change over the safety lifecycle.</p>



<p class="wp-block-paragraph"><strong>5. Can one simply apply the β in the SIL certificate?</strong><br>Use it as a starting point. Then compare against Annex D’s checklist and justify any adjustment.</p>



<p class="wp-block-paragraph"><strong>6. What portion can common cause contribute to total PFDavg?</strong><br>Often the majority. For 1oo2 with β ≈ 0.1–0.2, common cause failures can make up 70–95 % of total PFDavg — the same trend appears in higher architectures.</p>



<p class="wp-block-paragraph"><strong>7. I&#8217;ve heard that for NooN architectures, β is set to 0, but I thought that is not possible?</strong><br>β is involved in NooN, but it is already included in the&nbsp;λ<sub>DU</sub> term. Meaning we don&#8217;t want to double count. The NooN equations will not have any&nbsp;β terms. Thus, it can be thought of as they were set to zero, but that is not technically correct.</p>



<h2 class="wp-block-heading">Further Reading</h2>



<p class="wp-block-paragraph">Internal:</p>



<ul class="wp-block-list">

<li><a href="https://silsafe.net/functional-safety-glossary/">SIL Safe Glossary</a></li>


<li><a href="https://silsafe.net/pfdavg-explained/">PFDavg Explained: 6 Essentials for Getting Started with SIL Calculations</a></li>

</ul>



<p class="wp-block-paragraph">External:</p>



<ul class="wp-block-list">

<li><a href="https://ez.analog.com/ez-blogs/b/engineerzone-spotlight/posts/how-to-quantify-common-cause-failures" target="_blank" rel="noopener">Analog Devices — Guidance on quantification of common cause failure</a></li>


<li><a href="https://www.nrc.gov/docs/ml0729/ml072970404.pdf" target="_blank" rel="noopener">NRC — Common Cause Failure Guidance (PDF)</a></li>


<li><a href="https://www.nrc.gov/reading-rm/doc-collections/nuregs/staff/sr2225/index" target="_blank" rel="noopener">NRC — NUREG-2225 Index</a></li>

</ul>



<!-- FAQPage Schema (JSON-LD) — Verbatim conversion -->
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "Does beta factor (β) apply only to 1oo2 architectures?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "No. The concept applies to any redundant system where channels can fail together, though it’s most visible in 1oo2."
      }
    },
    {
      "@type": "Question",
      "name": "Can β ever be zero?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Not realistically. There’s always some shared factor — environment, maintenance, or design — that introduces correlation. For NooN scenarios, β does not appear in the mathematics, so some say β is zero, but it is still included in the λDU term."
      }
    },
    {
      "@type": "Question",
      "name": "What does β actually represent?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "It’s the percentage of dangerous failures that are shared between channels rather than truly independent."
      }
    },
    {
      "@type": "Question",
      "name": "How often should β be revisited?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Any time design, environment, or procedures change — typically during periodic review or FSA Stage 4. It is common for β to change over the safety lifecycle."
      }
    },
    {
      "@type": "Question",
      "name": "Can one simply apply the β in the SIL certificate?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Use it as a starting point. Then compare against Annex D’s checklist and justify any adjustment."
      }
    },
    {
      "@type": "Question",
      "name": "What portion can common cause contribute to total PFDavg?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Often the majority. For 1oo2 with β ≈ 0.1–0.2, common cause failures can make up 70–95 % of total PFDavg — the same trend appears in higher architectures."
      }
    },
    {
      "@type": "Question",
      "name": "I’ve heard that for NooN architectures, β is set to 0, but I thought that is not possible?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "β is involved in NooN, but it is already included in the λDU term. Meaning we don't want to double count. The NooN equations will not have any β terms. Thus, it can be thought of as they were set to zero, but that is not technically correct."
      }
    }
  ]
}
</script>

]]></content:encoded>
					
					<wfw:commentRss>https://silsafe.net/how-to-apply-beta-factor-for-common-cause-failure/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/?utm_source=w3tc&utm_medium=footer_comment&utm_campaign=free_plugin

Page Caching using Disk: Enhanced 
Minified using Disk

Served from: silsafe.net @ 2026-06-08 14:20:41 by W3 Total Cache
-->