Oil Rig Explosion

By ThinkReliability Staff

On April 20, 2010 about 10 pm a huge explosion rocked a semi-submersible drilling oil rig about 40 miles off the coast of Louisiana in the Gulf of Mexico. The oil rig was called the Deepwater Horizon and was owned by Transocean Ltd and leased to the British Petroleum Company through September 2013.

The oil rig burned for about 36 hours before sinking.  126 people were on the oil rig at the time of the explosion.  Eleven are missing and presumed dead and 4 were critically injured. Oil continues to leak from the wellhead more than a mile underwater on the ocean floor at an estimated rate of 42,000 gallons a day.

Remotely operated submersible vehicles were used to examine the wellhead.  The vehicles were also used in an effort to manually trigger the blowout preventer, which would close the wellhead and prevent any farther release of oil.  The blowout preventer is a 450-ton valve installed at the wellhead that is designed to automatically shut to prevent oil leaks in the event of an accident.  Attempts to manually close the blowout preventer have not been successful.

The other containment options being explored are drilling a separate well nearby to plug the flow at a location below the blowout preventer and building underwater domes that would contain the oil until it could be safely pumped to the surface for disposal.  Both of these alternatives are being actively worked and will take months to complete.  It is estimated that 4.2 million gallons of oil will be released if the blowout preventer is not able to be closed.

The cause of the explosion is unknown at this time.  An investigation is underway by the Coast Guard and the Minerals Management Service.

A preliminary root cause analysis can be started using the information that is known and details can be added as they become available.  The analysis can be documented using a Cause Map which is a simple, intuitive format that visually lays out all known causes for an incident.  The first step in building a Cause Map is to determine how the organizational goals were impacted by the incident.  Causes for each impacted goal are determined to begin building the Cause Map.

In this case, the safety goal was impacted because 11 people were killed and several injured.  The environmental goal was impacted because there was a significant oil release.  The materials goal was impacted because the $700 million oil rig is a complete loss and the production/schedule goal was impacted because the oil drilling operation is shut down.

Click on the “Download PDF” button above to view an initial Cause Map.

The Future of NASA

By Kim Smiley

A previous blog discussed a shortfall in the National Aeronautics and Space Agency (NASA) budget.  The lack of funding put NASA’s organization goals in jeopardy, including a planned return mission to the moon.  Then-President George W. Bush had tasked NASA to return to the moon five years ago and NASA has been working toward this goal since.

President Obama announced his vision for NASA during a speech Kennedy Space Center on April 15.  He canceled plans for a moon mission and redirected NASA to focus on sending astronauts to an asteroid and work toward an eventual Mars landing.  The proposed budget would boost NASA funding by six billion over the next five years.

President Obama’s plan calls for private companies to fly to the space station using their own rockets and ships, freeing up NASA resources for basic research and development of technologies for trips beyond earth’s orbit.  The final space shuttle mission is scheduled for September 2011 after which the US will depend entirely on Russia to carry astronauts to the space station until a replacement for the space shuttle is developed.  Additionally, the space station’s life would be extended by five years as part of the Obama plan.

The planning necessary to achieve a goal of this complexity is mind boggling.   There are many new technical issues to consider and brand new equipment will need to be designed.  There are many, many potential problems that could arise during this design process and mission.

Cause Mapping is often used to perform a root cause analysis of an incident that has occurred, but it can also be used to proactively approach a problem by building a map that captures failures that could happen.  Identifying potential problems before they happen would allow NASA to mitigate risks and allocate resources efficiently.

Cause Maps could be built to any level of detail that was deemed appropriate.  Cause Maps could be developed to capture all potential failure modes for something as small as a single component or for something as large the entire mission.

Chinatown Fire NYC

By ThinkReliability Staff

On April 11, 2010, a fire broke out in a store on the first level of an apartment building on the 200 block of Grand Street in Chinatown, New York City. The fire would eventually reach 7 alarms, requiring 250 firefighters to fight. Once firefighters were able to enter the building the next day, they found one body.  33 people, including 29 firefighters, were injured and approximately 200 were left homeless, as the blaze left three buildings needing to be demolished and at least two more severely damaged.

For years the buildings affected (which were more than a century old) had been neglected, including violations for missing smoke detectors and a boiler which released smoke into the buildings.  At this point it’s unclear how these violations may have contributed to the fire and its aftermath.  At the time of the fire, the buildings were for sale for over $9 million, although no offers had been made.    There were many goals impacted by the fire, but the loss of human life and number of injuries are the focus for our investigation.

The injuries (many of which were smoke inhalation) were caused by a seven-alarm fire.  The fire was able to reach seven alarms because the fire was able to quickly spread through the six-story building.  In order for a fire to start heat, fuel and oxygen are required.  There’s no shortage of fuel and oxygen in an apartment building, due to necessities for people to live there.  The heat (or ignition source) may have been provided by exposed wiring that many residents have complained of, or the boiler previously cited for neglect.  Or, it may have been something else altogether.  (However, arson is not suspected at this point.)

The fire was able to spread so quickly due to a large number of voids and shafts in the building – a function of its  age.  Another cause that may have contributed to the death was a potential lack of warning of the fire due to the missing smoke detectors for which the building had also been previously cited.

Throughout an investigation there may be additional tools that help to clarify the incident.  Here we use a timeline to show the sequence of events.  A timeline is especially useful for complex events such as this.

A thorough root cause analysis built as a Cause Map can capture all of the causes in a simple, intuitive format that fits on one page.  In fact, the outline, Cause Map and timeline for this event easily fit on one page.  (View them by clicking “Download PDF” above.)   Even more detail can be added to this Cause Map as more information is released about the incident. As with any investigation the level of detail in the analysis is based on the impact of the incident on the organization’s overall goals.

Deadly Mine Explosion in West Virginia

By Kim Smiley

Around 3 pm on April 5, 2010 in Montcoal, West Virginia, a huge explosion rocked the Upper Big Branch South mine owned by Massey Energy Company.  At least 25 miners were killed, both from the explosion itself and suffocation caused by high levels of dangerous gases.

There are still 4 miners missing.  The missing miners were working farther back in the mine and the hope is that they were able to reach one of the airtight chambers stocked with enough food, water and oxygen for several days.  Rescue efforts are underway and drilling efforts are ongoing to add additional ventilation so that the gas levels can be reduced to safe levels to allow rescue workers to enter the mine.

This is the worst mine accident in the US in over 20 years. If the 4 missing miners are not found alive, this accident will have the highest number of fatalities since a 1970 mine killed 38 in Hyden, Kentucky.

What triggered this explosion is not known at this time, but both state and federal agencies have initiated investigations.

Even though many details are still unknown, a root cause analysis can be started by building an initial Cause Map.  There was an explosion which means there must have been an ignition source, flammable material and oxygen present.

The source of the flammable material is known since there were high methane gas levels in the mine.  Methane gas is naturally occurring in coal mines and must be continually vented.  It can also be assumed that the mine ventilation was inadequate for some reason since the gas levels built up.  Coal dust accumulation may have also contributed to the accident since powdered combustible material in an enclosed space is a very explosive combination.

The source of the spark that ignited the explosion is still unknown.

More information will become available as the investigation proceeds and a more detailed Cause Map can be built as additional causes are added.

Media reports about the accident have discussed past safety violations cited at the mine, but it won’t be clear if the accident was preventable until the investigation is completed.  What is known that in March 2010, the Mine Safety and Health Administration cited the Upper Big Branch mine for 53 safety violations.  In additional to the recent citations, there was also a troubling increasing trend in citations, which more than doubled between 2008 and 2009.

Hopefully, the information obtained during the investigation will provide useful lessons learned that can be implemented to prevent a similar accident in the future.

Oil Refinery Explosion Rocks Anacortes Washington

By ThinkReliability Staff

Early this morning (Friday, April 2nd, 2010), an explosion at an oil refinery rocked the town of Anacortes, Washington.  The cause of the explosion is not yet known.  However, even with very little information a root cause analysis of the event can be started.  It is extremely helpful to gather information regarding an incident as soon as possible after it occurs.  More information can always be added as the investigation continues.

In this case, the date and approximate time are known.  It’s not clear if there was anything different or unusual at the refinery this morning, so we’ll put a question mark here for now.  Detailed information regarding the exact location of the incident has been released, so we can record that the explosion occurred in an Anacortes, Washington oil refinery, at the catalytic reformer hydrotreater unit while maintenance work was being performed.

We also know that some of the company’s goals have been impacted.  One worker was killed, four workers were seriously injured, and three workers are missing.  These are all impacts to the safety goal.  Because of the severe impact to the safety goal and the loss of human life, the other goals are far less important.  However, we can record the impacts for assistance in performing the analysis.

Reports of black smoke in the area indicate pollution which is an impact to the environmental goal.  There are reports of some damage to nearby buildings, which could be considered an impact to the customer service goal.  The damage to the plant, and possible delay in production as a result, are impacts to the production/schedule and property goals.  Additionally, the emergency response is an impact to the labor goal.

The costs resulting from the impacts to the goals and the frequency of events such as these are not immediately known.  This is information that can be filled in as the root cause analysis continues.   As more information is released regarding the incident, we can continue our investigation.