ML18101B205

From kanterella
Jump to navigation Jump to search
Insp Repts 50-272/95-81 & 50-311/95-81 on 951005-1113. Violations Noted.Major Areas Inspected:Ep,Emergency Response Equipment Area,Engineering,Operations,Training Organizations & Mgt Oversight Area
ML18101B205
Person / Time
Site: Salem  PSEG icon.png
Issue date: 01/23/1996
From: Calvert J, Ruland W
NRC OFFICE OF INSPECTION & ENFORCEMENT (IE REGION I)
To:
Shared Package
ML18101B203 List:
References
50-272-95-81, 50-311-95-81, NUDOCS 9602080027
Download: ML18101B205 (27)


See also: IR 05000272/1995081

Text

./ **

DOCKET/REPORT NOS.

LICENSEE:

FACILITY:

DATES:

INSPECTORS:

LEAD INSPECTOR:

APPROVED BY:

9602080027 960131 ..

PDR

ADOCK OS000272

G

PDR

U. S. NUCLEAR REGULATORY COMMISSION

REGION I

50-272/95-81

50-311/95-81

Public Service Electric and Gas Company

Hancocks Bridge, New Jersey 08038

Salem Nuclear Generating Station

October 5, 1995 - November 13, 1995

Lonny L. Eckert, Radiation Specialist

Jenifer M. Shannon, Electrical Engineer

n . Ca vert, Reactor ngineer

Electrical Engineering Branch

Division of Reactor Safety

LJ~~-1'1-~

Willfam H. Ruland, Chie

Electrical Engineering Branch

Division of Reactor Safety

01;/~/9--&

Date

.-

TABLE OF CONTENTS

Page Nos.

EXECUTIVE SUMMARY .

1.0

INTRODUCTION

. . . . . . . . . . .

1.1

Special Inspection (IP 92903) .

1.2

Significant Event Response Team

1.3

Overhead Annunciator System ..

1.4

Event Sequence ....... .

. . .

(SERT)

2.0

EVALUATION OF EMERGENCY PREPAREDNESS ....

2.1

Emergency Assessment and Classification ...... .

2.1.1 Evaluation of Event Recognition and Declaration .

.

2.1.2 Evaluation of the Initial Notification of the Loss of

Annunciator Event .............. .

2.1.3 Evaluation of the Appropriateness of the Alert

Declaration ................... .

2.2

Emergency Action Level Scheme ............. .

2.2.1 Appropriateness of the Loss of Annunciator EAL

2.2.2 Discussion of the EAL Scheme With the States of New

Jersey and Delaware ............ .

2.3

Emergency Response Organization (ERO) Augmentation ..

2.3.1 Emergency Operations Facility (EOF) ....... .

2.3.2 Operations Support Center (OSC) ......... .

2.3.3 Technical Support Center (TSC) .......... .

2.3.4 Licensee Emergency Response Organization Notification

Process ..................... .

2.3.5 Emergency Response Organization (ERO) Augmentation

Summary . . . . . . . . . . . . . . . . . . . . . .

2.4

Communications/Reporting/Notifications ........ .

2.4.1 Notification and Reporting of Events ...... .

2.4.2 Communications with Off-Site Response Organizations

3.0

SIGNIFICANT EVENT RESPONSE TEAM (SERT) PERFORMANCE

3.1

Emergency Preparedness Review ...... .

3.2

Human Performance and Procedural Review .. .

3.3

Integration and Usability of Operational Procedures

3.4

Overall SERT ...... .

4.0

TECHNICAL ROOT CAUSE PERFORMANCE

4.1

Causal Factors Analysis ..

4.2

Root Cause ....... .

4.3

Operability Determination and Compensatory Periodic Tests 4.4

Comparison With the 1992 Salem Unit 2 OHA Event

4.5

Generic Implications and Notifications

5.0

MANAGEMENT OVERSIGHT

6.0

OVERALL CONCLUSIONS .

7.0

EXIT MEETING AND TELEPHONE CALLS

ii

i i i

1

1

1

2

3

4

4

4

6

6

7

7

8

10

10

10

10

11

11

12

12

12

13

13

14

14

16

16

16

17

17

18

19

19

20

21

..

EXECUTIVE SUMMARY

Salem Inspection Reports 50-272/95-81; 50-311/95-81

October 5, 1995 - November 13, 1q95

This special inspection evaluated the performance of PSE&G in response to a

30 minute undetected loss of the Salem Unit 1 overhead annunciator (OHA)

system.

The inspection methods involved independent oversight of licensee

activities associated with this event, walkdown of control room OHA equipment,

and interviews with some of the personnel involved in the event.

The

inspectors also examined PSE&G's significant event response team (SERT) and

root cause investigation reports.

The event occurred on October 4, 1995, at 10:35 p.m., when Salem Unit 1

experienced a failure of the OHA system that was not indicated to the

operators.

The operating crew recognized the failure approximately 30 minutes

into the event, and reset the system 30 minutes after recognition.

The reset

was not entirely successful in that certain anomalies were still observed with

the system.

An alert was declared on October 5, 1995, at 1:38 a.m.

Following

another OHA system reset and diagnostic testing performed by the system

engineer that indicated proper functioning, the alert was terminated at

5:22 a.m.

Before, during, and after the event, Salem Unit 1 was shutdown and defueled.

Salem Unit 2 was in cold shutdown.

The OHA systems are identical for Unit 1

and Unit 2.

There were no turnovers in the operating crew during the event.

In the emergency preparedness (EP) area, the inspectors found that although

the conditions for an alert declaration had been exceeded and identified by

the operating crew, the operating crew decided not to declare an alert. The

operating crew initially dispositioned the event as a I-hour report to the

NRC, but the report was not initiated in a timely manner.

The event was

eventually classified as an alert at 1:38 a.m., approximately ~h hours after

the OHA system was observed to be inoperable.

The technical support center

(TSC) activation was not timely, and PSE&G did not meet their emergency plan

staffing requirements for the TSC.

After the event, PSE&G appropriately

revised the emergency action levels (EALs) covering loss of annunciator

events, but failed to discuss and seek agreement with the States prior to the

implementation of the revised EALs.

Three violations based on these emergency

preparedness findings were issued.

Also in the EP area, the inspectors noted that the operations support center

(OSC) activation was timely.

The emergency operations facility (EOF) manning

was a conservative action on the part of PSE&G and was within the level of

discretion provided by their emergency plan.

The licensee's emergency

response organization (ERO) call-out process was weak as shown by, for

example, the fact that a duty call list was not maintained, and a significant

number of those people that were called failed to respond or questioned the

need to respond.

The inspectors also noted that with the exception of the

State of New Jersey, representatives from offsite response organizations found

ii i

  • -*

the communications provided by the licensee to be commendable.

PSE&G's use of

the phrase, "5-minute alert," hindered the understanding of the State of

New Jersey Bureau of Nuclear Engineering (BNE) regarding the event.

Iri the emergency respons1~ equipment area, the inspectc rs found that there was

no indication to operators of major OHA system problems that would prevent

alarm processing. This type of undetected failure also occurred in the

December 1992 Salem Unit 2 OHA event, and therefore this is the second

occurrence of this type of "silent" malfunction without failover to the

backup. This made the equipment unavailable to perform the OHA function, and

did not provide the operators with sufficient information to determine that

unavailability.

One violation based on the above finding was issued.

The inspectors noted that PSE&G's determination of the most likely root cause

was technically sound and was the best fit with the actual indications of

failure. Operator errors in keystrokes and interface switch settings were not

a factor in this event due to effective corrective actions for these areas

taken after the 1992 Salem Unit 2 OHA event.

The OHA operability

determination and the compensatory tests were sufficient for use in the

respective plant modes.

The inspectors observed an overall weakness in the effectiveness of the

engineering, operations, and training organizations to support the plant

operators with a unified set of OHA knowledge, skills, and abilities to

recognize failure indications, to determine operability, and to take proper

corrective action.

In the management oversight area. the inspectors concluded that management was

actively involved in the alert declaration and overall direction of the

failure analysis process .

iv

- *.~.. _,

~

DETAILS

1.0

INTRODUCTION

1.1

Special Inspection (IP 92903)

On October 4, 1995, at 10:35 p.m., Salem Unit 1 experienced an unannunciated

failure of the Overhead Annunciator (OHA) system.

The operating crew

recognized the failure approximately 30 minutes into the event.

The operators

took action to reset the system in another 30 minutes after discovery of the

failure. The reset was not entirely successful in that certain anomalies were

still observed with the system.

An alert was declared on October 5, 1995, at

1:38 a.m.

Following another OHA system reset and diagnostic testing performed

by the system engineer that indicated proper functioning, the alert was

terminated at 5:22 a.m.

Before, during, and after the event, Unit 1 was shutdown and defueled.

The

OHA systems are identical for Unit 1 and Unit 2.

There were no turnovers in

the operating crew during the event.

Following the event, NRC managers determined that a special inspection was

warranted to gather event-related information. A charter was formulated for

the special inspection (Attachment 1). The inspectors began their inspection

on October 5, 1995.

The NRC had previously inspected a Salem Unit 2 OHA event during an Augmented

Inspection Team (AIT) inspection in December 1992, Report Number 50-272/92-81,

50-311/92-81.

The inspection methodology involved independent oversight of licensee

activities associated with this event. The inspectors conducted a detailed

walkdown of the control room and overhead annunciator equipment.

Independent

interviews were conducted with some of the personnel involved in the event, as

well as training and procedure-writing personnel.

The inspectors used

techniques set forth in NUREG/CR-5455, "Human Performance Investigation

Process," for guidance in causal factors identification. The assessments of

the licensee's reports by the inspectors were based upon the findings

described in the SERT and root cause investigation team reports.

The

inspectors supplemented the document review with observation of the activities

of the various teams and interviews of team members.

The licensee reports had

not gone through licensee corrective action review procedures at the time the

inspectors reviewed the reports, so the recommended corrective actions of the

SERT are not covered in this inspection report.

The inspectors compared the

final versions of the reports to the versions used during the inspection and

determined that there were no substantive revisions that-would change the- -

assessments.

1.2

Significant Event Response Team (SERT)

The General Manager-Salem Operations chartered a SERT because the loss of OHA

event involved several significant hardware and personnel performance issues.

The SERT charter required investigation of the.following issues: the failure

mode of the OHA system; the appropriateness of the alert classification; and

the performance of the emergency response organization.

~-

') .

. *.~ . -.:.,;

.

2

The SERT was tasked to investigate the event in accordance with Salem Station

Procedure NC.NA-AP.ZZ-006l(Q), Rev. 6.

The team was composed of

representatives from operations, maintenance, system engineering, emergency

preparedness, independent oversight, and nuclear safety review.

The SERT had

been tasked to determine the sequence of events, determine the root causes,

and provide recommendations for corrective actions.

The SERT consisted of two subteams.

One subteam examined the technical root

cause, while the other subteam retained the SERT title and examined the human

performance and emergency response issues. The final SERT report contained

the results of the two subteam findings.

1.3

Overhead Annunciator System

The OHA system is a high speed distributed data acquisition and display

system.

The purpose of the system is to detect alarm conditions and indicate

these conditions to control room operators. The alarm conditions

are provided to the operator through overhead window lights with audible

alarms, a CRT screen, and a printer.

The system has two sequential event recorders (SER) that receive alarm

information, process .the information, and send it to distributed logic

controllers. Outputs from the distributed logic controllers drive the

overhead windows and alarms.

The two SERs are designed to act as a primary

and hot standby failover pair. The design intent was that when the primary

SER, fails, the hot standby SER takes over control. This failover mechanism

is not active as long as SER-A periodically resets a watchdog timer circuit.

If SER-A fails to reset the watchdog timer, the circuitry is designed to

automatically switches data flow through SER-B.

The CRT is located on the control console and displays all alarm and reset

events along with the current time.

The control console also contains push

buttons that the operator uses to silence, acknowledge and reset the overhead

alarms.

Various internal failures are identified and annunciated by the OHA system.

These alarms are combined into an independent control console group alarm.

The operator panel is installed in the equipment cabinets adjacent to the -

control room.

The panel consists of pushbuttons for obtaining printed reports

containing information related to system status.

The Remote Configuration Workstation (RCWS) is a personal computer that

provides a means to configure the system and is located -in the same area -as -

the operator panel. Access to the keyboard for the RCWS is by administrative

keylock control .

...~ ... -... -

-

~**

., -, . ~-

.:,.: - ~ .. - ; .... *-*;;.;

..

3

The licensee took corrective actions after the 1992 Salem Unit 2 OHA event as

a result of their root cause investigation. These equipment corrective

actions were installed in the OHA of both units and concerned mainly key

access to the computer keyboard, electrical noise susceptibility reduction,

operator initiated independent built-in test of function, and indication of

test failure to operators.

1.4

Event Sequence

Some details of the event are described here to aid in the correlation of the

other sections of the report.

On October 4, 1995, at 10:22 p.m., the breaker for service water bay sump pump

.. "11" was opened as part of a routine surveillance (typically performed

nightly).

OHA system window A-41, "auxiliary alarm system printer," and the

auxiliary alarm system typewriter actuated as expected.

At 10:35 p.m., OHA

window A-21 actuated; this was the last known alarm to have been properly

processed and displayed by the OHA system.

At 11:03 p.m., the "11" service water sump was reported to the control room as

being full, but OHA window B-29, "11-13 service water pump sump area level

high," had not actuated. This was the first indication to the operators that

there was a failure of the OHA system.

The sump pump breaker was closed with

proper acknowledgment from the auxiliary alarm typewriter.

However, OHA

window A-41 failed to annunciate.

At this time, the Nuclear Control Operators

(NCOs) recognized that there was a problem with the OHA system.

At 11:05 p.m., the NCOs initiated an operability check by implementing OHA

system normal Operating Procedure Sl.OP-SO.ANN-OOOl(Q).

Reactor coolant

system valve 1SS104 was cycled with proper acknowledgment from the auxiliary

alarm typewriter.

Once again, OHA window A-41 failed to annunciate. Also, a

lamp check was performed locally on the 104 panel by a Nuclear Equipment

Operator (NEO) with no response from OHA window C-9.

At 11:12 p.m., an OHA lamp and hourly test was performed.

All OHA windows

illuminated and the group alarms worked.

However, no messages of successful

tests were received on the OHA cathode ray tube (CRT) screen.

OHA window A-9,

"annunciator trouble alarm," did not alarm and clear. Also, the "OHA Trouble"

console alarm was not observed to have alarmed.

It was at this point that the

Senior Nuclear Shift Supervisor (SNSS) determined that the OHA system was

inoperable.

At 11:30 p.m., in accordance with "Loss of Overhead Annunciator System,"

Procedure Sl.OP-AB.ANN-0001, the operators reset. SER-A .. After this res.et and

system testing, the operators were confident that the OHA was functioning

properly, but the reset was not entirely successful in that certain anomalies

were still observed with the system.

An alert was declared on

October 5, 1995, at 1:38 a.m.

The system engineer reset the system again and performed diagnostic testing to

show that the OHA system was functioning properly.

The alert was terminated

at 5:22 a.m.

4

2.0

EVALUATION OF EMERGENCY PREPAREDNESS

2.1

Emergency Assessment and Classification

2.1.l Evaluation of Event Recognition and Declaration

The loss of annunciators was not declared as an alert by the SNSS despite

having sufficient information by about 11:12 p.m., to have declared the event.

The failure was identified by licensee management and the alert was

subsequently declared at 1:38 a.m., on October 5, 1995, after management

review of the situation (see Section 2.1.3 for NRC's conclusion on the

appropriateness of the action of declaring an alert at the time it was made).

During the inspectors' interviews with the operating crew, who are senior

reactor operator (SRO) qualified, the inspectors found that both individuals

had reservations about declaring the alert due to its minimal safety

significance. The SROs believed that not declaring the alert was appropriate

because mobilizing large numbers of emergency response personnel would have

posed more of a risk, and that the attention that would have been aroused as a

result of the alert declaration was unwarranted.

The inspectors concluded

that these statements suggest confusion on the part of the on-duty operating

crew with regard to the responsibilities of the licensee and the States

(New Jersey and Delaware).

The inspectors determined that the following factors contributed to the

operating crew's failure to declare an alert in a timely manner.

The Salem Emergency Classification Guide (ECG) was confusing/incomplete

as the SNSS had considered the ECG to be a guide.

The loss of annunciators when in mode 6 (defueled) does not merit an

alert declaration based on the latest guidance by the Nuclear Management

and Resources Council, Inc., NUMARC/NESP-0007, "Methodology for

Development of Emergency Action levels." The NUMARC-NESP/0007 document

has received generic acceptance by NRC in NRC Regulatory Guide 1.101,

Revision 3.

The NUMARC EAL guidance establishes mode specificity to the

EALs which address a loss of annunciator event. Also, per NUMARC/NESP-

0007, loss of annunciators is not an alert unless a transient is in

progress at the same time.

However, this guidance had not yet been

approved for Salem at the time of the event.

The ECG was not used properly because it had not been stressed

adequately that if an EAL initiating criteria is satisfied, the event is

intended to be classified in accordance with the ECG even if the

Emergency Coordinator does not agree with the classification level .

.-

5

The ECG, "Introduction and References" section, Step V.A. requires, in

part, that "The ECG is a guide.

The EALs described in the ECG are not

all inclusive and will not identify each and every cor~'tion, parameter

or event that could lead to an event classification. If the Emergency

Coordinator, using his best judgment, determines an Initiating Condition

has been satisfied but the specific EAL is in question, he/she should

promptly classify the event in accordance with the Initiating Condition.

In any event, if the plant conditions are equivalent to one of the four

emergency classes ... , that classification should be declared."

The ECG was not used because the operating crew had been trained on a

complete revision of the Emergency Action Level (EAL) scheme in which

loss of annunciators when-in mode 6 was not an alert.

The training for the ECG was less than adequate as there were no

terminal learning objectives for the SNSS, Nuclear Shift Supervisor

(NSS), and the Nuclear Shift Technical Advisor (NSTA) in regards to

demonstrating an understanding of licensee and state capabilities and

responsibilities.

Also, the Emergency Preparedness and Radiological Support Manager

informed the inspectors that the licensee did not use watch-standing

senior reactor operators in activities with personnel from off-site

response organizations. Such activities *such as EAL training or

technical liaisons could foster a more complete understanding of roles

and responsibilities of the various response organizations.

The Salem ECG contains EALs and initiating conditions for event

classifications. The ECG, Section lOB, "Loss of Instrumentation/

Annunciation/Communications," required an alert declaration if "Loss of most

or all (>75%) overhead annunciators (excluding a scheduled test or maintenance

activity for which pre-planned compensatory measures have been implemented)

AND 15 minutes have elapsed since the loss of annunciators."

The ECG, "Introduction and References" section, Step V.A. requires, in part,

that "If the Emergency Coordinator, using his best judgment, determines an

Initiating Condition has been satisfied but the specific EAL is in question,

he/she should promptly classify the event in accordance with the Initiating

Condition.

In any event, if the plant conditions are equivalent to one of the

four emergency classes ... , that classification should be declared."

As noted previously, the loss of annunciators was not promptly declared as an

alert_by the SNSS despite having recognized that the EAL initiating criteria

had been satisfied at about 11:12 *p.m.

The alert was subsequently declared at

1:38 a.m., by the SNSS after management had reviewed and discussed the event

with the SNSS (detailed further in Section 2.1.3) .

6

10 CFR 50.54(q) requires, in part, that "A licensee authorized to possess and

operate a nuclear power reactor shall follow and maintain in effect emergency

plans ~.hi~h meet the standards in 50.47(b) and the requirements in Appendix E

of this part." Therefore, the failure to promptly declare the loss of

annunciator event as an alert, in accordance with the licensee's procedures,

was assessed as a violation of 10 CFR 50.54(q) (VIO 50-272/95-81-01).

2.1.2 Evaluation of the Initial Notification of the Loss of Annunciator Event

After the overhead annunciator system had been reset, the SNSS and NSS had

difficulty in deciding whether it was appropriate to declare an alert (per ECG

Section 108) or to provide a 1-hour notification to the NRC (per ECG Section

17D).

The operating crew debated 1) the appropriateness of EAL Section 17D

due to the portion of the EAL that stated "event was not ongoing at the time

of discovery" as this part of the EAL was not satisfied and 2) the

appropriateness of EAL Section 108 as the reactor was defueled and the

overhead annunciator system appeared to be operable after it was reset at

11:30 p.m.

After the operating crew had consulted with emergency preparedness

staff members and management several times by telephone, the operating crew

decided to enter EAL Section 17D at 1:12 a.m., on October 5, 1995, (about 2

hours after recognition of the loss of all annunciators).

ECG Section 17D, "Emergency Conditions Discovered After-The-Fact," requires

that "If discovery of an event/condition that had previously occurred, (event

was not ongoing at the time of discovery) which exceeded an Emergency Action

Level (EAL) but was not declared as an emergency, AND there are currently NO

adverse consequences in progress as a result of the event THEN refer to

Attachment 12 I-hour Report." The licensee has committed in its emergency

plan that this will be performed within 1 hour1.157407e-5 days <br />2.777778e-4 hours <br />1.653439e-6 weeks <br />3.805e-7 months <br /> (per ECG Attachment 12,

Step 3).

The inspectors assessed that this notification to the NRC was not initiated in

the time specified by the ECG.

The enforcement issue pertaining to this

failure is being addressed by the violation for the failure to promptly

declare an alert.

2.1.3 Evaluation of the Appropriateness of the Alert Declaration

The following information was found in the licensee's SERT report:

The Operations Manager arrived in the control room at 1:00 a.m.

The system engineer for the overhead annunciator system arrived in the

control room at 1:06 a~m.

The General Manager-Salem Operations and the Emergency Preparedness and

Radiological Support Manager arrived in the control room at 1:21 a.m .

  • .-.

~

7

Upon arriving in the control room, the licensee managers identified above

discussed event declaration with the SNSS.

Licensee management concluded that

an alert declaration was appropriate and that the 1-ho**r report that had been

completed was not appropriate for the loss of annunciator event. At this

time, the SNSS was confident that the overhead annunciator system was

operable, but the system had not been formally declared as operable as the

system engineer had just started his diagnostic testing of the overhead

annunciator system.

The SNSS declared an alert per ECG Section 108 at

1:38 a.m.

The system engineer completed his diagnostic tests at 5:15 a.m.,

and the alert was terminated at 5:22 a.m.

The General Manager, Salem Operations provided key input into the decision to

declare the alert and stated to an inspector that he remained concerned with

system operability. The inspectors concluded that the alert declaration was

not inappropriate at the time it was made at 1:38 a.m., on October 5, 1995,

because OHA system operability remained unestablished at the time the alert

declaration was made.

2.2

Emergency Action Level Scheme

2.2.1 Appropriateness of the Loss of Annunciator EAL

As noted in Section 2.1.1, the NUMARC/NESP-0007 guidance document establishes

mode specificity to the EALs which address a loss of annunciator event. Also,

per NUMARC/NESP-0007, loss of annunciators is not an alert unless a transient

is in progress at the same time.

At the time of the October 4-5 loss of

annunciator event, the licensee's EALs covering loss of annunciator events

were not yet revised to conform to the NUMARC/NESP-0007 document.

The EALS

were based upon previous NRC guidance contained in NUREG-0654, "Criteria for

Preparation and Evaluation of Radiological Emergency Response Plans and

Preparedness in Support of Nuclear Power Plants."

After the event, the licensee discussed changing the loss of annunciator EALs

with the NRC.

The licensee informed the NRC that its 10 CFR 50.54(q) review

of the planned changes had determined that there would be no loss of

effectiveness of the emergency plan.

The licensee was specifically reminded

by NRC representatives to discuss these EAL changes with the states prior to

.their implementation ..

On Saturday October 7, 1995, the licensee implemented the revised EALs for

loss of annunciator events.

The revision adopted the NUMARC/NESP-0007

guidance.for loss of annunciator events. The inspectors considered that

~evising the EALs was an appropriate action on the part of the licensee.

Section 2.2.2 provides an assessment of the licensee's efforts in seeking

approval from NRC and the efforts in discussing with the states of Delaware

and New Jersey prior to implementation of the revised EALs.

Additionally, the

licensee has submitted a completely revised EAL scheme based upon the

NUMARC/NESP-0007 guidance for the three stations operated by Public Service

Electric and Gas to the NRC for review and approval in accordance with NRC

requirements .

_ ......

~

8

2.2.2 Discussion of the EAL Scheme With the States of New Jersey and Delaware

The licensee implemented the revised loss of annunciatcr EALs on

October 7, 1995.

The licensee neither discussed nor sought agreement with the

states prior to implementing the revised EALs.

The licensee was specifically

reminded by the NRC of the need to discuss the EAL changes and seek agreement

from the states prior to implementation of the revised EALs covering loss of

annunciator events.

During discussions with the New Jersey Bureau of Nuclear Engineering (BNE)

representatives, the inspectors were informed that the licensee informed

New Jersey BNE of the EAL change on October 11, 1995, at about 3:15 p.m.

The

New Jersey BNE representatives conveyed that they were not satisfied with

being informed of the revision to the EAL scheme after-the-fact.

The inspectors also contacted the State of Delaware Emergency Management

Agency (EMA), which had not been informed of the subject EAL revision until

October 20, 1995.

Its representatives conveyed that they had no additional

concerns over the late notification.

During an interview with the licensee's Emergency Preparedness and

Radiological Support Manager, the inspectors were informed that the licensee

had never before discussed or sought agreement with the states when revising

portions of the EAL scheme .

10 CFR 50.54(q) requires, in part, that "A licensee authorized to possess and

operate a nuclear power reactor shall follow and maintain in effect emergency

plans which meet the standards in 50.47(b) and the requirements in Appendix E

of this part."

10 CFR 50, Appendix E, Section IV. "Content of Emergency Plans," Part B.

"Assessment Actions" requires, in part, "The emergency action levels shall be

based on in-plant conditions and instrumentation in addition to on-site and

off-site monitoring.

These emergency action levels shall be discussed and

agreed on by the applicant and state and local governmental authorities and

approved by NRC.

They shall also be reviewed with the state and local

governmental authorities on an annual basis."

In summary, the failure to discuss the EAL revision with the states of New

Jersey and Delaware and acquire their agreement is assessed as a violation of

10 CFR 50.54(q) and 10 CFR 50 Appendix E. (VIO 50-272/95-81-02)

The Jnspectors determined that the factors listed below contributed to the

licensee's failure to discuss and seek agreement from the states prior to -

implementing the revised EALs.

Licensee administrative procedures did not provide adequate direction

for single EAL revisions.

This practice had not been questioned during licensee annual 10 CFR

50.54(t) reviews (audits) .

. *.

~ -~.

9

Prior to the exit meeting, the licensee provided its view regarding the

failure to discuss the revised loss of annunciator EALs with the states prior

to implementation.

The following points were provided by the licensee to the

inspectors:

1)

The pertinent requirement in 10 CFR 50, Appendix E, requires the

emergency action levels to be discussed and agreed on by the applicant

and state and local governmental authorities and approved by NRC.

The

licensee maintained that it interpreted the requirement as applicable to

an applicant for an original NRC license and not an existing licensee.

The inspectors noted that 10 CFR 50.54(q) requires a "licensee" to

follow the requirements in 10 CFR 50, Appendix E .. Therefore, the

licensee must implement Appendix E regardless of whether "applicant" or

"licensee" is used.

2)

Written agreement from the states had been received by the licensee for

the revision of the Hope Creek EAL scheme to implement the

NUMARC/NESP-0007 guidance. This agreement was in the form of an

internal BNE memo provided to PSE&G.

The licensee viewed this agreement

as providing tacit agreement of the revision made to the Salem EALs

covering loss of annunciator events as both changes simply implemented

the NUMARC guidance for loss of annunciator events.

The inspectors

assessment of this point is as follows .

The inspectors found that the Salem specific EAL changes had

neither been discussed nor agreed to prior to implementation.

-

As noted in Section 2.2.1, the licensee was specifically reminded

by NRC personnel to discuss the EAL revisions with the states

prior to implementation.

As no advance notice had been provided by the licensee, the states

had no opportunity to take internal action on the revised EALs

prior to their implementation.

NOTE:

The inspectors consider

another NRC requirement germane to this issue.

10 CFR 50.47(b)(4)

requires that "A standard emergency classification and action

level scheme, the bases of which include facility system and

effluent parameters, is in use by the nuclear facility licensee,

and State and local plans call for reliance on information

provided by facility licensees for determinations of minimum

initial off-site response measures."

In essence, states and local

governments have a vested interest in EALs developed by NRC

licensees.

The inspectors concluded that the licensee had not provided a sufficient basis

to support their contention that a violation of NRC requirements had not

occurred .

/

10

2.3

Emergency Response Organization (ERO) Augmentation

2.3.1 Emergency Operations Facility (EOF)

The activation of the EOF was not required at the alert declaration (only

required when a Site Area Emergency is declared). Certain positions in the

EOF were manned but the facility was not formally activated. The inspectors

considered EOF manning at the alert level to be within the bounds of

discretion provided by the licensee's emergency plan.

No issues concerning

the EOF were noted by the. inspectors.

2.3.2 Operations Support Center (OSC)

The OSC responders were composed of individuals who were working backshift

hours at the time of the event.

The OSC was activated in a timely manner,

after the alert was declared, at 2:13 a.m.

2.3.3 Technical Support Center (TSC)

The licensee's Emergency Plan states "It is estimated that a TSC facility can

be fully activated within about 1 hour1.157407e-5 days <br />2.777778e-4 hours <br />1.653439e-6 weeks <br />3.805e-7 months <br /> following initial notification of

personnel assigned to the TSC.

This estimate is only a target value and may

vary based on initial notifications, travel, and other conditions." The

inspector's discussions with licensee emergency preparedness staff

representatives indicated that experience had demonstrated that the ERO pager

system could be activated within 15 minutes of an event classification. There

were no abnormal conditions at the time that could have understandably delayed

activation.

The alert was declared at 1:38 a.m.

The TSC was activated at 3:30 a.m.

The

TSC was manned to a minimum staffing level at 4:00 a.m.

As such, the

inspectors concluded that the TSC activation was not timely.

The licensee's Emergency Plan, Section 3, "Organization," Part 10.0, "Staffing

Commitments" provides a commitment for minimum staffing in accordance with

Supplement 1 of NUREG-0737, Table 2.

Tables 3.1 and 3.2 of the Emergency

Plan, Section 3, details licensee staffing by position. The licensee has

committed to maintain the ability to.augment its ERO with at least one

electrical engineer and one mechanical engineer within about 1 hour1.157407e-5 days <br />2.777778e-4 hours <br />1.653439e-6 weeks <br />3.805e-7 months <br />.

It took about 2 hours2.314815e-5 days <br />5.555556e-4 hours <br />3.306878e-6 weeks <br />7.61e-7 months <br /> from the alert declaration to fill the TSC mechanical

and electrical engineering positions with fully qualified ERO members.

Other

-positions were not filled within an hour of the alert declaration but they

were not key ERO positions.

In summary, the licensee failed to meet its emergency plan because a qualified

electrical engineer and a qualified mechanical engineer were not available in

the TSC within 1 hour1.157407e-5 days <br />2.777778e-4 hours <br />1.653439e-6 weeks <br />3.805e-7 months <br /> of the alert declaration. This is considered to be a

violation of 10 CFR 50.54(q). (VIO 50-272/95-81-03)

... .. ii *

. ~,.;*.

-

..

11

2.3.4 Licensee Emergency Response Organization Notification Process

A significant number of ERO members assigned emergency notification devices

(pagers) did not respond when the pagers were activated (call-out). The

reasons for this lack of response were varied. This matter was also

identified by the licensee's SERT.

Some 20 to 30 pager holders improperly

called the station (including the control room) to find out if they really had

to respond.

One of the reasons for the lack of response was that it was

common knowledge that an emergency preparedness drill was scheduled for that

day (October 5, 1995).

The alert was declared at 1:38 a.m.

The Information Technologies Operations

Center (ITOC:

a center staffed around-the-clock responsible for pager system

-activation) received direction from the Salem Control Room at 1:45 a.m., to

initiate pager call-out.

ITOC activated the pager system at 1:54 a.m.

"A"

group pager activation started at 1:55 a.m., and "C" group pager activation

was completed at 2:07 a.m. (29 minutes after the alert declaration). The

inspectors assessed that the time to complete the pager activation was lengthy

considering that the TSC was required to be activated within about one hour of

the alert declaration.

A duty call list (a list of rotating primary responders responsible for

maintaining themselves fit-for-duty and within an appropriate travel distance

to the station) is not maintained by the licensee. The licensee emergency

preparedness staff conducted periodic pager tests with the intent to assess

system operability and to monitor its ability to staff the emergency response

facilities in a timely manner.

However, the inspectors noted that the pager

test results had not been annotated with the time it took each pager holder to

travel to the station.

There was no procedural requirement for a pager holder to respond to a call-

out.

Licensee management provided its expectations on this matter in the form

of an instruction that had been disseminated on an approximate annual basis.

The inspectors concluded that the licensee ERO call-out process was weak

because key ERO members failed to respond, ERO members called the s~~tion to

find out if they really had to respond, the time to complete the pager

activation was lengthy, a duty call list was not maintained, pager system

tests were not annotated with the time it took each pager holder to travel to

the station, and there was no procedural requirement for a pager holder to

respond to a call-out.

2.3.5 Emergency Response Organization (ERO) Augmentation Sunvnary

The OSC activation was appropriate and timely.

The TSC activation was not

timely and some key positions were filled late. Licensee management stated at

the exit meeting that the emergency preparedness issues discussed in this part

of the report would be resolved prior to restart .

-*

12

2.4

Comnunications/Reporting/Notifications

2.4.1 Notification and Reporting of Events

At about 12:20 a.m., the Plant Manager notified the NRC Senior Resident

Inspector that Salem Unit 1 was preparing to declare an alert due to the loss

of annunciators. Another call was made to the NRC Senior Resident Inspector

at 1:07 a.m., that a 1-hour report would be made to the NRC Operations Center.

The licensee's 1-hour report (see Section 2.1.2) was initiated (late) at

1:12 a.m., with the NRC Operations Center.

The alert was declared formally at 1:38 a.m.

The Initial Contact Message Form

was approved by the Emergency Coordinator at 1:41 a.m.

The licensee's

communications log indicated that the Delaware State Police were notified at

1:47 a.m., the New Jersey Office of Emergency Management (OEM) at 1:51 a.m.,

the Lower Alloways Creek Township at 2:03 a.m. (a courtesy call to be provided

within 30 minutes), and the NRC Operations Center at 2:14 a.m.

The inspectors

confirmed that the New Jersey OEM had been informed as required.

In summary,

once the event was declared as an alert, notifications to off-site response

organizations were initiated in the times specified by the emergency plan.

2.4.2 Comnunications with Off-Site Response Organizations

The inspectors found that PSE&G had informed New Jersey BNE that a "five

minute alert" (the event would be declared and terminated after 5 minutes)

would be declared.

The inspectors noted that this terminology ("five minute

alert") was not described in the licensee's emergency plan.

The inspectors

concluded that such terminology should be addressed in the emergency plan and

agreed upon by the state and local governmental authorities so that confusion

will be minimized and response from NRC and/or off-site response organizations

to an event will more likely be appropriate to the situation.

The inspectors reviewed whether PSE&G had difficulty establishing

communication with New Jersey BNE and whether PSE&G had complied with its

emergency plan in regards to communications.

1)

The inspectors verified that the New Jersey BNE received all station

status checklists generated during the emergency.

Those checklists were

sparse on details concerning the OHA system, but provided the essential

parameters concerning plant status, which included that the unit was

defueled with the spent fuel pool temperature steady at 83°F.

2)

The inspectors verified that the licensee had successfully transmitted

Emergency Response Data System (ERDS) data to the NRC .

3)

13

The inspectors reviewed whether the licensee had met its obligation in

regard to staffing the NETS line (a dedicated and controlled telephone

exchange).

A New Jersey BNE representative reported to ~he inspectors

that they had initiated two calls over the NETS line tha~ were not

answered by the licensee. At the time of the inspection, New Jersey BNE

representatives were not able to provide the time they made these calls.

Further discussion of this matter with the licensee's Emergency

Preparedness and Radiological Support Manager indicated that one of the

calls may have been to the NETS line in the TSC NRC office, which was

not occupied at the time.

The inspectors verified that the licensee's communications-related ERO

positions were filled within 1 hour1.157407e-5 days <br />2.777778e-4 hours <br />1.653439e-6 weeks <br />3.805e-7 months <br /> of the alert declaration. A review of

licensee records revealed that the TSC communicators arrived at 2:01 and

2:37 a.m., and that by 2:35 a.m., three communicators had arrived at the EOF.

The Emergency Preparedness liaison position was filled at 2:51 a.m.

TSC

management did not activate the TSC until 3:30 a.m.

More timely TSC

activation may have provided better management oversight and clearer

communications with outside response organizations.

The inspectors concluded that PSE&G use of the phrase "five minute alert"

hindered the understanding of the New Jersey BNE regarding the event. While

the TSC communicators arrived quickly, untimely activation of the TSC may have

impacted the quality of information between the licensee and the New Jersey

BNE because TSC management was not present within the facility in a timely

manner.

More timely TSC activation may have provided better management

oversight and clearer communications with outside response organizations.

The inspectors contacted representatives from the Salem County Office of

Emergency Services, Cumberland County Office of Emergency Management, New

Jersey Office of Emergency Management, and the Delaware Emergency Management

Agency.

In each case, these representatives conveyed that they found the

communications provided by the licensee as commendable.

The licensee's

Pmergency preparedness staff had provided these courtesy calls in addition to

the required communications.

3.0

SIGNIFICANT EVENT RESPONSE TEAM (SERT) PERFORMANCE

3.1

Emergency Preparedness Review

In the emergency preparedness area, the SERT identified the key issues

relating to the alert declaration, TSC activation, and staffing. The

inspectors also agreed with the SERT's contributing factors to these key

issues.

The inspectors identified a key difference with the SERT that pertained to the

issue of the failure to discuss and seek approval from the State of New Jersey

prior to implementing changes to the Salem EAL scheme, which is discussed in

Section 2.2.2 of this report.

The SERT did not identify this matter as an

issue because their interpretation of the pertinent NRC requirements was

fundamentally different .

.. _,

14

r

3.2

Human Performance and Procedural Review

The SER1 ::sus was primarily on emergency response issues, but the SERT

investigated human performance and procedural issues in other areas.

The

inspectors observed instances when SERT investigations could have been more

thorough.

For example, one particular procedural issue identified by the SERT

was that the abnormal OHA procedure for Unit 1 was different than the

procedure for Unit 2.

Upon finding this discrepancy, the SERT identified this

as a broken barrier, but looked no further to determine its cause.

The

inspectors found evidence of a procedural change request associated with both

units. The change request was implemented on Unit 2, but not on Unit 1. This

evidence suggested a revision control problem that could have been explored in

more detail.

The SERT performed a detailed analysis of the corrective actions from the

Salem Unit 2 loss of OHA event in 1992.

The SERT aggressively identified

missed opportunities to prevent the recent event by failure to implement

certain recommended corrective actions.

The SERT performed a broad-based comparison of this event with the Hope Creek

shutdown cooling event and other industry-related annunciator events.

The

inspectors noted that this comparison showed good use of operational

experience to extract technical and organizational insights.

3.3

Integration and Usability of Operational Procedures

The SERT found that the operational procedures for the OHA were inadequate. A

particular weakness observed by the inspectors was the apparent collective

inefficacy of the engineering, procedure writing, and training organizations

to support the plant operations staff with an integrated, consistent, fully

usable tool kit for the determination of OHA availability. The individual

procedures for the OHA by the organizations were generally very good, but the

collective set did not fully meet the usability needs of the operators.

The operators used two procedures to diagnose and correct the loss of OHA

during the event and the Event Classification Guide to determine emergency

response actions. The first procedure, Sl.OP-SO.ANN-001, Rev. 7, was the

Normal Operating Overhead Annunciators Procedure. The**operators primarily

used only two sections from this procedure during the event: (1) Testing the

Operability of the OHA System and (2) Resetting the OHA System.

The second

procedure, Sl.OP-AB.ANN-0001, Rev. 4, was the Abnormal Procedure for Overhead

Annunciator Operation.

Examples of the lack of integration and operator usability.reflected in the

procedures are summarized as follows:

There was inadequate consideration of the 15 minute emergency

classification requirement in the ECG in the development of the Normal

and Abnormal Operating Procedures.

The steps required by these

procedures to determine system operability and restore operability took

longer than 15 minutes for the crew to perform .

,,

15

Engineering and training relied heavily on symptoms of failure from the

1992 event.

Operators were not specifically trained on other

indications of systeLl failure.

One example of th,s was that the

indications of an incomplete failover from SER A to SER B were not

specifically covered.

In this event, the operators did not recognize

that the failover did not occur. Another example is that the operators

expected the clock on the CRT to stop if the system was inoperable.

In

this event the clock continued to update.

Operators were observing

inconsistent indications of system operability.

A trouble annunciator was installed after 1992 event to alert operators

to OHA failure.

When this panel annunciator is actuated, overhead

annunciator window A-9 is illuminated and an audible alarm sounds.

This

window was not actuated during the event.

Engineering intent was not correctly translated into operating

procedures.

Upon loss of OHA, engineering intended the operators to

switch manually from SER-A to SER-B.

However, the way the procedures

were written, the operators were trained to reset the system rather than

to manually switch the SERs.

Procedural tests were too long and cumbersome for operators to quickly

determine system status, so operators sometimes relied on the "Lamp

Test" to determine system status rather than use procedural methods .

The abnormal procedure revision process was not adequately controlled .

Salem Unit 1 Abnormal Procedure directed operators to reset the OHA

system upon any indication of loss of OHA.

The operators were not clear

on when to use the abnormal procedure.

Salem Unit 2 Abnormal Procedure

directed operators to test first for operability and then reset only if

required, which was different than Unit 1.

During the event, SER-A failed but due to the nature of the failure, the

OHA system did not failover to SER-B as designed.

When the system does

failover, there is indication to the operating crew by the extinguishing

of all previously lit window boxes, then a reflash followed by a

restoration of all alarmed windows (repainting). Although this

indication was known by the engineering staff and included in the

discussion section of the normal operating procedure, operators were not

specifically trained to look for this indication. Since the operators

were not trained to look for a failover, they did not realize that the

system had not failed over to SER-B.

Had the operators been trained

more aggressively on system operations and indications, they may have

realized that the system failed to successfully failover and then taken

action to manually switch control from SER-A to SER-B as engineering

intended for this particular situation .

~

16

'

3.4

Overall SERT

The inspectors considered the licensee efforts in highlighting emergency

preparedness problems to be commendable, even though the inspectors did not

reach the same conclusions in all cases.

The inspectors agreed, in general, with the conclusions and recommendations

for human performance and procedural issues.

In some cases, recommendations

were not clearly stated. Problem areas were identified, but specific

corrective actions were not described. A particular weakness observed by the

inspectors was the apparent collective inefficacy of the engineering,

procedure writing, ~nd training organizations to support the plant operations

staff with an integrated, consistent, fully usable tool kit for the

determination of OHA availability.

4.0

TECHNICAL ROOT CAUSE PERFORMANCE

4.1

Causal Factors Analysis

The licensee postulated failure modes that were developed into scenarios,

which were then examined for the most likely failure mechanism.

Causal

factors analyses were performed for four human activity scenarios, thirteen

hardware failure scenarios, and fifteen software failure scenarios.

The human activity scenarios covered such items as operator error, welding

activity in the area, and inadvertent grounding of the field contact power.

The licensee concluded that human activity was not a likely initiator of the

event.

The hardware failure scenarios analyzed areas such as induced electrical

noise, power supply problems, failover switching, and data line switching.

The licensee concluded that there were no likely hardware scenarios analyzed

that would cause the system failure indications. However, induced noise and

some type of input power spike affecting the SER circuits were not ruled out

as likely initiators that could cause the hardware to induce a software fault.

The software failure scenarios covered software faults in all major programs.

The three most likely scenarios centered on SER A in the scanner

acknowledgement, memory, task error, and task failure areas.

The event-expand

task,, which processes the data from the scanners was singled out as the most

likely software fault candidate that matched the lack of failover and other

indications. The vendor analysis indicated that the event-expand task

probably exited or was aborted due to an undetermined cause.

The licensee

conducted independent tests at the vendor facility that confirmed the results

of the event-expand task analyses.

The licensee also conducted a design

review of the vendor software by examining the system architecture and the

software code with the software developers. This review uncovered a design

flaw that concerned the placement of the watchdog timer refresh commands in

the task structure, which could prevent failover under certain conditions .

l

J

.. ..

(~

,.

17

f e

The inspectors' evaluation was that the causal analysis failure scenarios were

accurate, detailed, reasonable and thorough.

Detailed statements of

supporting evidence and refuting evidence supported each conclusion.

4.2

Root Cause

The root cause subteam determined that the failure was most 1ike1 y caused by

the undetected premature exit of a single software task in SER A, which halted

the processing of alarms to the OHA window boxes.

The exact initiating event

was not determined, but some combination of possible degradation of the SER

power supplies and power line instability was implicated.

The determination

of the most likely root cause was technically sound and was the best fit with

the actual indications of failure.

According to 10 CFR 50.47(b)(8), it is a requirement that adequate emergency

facilities and equipment to support the response be provided and maintained.

The OHA system is part of the emergency equipment to support the response and

can alert operators to abnormal conditions, but operators must be able to

determine whether the OHA system is available to implement the ECG.

The root

cause for this event was an OHA software task fault that halted the processing

of alarms with no indication, alarm, or annunciation to the plant operator.

This is the second occurrence of this type of "silent" malfunction without

failover to the backup, which made the equipment unavailable to perform the

OHA function, and did not provide the operators with sufficient information to

determine that unavailability. Therefore, the licensee provided inadequate

emergency equipment to support emergency response, which is a violation of 10

CFR 50.47(b)(8) (VIO 50-272/95-81-04, 50-311/95-81-04).

4.3

Operability Determination and Compensatory Periodic Tests

The inspectors reviewed the document, "Operability of Overhead Annunciator Due

To a Silent Fault," Number SES95-441, dated November 7, 1995.

Salem Unit 1

was defueled and Unit 2 was in mode 5, with plans to transition to mode 6 and

defueling at a later date. The document included design basis requirements,

analysis/assessment, specific operability requirements, and

conclusions/actions.

The licensee conclusion was that the OHA systems were

operable, but in a degraded mode.

The follow-up assessment of operability,

after corrective action, was to be tracked by the licensee as a Unit startup

issue.

Compensatory manual tests were performed, in addition to the 8-hour and weekly

tests, by the operators on a periodic basis. The licensee stated that

compensatory tests should detect the known software failure mechanisms that

could result in non-operability. A test performed every 30 minutes used an

alarmed cabinet door as an input alarm condition.

The licensee stated that

this test should check the ability of OHA to process data through a scanner,

through the SER A, and display the alarm/return to normal state on the

appropriate window, CRT, and printer. Another test performed every 4 hours4.62963e-5 days <br />0.00111 hours <br />6.613757e-6 weeks <br />1.522e-6 months <br />,

called the lamp test, should test all the window lamps and inject 11 known

alarms into three scanners.

The lamp test should verify the same conditions

as the alarmed door test with the additional test of two scanners.

..._ ........ -

...

'

18

The inspectors noted that the operability determination covered the possible

loss of core cooling in Unit 2, and the possible radiological alarms in the

spent fuel pool area of Unit 1.

The finding of the root cause subteam that

there was a known class of software faults that could go undetected was also

factored into the analysis.

The technical analysis of the ability of the

compensatory tests to detect the known class of software faults was very good.

The inspectors concluded that the operability determination, with the

compensatory tests specified, was sufficient for use in the respective plant

modes.

4.4

Comparison With the 1992 Salem Unit 2 OHA Event

The licensee performed a detailed comparison of the 1992 Salem Unit 2 OHA

event with this 1995 Salem Unit 1 OHA event.

The 1992 event was centered

around operator errors with manual keyboard strokes and interface switch

settings, which in conjunction with a software download task, suspended

certain other SER A tasks. The vendor software design was such that with the

unexpected switch positions and keyboard errors, the download task was waiting

for data that never came.

One of the tasks that was not suspended was the

update of the watchdog timer, which in turn did not allow the transfer of

alarm processing to SER B.

The result was that the OHA did not process alarms

for 90 minutes, until discovered by an alert operator.

The OHA system did not

indicate that the alarm processing had stopped.

For the 1995 event, the root cause subteam ruled out operator errors

concerning keyboard strokes and interface switch positions as an initiator.

After the 1992 event, the keyboard was placed in a locked compartment and had

administrative controls for access to the key.

The keys were not checked out

before or during the 1995 event. Also after the 1992 event, the manual

interface switch function was performed by an electrically interlocked switch

that aligned the data paths correctly.

In the 1992 event, the operators did not follow procedures, which allowed some

troubleshooting data to be lost.

In the 1995 event, the operators followed

procedures, which allowed them to reset the SERs; the reset caused some data

to be lost. However, the system capability for storing historical data was

used in the diagnosis of the software fault.

The print-outs provided a record

of missed data items that could be interpreted for correct or incorrect

operation. Special data printers added after the 1992 event to collect a

reduced set of error codes and SER status were not effective in providing

diagnostic data for the class of software faults suspected.

The licensee concluded that a contributing factor to the 1995 event was that

the apparent scope of corrective actions after the 1992 event was too narrow.

Those--actions were aimed at preventing only the set of errors that were

present and detectable. The emphasis was not on finding the adequacy of how

the system detected possible faulted conditions and indicated or took action

in response to those faults to prevent failure of alarm processing.

A

deterrent effect to widening the scope was that the vendor, as determined by

the licensee, had no design specifications for the software and relied on the

collective memory of the software designers through commented code for the

19

architectural details. The licensee determined that the vendor software

design process, procedures, and documentation were less than adequate, which

also hampered the effectiveness of corrective actions from the 1992 event.

The main similarity between the 1992 and 1995 events was in the inability of

the design to detect a certain class of software faults that would cause a

failure of the OHA to process valid alarms.

In the 1992 event, the operator

error sequence initiated the suspension of a task that was not detected and

failover did not occur.

In the 1995 event, a suspected power supply problem

was a strongly implicated initiator for the suspension of a different task

that was not detected and for which no failover occurred.

In both cases, the

particular software fault was not detected or indicated.

4.5

Generic Implications and Notifications

Every vendor annunciator system of the type used in the Salem units may be

vulnerable to the type of silent failure experienced.

Those systems that

incorporate the same system architecture as Salem plants, the redundant

failover pair, may be the most vulnerable to loss of annunciator function.

For those systems that have a redundant train architecture, each train may be

vulnerable to the same type of silent failure, but the vulnerability to

complete loss of annunciator function is less than the redundant failover pair

architecture.

The NRC residents at licensee sites with the basic vendor equipment,

regardless of system architecture, were notified by Region I.

The sites were

Kewaunee, Sequoyah, and Pilgrim.

The engineering staff at PSE&G notified the

industry by issuing Operating Experience Information Notice Number 7575 on

November 15, 1995.

The vendor, Hathaway Process Instrumentation, issued

Quality Alert, Issue Number 116, on November 6, 1995, to all customers who

purchased similar equipment.

5.0

MANAGEMENT OVERSIGHT

The management provided tight oversight of the decision to declare the alert.

The management chartered the SERT team.

The SERT team leader was a ma11ager,

and two managers were overseeing the root cause team.

The SERT process and

report was reviewed by management.

The management provided outside failure

analysis consultants to both the root cause and SERT subteams.

The inspectors

inferred from these actions that management was actively involved in the alert

declaration and overall direction of the failure analysis process.

6.0

OVERALL CONCLUSIONS

- Although the conditions for an alert declaration had been exceeded and

identified by the operating crew, the operating crew decided not to

declare an alert. This was assessed to be a violation of

10 CFR 50.54(q) (VIO 50-272/95-81-01).

20

The licensee's failure to discuss and seek agreement with the States

. prior to the implementation of the revised EALs covering loss of

annunciator events was assessed to be a violation of IO CFR 50.54(q) and

IO CFR 50 Appendix E (VIO 50-272/95-81-02).

TSC activation was not timely.

The licensee did not meet the emergency

plan staffing requirements for the TSC.

This was assessed to be a

violation of IO CFR 50.54(q) (VIO 50-272/95-81-03).

There was no indication to operators of major system problems that would

prevent alarm processing. This type of failure also occurred in the

I992 Salem Unit 2 OHA event. This is the second occurrence of this type

of "silent" malfunction without failover to the backup, which made the

equipment unavailable to perform the OHA function, and did not provide

the operators with sufficient information to determine that

unavailability.

This a violation of IO CFR 50.47(b)(8) because the

licensee provided inadequate emergency equipment to support emergency

response in that the declaration of an alert is based on the

availability of the OHA system (VIO 50-272/95-81-04, 50-311/95-01-04).

The event was initially dispositioned by the operating crew as a I-hour

report. This I-hour report was not initiated in a timely manner.

OSC activation was timely.

EOF manning was a conservative act on the

part of the licensee and was within the level of discretion provided by

the NRC-approved emergency plan.

The licensee's action to change the loss of annunciator EALs to be mode

specific was appropriate based on NRC's generic acceptance of the

NUMARC/NESP-0007 EAL guidance document.

The licensee's ERO ca 11-out process was weak.

The 1-hour report and the alert were not made ir; a timely manner;

however, the subsequent reporting requirements were carried out in a

timely manner.

The informal communications hindered understanding of the event for the

State of New Jersey Bureau of Nuclear Engineering (BNE).

Otherwise,

representatives from other offsite response organizations found the

communications provided by the licensee to be commendable.

The determination of the most likely root cause was technically sound

and was the best fit with the actual indications of failure.

The operability determination and the compensatory tests were sufficient

for use in the respective plant modes.

..

\\_.

..

7.0

21

There was an overall weakness observed in the effectiveness of the

engineering, operations, and training organizations to support the plant

operators with a unified set of OHA knowledge, skills, a~~ abilities to

recognize failure indications, to determine operability, and to take

proper corrective action.

Operator errors in keystrokes and interface switch settings were not a

factor in this event due to effective corrective actions for these areas

taken after the 1992 Salem Unit 2 OHA event.

Management was actively involved in the alert declaration and overall

direction of the failure analysis process.

EXIT MEETING AND TELEPHONE CALLS

An exit meeting was held on November 13, 1995, with the PSE&G personnel as

noted in Attachment 2 to summarize the scope and findings of their inspection

activities. The licensee acknowledged the inspection findings and also had

some comments beforehand as discussed in Section 2.2.2 of this report.

The

inspectors neither received or reviewed any proprietary material during the

inspection.

A telephone conference was held on November 17, 1995, with

Messrs. Munzenmaier, Villar, and Banner to resolve the comments from the exit

meeting.

The Regional State Liaison Officer called the New Jersey BNE on

January 30, 1996, to confirm the accuracy of the references to New Jersey BNE

contained in the report. During that call, the New Jersey BNE representative

stated that the licensee's station status checklists (see Section 2.4.2) did

not contain sufficient information to serve as a basis for de-escalation of

State resources allocated to respond to the loss of OHA event.

Attachments:

1.

Special Reactive Inspection Charter for the Salem 1 Alert on

October 5, 1995

2.

Exit Meeting Attendees

-.

-~

ATTACHMENT 1

CHARTER

S?E2I~l REACTIVE INSPECTION OF THE SALEM 1 ALERT ON OCTOBER 5, 1995

I.

Overhead Annunciator Performance

Independently evaluate licensee's root cause(s) of loss of

annunciation.

Assess any previous opportunities to detect and correct problem.

Evaluate adequacy of licensee's current corrective actions.

Evaluate any commonality with the previous Unit 2 loss of

annunciator event.

Evaluate initial actions with overhead annunciator system to

preserve data for future troubleshooting.

II.

Emergency Preparedness

Evaluate operator actions and decisions relating to implementing

the Salem Emergency Classification Guide (ECG).

Assess adequacy of emergency plan to address the situation.

Evaluate implementation of emergency plan, including TSC staffing.

. 4 * *

ATTACHMENT 2

EXIT MEETING ATTENDEES

public Service Electric and Gas

C. Bakken

M. Bursztein

N. Coni cell a

T. DiGuiseppi

C. Fricker

C. Munzenmaier

P. Noeller

L. Raj kows k i

M. Renchek

L. Storz

E. Villar

C. Waite

C. Warren

Atlantic Enerqv

J. Lazzara

Delmarva Power

P. Duca

Manager, Salem Operations

Manager, Nuclear Electrical Engineering

Manager, Salem Restart

Manager, Radiation Safety

Supervisor, Salem Plant Assessment

General Manager, Nuclear Operations Services

Licensing

Manager, Salem Main~enance, Controls

Manager, Salem System Engineering

Senior Vice President, Nuclear Operations

Licensing Engineer

Supervisor, Digital Systems Group

General Manager, Salem Operations

Site Representative

Site Representative

Philadelphia Electric Company

R. Kankus

Senior Strategic Planning Specialist, Joint Owners

Alliance

Delaware Emergency Management Agency

K. Kil te

Radiological Emergency Preparedness Specialist

New Jersey Department of Environmental Protection

P. Gardner

T. Kolesnik

J. Li pot i

P. Mulligan

Research Scientist

Nuclear Engineer

Assistant Director

Radiation Physicist

U. S. Nuclear Regulatory Commission

C. Marschall

L. Nicholson

W. Ruland

Senior Resident Inspector

Branch Chief, Division of Reactor Projects

Branch Chief, Division of Reactor Safety

--~.