ML18100A540

From kanterella
Jump to navigation Jump to search
Insp Repts 50-272/93-81 & 50-311/93-81 on 930606-28.No Violations Noted.Major Areas Inspected:Areas Necessary to Ascertain Facts & Determine Probable Causes of Numerous Rod Control Sys Electronic Component Failures
ML18100A540
Person / Time
Site: Salem  PSEG icon.png
Issue date: 08/06/1993
From: Durr J, Ruland W
NRC OFFICE OF INSPECTION & ENFORCEMENT (IE REGION I)
To:
Shared Package
ML18100A539 List:
References
50-272-93-81, 50-311-93-81, NUDOCS 9308170041
Download: ML18100A540 (64)


See also: IR 05000272/1993081

Text

U.S. NUCLEAR REGULATORY COMMISSION

REGION I

REPORT/DOCKET NOS.

50-272/93-81

50-311193-81

LICENSE NOS.

LICENSEE:

FACILITY:

INSPECTION DATES:

DPR-70

DPR-75

Public Service Electric and Gas Company

P.O. Box 236

Hancocks Bridge, New Jersey 08038

Salem Nuclear Generating Station

June 6-28, 1993

INSPECTORS:

Stephen Barr, Resident Inspector, Salem, DRP (Asst. Team Leader)

M. Eugene Laz3.rowitz, Reactor Engineer, DRS

TEAM LEADER:

APPROVED BY:

Hukam Garg, Sr. Electrical Engineer, NRR/IDCB

Larry Scholl, Reactor Engineer, DRP

W. H. Ruland, Chief, Electrical Section,

Engineering Branch, DRS

Jacque P. Durr, Chief, Engineering

Branch, Division of Reactor Safety

9308170041 930811

PDR

ADOCK 05000272

G

PD.R

' DAte

I

Date

2

Areas Inspected: An Augmented Inspection Team (AIT), consisting of personnel from

Region I and NRR, inspected those areas necessary to ascertain the facts and determine

probable causes of numerous rod control system electronic component failures that occurred

in May and June 1993. The team assessed the safety significance of potential inadvertent

outward rod motion that occurred as the result of a single component failure. The adequacy

of the licensee's maintenance and troubleshooting practices relative to the rod control system

was reviewed. The decision process concerning attempted plant startups following the

failures and repairs was evaluated as well as the possibility for any potential generic

implications posed by the Salem rod control problems.

Results: See Executive Summary

TABLE OF CONTENTS

EXECUTIVE SUMMARY ...................................... 3

1.0

INTRODUCTION ....................................... 4

1.1

Rod Control System Failure Overview . . . . . . . . . . . . . . . . . . . . . . 4

1.2

Augmented Inspection Team (AIT) Formation ... ~ . . . . . . . . . . . . . . 4

1.3

Rod Control System Description . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2.0

GENERAL SEQUENCE OF EVENTS . . . . . . . . . . . . . . . . . . . . . . . . . . 6

3.0

ROD CONTROL SYSTEM CORRECTIVE

MAINTENANCE/MODIFICATIONS . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

4.0

EVALUATION OF PREVENTIVE MAINTENANCE ACTIVITIES . . . . . . . .

8

4.1

Vendor Work Scope. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

4.2

Work Accomplished ............................ *. . . . . . 9

4. 3

PSE&G Oversight of Vendor Work . . . . . . . . . . . . . . . . . . . . . . .

10

4.4

Independent NRC Inspections ............... .' . . . . . . . . . . .

11

4.5

Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

12

5.0

EVALUATION OF TROUBLESHOOTING ACTIVITIES

. . . . . . . . . . . . .

12

5.1

Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . .

12

5.2

PSE&G Activities Prior to AIT Arrival . . . . . . . . . . . . . . . . . . . . .

13

5. 3

Actions After AIT Arrival . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

15

5.3.1 Component Failure Analysis . . . . . . . . . . . . . . . . . . . . . . . .

16

5.3.2 PSE&G Conclusions ........... ~ . . . . . . . . . . . . . . . . .

16

5. 3. 3 Corrective Actions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

16

5. 3 .4 AIT Assessment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

17

5.3.5 General Observations . . . . . . . . . . . . . . . . . . . . . . . . . . . .

18

5 .4

Summary of Component Failures . . . . . . . . . . . . . . . . . . . . . . . . .

18

6.0

RESTART DECISION PROCESSES . . . . . . . . . . . . . . . . . . . . . . . . . . .

18

6.1

Description of PSE&G Actions . . . . . . . . . . . . . . . . . . . . . . . . . .

18

6.2

Assessment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

20

6.3 * Conclusions ......... '. . . . . . . . . . . . . . . . . . . . . . . . . . . . .

21

7.0

ROOT CAUSE POLICY AND PROCEDURES . . . . . . . . . . . . . . . . . . . .

21

8.0

COMPENSATORY MEASURES/JCO.......................... 22

i

9.0

SAFETY SIGNIFICANCE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

24

10.0

CAUSES OF ROD CONTROL SYSTEM PROBLEMS . . . . . . . . . . . . . . .

25

10.1

HARDWARE PROBLEMS . . . . . . . . . . . . . . . . . . . . . . . . . . . .

25

10.1.1 Multiple Integrated Circuit and Output Transistor Failures .....

10 .1. 2 Regulation Circuit Board Short Circuits . . . . . . . . . . . . . . . .

10.1.3 I/O AC Amplifier, I/O Receiver and Slave Cycler Logic Card

25

25

Failures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

26

10.2

10.3

10.1.4 Q9 Transistor Failures ................... *. . . . . . . .

26

10.1.5 Failure Detector Card Resistor . . . . . . . . . . . . . . . . . . . . . .

26

ROD CONTROL SYSTEM DESIGN . . . . . . . . . . . . . . . . . . . . . .

27

10.2.1 Single Failure Vulnerability . . . . . . . . . . . . . . . . . . . . . . . .

27

10.2.2 Voltage Suppression Circuit . . . . . . . . . . . . . . . . . . . . . . .

27

10.2.3 Group Step Counter Compatibility

. . . . . . . . . . . . . . . . . . .

28

LICENSEE POLICY AND PERFORMANCE . . . . . . . . . . . . . . . . .

28

11.0

POTENTIAL GENERIC IMPLICATIONS . . . . . . . . . . . . . . . . . . . . . . .

28

12.0

OTHER FINDINGS/OBSERVATIONS . . . . . . . . . . . . . . . . . . . . . . . . .

29

13.0

EXIT MEETING . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

30

ATIACHMENT 1 - AIT Charter

ATTACHMENT 2 - Rod Control System Block Diagram

ATIACHMENT 3 - Sequence of Events

A TI ACHMENT 4 - Circuit Card Locations Experiencing Multiple Failures

ATIACHMENT 5 - List of Failures/Cause

A TI ACHMENT 6 - Exit Meeting Slides

ATIACHMENT 7 - Exit Meeting Attendees

ATIACHMENT 8 - Persons Contacted

ATIACHMENT 9 - Acronyms and Initialisms

ii

EXECUTIVESUMJ.\\fARY

NRC established an Augmented Inspection Team (AIT) on June 5, 1993, after Salem Unit 2

experienced numerous component failures of the rod control system (RCS). The team's

charter required detailed fact-finding, identification of root causes and potential generic

issues, and review of Public Service Electric and Gas (PSE&G) restart decisions.

The team found several causes of the RCS failures. The predominant cause of the failures

was that the solid state components were subjected to high voltage spikes produced by the

group step counters. During the outage, several counters were replaced during preventive

maintenance work. These counters had different electrical characteristics and resulted in the

generation of a higher reverse electromotive force (EMF) than the original counters.

Coincident with the higher reverse EMF, a surge suppression circuit was most likely lost and

permitted high voltage spikes to stress electrically and ultimately fail solid-state components

in the rod control system logic cabinet circuits.

The rod drop event, which occurred on May 28, 1993, was the result of short circuits on a

printed circuit card due to slivers of solder bridging adjacent circuit traces. Additional

component failures were the result of poor work practices during system troubleshooting and

testing that introduced momentary short circuits in the system. One failure was the result of

a spare circuit card that had an incorrect resistor installed.

PSE&G did not provide for the programmatic determination of a root cause for the RCS

problems. Had such a determination been performed, the repetitive startups and shutdowns

of Unit 2 might have been avoided. This weakness contributed to the troubleshooting effort

lacking clear leadership and delegation of responsibilities. The troubleshooting effort focused

too narrowly on identifying failed components and did not put adequate emphasis on finding

and correcting the root cause of the failures. For example, testing conducted by PSE&G,

after the AIT arrived, showed that. one of their original root cause.s was in error. However,

the team found that, for each of the five startups, the licensee satisfied all procedural and

technical specification (TS) requirements prior to initiating a Unit 2 startup, and that

operators conservatively and properly responded to the RCS problems. Licensee

management responded to, and devoted adequate resources to, each individual RCS event,

developed and resolved the apparent causes for each event, and required the testing of the

RCS before authorizing each startup.

The inspection also revealed that the rod control system is vulnerabl.e to a single component

failure that could result in inadvertent outward control rod motion when inward motion is

selected. Although the reactor protection system is independent of the rod control system

logic and, therefore, the scram function is not compromised,. there remains a concern that a

previously unanticipated single-failure mechanism may exist in the control system that can

initiate or aggravate reactivity excursions and result in fuel failure. On June 21, 1993, the

NRC issued Generic Letter 93-04, "Rod Control System Failure and Withdrawal of Rod

Control Cluster Assemblies," to notify all licensees of the problem and to request information

describing their plant specific findings and actions taken related to this issue.

DETAILS

1.0

INTRODUCTION

1.1

Rod Control System Failure Overview

Salem Unit 2 experienced multiple failures of the rod control system (RCS) during startup

following the cycle 7 refueling outage. Five plant startups were attempted with the last one

on June 3 being successful. However, on the first three attempts the RCS failed to move

correctly and maintain the control rods in the desired position. During the fourth startup, the

rod insertion limit computer did not respond as expected. The most serious failure occurred

during the second startup attempt on May 27, 1993, when during an attempt to withdraw

shutdown bank A, the operator observed that the analog rod position indication (ARPI) did

not indicate that the control rods were being withdrawn although the group step counter

showed demanded outward motion. The operator then attempted to insert shutdown bank A

to step 6; however, one control rod (1SA3) withdrew 8 steps while the group step counter

counted down from 20 steps to 6 steps. The operator then inserted the shutdown bank to

zero. At that time the ARPI indication for control rod 1SA3 indicated 15 steps. The plant

operators removed the power from the rod by pulling fuses and rod 1SA3 dropped to the

zero position by ARPI indication.

Preliminary NRC inspection revealed that the aberrant rod motion during the second startup

attempt may have been caused by a previously unrecognized single failure of the rod control

system. In response to NRC questions about the single failure issue and after consultations

with the RCS vendor (Westinghouse), PSE&G, on June 4, 1993, decided to bring Unit 2 to

Mode 3, hot standby. All the rods were inserted in the core and the reactor trip breakers

were opened to prevent any spurious rod withdrawal. Unit 1 continued to operate at 100%

power. The licensee submitted a justification for continued operation (JCO) for Unit 1 that

was accepted by NRC. Unit 1 tripped on June 8, 1993, due to an unrelated problem with

the circulating water system. Unit 1 started up on June 18, 1993, after NRC approval of the

PSE&G safety evaluation.

On June 22, 1993, PSE&G started RCS system retesting after extensive printed circuit card

replacements. During the test more failures occurred in the RCS. These failures were

unrelated to the previous failures and were most likely caused by a technician while he was

installing and removing jumpers (a device used to connect normally-isolated portions of a

circuit). On June 28, 1993, Salem Unit 2 commenced a reactor startup and the main

generator was connected to the electric grid on July 3, 1993.

1.2

Augmented Inspection Team (AIT) Formation

On June 5, 1993, senior NRC managers determined that an AIT was warranted to gather

information on the rod control system failures at Salem Unit 2. The AIT met with PSE&G

management and staff on June 6. PSE&G gave a presentation for the AIT on the facts

known at that time. A charter was formulated for the AIT and transmitted to the team on

June 7, 1993 (Attachment 1).

5

On June 8, 1993, NRC Region I issued a confirmatory action letter (CAL) that documented

the verbal commitments made by the licensee to the NRC regarding the rod control system

problems, the continued operation of Unit 1 and the subsequent restart of Unit 2.

The AIT completed initial inspection activities on June 17, 1993. PSE&G met with the team

and NRC management on June 18 to discuss their findings, corrective actions, and restart

plans. Additional on-site inspection was performed from June 23 through June 27 when

additional component failures occurred. This included the witnessing of final rod control

system testing and resolution of hardware problems. The actions agreed to in the CAL were

completed on June 27, 1993. The AIT charter was completed and the inspection terminated

on June 28, 1993.

1.3

Rod Control System Description

The Westinghouse rod control system (RCS) uses solid state logic and power electronics

packages to provide control and drive signals to the electro-mechanical ("Mag-Jack") control

rod drive mechanisms (CRDMs). (Refer to Attachment 2 for a system diagram). The

system commands are either a manual in or out signal generated by the reactor operator, or

an automatic signal generated by a difference in turbine power and its associated

temperature/pressure requirements, the actual reactor power, and the average temperature of

the reactor coolant (T.vJ*

The system develops a pulse train based on desired control rod direction and speed. The

system fixes the single speed for manual operation. If the signal comes from the automatic

part of the system, the difference between the required T.vg and the actual T.vg , generates the

speed signal. A higher difference in temperature* will cause a higher rod speed. This pulse

train ultimately generates the current pulses to the three coils in each CRDM. Because of

the core pattern, banks of rods consisting of one or two groups move in a predetermined

sequence. The logic cabinet provides the pulse train and the logic of which banks and

groups are to move. The logic cabinet also provides the memory to know where the rods are

in the sequence.

The logic cabinet also provides a signal to the control room group step counters to show

where the RCS has commanded the rods to be within the core. A separate analog or

individual rod position indication (ARPI) system that monitors actual position of each rod

(within a limited accuracy) shows the operator the actual rod location. There are limits on

the allowable mismatch between the RCS commanded position and the ARPI. These limits

require the operator to take corrective action to return the rods to the required position.

'-._

The power cabinets take the logic pulse train and convert it into a set of firing orders for

semiconductor power devices. These devices control the flow of power to the three CRDM

coils associated with each rod in a control rod group. When control rods are not being

moved, the cabinet powers the stationary holding coil to hold the rod at the desired position.

When control rod motion is required, the cabinets supply power in sequence to the movable,

6

lift and the stationary holding coils for each rod in the group. The "in" or "out" motion

command determines the sequence in which power is supplied to each coil, which then

controls which direction the rods move.

2.0

GENERAL SEQUENCE OF EVENTS

On March 16, 1993, Salem Unit 2 experienced an automatic reactor trip when steam

generator water levels reached their low level setpoint following the loss of one steam

generator feedwater pump. The trip occurred four days prior to the scheduled start date for

the Unit 2 seventh refueling outage. Plant management decided to enter the refueling outage

early and the unit was not restarted following the trip.

PSE&G had scheduled the implementation of a design change package (DCP) involving the

RCS power cabinet power supplies and a Westinghouse preventive maintenance service on

the RCS during the outage. The power supply DCP, which involved power supply

replacements and the rearrangement of the grounds on four power supplies within each of the

five power cabinets, was completed without incident. The Westinghouse preventive

maintenance service consisted of an inspection and repair, as necessary, of all of the circuit

cards in the RCS logic and power cabinets. During this work, four of the control rod group

counters in the main control room were replaced. One of the new counters came from

PSE&G's inventory and the other three were acquired from Florida Power and Light's

Turkey Point Station. During the retest of the new counters, several solid state components

were found to have failed on RCS system printed circuit boards. Repairs were made and the

system testing satisfactorily completed.

At the end of the outage, just prior to unit startup, the licensee conducted ARPI calibrations

on May 24 and 25. During these calibrations, additional problems with group counters for

control bank A, group 2; control bank C, group 1; and control bank B, groups 1 and 2 were

identified and repairs made by PSE&G.

On the evening of May 25, PSE&G initiated the startup of Unit 2 and began to withdraw the

control rods prior to diluting the primary coolant boron concentration to achieve criticality.

The licensee suspended the startup when the group counter for control rod control bank C,

group 1, stopped counting out while that group was being withdrawn. All control rods were

reinserted by the operators, and troubleshooting was initiated. The licensee identified and

repaired a number of problems, including shorted diodes on circuit cards in the RCS logic

cabinet. Subsequent to a RCS retest, PSE&G attempted a second startup on May 27.

During this startup, when operators attempted to withdraw shutdown bank A, no rod motion

was indicated on the ARPis and when the operator demanded a rod insertion for the bank,

rod 1SA3 actually stepped out 15 steps. The licensee removed power from the rod to return

it to the bottom of the core. Following this aborted startup, PSE&G identified and repaired

7

two additional failures on RCS circuit cards. After consultation with the RCS vendor,

Westinghouse, the licensee performed additional testing to demonstrate system operability

and, on May 28, undertook a third startup.

During the third startup, all control rods performed satisfactorily as they were withdrawn to

their critical position. As operators began the dilution to criticality, all four rods in control

bank C, group 1, fell completely into the core. Unit 2 operators entered the Abnormal

Procedure for a dropped rod event and subsequently manually tripped the plant, returning all

rods completely into the core. At this point, the Salem General Manager initiated a

Significant Event Response Team (SERT) to investigate the root cause of the RCS problems,

and a round-the-clock troubleshooting effort commenced to repair the RCS. The licensee

undertook an extensive investigation into the RCS and found faults on several more circuit

cards. These cards were repaired, and the licensee concluded that, because the faults were

similar to those that had been found and repaired previously, they had propagated a fault

through the RCS through their earlier troubleshooting efforts. Following this investigation,

PSE&G developed a root cause for the RCS failures and conducted additional testing to,

again, verify RCS operability. Satisfied that the system had been returned to normal,

PSE&G conducted a fourth startup on June 2, 1993.

The control rods behaved normally during the fourth startup; however, operators noted that

the pulse-to-analog converter and the rod insertion limit computer were not registering rod

motion while control banks Band D were being withdrawn. PSE&G management elected to

shut the unit down to investigate. The licensee found a defective component on a circuit

card in the RCS logic cabinet and attributed the problem to incomplete troubleshooting

following the third startup attempt. Once this additional card was repaired, operators started

up Unit 2 on June 3, 1993. This fifth startup took place without incident; however, PSE&G

decided to voluntarily shut down the unit on June 4 when it became evident that the problems

encountered on the second startup (when rod 1SA3 withdrew on an insertion signal) may

have been the result of a single failure, an event that is outside the plant's licensing basis.

Beginning on June 4, PSE&G, with assistance from Westinghouse, began an in-depth

investigation into all the problems that had been encountered with the Unit 2 RCS and an

analysis of the plant licensing basis. Additionally, the NRC dispatched an Augmented

Inspection Team (AIT) to Salem on June 5, 1993, to determine the cause of the RCS

problems that had occurred at Unit 2, to assess PSE&G's performance, and to develop any

generic applicability of those problems.

The AIT completed their initial on-site inspection effort on June 17, and PSE&G presented *

the results of their investigation to NRC Region I management on June 18, 1993. The root

.-

causes of the system failures had been identified, and the system repairs and modifications

were completed on June 21, 1993. System retesting commenced on June 22, 1993, during

which two additional transistor failures were identified. This resulted in additional circuit

cards being replaced. During the testing of the new circuit cards, one circuit board was

found to have an incorrect resistor installed and was then replaced with a spare card. System

8

testing was completed satisfactorily on June 27, 1993. The root cause for the transistor

failures was identified and Unit 2 commenced a reactor startup on June 28, 1993. See

Attachment 3 for a more comprehensive sequence of events.

3.0

ROD CONTROL SYSTEM CORRECTIVE MAINTENANCE/MODIFICATIONS

During the refueling outage, the only modification to the RCS was Design Change No. 2EC-

3136. PSE&G replaced obsolete power supply PS3 and PS4 (Lambda Model No. LM-261)

with new Lambda Model No. LCS-A-24-6795 for all five power supply cabinets. PSE&G

also replaced existing daisy chain neutral wiring between Lambda power supplies PS 1 to PS2

and PS3 to PS4 with an individual neutral wire for each power supply. This change was

done to minimize the risk of tripping the reactor during rework or replacement of any power

supply.

This change is similar to the change that was done at Salem Unit 1 under Design Control

Change No. lEC-3113. Unit 1 has been operating with this change without any problem

noted with the RCS. The team walked down the changes made to the power cabinets under

this design change and did not identify any differences in the installation between Unit 1 and

2. Based on the modification package review and the system walkdown, the team concluded

that the post-modification electrical configuration was equivalent to the original design and

that this design change did not cause the Unit 2 RCS failures.

In addition to the design change, the licensee had performed maintenance on the RCS. The

team reviewed all work orders (WOs) completed on the rod control system during the

outage. In'particular, WOs on an ARPI cabinet relay light (WO 930301114), insulation

resistance check on #2 CRDM (WO 931029004), and the repair/replacement of cracked outer

insulation of CRDM cables P-12 and P-8 for control rod 2SA3 and 1B4 (WO 930330102)

were reviewed. The team found no connections between these activities completed by

May 12, 1993, and the subsequent rod control system failures.

Work order 930421129 replaced four group step counters. The problems with the step

counters are discussed in Section 5.0.

4.0

EVALUATION OF PREVENTIVE MAINTENANCE ACTIVITIES

4.1

Vendor Work Scope

Approximately one month prior to the start of the Unit 2 refueling outage, PSE&G decided

to perform preventive* maintenance on the rod control system to ensure continued system

reliability. The work scope for the current outage was expanded beyond the routine

refueling outage preventive maintenance and testing to include a contract with Westinghouse

to perform a detailed cleaning, inspection, and testing of the rod control system. Minor

repairs and adjustments were to be performed by Westinghouse technicians. The

Westinghouse work scope also included the performance of static functional testing of the

..

9

installed rod control system printed circuit (PC) cards. Westinghouse had performed this

service at numerous reactor plants, including Salem Unit 1 in 1991.

4.2

Work Accomplished

The rod control system maintenance was performed in accordance with Westinghouse

procedure NSID-EIS-85-11, "Full Length Rod Control System Maintenance," which was

revised and approved for use at Salem Unit 2 as station procedure VSC.IC-PT.RCS-0003(Q),

"Full Length Rod Control System Maintenance." A summary of the work accomplished:

Pre-maintenance, as-found checks of the rod speed voltage, the oscillator-pulser

period, and resistance readings from the power supply neutral buses and the zero volts

bus to ground. The as-found readings were satisfactory.

A general precleaning inspection of the rod control cabinets and bus duct was

performed, the cabinets were cleaned, and a more detailed inspection was performed.

The inspection attributes included damaged or overheated components, wires touching

heat sinks, solder joint integrity, terminal tightness and correct lamp and fuse

installation. The identified deficiencies and the corrective actions taken were

documented in the procedure.

Following the cabinet cleaning, the power supply neutral bus and zero bus resistances

to ground were checked. Resistance read~gs were also taken on the current sensing

resistors. All readings were satisfactory.

Power supply checks were performed, and minor adjustments were made to ensure

the as-left voltages were within the procedure specifications. The overvoltage

protectors in the logic and power cabinets were also checked and minor adjustments

made to restore the trip points to the desired value.

The operation of the urgent and non-urgent alarms was checked, and the oscillator

and pulser periods were adjusted.

Various system logic tests were performed, and the operation of the control rod group

step counters was checked. Four group step counters failed the test acceptance

criteria and were replaced and satisfactorily retested; however, several circuit card

failures were also identified and corrected.

Current traces were taken using dummy coils in place of the normal lift, stationary,

and moving coils. All traces were as expected.

10

The Westinghouse work on the PC cards included the following:

Verification that each PC card is the correct configuration for the site. No problems

were identified with card configurations.

Visual inspection of the condition of the components on the PC cards. These

inspections identified cracked or pitted solder joints on 70 of the 184 (174 installed

cards and 10 spare) PC cards. The technicians repaired all deficiencies.

Performance of a static functional test of installed cards and spare cards using factory

test equipment to new card test acceptance criteria. Twenty-two cards failed the test

and were repaired and retested (this included 3 spare cards).

Performance of minor repairs on failed cards on-site using Westinghouse supplied

components and documentation of the failures. The cards that had problems identified

during the visual inspection and/ or functional tests were repaired by the Westinghouse

technicians.

PC cards requiring in-depth troubleshooting were to be returned to Pittsburgh for

repair or be replaced with a site spare. Three spare cards were utilized when

problems were encountered following the replacement of the group counters as

discussed above.

The Westinghouse technicians estimated that the number of repairs required to the circuit

boards was typical based on their experiences at other plants.

A more detailed review by

the SERT determined that the number of card repairs on Unit 2 were significantly higher

than the number of repairs required on Unit 1 .. Seventy Unit 2 cards required solder

connection repairs as compared to 28 on the Unit 1 cards. On both units the static card

testing failure rate was approximately 11 % .

4.3

PSE&G Oversight of Vendor Work

The maintenance procedure provided by Westinghouse was reviewed by PSE&G and revised

to ensure the appropriate station controls were incorporated and the procedure was approved

as a Salem station procedure. The PSE&G Instrumentation and Controls (I&C) department

supervisor ensured that the vendor technicians met the station requirements and were

qualified to work on the rod control system. The work was properly authorized by the shift

supervision, and the I&C supervisor coordinated the processing of the work order controlling

the maintenance activities. The rod control system work was performed primarily by

Westinghouse technicians who reported problems and status of the work to a PSE&G I&C

supervisor. The decision of what minor repairs were to be made was left to the discretion of

the Westinghouse technicians. The technicians documented the test data and deficiencies as

11

well as any repairs and adjustments that were made to resolve out-of-specification data and

hardware deficiencies. The completed work package was reviewed and approved by the I&C

supervisor.

PSE&G did not give Westinghouse specific direction as to the standards to which they

expected the work to be performed. For example,. the licensee did not specify criteria that

would be applied during the visual examination of the circuit cards. PSE&G expectations as

to how work was to be performed--such as soldering, installation of jumpers and working on

energized equipment--were not specified. PSE&G did not review the Westinghouse soldering

procedure to ensure that it met PSE&G standards. PSE&G personnel did not have a clear

understanding of the capabilities of the circuit card test consoles and were not aware that

some individual components on a card could be failed and not be identified during the bench

test.

4.4

Independent NRC Inspections

The NRC inspectors performed an independent inspection of several PC boards that were

installed in the logic cabinet to assess the effectiveness .of the preventive maintenance work

performed during the outage. The inspector also inspected four PC boards that were

removed from the power and logic cabinets and replaced with spare cards following the rod

drop event, which occurred on May 28, 1993. The cards were examined visually using a

JOx magnifying glass. The inspector identified solder connections that did not meet the

quality required by the PSE&G and Westinghouse soldering procedures; however, all of the

connections appeared to be adequate to ensure the circuit operability. On the stationary

gripper regulation circuit card, which was replaced as a result of the control rod drop

problem, the inspector identified degradation of the circuit board solder run. The circuit

traces had areas where the edge of the solder run had slivers of solder partially separated

from the trace. Most of the slivers were attached to the trace on one end and were parallel

to the solder run. However, several of the slivers had been displaced perpendicular to the

solder run to which they were attached and two appeared to bridge the gap between two

solder runs resulting in potential short circuits on the boards. PSE&G evaluated this

condition and determined that short circuits on these portions of the circuit board would have

resulted in the dropped control rods without a coincident failure alarm.

The inspectors reviewed the PSE&G technician training program for performing soldering of

circuit components. The course required the technicians to make high quality solder

connections of various configurations in order to become qualified. The inspectors reviewed

training records for several technicians involved in repair work performed on the rod control

system following the preventive maintenance effort and verified the technicians had

completed the training. An in-progress circuit repair was observed by the inspectors and

found to be properly performed resulting in a good quality repair.

12

4.5

Conclusions

The decision to perform the rod control system preventive maintenance effort was prudent.

Similar preventive maintenance previously performed on Salem Unit 1 had been performed

and subsequent system operation was reliable. Numerous deficiencies, which did not prevent

the system from operating in the past, were identified and corrected to avoid degradation to

an extent where future failures may occur. The technicians performing the work were well-

qualified, and the quality of their work was generally good. The identified deficiencies and

corrective actions were clearly documented in the work package.

The team concluded that PSE&G exerted minimal control over the vendor work effort. To a

large degree, the vendor technicians worked independently and applied their own standards

when evaluating system deficiencies. PSE&G did not specify inspection criteria to be used

during visual inspections of cards and components and for inspection of work performed such

as soldering.

The PSE&G personnel were not aware of the capabilities of the circuit card test consoles and

did not realize that the testers may not locate all failed .components on a circuit card. Cards

installed in the system after a successful bench test were assumed to be 100 % operable when

they could have failed components, such as shorted diodes.

The team concluded that preventive maintenance work was not the direct cause of the

numerous circuit card failures. However, the inspector concluded that a more detailed

circuit card inspection should have identified the degraded solder traces on the regulation

card and thereby avoided the rod drop event. Although the replacement of the group step

counters introduced a different model counter that contributed to the component failures, the

inspectors did not identify any inappropriate actions during the replacement activity.

5.0

EVALUATION OF TROUBLESHOOTING ACTIVITIES

5 .1

Background

As discussed in section 4.0, PSE&G decided to perform extensive preventive maintenance on

the rod control system during the refueling outage. During this work, on May 14, 1993,

Westinghouse identified that two step counters on the c.;ontrol room RCS panel failed an

operability test and two others marginally passed.

On May 16, 1993, a Westinghouse technician replaced the four counters with one unit from

the PSE&G stock and three obtained from the Turkey Point Nuclear Plant. During the

testing of the new counters, the first set of problems with the printed circuit cards occurred.

Westinghouse and PSE&G personnel performed the system troubleshooting, which involved

in-cabinet measurements and swapping of suspected bad printed circuit cards with good

circuit cards. The Westinghouse circuit card testers had been shipped to another site

following the completion _of that phase of the preventive maintenance work and, therefore,

13

were not available for troubleshooting. This effort found several failed components on the

supervisory data logging (SDL) and relay driver cards (RDCs).

During subsequent surveillance testing and \\plant startup attempts, additional failures were

encountered. The test consoles were returned to Salem on May 20, 1993, and PSE&G and

Westinghouse personnel continued to troubleshoot, repair, and replace printed circuit cards.

The problems included the failure of numerous integrated circuits, small signal diodes, and

power transistors. The integrated circuit compon~nts were from the Motorola High

Threshold Logic (HTL) family, a noise-resistant design. The failed logic circuits included

NAND, NOT and AND gates.

This effort culminated with PSE&G informing the NRC on June 2, 1993, that they had found

and repaired all of the failures, and that the reactor was ready for restart.

5.2

PSE&G Activities Prior to AIT Arrival

PSE&G experienced over thirty failures of solid state devices between May 16 and

June 5, 1993. When failures occurred in several areas in the logic cabinet, PSE&G decided

that the failures might be related and commenced a troubleshooting effort to identify the

relationship between the failures. Refer to Attachments 4 and 5 for a list of failures,

locations, and the final AIT determination of probable causes of the failures.

During the troubleshooting effort that began on May 16, 1993, the Westinghouse circuit *

cards testers were not on site, and the troubleshooting was performed by replacing suspected

failed cards with spare cards and the interchanging of similar cards within the logic cabinet.

The replacement of integrated circuit chips on failed cards was also performed. The

. capability to test functionally suspect cards was added to the troubleshooting methods on

May 20, 1993, when the card test units were returned to Salem. The troubleshooting was

performed on all shifts by PSE&G and Westinghouse persc;mnel. The Westinghouse role was

not specifically defined by procedure or contract and the technicians worked with the PSE&G

personnel as peers. The troubleshooting efforts continued through June 3, 1993, and resulted

in the identification and repair of numerous and often repeat component failures. The

troubleshooting for each rod control system malfunction ended with the identification and

repair of the specific component that caused the malfunction, even after repeat failures

continued to occur.

A PSE&G maintenance engineer initially concluded that the root cause of the failures was a

bad blocking diode on a relay driver circuit card. The cause of the initial failed diode was

attributed to random failure. PSE&G also concluded that, during the troubleshooting efforts

as the circuit card with the bad diode was moved from slot to slot, it caused failure of

several supervisory data logging circuit cards. The failed integrated circuit on the SDL

circuit card was then assumed to have caused additional diode failures. PSE&G finally

concluded that the interaction between failed diodes and integrated circuits was the cause of

all of the component failures except for a blown fuse in the 100 VDC power supply and a

14

failed auctioneering diode in the -15 VDC power supply. These failures were attributed to a

short circuit caused by technicians during the installation of test equipment.

Prior to the arrival of the AIT, PSE&G had not performed any safety evaluation to assess the

significance of the rod control system malfunctions and failures. Specifically, when control

rod 1SA3 withdrew during an insert signal, PSE&G did not investigate the potential that a

single component failure could have caused the event, thereby resulting in a condition that

was outside the design basis of the plant.

The plant management held a meeting to discuss the cause of the failures. During this

meeting, no one contradicted the root cause as the diodes, although subsequent interviews

revealed that several people had different theories. PSE&G informed the NRC of their root

cause determination during a June 2, 1993, phone call.

An AIT member went to the site on June 3, 1993, when PSE&G reported that another SDL

printed circuit card had experienced a failure during a plant startup. The inspector reviewed

the results of the troubleshooting and repair efforts and interviewed involved personnel to

determine whether the root causes identified by PSE&G were consistent with the available

information. The station troubleshooting procedure, SC.IC-GP.ZZ-0006(Q), "Controls

Equipment - Troubleshooting" was also reviewed.

The AIT member also reviewed the system design to determine whether a single component

failure could have resulted in the 1SA3 control rod moving out when rod in-motion was

demanded.

Conclusions

The team concluded that, prior to the AIT arrival, the troubleshooting efforts lacked clear

leadership and delegation of responsibilities. Personnel on each shift would isolate and

eliminate the cause of the most recent system malfunction; however, no one had the overall

responsibility to identify and correct the root cause, even after repeat failures continued to

occur. The troubleshooting procedure does not require the identification of the reason for a

component failure. When a root cause was presented, it was accepted by plant management

even though there was no analysis or testing performed to ensure it was a valid explanation

for the failures.

The team also concluded PSE&G had not evaluated the rod 1SA3 withdrawal event for a

single failure design vulnerability since it was assumed that the problem observed had been

the result of two component failures. The team found that the failure of one logic gate on a

slave cycle decoder circuit card would cause the same CRDM current signals as those

observed during the inadvertent outward rod motion on rod 1SA3.

15

5.3

Actions After AIT Arrival

After the arrival of the AIT, PSE&G formed a task force composed of elements of their

Engineering, Operations, Systems, Maintenance, and Training departments, as well as

Westinghouse Engineering and Licensing department personnel. PSE&G tasked this group

with finding the root cause of the failures and investigating the reason why one rod in a bank

of eight moved out and the other seven did not.

PSE&G, with Westinghouse assistance, used the utility's control rod system simulator in the

training center to investigate the failures. They introduced the component failures found in

the plant RCS and were able to reproduce the current traces to the CRDM coils that had

been taken in the plant after the single rod out event. Further, they demonstrated that a

single failure would cause outward rod motion with an inward demand. Again, the current

traces observed during this demonstration matched those taken in the plant as well as the

traces predicted by Westinghouse Engineering.

PSE&G also simulated the diode failure, assumed to be the cause of the component failures,

in the training center. They did not observe any integrated circuit failures similar to the

plant failures. This finding contradicted PSE&G's original root cause of the integrated

circuit failures and validated the AIT conclusion that the original root cause determination

was in error.

PSE&G put in place a troubleshooting team with a single leader and, utilizing PSE&G and

Westinghouse personnel and the Westinghouse card testing equipment, a bench test unit was

set up in an effort to validate suspected root causes.

During this testing, PSE&G learned that the new group step counters that were installed on

May 16, 1993, had different electrical characteristics than the previously installed units.

These different characteristics resulted in higher back electromotive forces (EMFs), when the

counter coils were pulsed, than those seen with the original counters. A surge suppression or

"wheeling" diode is installed to shunt this back EMF around the coil to protect the solid state

components that drives the coil. This diode is on the relay driver circuit (RDC) board and is

not related to the blocking diodes that failed. PSE&G learned that the old counter EMF was

such that, even without the wheeling diode in the circuit, the voltage spike produced

probably would not damage the connected transistor, nor would it propagate to any other

parts of the circuit. Without the wheeling diode, the new counter would exceed the transistor

rating and would sometimes cause the transistor to saturate or not tum off when the signal

was removed from the transistor. PSE&G noted that they had seen evidence of this

condition, in the form of condensation on the group counter face due to coil heatup, during

their troubleshooting during the time prior to the team's arrival. The coil heating resulted

when the transistor saturated causing continuous current flow through the coil instead of the

normal momentary pulses. Initially, PSE&G did not identify any relationship between the

condensation on the group counter face and the component failures. The connection was

made when a counter exhibited the same condensation condition during bench testing.

16

PSE&G also noted that the pulse without the wheeling diode was strong enough to propagate

to the integrated circuit on the SDL card output. They noted voltages of about 26 volts at

this point during the shop simulation. Since the integrated circuit is rated at a maximum of

20 volts for one second, the 26 volt or greater spikes could cause the integrated circuits to

degrade and fail.

5.3.1 Component Failure Analysis

PSE&G sent several failed integrated circuits to the manufacturer, Motorola, for a failure

analysis. Motorola's conclusion was that an over-voltage event caused the damage seen in

the integrated circuits. They found some integrated circuits failed, while others were

degraded and starting into failure. PSE&G also sent one integrated circuit to Westinghouse

and several diodes were sent to Phillips Electronics. The organizations, with one exception,

listed the cause of the failure as voltage overstress. In one case, electrostatic discharge

(ESD) was identified as the likely failure mechanism. Motorola informed PSE&G that,

while they used the term BSD, that did not necessarily mean the failure was caused by

handling, but by some voltage greater than 100 volts. These observations were consistent

with the events seen at the plant. Motorola also noted that voltage spikes can also degrade or

fail integrated circuits that are not in the immediate circuit subjected to the voltage spike.

5.3.2 PSE&G Conclusions

Based on the above, PSE&G concluded that the most likely root cause of the solid-state

component failures was the addition of the new counters, coupled with a bad printed circuit

card connection in the RCS logic cabinet that prevented the suppression diodes from

connecting to the circuit. The counters obtained from Turkey Point function properly in the

system as long as the surge suppression feature is present.

The potentially bad pin connection would have likely been in the connector in the RCS

cabinet. The pin connector on the printed circuit card was visually inspected during the

original Westinghouse maintenance work. PSE&G noted that they had straightened a few

pins during the maintenance and troubleshooting, but were not sure if they had checked or

straightened this particular pin. No pin problems were identified during inspections

performed subsequent to the identification of the voltage spike problem.

5.3.3 Corrective Actions

PSE&G replaced the 74 printed circuit cards in the logic cabinet to ensure that all damaged

or degraded components, as a result of the voltage spikes, were eliminated from the system.

Additional suppression diodes were also added directly at the counter coil terminals to

eliminate the possible loss of the suppression circuit should the pin connection degrade. The

system was then retested using the applicable portions of the test used during the

Westinghouse preventive maintenance work. During the testing, PSE&G noted that an

"Urgent Alarm" that should have annunciated at the shutdown bank C and D (SCD) power

17

cabinet did not illuminate. They tested several printed circuit cards and found that the signal

processing card and the alarm card in the SCD cabinet each had a failed transistor (Q9). The

transistors were replaced and system testing was satisfactorily completed.

All of the circuit cards had been bench tested satisfactorily prior to installation and had

performed normally during the initial system retest. The failed transistors were sent to a test

facility and were both found to have been damaged by an apparent overcurrent event that

resulted in the catastrophic failure of the transistor emitter. PSE&G then reviewed the

circuits involved as well as work that was in progress at the estimated time of failure. They

determined that the transistor failures occurred at about the time that two failure detector

cards were reinstalled. These cards had been removed earlier as directed by the test

procedure. During the time the cards are removed, a "card removed" alarm signal is

disabled by the installation of a jumper into the backplane of the power cabinet card cage.

At the time of the failure both Westinghouse and licensee personnel were involved in the test

activities. A licensee instrumentation and controls (I&C) technician was tasked with the

installation and removal of the two jumpers. The jumper installation and removal was

normally performed by Westinghouse technicians. The jumpers are installed in the card

connection adjacent to the signal processing and alarm cards. In one case, the access space

to the card cage connector is only about two inches wide making the exact placement of the

jumper difficult. The jumpers are small "h"-shaped assemblies that are designed to jumper*

across two card connector pins. The size and shape of the jumpers also contributed to the

difficulty of the jumpering task and increased the probability of installing the jumper across

the wrong set of pins. After reviewing the affected circuitry on the signal processing and

alarm cards, the transistor failure mode, the jumper installation and removal technique and

the proximity of the jumpering activity, PSE&G determined that the most likely cause of the

transistor failures was momentary short circuits caused by the jumpering activity. PSE&G

determined that, in one case, the I&C technician inadvertently shorted two pins in the back

plane of the card cage and, during another step, shorted two adjacent traces on the back of

the printed circuit card. PSE&G determined that these incidents caused a misbiasing of the

transistors and the resultant failures. Due to the low voltage and current levels involved, the

momentary short circuits caused by the jumpers would not have been readily discernible by

the technician.

As a result of the transistor failures, PSE&G replaced 13 circuit cards in the SCD power

cabinet. During the retest of these cards an incorrect resistor was identified on one of the

new cards. See section 10.1.5 for details. This card was again replaced and the system

retested satisfactorily.

5.3.4 AIT Assessment

The team reviewed the PSE&G troubleshooting results, component failure analyses, and

engineering evaluations associated with the RCS. The team concluded that unsuppressed

18

voltage spikes generated by the new group counters were the likely cause of the multiple

solid state component failures. The team also concluded that the corrective actions taken by

PSE&G were appropriate.

The team evaluated the PSE&G root cause evaluation for the Q9 transistor failures and

concluded that it was a reasonable explanation for the failed transistors. The team noted that

the poor jumpering techniques, which PSE&G initially failed to identify and correct, were

eliminated by the use of a modified circuit card in the place of the jumpers.

5.3.5 General Observations

The team occasionally observed the technicians handling the circuit cards by capacitors or

transistors mounted on the cards. This practice can mechanically stress solder connections

and may have caused some of the cracked solder joints observed by the team.

The team also noted that PSE&G technicians and the vendor technicians handled the printed

circuit card such that their hands contacted the traces and components on the board.

Although the component families selected for these boards are resistant in the area of

electrostatic discharge (BSD) damage, the handling could cause damage to the solid state

devices on the board.

PSE&G has since implemented proper handling techniques.

5.4

Summary of Component Failures

Attachment 4 summarizes the repeat failures of printed circuit cards in the logic cabinet slots

Al13, Al14, A713, and A714. Slots A113 and A114 are locations of the supervisory data

logging (SDL) cards, and slots A713 and A714 are locations of the slave cycler decoder

(SCD) cards. Attachment 5 is a list of failures in the RCS that occurred after the completion

of preventive maintenance activities. This attachment includes the corrective actions taken

and the most probable root cause of each failure. The root causes listed were determined by

PSE&G following the arrival of the AIT. The team independently reviewed the available

information and concluded that the root causes assigned by PSE&G were the most likely

causes of the failures. In one case, for the bank overlap card, PSE&G had not determined

the cause of the failure and the card was sent to Westinghouse for a failure analysis. The

team concluded that electrical overstress was the most likely cause of this failure.

6.0

RESTART DECISION PROCESSES

6.1

Description of PSE&G Actions

~.*~*

~-

Following the work performed on the RCS during the Unit 2 seventh refueling outage,

including the Westinghouse preventive maintenance on the circuit cards in the RCS logic and

power cabinets and the successful performance of ARPI calibrations, Salem management had

19

no apparent reason to suspect the operability of the RCS. The initial startup conducted on

May 25, 1993, was performed with I&C engineers present on 12-hour shifts, providing

normal startup coverage. It was these engineers who initiated the troubleshooting subsequent

to the shutdown performed when the control bank C, group 1, counter failed. PSE&G found

a number of component problems on the circuit cards in the RCS, including several shorted

blocking diodes. PSE&G determined the root cause of this first evidenced problem to be the

shorted diodes and that the movement of those failed diodes throughout the RCS during

troubleshooting had caused the large number of other problems. After repairs, PSE&G

verified the proper operation of the RCS and the group counters by stepping the counter for

each control rod group "OUT" for the full range of 228 steps and then cycling the counter

"IN" for 228 steps. Following the successful completion of that test, the Salem General

Manager (GM) authorized a unit restart.

The second Unit 2 startup was abandoned when shutdown bank A did not withdraw after

being commanded 20 "OUT" steps and then rod 1SA3 withdrew 15 steps when the bank of

rods had been given 20 "IN" commands. Additional licensee troubleshooting identified two

slave cycler decoder cards with bad ICs, the effect of which produced irregular current order )

signals to the control rod drive mechanisms (CRDMs) .. PSE&G repaired the defective cards

and consulted with Westinghouse to confirm that the action of shutdown bank A was

expected given the two identified failures. Westinghouse explained to PSE&G that the

current orders produced by the failed circuit cards would not have damaged the CRDMs.

Once the faulty cards were repaired, the licensee exercised the RCS and the CRDMs by

stepping every control rod group "OUT" 20 steps *and then "IN" 20 steps. When all group

counter and ARPI indications agreed and the rods behaved as expected, the Salem GM

authorized a unit startup. The GM later recounted to the inspector that he believed the

problems seen with the RCS were still due the propagation of faults through the movement of

the initial failed logic cards. The GM also related that PSE&G had not considered the 1SA3

rod withdrawal event to be contradictory to the single failure criteria for the RCS because

two failures had occurred.

After the four rods in control bank C, group 1, dropped into the core during the startup

attempt on May 28, and a manual reactor trip was performed, the Salem GM chartered a

Significant Event Response Team (SERT) to investigate the root cause of the RCS problems

and to assess the GM's decisions to restart the unit following each failure. Following the

May 28 event, the licensee also expanded their troubleshooting effort to a three shift round-

the-clock team approach with each shift consisting of a maintenance engineer, I&C

technicians and supervisors, system engineering personnel, PSE&G training center personnel,

and Westinghouse personnel. In accordance with Salem procedure AD-16, "Post-Reactor

Trip/Safety Injection Review Report," a root cause determination was required before restart

could be authorized by the GM following the plant trip. The licensee's troubleshooting effort

found circuit card component failures similar to those identified previously and separate

evidence of a faulty firing card in the RCS power cabinet. All logic cabinet cards were

repaired, and the licensee replaced the identified firing card and three other cards associated

with the group of dropped rods. The root cause determination for the RCS failures

20

submitted to and accepted .by the Salem GM cited the original shorted blocking diode and.

subsequent associated troubleshooting as one root cause, and a random and intermittent

failure of the firing card as another root cause. The GM accepted this root cause

determination and, after additional extensive functional testing of the RCS, authorized

another Unit 2 startup.

The failure encountered during the fourth startup of Unit 2 involved the pulse-to-analog

converter and the rod insertion limit computer input for control banks B and D. PSE&G

identified a failed chip in a location identical to a number of previously failed ICs. Station

personnel believed this failure to be a previously existing failure that was missed during the

extensive troubleshooting efforts of May 28-June 1. The card was repaired, and the rods

were again exercised to assure proper rod behavior and group counter and ARPI agreement.

With the testing satisfactorily completed, PSE&G performed the fifth startup of Unit 2. This

startup was completed successfully, and low power physics testing was underway when

PSE&G decided to voluntarily shut down the unit pending the full investigation into the

single failure and licensing basis issues.

Prior to each Unit 2 startup, the Salem Operations Department performed the requirements of

integrated operating procedure IOP-3, "Hot Standby To Minimum Load." This procedure

requires the senior nuclear shift supervisor to obtain all necessary Department sign-offs

certifying that the unit is prepared for startup and to obtain the Operations Manager's

authorization to change the unit's operating mode.

6.2

Assessment

A team member monitored the licensee's actions from May 25 to June 4, 1993. During that

period, the inspector maintained contact with Salem plant management and observed many of

the licensee's actions. Subsequent to June 4, the team reviewed operators' logs and

procedures used during the five startups and shutdowns, and interviewed the operators, shift

supervisors and plant management that had been involved with the events.

The team determined through direct observation and later review that Salem operators had

complied with all applicable procedures and Technical Specifications (TSs) during the five

startup and shutdown events. The operators displayed good awareness of plant conditions

and quickly identified and responded to the RCS abnormalities during the startups. Salem

management properly supported the shutdown of the unit once the abnormalities were

identified. The inspector verified that the licensee had satisfied all procedural and TS

requirements prior to the restart of the plant subsequent to each RCS event.

The team noted that PSE&G management recognized and responded to each RCS problem as

it occurred. PSE&G had prepared for potential problems by having I&C engineers on 12-

hour shifts for startup coverage and expanded the troubleshooting effort subsequent to each

startup to resolve the RCS problems. Salem management remained involved in those

troubleshooting efforts and challenged the technical staff to find and repair the cause of each

21

event. Licensee management also properly involved the system vendor to assist in problem

investigation and to verify system operability following the unexpected rod 1SA3 withdrawal

event. The inspector determined that Salem management, including the GM, devoted

adequate resources to the investigation of each RCS event and that management did not

initiate any unit restart before they thought that each individual problem had been identified

and resolved.

Despite the Salem management and technical stafrs attention to and resolution of each

individual RCS problem, the team identified a weakness in the licensee's pursuit and

consideration of a root cause for the sustained problems encountered with the RCS. See

section 7. 0 for further discussion.

As the inspection progressed, the team noted a significant improvement in the licensee's

control of troubleshooting and identification of root cause for component failures. This was

particularly evident during the resolution of the Q9 transistor failures of June 23 and the

unexpected urgent alarm that occurred on June 25, 1993. The systematic troubleshooting and

in-depth root cause analyses identified a plausible explanation for the transistor failures and

identified a specific incorrect circuit card component as .the cause for the unexpected alarm.

6.3

Conclusions

The team determined that PSE&G satisfied all procedural and TS requirements prior to

initiating a Unit 2 startup, that operators properly responded to the RCS problems, and

station management responded to and devoted adequate resources to each RCS event.

Proximate causes were developed for each event, and testing of the RCS was performed

before each unit startup. The team also concluded, however, that the PSE&G restart process

did not provide for the programmatic determination of a root cause for the RCS problems

and, had such a determination been performed, that the repetitive startups and shutdowns of

Unit 2 could have been avoided.

7.0

ROOT CAUSE POLICY AND PROCEDURES

The team reviewed the station policy and procedures relative to root cause determination

particularly as it applies to troubleshooting, maintenance and unit restart.

Station procedure SC.IC-GP.ZZ-0006(Q), "Controls Equipment - Troubleshooting," is used

to control troubleshooting of instrumentation and controls systems. A similar procedure

exists for mechanical sys~ems. Sectio~ 5.9 of the procedure require~ the user to "Determine ft.

the cause of the malfunction by recordmg as found data and companng to the expected

performance as derived from the procedures, ICD Cards, valve data cards or vendor

manuals." This is the extent of guidance provided in this procedure relative to determining

root cause of the equipment failures.

The work control system at Salem requires the completion of a cause section per procedure

NC.NA-AP.ZZ-:0009(Q), "Work C<;mtrol System." This procedure lists thirteen alternatives

22

for the cause of an electrical/electronic controls problem; however, no guidance is provided

for the user of the procedure. Absent guidance, the maintenance craft--with supervisory

help--would have to determine cause and classify that cause within a limited set of general

categories.

The AD-16, "Post-Reactor-Trip/Safety Injection Review," report (see section 6.1) specifies

that the root cause be determined for the component failure prior to startup. By this .

procedure, it is left to the General Manager to accept Engineering's determination of root

cause. For failures not resulting in a plant trip or safety injection, there is no guidance in

the station procedures on when a root cause determination should be required and whether a

determination should be completed before considering the system operational.

The team identified instances where the cause was simply listed as componetit failure or

random failure without any explanation for what may have caused the component failure.

When a shorted blocking diode was found during the initial RCS troubleshooting, the root

cause was attributed to random failure. As discussed in section 5.2, the PSE&G

management then attributed integrated circuit failures to the failed diodes. As discussed in

section 5. 3, subsequent testing in the shop and the training center showed this cause was

incorrect. The multiple card and diode failures, which were identified, were assumed to be

the result of moving circuit cards around to various locations during the troubleshooting and

thereby promulgating additional failures. The initial troubleshooting focused primarily on

finding and replacing the failed components with minimal focus on establishing why the

particular component failed. Postulated root causes were not tested to ensure they had a

sound engineering basis. PSE&G eventually concluded that the circuit card with the failed

diode would function normally even with the diode short circuited and that there was no

plausible explanation for why the shorted diode would result in other component failures.

Further, three members of the plant staff involved with the troubleshooting stated to the team

that they did not believe the root cause as described in the post-trip review document for the

May 28, 1993, manual reactor trip. They also reported that they did not air their concerns

to management at the time.

The team concluded that there was no clear station policy or expectations on how, when and

to what extent to perform root cause analysis for component failures. PSE&G management

expected the plant staff to use engineering judgement to determine how far to pursue the root

cause of component failures. There also is no clear policy on whether or not a system can

be considered operable without having determined the root cause of any failures.

8.0

COMPENSATORY l\\1EASURES/JCO

PSE&G decided to shut down Unit 2 on June 4, 1993, to assure the safety of the plant while

the root cause, potential safety-significant effects and licensing basis relevance of the RCS

problems were investigated. At the time, Unit 1 was operating at 100% power, and the

licensee had not experienced any problems with the Unit 1 RCS. On June 8, 1993, the NRC

Region I Office issued a Confirmatory Action Letter (CAL) to PSE&G, which affirmed the

l

23

licensee's commitment to maintain Unit 2 shut down until the above-mentioned investigation

was satisfactorily completed and to prepare a Justification for Continued Operation (JCO) to

assure the continued safe operation of Unit 1.

Between June 4 and June 8, PSE&G prepared the JCO for Unit 1, which was approved by

the Salem Station Operations Review Committee (SORC) on June 8, 1993. The JCO

considered such factors as the current plant status, RCS alarms and indications, operator

training, the unit licensing basis and accident analyses, and demonstrations of RCS

operability. The JCO also considered compensatory actions to be taken by unit operators,

including the placing of the RCS into the MANUAL mode in order to prevent automatic rod

motion without operator cognizance. The SORC approved the JCO for Unit 1 operation in

Mode 1 (Power Operation); however, within an hour and a half of the approval, Unit 1

tripped off-line due to the loss of the main condenser system when river debris clogged the

circulating water system travelling screens. With Unit 1 forced to enter Mode 3 (Hot

Standby), the JCO was no longer valid, because it did not consider the effects of restarting

the plant.

On June 15, 1993, SORC reconvened to consider a new JCO, one that did include an

analysis of unit restart. This second JCO considered all the factors of the first JCO and the

additional accident analyses related to startup events. The new analysis showed that, even

given the possibility of an inadvertent rod withdrawal accident, protection of the reactor core

and fuel would be assured in all cases except an asymmetric rod withdrawal from a

subcritical condition. The licensee committed in the JCO that any startup would be

performed by first withdrawing control rods and then achieving criticality by dilution of the

primary coolant boron concentration, thereby avoiding the possibility of an inadvertent rod

withdrawal from a subcritical condition.

With the SORC approval of the second JCO, and its subsequent approval by NRC Region I

and NRR, PSE&G initiated a Unit 1 startup on June 18, 1993. Prior to the startup, the

licensee implemented a design change to the RCS in which additional suppression diodes

were placed in each group counter circuit to assure the blocking of the EMF pulses back into

the RCS circuitry. This design change, additional RCS surveillance testing and the

compensatory measures of the JCO were completed, and Unit 1 achieved criticality on June

19 without event.

The team monitored licensee performance throughout the period in which the JCOs and

compensatory measures were developed and implemented. The team noted very"good

cooperation between the Salem Operations and Technical Departments and PSE&G

Engineering and Plant Betterment in the development of the additional accident analyses and

compensatory measures. A team member attended all SORC meetings associated with the

JCO preparation and was present in the control room for Unit 1 startup.

24

The team concluded that the licensee's approach to the continued operation of Unit 1 was

performed in a deliberate and conservative manner.

9.0

SAFETY SIGNIFICANCE

The identified failure in the Salem RCS could have potentially resulted in the operation of

either of the Salem units outside their design basis. Licensee evaluation of the failure, in

accordance with 10 CFR 50.59, concluded that the potential single failure was an

Unreviewed Safety Question (USQ). PSE&G subsequently performed an engineering

evaluation to ensure the safe restart and operation of both Units 1 and 2.

The applicable portion of the Salem licensing basis, as expressed in the Salem Updated Final

Safety Analysis Report (UFSAR) sections 4.3 and 15.3.5, stated that multiple failures in the

RCS would be required for a single rod withdrawal to occur. The single rod withdrawal

event was, therefore, allowed to be considered as an ANSI N18.2 Condition III event

(Infrequent Faults), for which the acceptance criteria allowed a small percentage ( <5%) of

fuel failure due to the low probability of event occurrence. With the determination that a

single failure could cause an inadvertent rod withdrawal accident, the event had to be

reevaluated against the criteria of a Condition II event (Events of a Moderate Frequency).

These criteria required that the Departure From Nucleate Boiling Ratio (DNBR) limit is not

exceeded, and, therefore, fuel design limits are maintained. The licensee was also required

to demonstrate that the plant still met the intent of the General Design Criteria (GDC) 25 of

10 CFR 50 Appendix A, which requires the protection system be designed to assure that fuel

design limits not be exceeded for any single malfunction of the reactivity control systems

such as accidental withdrawal of control rods.

The PSE&G safety evaluation demonstrated that no fuel design limits would be exceeded for

any of the affected operating transients other than the asymmetrical rod withdrawal from

subcriticality. PSE&G addressed the potential for exceeding fuel design limits as a result of

asymetrical rod withdrawal from subcriticality through administrative controls and, thereby,

resolved the USQ. PSE&G, therefore, concluded that both Salem units complied with the

requirements of Condition II events and GDC 25. PSE&G's evaluation was predicated on

several factors: the relevant failure does not affect the ability of the reactor protection

system to perform its intended safety function; the failure is detectable based on periodic

surveillance testing and operator verification of control rod position; although not taken

credit for in the evaluation, alarms, administrative controls, and compensatory measures

implemented specifically for the event provided further assurance that the discovered failure

would not adversely affect the safety of the plant; and the evaluation bounded all possible rod

motions considered in the evaluation. The licensee recognized the evaluation was an interim

document and could be affected by design changes to the plant (i.e., the installation of digital

electronic group counters) or further industry-wide revelations and developments.

The team reviewed the Salem licensing basis, observed portions of the licensee's deliberation

on the matter, and reviewed the final versions of the PSE&G 10 CFR 50.59 documentation

25

and their engineering safety evaluation. The team verified that this event did not provide a

challenge to the reactor coolant system or the containment boundary and further concluded

that, although the safety significance of the issue is not high, PSE&G acted properly and

prudently in their investigation into and resolution of all licensing basis concerns. The team

concluded that PSE&G's safety determination was accurate and conservative.

10.0

CAUSES OF ROD CONTROL SYSTEM PROBLEMS

The team reviewed available information and the sequence of events to arrive at the root

cause of the rod control system failures. Where warranted, the team's determination is

qualified based on the available information.

10.1

HARDWARE PROBLEMS

10.1.1 Multiple Integrated Circuit and Output Transistor Failures

As discussed in section 5.3, the IC failures began to occur when three of the group counters

were replaced with models that had different electrical characteristics and generated higher

reverse EMFs than the original counters. The higher voltage spikes coincident with the loss

of the surge suppression capability resulted in the circuit components being subjected to

voltages significantly in excess of their design ratings. The surge suppression circuitry was

suspected to have been lost as the result of an open circuit caused by spread pins on the card

cage connector. These voltage spikes also exceeded the voltage rating of the output

transistors on the relay driver cards. While a bad pin connection could not be definitely

identified, symptoms of the loss of the suppression circuit were identified during the

troubleshooting and operation. For example, on one occasion condensation was seen on a

group counter face in the main control room.

This same condition was observed during *

bench testing when the suppression circuit was disconnected and the circuit card output

transistor went into saturation resulting in continuous energization and subsequent heatup of

the counter coil.

The team concluded the unsuppressed voltage spikes were the most probable cause of the IC

and transistor failures. Attachment 5 contains a list of failed components including those

likely to have failed as a result of these voltage spikes (electrical overstress).

10.1.2 Regulation Circuit Board Short Circuits

As discussed in section 4.4, the stationary gripper regulation circuit card was found to have

short circuits on the back of the card that was caused by slivers of the solder trace crossing

between parallel solder runs. The slivers appeared to have been present as a result of

problems during the manufacturing process and may have been disturbed during card

cleaning as part of the preventive maintenance work. PSE&G evaluated potential effects of

the short circuits and determined that they would have resulted in reduced current to the

stationary gripper coils and at the same time would have prevented an urgent alarm. The

26

reduced stationary gripper coil current could result in dropped control rods. The slivers

causing the short circuits were very fine and, therefore, susceptible to movement due to

expansion 'effects of temperature and physical orientation of the card, thereby maldng the

possibility of an intermittent fault more likely.

The team concluded that the circuit card short circuits were the most probable cause of the

dropped rods on May 28, 1993.

10.1.3 1/0 AC Amplifier, 1/0 Receiver and Slave Cycler Logic Card Failures

May 29, 1993, troubleshooting was being performed in an effort to determine the cause of

the dropped control rods that occurred on May 28. During troubleshooting, the 100 VDC

power supply fuses were found to have blown and a -15 VDC power supply auctioneering

diode was found to have failed. It is suspected that technicians installing test equipment to

support this troubleshooting caused a short circuit when, at one point, they were routing

leads through the power cabinet while having one end already connected. The short may

have resulted in 100 volts being applied to the -15 volt bus and would explain the failure of

components on several I/O AC amplifier circuit boards, the I/O Receiver card, and the slave

cycler logic card.

Based on the proximity of these failed components, relative to the counter and their

associated voltage spikes, the team concluded that a short circuit of the power supply was the

most likely cause of these failures. However, the effects of the unsuppressed voltage spikes

could not be absolutely eliminated.

10.1.4 Q9 Transistor Failures

As discussed in section 5.3.3, during testing of the rod control system following the

replacement of the logic cabinet circuit cards, a failed transistor was identified on both the

signal processing and the alarm cards in the SCD power cabinet. The test procedure directs

the removal of two failure detector cards and the installation of jumpers in their associated

pin connectors. The location of these cards and the associated jumpering is immediately

adjacent to the cards that experienced the failures. The jumper installation and removal is

performed with the cabinet energized and the licensee has identified locations where

momentary mislocation of the jumper or contacting the circuit cards with the jumper would

result in high currents through the transistors resulting in their failure.

The team concluded that high transistor currents, as the result of jumpering activities, was

the likely cause of the Q9 transistor failures.

10.1.5 Failure Detector Card Resistor

Due to the failure of the Q9 transistors, (discussed in sections 5.3 and 10.1.4), PSE&G

replaced thirteen circuit cards in the SCD power cabinet, including the failure detector cards.

27

On July, 25, 1993, during the system retest, an urgent alarm was received when attempting

to obtain current traces with dummy coils installed in place of the control rod drive coils.

Troubleshooting efforts isolated the problem to the failure detector card, one of the

replacement cards, which was then removed for inspection and bench testing. The followup

investigation by PSE&G identified that the value of the Rl 8 resistor was 50 kn instead of the

intended design value of 511 kn. With the incorrect resistor installed, the failure detector

card permitted the urgent alarm to actuate prematurely. The card had been obtained from

the plant spare parts inventory and apparently had_ never been installed in the RCS since with

the wrong resistor installed it would have caused urgent alarms during normal system

operation.

The team concluded that the most probable cause of this failure was a manufacturing error,

although the possibility that PSE&G replaced the resistor at some previous date could not be

eliminated.

10.2

ROD CONTROL SYSTEM DESIGN

The team reviewed the design of the RCS and concluded that the following design features

were contributing factors to the 1SA3 rod withdrawal problem and the hardware failure root

causes discussed in section 10.1.

10.2.1 Single Failure Vulnerability

On May 27, 1993, one control rod withdrew while control rod insertion was being

attempted. This unexpected out motion was the result of a logic circuit component failure

that resulted in the movable gripper coil, the lift coil, and the stationary coil being energized

simultaneously contrary to what occurs during normal control rod motion. PSE&G

subsequently confirmed that the failure of a single gate on an integrated circuit could result in

the current signals experienced on May 27 and, therefore, could result in rod out motion

when in motion is desired. This postulated failure was installed in the training center rod

control system; and, during testing, the single failure was verified to cause outward rod

motion with insert demand. The potential for a single failure to challenge the fuel design

limits may be contrary to the requirements of General Design Criteria 25 of 10 CFR 50,

Appendix A. The safety significance of this vulnerability is discussed in section 9.0, and the

potential generic implications are discussed in section 11. 0 of this report.

10.2.2 Voltage Suppression Circuit

As discussed in section 5.3, the surge suppression diodes, which are used to protect the solid

state components from voltage spikes that are generated by the group step counters during

normal operation are mounted on the relay driver cards. Several diodes have a common

connection that utilizes pin 4 on the relay driver circuit card as the connection point to the

circuit. In the event a poor connection develops between the card pins and the pins in the

card cage connector, the suppression diode is prevented from performing its function and

28

high voltage spikes are seen by the solid state components. These voltage spikes will result

in the degradation and failure of the components. The effects of the loss of the pin 4

connection may not impact the system operation and, therefore, be undetectable until solid

state component failures occur and result in RCS malfunctions. The licensee has eliminated

this vulnerability to a poor pin connection by installing additional suppression diodes directly

on the group step counter terminals and is considering the installation of digital counters in

the future. The digital counters do not contain coils and, therefore, would remove the source

of the high voltage spikes and eliminate the need for a suppression circuit.

10.2.3 Group Step Counter Compatibility

As discussed in section 5.3, during rod control system preventive maintenance and testing

during the refueling outage, four group step counters were replaced. Three of the

replacement counters are a different model than the installed counters. PSE&G reviewed this

difference with the counter manufacturer and Westinghouse and concluded that they were

suitable replacements. The coils in the new counters have the same design operating voltage;

however, the coil resistance and inductance ratings of the coils are different. The difference

in the electrical characteristics resulted in higher reverse EMFs being generated than those

developed by the original counters. These higher reverse EMFs, coincident with the loss of

the surge suppression circuit, result in damage to the solid state components. While under

normal conditions the new model counter is compatible with the existing rod control system,

the new counters make the surge suppression circuitry essential.

10.3

LICENSEE POLICY AND PERFORMANCE

The Salem station does not have well-defined policies and procedures governing the

determination of root causes of equipment failures. The lack of such a policy and associated

procedures contributed to the faulty root cause determination and management decisions to

attempt startups without the problem being resolved. Refer to section 7 .0 of this report for

additional discussion.

11.0

POTENTIAL GENERIC IMPLICATIONS

The inadvertent withdrawal of a single rod (1SA3) during the May 27, 1993, startup of

Salem 2, could have been caused by a single failure. This is in conflict with the Salem

Updated Final Safety Analysis Report Section 15.3.5.1, which states that no single failure

could cause a single Rod Control Cluster Assembly (RCCA) withdrawal. Also 10 CFR 50,

Appendix A, "General Design Criterion for Nuclear Power Plants," Criterion 25 requires

that fuel design limits should not be exceeded as the result of a single malfunction of a

reactivity control system.

The rod control system installed at Salem 2 is used at all Westinghouse designed plants

except the Haddam Neck plant. Hence, the failure that occurred at Salem 2 has generic

implications. The NRC issued Information Notice 93-46 on June 10, 1993, to inform

29

licensees about the potential problem with Westinghouse designed rod control systems. The

NRC has also activated the Westinghouse Owners Group (WOG) Regulatory Response Group

(RRG) to address the concerns. On June 11, 1993, Westinghouse issued a Nuclear Safety

Advisory Letter (NSAL)93-007, to all domestic Westinghouse nuclear power plants. A

meeting was also held with the WOG at NRC headquarters to discuss their preliminary

findings.

According to the WOG, the failure of the RCS at Salem 2 is a Condition III event and not a

Condition II event, as defined by ASNI Standard Nl8.2, based on the fact that a small

fraction of failed fuel is an acceptable consequence for this event. Based on the analysis

done by Westinghouse, the WOG considers that the RCS meets the ANS condition III

acceptance criteria. The NRR staff has issued Generic Letter 93-04 on June 21, 1993, to all

Westinghouse plants for action and B&W and C-E plants for information. The staff is

requesting licensees who operate Westinghouse designed plants to ensure that the licensing

basis for their plant is satisfied and provide a supporting discussion for that assessment. If

the licensing basis is not satisfied, then the licensees are requested to provide the short-term

and long-term, corrective actions that will be taken. Thus, the generic aspects of this event

will be reviewed by NRR for all Westinghouse plants. NRR is also evaluating if this event

could also occur at B &Wand C-E plants.

12.0

OTHER FINDINGS/OBSERVATIONS

In addition to the findings discussed in previous sections, the AIT had the following

observations:

Work orders did not always contain the necessary engineering evaluations for closure.

Specifically, during work order 931029004, a question was raised about the insulation

resistance value between the cable shield and ground and the completed work package .

did not document the resolution of this question. Also, during work on CRDM cable

P-12 (WO 930330102), the cable was repaired using fiberglass tape, when the work

package specified Raychem heat shrink, and an engineering disposition for the

alternate repair method was not documented.

Printed circuit cards were handled in a manner that could result in stressing and

potentially damaging solder connections and/or components. (Section 5.3.5)

Precautions were not taken to avoid damage to circuit card components that could

result from electrostatic discharge. (Section 5.3.5)

The team had the following positive observations:

Plant housekeeping and the material condition of the plant was excellent, in particular,

the relay room and rod control cabinets.

30

The PSE&G technical training center was a valuable resource for the investigation

and confirmation of the rod control system problems. The center was well-equipped,

staffed and managed.

The control room operators performed well when responding to the numerous rod

control system (RCS) problems encountered during the attempted startups. The

operators displayed good plant awareness during the May 25 and May 27 startups by

promptly recognizing the abnormal RCS behavior and properly implementing the

abnormal and emergency procedures for the dropped control rod event of May 28.

13.0

EXIT MEETING

A formal, public exit meeting was held with PSE&G management on July 7, 1993, at the

Artificial Island Processing Center. The slides used during the meeting are included as

Attachment 6. Exit meeting attendees are included as Attachment 7. People contacted

during the inspection are listed in Attachment 8. The team submitted written questions to

PSE&G during the inspection. The.se questions will be placed in the Public Document Room

under a separate cover.

Attachments:

1. Memo dated June 7, 1993, from Thomas T. Martin to Wayne W. Hodges

2. Logic Cabinet Automatic and Manual

3. Sequence of Events

4. Circuit Card Locations Experiencing Multiple Failures

5. List of Failures/Cause

6. NRC Augmented Inspection Team Exit Meeting

7. Exit Meeting Attendees

8. Persons Contacted

9. Acronyms and Initialisms

Docket Nos. 50-272

50-311

ATTACHMENT 1

UNITED STATES

NUCLEAR REGULATORY COMMISSION

REGION I

475 ALLENDALE ROAD

KING OF PRUSSIA, PENNSYLVANIA 19406-1415

MEMORANDUM FOR:

Wayne M. Hodges, Director, Division of Reactor Safety

FROM:

Thomas T. Martin, Region Administrator

SUBJECT:

AUGMENTED INSPECITON TEAM CHARTER FOR REVIEW

OF ROD CONTROL SYSTEM FAILURES AT SALEM 2

Due to control rod system failures at Salem Unit 2, NRR and AEOD senior management and

I have determined that an Augmented Inspection Team (Afl) inspection should be conducted to

review the causes, safety implications, and associated licensee actions which led to or resulted

in the failure of the control rod system to function as expected to support the startup of Salem

Unit 2; and verify the circumstances and evaluate the significance of this event.

The Division of Reactor Safety (DRS) is assigned the responsibility for the overall conduct of

this Augmented Inspection. William Ruland, Chief, Electrical Section, DRS, is appointed as

Augmented Inspection Team Leader (Other AIT members are identified in Enclosure 2). The

Division of Reactor Projects (DRP) is assigned the responsibility for resident and clerical

support, as necessary; and the coordination with other NRC offices, as appropriate. Further,

the Division of Reactor Safety, in coordination with DRP, is responsible for the timely issuance

of the inspection report, the identif1cation and processing of potentially generic issues, and the

identification and completion of any enforcement action warranted as a result of the team's

review.

Enclosure 1 represents the charter for the Augmented Inspection Team and details the scope of

the inspection.

The inspection shall be conducted in accordance with NRC Management

Directive (MD) 8.3, NRC Inspection Manual 0325, Inspection Procedure 93800, Regional Office

Instruction 1010.1, and this memorandum.

This Augmented Inspection Team is being established and chartered in response to repeated

failures of the Rod Control System to function as expected and required during the startup of

Salem Unit 2 between May 25 and June 4, 1993.

Based on available information and

preliminary review by the licensee and their vendor, Westinghouse, it is not evident that the root

cause and circumstances leading to these repeated malfunctions have been sufficiently understood

or analyzed. Further, there is the potential that the licensee's own troubleshooting activities may

have caused or exacerbated the malfunctions. Additionally, there is also a possibility, as

demonstrated by one of the failures, that a single failure could result in the inadvertent and

unexpected withdrawal of a single control rod cluster assembly when an insert demand signal

is present. This conflicts with the design basis of the specific facility and may have safety

implications involving other Westinghouse designed facilities that use similar rod control

systems.

2

An AIT to review this matter is appropriate since the event involves possible adverse generic

implications; and is complicated, difficult to understand, and has an unknown probable cause.

Enclosures:

1.

Augmented Inspection Team Charter

2.

Team Membership

cc w/encls:

J. Taylor, EDO

J. Sniezek, OEDO

T. Murley, NRR

J. Partlow, NRR

W. Russell, NRR

J. Calvo, NRR

C. Miller, PD 1-2, NRR

F. Miraglia, NRR

C. Berlinger, NRR

J. Wermeil, NRR

J. Richardson, NRR

A. Thadani, NRR

B. Grimes, NRR

B. Boger, NRR

E. Jordan, AEOD

D. Ross, AEOD

L. Wheeler, OEDO

J. Larkins, ACRS

J. Lieberman, OE

W. Kane, DRA, RI

C. Hehl, DRP, RI

J. Wiggins, DRP, RI

J. White, DRP, RI

. R. Cooper, DRSS, RI

S. Shankman, DRSS, RI

T. Johnson, SRI, Salem/Hope Creek

J. Stone, PD 1-2, NRR

W. Ruland, DRS, RI

J. Durr, DRS, RI

W. Hodges, DRS, RI

L. Bettenhausen, DRS, RI

E. Wenzinger, DRP, RI

K. Abraham, PAO, RI

M. Miller, SW, RI

~~?

Thomas T. Martin

Regional Administrator

ENCWSUREl

Salem Generatin~ Station. Unit 2

Rcxl Control System Failures

Au~mented InS£>eci:ion Te.am <Am Charter

The general objectives of this AIT are to:

1.

Determine the specific circumstances and causes of e.ach individual rod control system

component failure.

2.

Based on the evaluations of the specific component failures, determine the root causes

of the multiple rod control system failures.

3.

Develop a detailed se.quence of events related to the failures.

4.

Determine and evaluate any changes made in the design, maintenance, testing, or

operation of the rod control system prior to the failures.

5.

Assess the safety significance of the rod control system failures, including any postulated

single failures that could result in a single control rod withdrawal when a control rod

insert signal is present.

6.

Determine the adequacy of Public Service Electric & Gas Company's (PSE&G)

maintenance and troubleshooting practices relative to the rod control system, including

vendor interface and control.

7.

Evaluate PSE&G's decision-making concerning attempted plant restarts after the failures

and repairs.

8.

Identify any potential generic implications posed by the rod control problems experienced

at Salem.

9.

Prepare a report documenting the results of this review for signature by the Regional

Administrator within 30 days of the completion of the inspection.

ENCWSURE2

Salem AIT Membership

William Ruland, AIT Leader, Section Chief, Division of Reactor Safety (DRS), Region I (RI)

M. Eugene Lu.arowitz, Reactor Engineer, DRS, RI

Steve Barr, Resident Inspector, Salem, DRP, RI

Hukam C. Garg, NRR

Qbserver

Dave Vann, New Jersey Bureau of Nuclear Engineering

Other NRC personnel, consultants, or contractors will be engaged in this AIT, as needed.

)

ATTACHMEN,J 2

)

'

LOGIC CABINET AUTOMATIC AND MANUAL

Rod Barile

S1l..ctor Swilch

rf

Min..

~.CSA

seo..

...cge

SBC~ Q ,.tee

sea*

  • C8D

SBA*

' I

I

Ccwilro4 &

--

SDRCICIBril

~

-

Alarm -

Cktl

n

hn Urgtn*I

Flitufe

~

  • ...J

I Alarm I

~ ' , ,

,,..,.,

~

sco

  • r

Logic ~

I low~ I

!mi

Bank

Sel*ctor

CONTROL RODS

-

SID Bank - Al + A2

Bl + B2

C + D

Control Bank - Al + A2

Bl + B2

Cl + C2

e91 + D2

'

In

--

Oul

~ .

In Out

.

, t

I *

Supemaor

Circuit

  • ~ i*

Pulaer *

Slave

'

Pow*r

Cab

2SCD

SID C + D

Speed

Supervisor

I

Ci'cult In/Ou! I

-

Pulser - 48-432

1

' Pulstt

Slave

In/Out

I '

Select

.

MHltr

-

-

C~cJer

~

In/Ou'

I

coo

Oo

-

Counter

Mulliple1e

~c

--

Pulles

I

In/Out

'

, ' L. Slave

Stave

Cycler

Cyder

TAC

1BD

Cooent

Orders

1 *

1 Ir

Power t.J1ower

Cab

2.JAC

Cont Al

Cont Cl

SID Al

Cab

2.IBO

Cont Bl

Cont Dl

SID Bl

-

......

-

L. Sl~;n.

~ .

Stave

Cycler

Cycler

2AC

I t

Power

Cab

2.2AC

Cont A2

Cont C2

SID A2

2BD

Multipfeic

I t

-

~ I

Power

Cab

2280

Cont B2

Cont D2

SID B2

'

1 . .

Bank

Overtap

Unit

-

C31oup

Select

. ,

LOGIC

CABINET

'

t

POWER

CABINET

ATTACHMENT 3

Seguence of Events

Note: This sequence of events was developed from information collected during the course of

the inspection through interviews with PSE&G and contractor personnel, reviews of

documents and information provided by the SERT.

Mq,rch 1~. 1993

Unit 2 reactor trip occurred and the refueling outage commenced.

March 18, 1993

Westinghouse letter sent to PSE&G specifying terms of offer for rod control system

preventive maintenance work. PSE&G subsequently contracted Westinghouse to

perform work.

March 26, 1993

Westinghouse commenced circuit board inspection as part of the preventive

maintenance work on the rod control system.

April 6, 1993

Inspection, repair and bench test of logic cabinet and power cabinet circuit cards was

completed. System logic tested satisfactorily. Rod control system motor-generator

set (MG) preventive maintenance commenced.

April 10, 1993

Westinghouse technicians left Salem site when their work was delayed due to power

availability problems for the rod control systems. Printed circuit card testers were

removed from site for use at another facility.

May JO, 1993

Westinghouse _technicians returned to the Salem site t.o resume rod control system

functional testing.

May 14, 1993

Group step counters SDA-1 and CBA-1 failed their functional test. Counters CBB-1

and CBC-1 marginally passed the test.

May 16, 1993

Group step counters SDA-1, CBA-1, CBB-1 and CBC-1 were replaced with one

counter obtained from Salem spare parts inventory and three counters obtained from

the Turkey Point plant spares. During the retest of the counters several circuit card

components were found to be failed and were replaced.

May 20, 1993

Westinghouse circuit card tester returned to Salem site.

May 24, 1993

MG set testing was completed and individual rod position indicator (ARPI)

calibrations and rod drop testing commenced. CBA Group 1 did not move and an

Urgent Failure alarm was reeeived. CBC Group 1 counter responded to in rod

motion but not out. A short circuit at an indicating light was repaired and a circuit

card required replacement.

May 25, 1993

During ARPI calibrations, CBB Group 1 and 2 counters did not operate properly and

were replaced with two of the old counters which had been removed on

May 16, 1993. ARPI calibrations and rod drop testing was completed and a reactor

startup commenced.

May 26, 1993

Salem operators completed withdrawal _of all shutdown bank control rods to 225 steps

and commenced control bank rod withdrawal. CBC Group 1 step counter stopped

advancing at 31 steps. All control rods were inserted. Several failed integrated

circuits (IC) and diodes were identified. Numerous card repairs and replacements

were performed to correct problems.

May 27, 1993

Following repairs to circuit cards, the control rod group counters were tested by

performing a 228 step withdrawal and insertion without latching the control rods.

During a subsequent startup attempt, SBA was withdrawn to 20 steps as indicated on

the group counter with no corresponding response observed on the ARPI indicators.

During rod insertion, the ARPI indication for control rod 1SA3 indicated the rod had

withdrawn to 15 steps. The fuses for the l_SA3 control rod stationary gripper coil

were removed and the ARPI position indication went to zero.

May 28, 1993

Troubleshooting determined that failures on the circuit cards resulted in the movable

gripper coil and lift coils being energized simultaneously. Additional circuit board

problems were identified and repaired.

PSE&G personnel conferred with Westinghouse representatives via phone and were

advised that control rod 1SA3 *would not have been damaged during the improper

operation on May 27, 1993.

Rod control system testing was performed satisfactorily by performing a 20 step

withdrawal and insertion for each rod control bank, and a reactor startup commenced.

The control rods were withdrawn without incident.

While diluting the reactor coolant boron concentration to achieve criticality, all four

CBC Group 1 control rods dropped to the fully inserted position. The remaining

control rods were then manually tripped by the reactor operator as required by plant

procedures.

May 29-30, 1993

Troubleshooting was performed, but the cause of the dropped rods could not be

determined. Four circuit cards whose failure could have caused the dropped rods

were replaced.

During troubleshooting, the 100 VDC and -15 VDC power supplies were apparently

short circuited, resulting in blown fuses in the 100 VDC supply and a failed

auctioneering diode in the -15 VDC supply. The fuses and failed diode were

replaced. A transistor was found failed and replaced as well as failed components on

several other circuit cards.

May 31 and June 1, 1993

Additional group step counter problems continued to occur during troubleshooting and

testing. Numerous circuit card problems were identified and repaired.

June 2, 1993

Rod control system testing was performed, including counter operation, current order

and ARPI checks. Testing was completed satisfactorily.

Reactor startup commenced. Pulse-to-analog converter discovered to be failed when

Rod Insertion Limit annunciator failed to clear during CBD rod withdrawal.

June 3, 1993

The shift supervisor directed all rods to be inserted while troubleshooting was

performed. Faulty IC chips on data logging card were identified and replaced. The

system was retested satisfactorily and a reactor startup commenced. Criticality was

achieved at 3:05 p.m.

NRC Region I inspector arrives on site.

  • June 4, 1993

Plant management decided to shut the plant down when it became evident that the

improper rod motion on rod 1SA3, encountered on May 27, 1993, may have been the

result of a single failure, resulting in operations outside the plant design basis.

June 5, 1993

NRC Augmented Inspection Team (AIT) arrived on site to investigate rod control

system problems.

June 8, 1993

The NRC issued a Confirmatory Action Letter (CAL) to PSE&G which affirmed the

licensee's commitment to 1) maintain Unit 2 shut down until the rod control system .

problems were investigated and satisfactorily resolved and 2) to prepare a Justification

for Continued Operation (JCO) for Unit 1 to documel)t their technical basis for

assuring that Unit 1 will continue to operate safely.

Unit 1 tripped due to loss of circ.ulating water and was maintained in hot standby

while the Unit 2 rod control system problems continued to be investigated.

June 17, 1993

AIT completed its initial on site inspection.

June 18, 1993

PSE&G met with NRC Region I management to present the results of their

investigation of the rod control system failures and planned corrective actions.

Unit 1 startup commenced following the addition of surge suppression diodes on the

group step counters.

June 19, 1993

Unit 1 achieved criticality. Rod control system operation normal.

June 20 and 21, 1993

PSE&G installed new circuit cards in logic cabinet, installed surge suppression diodes

on the group step counters, inspected and repaired all power cabinet circuit cards.

June 22, 1993

Rod control system testing commenced which required jumper installation and

removal in the power cabinets.

June 23, 1993

Transistor failures discovered on the signal processing and alarms cards in the SCD

power cabinet. Transistors replaced and system testing completed.

Region I NRC inspector arrived on site to review problem with failed transistors and

to observe current trace tests.

June 24, 1993

Current traces completed satisfactorily. Root cause evaluation for failed transistors

continued.

June 25, 1993

Thirteen circuit cards were replaced in the SCD power cabinet and system testing

commenced. An Urgent Failure alarm occurred while attempting to record current

traces with dummy coils installed. Testing stopped and troubleshooting deferred until

the following morning.

June 26, 1993

Troubleshooting was performed and the cause of the Urgent Alarm found to be an

incorrect resistor installed on one of the new sen cabinet circuit cards.

June 27, 1993

System testing and current traces were completed satisfactorily.

CAL actions complete.

June 28, 1993

AIT inspection completed.

Unit 2 reactor startup commenced, control rods functioned normally.

-.

May 16

May24

May 26

May 27

May 31

June 1

ATIACHMENT 4

CIRCUIT CARD LOCATIONS EXPERIENCING MQLTIPLE FAILURES

SUPERVISORY DATA LOGGING CARD CA113 Location)

Replaced card Westinghouse Serial Number (WSN) 1S3 with WSN 217

Repaired card WSN 217 by replacing integrated circuit chips ZS, Z9

and Z12

Replaced card WSN 217 with WSN 6014

Replaced card WSN 6014 with WSN 039

Reinstalled card WSN 6014 after replacing Z3 and Z6

Replaced Z2, ZS and ZS on card WSN 6014

Installed card WSN 039 after replacing Z2, Z3, and Z8

Repaired card. WSN 0039 -- replaced Z2

Card WSN 039 failed again -- replaced Z2 and Z3

Repaired card WSN 6014 (replaced Z2, Z5 and Z6) and then tested

WSN6014 and WSN 039 at the training center. Replaced one chip on

WSN6014

Installed card WSN 0039

Replaced the card with WSN 6014 (After replacing Z3 and Z6)

Repaired WSN ~14 -- replaced Z3 and Z5

Repaired WSN 6014 -- replaced Z3

Repaired WSN 6014 -- replaced Z3

Repaired WSN 6014 -- replaced Z3

SUPERVISORY DATA LOGGING CARD (Al 14 Location)

May 16

Repaired card WSN 216 -- replaced Z3

May 27

Replaced card WSN 216 with WSN 0039

Repaired and reinstalled card WSN 0039 -- replaced Z3

Replaced the card with WSN 216

May 31

Replaced the card with WSN 0183 (Z3 failed)

Repaired WSN 0183 -- replaced Z3

June 1

Repaired WSN 0183 -- replaced Z3

June 3

Repaired WSN 0183 -- replaced Z3

May 16

May26

May27

May 16

May26

May 27

  • * *

IN/OUT RELAY DRIVER CARD (A 713 Location)

Replaced card WSN 132 with WSN 120

Swapped with card WSN 0133 from the A714 location

Replaced the card with WSN 701

Installed card WSN 0120

Installed card WSN 0345

Swapped with the card in A714 location

Cage pin 34 spread open -- closed pin

IN/OUT RELAY DRIVER CARD (A714 Location)

Replaced card WSN 139 with WSN 133

Swapped with card WSN 120 from A713 location

Replaced the card with WSN 695

Replaced the card with WSN 681

Replaced the card with WSN 695

Restored with card WSN 681

Installed card WSN 0342

Swapped with the card in A713 location

Reinstalled card WSN 0342

ATTACHMENT 5

LIST OF FAILURES/CAUSE

DATE/TIME/EVENT

FAILURE

CORRECTIVE ACTION

CAUSE

May 14

Group counters:

Replaced with one with

Normal wear

spare and three from

Preventive Maintenance

CBA-GRl

Turkey Point

CBB-GRl

CBC-GRl

SBA-GR2

May 16

Al 13 (Note l)

Replaced card

Electrical overstress (The

new counters and loss of

Counter Testing

A114

Replaced Z3 (Note 2)

suppression circuitry resulted

in solid state components

A113

Replaced Z8, Z9 & Zl2

being subjected to voltage

spikes in excess of their

A713

Replaced card

design rating)

A714

Replaced card

May24

Resistor in 22AC

Replaced resistor and

Short circuit caused by

Power cabinet

light socket

installation of the wrong type

Operations Surveillance

of light socket during

preventive maintenance.

A113

Replaced card

Electrical overstress

May 25

Group counters

Replaced with counters

Transistor saturation due to

CBB-GRl and

previously removed on

loss of suppression circuit

ARPI Calibration

CBB-GR2

May 16

May 26

A714

Replaced A714 card with

Electrical overstress

spare

2:00 a.m.

Plant Startup

DATE/TIME/EVENT

FAILURE

CORRECTIVE ACTION

CAUSE

May 26

Al13

Replaced card

Electrical overstress

8:00 a.m.

A713

Replaced the cards

A714

Troubleshooting

A113

Replaced the card

A113

Replaced Z2,Z5,Z8

A113

Replaced Z2

Al13

Replaced Z2,Z3

May 26

Al13

Replaced Z2,Z3

Electrical overstress

11:50 p.m.

Troubleshooting

May 27

A713

Swapped A713 and

Spread pin in A 713 card

A 714, corrected spread

cage

5:00 a.m.

pin

Troubleshooting

A113

Replaced with spare

Electrical overstress

May 27

Al14

Replaced with spare card

Electrical overstress

then replaced Z3 when

8:40 a.m.

problem occurred again

Troubleshooting

A113

Replaced Z3, Z6

May 27

A710

Replaced diode

Electrical overstress

1:05 p.m.

Troubleshooting

May 27

A114

Replaced with Spare

Electrical overstress

14:30

Troubleshooting

DATE/TIME/EVENT

FAILURE

CORRECTIVE ACTION

CAUSE

May 27

Rod 1SA3 moved

Replaced with new cards

Electrical overstress

15 steps up while

18:44

other stayed in

bottom when A511

Reactor Startup

and A501 failed

May 28

Control bank C -

Replaced firing, phase,

Short circuits on regulation

Group 1 dropped

regulation and 1/0 ac amp

card caused by slivers of

6:12 p.m.

cards associated with this

solder run

rod group.

Reactor Startup

May 30

Fuses FUll and

Replaced fuses

Technicians caused a short

FU6 blown on 100

circuit while installing test

3:00 a.m.

VDC power supply

equipment

Troubleshooting

-15 VDC

Replaced diode

Auctioneering diode

May 30

A803

Swapped card from A805

Technicians shorted 100

slot

VDC power to -15 VDC

. 4:30 p.m.

during test equipment

installation (Note 3)

Troubleshooting

May 30

Failed bench test:

Cards repaired

100 VDC to -15 VDC short

circuit (Note 3)

8:40 p.m.

A808

A812

Troubleshooting

A813

A814

May 31

Failed bench test:

Replaced transistor Q12

100 VDC to -15 VDC short

circuit (Note 3)

2:45 a.m.

A809

Troubleshooting

May 31

MXR2 Relay

Found and corrected

Spread pin in 22BD cabinet

spread pin

8:40 a.m.

Troubleshooting

DATE/TIME/EVENT

FAILURE

CORRECTIVE ACTION

CAUSE

May 31

Failed bench test:

10:45 a.m.

A114

Replaced card (Z3 failed

Electrical overstress

however card not

Troubleshooting

repaired due to lifted

trace)

Al13

Replaced Z3 and Z5

May 31

A113

Replaced Z3

Electrical overstress

1:00 p.m.

Al14

Replaced Z3

Troubleshooting

May 31

Al13

Replaced Z3

Electrical overstress

3:10 p.m.

Troubleshooting

May 31

A514

Replaced with spare

100 VDC to -15 VDC short

circuit (Note 3)

9:35 p.m.

Troubleshooting

June 1

A113

Replaced Z3

Electrical overstress

6:05 a.m.

Al14

Replaced Z3

Troubleshooting

June 1

A207

Replaced with spare

Electrical overstress

7:15 p.m.

Troubleshooting

June 3

A114

Replaced Z3

Electrical Overstress

Reactor Startup

DATE/TIME/EVENT

FAILURE

CORRECTIVE ACTION

CAUSE

June 23

SCD cabinet signal

Replaced Q9 Transistor

Technician caused short

processing and

on each card

circuit during installation of

12:30 a.m.

alarm cards

jumper

System Retest

June 25

Failure detector

Replaced card

Wrong value Rl8 resistor

card in power

installed at time of

10:40 p.m.

cabinet SCD

manufacture (50 kn versus

511 kn)

SCD Cabinet Retest

NOTE 1:

NOTE2:

NOTE 3:

Failures designated AXXX refer to the failure of a printed circuit card in associated with a

specific logic cabinet card location:

Al 13 - Supervisory Data Logging

Al14 - Supervisory Data Logging

A207 - Bank Overlap Logic

A501 - Slave Cycler Decoder

A511 - Slave Cycler Decoder

A514 - Slave Cycler Logic

A 710 - In/Out Relay Driver

A 713 - In/Out Relay Driver

A803 - In/Out AC Amplifier

A808 - In/Out AC Amplifier

A809 - In/Out Receiver

A812 - In/Out AC Amplifier

A813 - In/Out AC Amplifier

A814 - In/Out AC Amplifier

Replacement of components designated ZX refer to the replacement of a particular type of

integrated circuit on a printed circuit card.

While this is the most likely cause for these failures electrical overstress could not be

completely eliminated as a possible cause.

  • --

ATTACHMENT 6

NRC AUGMENTED INSPECTION TEAM

EXIT MEETING

SALEM UNIT 2 ROD CONTROL SYSTEM PROBLEMS

ARTIFICIAL ISLAND PROCESSING CENTER

JULY 7, 1993, 10:00 A.M.

AGENDA

I.

INTRODUCTIONS

M. WAYNE HODGES

II.

PURPOSE OF INSPECTION

M. WAYNE HODGES

III. SEQUENCE OF EVENTS

WILLIAM H. RULAND

IV.

SCOPE OF REVIEW

V.

MOST PROBABLE CAUSES

VI.

PRELIMINARY FINDINGS

VII. SAFETY SIGNIFICANCE

VIII. CONCLUDING NRC REMARKS

M. WAYNE HODGES

IX.

PSE&G REMARKS

STEVEN MILTENBERGER

III. SEQUENCE OF EVENTS

UNIT 2 OUTAGE STARTED MARCH 16. WESTINGHOUSE

CONTRACTED FOR ROD CONTROL SYSTEM PREVENTIVE

MAINTENANCE.

PROBLEMS ENCOUNTERED WITH ROD GROUP POSITION

INDICATORS - REPAIRS. MADE.

IIL SEQUENCE OF EVENTS (CON'T)

MAY 25 - GROUP COUNTER STOPPED COUNTING DURING

ROD MOTION FOR START-UP. REPAIRS MADE.

MAY 27 - ONE ROD NOTED MOVING OUT WITH IN MOTION

REQUESTED. REPAIRS MADE.

MAY 28 - ALL FOUR BANK C, GROUP 1, RODS DROPPED

INTO THE CORE. PSE&G INVESTIGATION STARTED.

REPAIRS MADE, ROOT CAUSE(S) IDENTIFIED BY PSE&G,

TESTING COMPLETED.

JUNE 1 - ROD INSERTION LIMITS NOT RECORDING.

ATTRIBUTED TO INCOMPLETE

TESTING/TROUBLESHOOTING. REPAIRS MADE.

JUNE 3 - STARTUP STOPPED DUE TO SINGLE FAILURE

QUESTIONS.

III.

SEQUENCE OF EVENTS (CON'T)

AIT SENT ON JUNE 5 TO REVIEW EVENTS.

CONFIRMATORY ACTION LETTER ISSUED TO PSE&G ON

JUNE 8.

PSE&G/NRC MANAGEMENT MTG ON JUNE 18.

JUNE 23 A.M. - AFfER CARD REPLACEMENT, FOUND TWO

CIRCIDT CARDS WITH DAMAGED TRANSISTORS. REPAIRED

AND TESTED.

JUNE 25 - URGENT FAILURE ALARM. FOUND CARD WITH

INCORRECT COMPONENT (RESISTOR).

CLOSEOUT OF CONFIRMATORY ACTION LETTER ON JULY 1.

-

JULY 3 - GENERATOR BREAKER CLOSED, OUTAGF.; ENpS.

IV.

SCOPE OF REVIEW

TEAM COMPOSITION - s*MEMBERS - IN EXCESS OF 400

HOURS DIRECT INSPECTION EFFORT

INTERVIEWED MAINTENANCE PERSONNEL,

ENGINEERS, EQUIPMENT VENDOR, AND PSE&G

MANAGEMENT

OBSERVED TROUBLESHOOTING ACTIVITIES, ROD

CONTROL SYSTEM TESTING, AND IMPLEl\\IBNTATION

OF COMPENSATORY l\\IBASURES

OBSERVED OPERATION OF THE ROD CONTROL

SYSTEM SIMULATOR IN THE PSE&G TRAINING CENTER

REVIEWED CHANGES MADE TO THE ROD CONTROL

SYSTEM-

REVIEWED PSE&G RECORDS MADE DURING AND

AFTER THE EVENTS, INCLUDING DETAILED

DOCUMENTATION OF COMPONENT FAILURES

V.

MOST PROBABLE CAUSES

  • voLTAGE SPIKES FROM REPLACEMENT COUNTERS WITH

AN OPEN CIRCUIT TO A SUPPRESSION DIODE REPEATEDLY

DAMAGED SEMICONDUCTOR DEVICES ON PRINTED

CIRCUIT (PC) CARDS.

DAMAGED OVER 30 COMPONENTS.

LED TO THE ABORTED STARTUPS ON MAY 25, MAY 27

ANDJUNE2.

LED TO THE DISCOVERY OF A POTENTIAL SINGLE

FAILURE VULNERABILITY AND POSSIBLE FAILURE TO

MEET GENERAL DESIGN CRITERION 25 (PROTECTION

SYSTEM REQUIREMENTS FOR REACTIVITY CONTROL

SYSTEMS).

  • ~

-: .. ,

,,

V.

MOST PROBABLE CAUSES (CON'T)

CONDUCTIVE WIIlSKER SHORTED PC CARD TRACES

TOGETHER, LEADING TO DROPPED RODS*(MAY 28).

JUMPER REMOVAL SHORTED* CONNECTOR PINS AND PC

CARD TRACES TOGETHER, DAMAGING TRANSISTORS

(JUNE 23).

WRONG VALUE RESISTOR FOUND ON PC CARD (JUNE 25).

VI. PRELIMINARY FINDINGS

  • *

DESIGN WEAKNESS - DIODE TO * -*

PROTECT CIRCUITS SUBJECT TO A

MECHANICAL FAILURE OF

CONNECTOR.

APPLICATION OF ROD CONTROL

SYSTEM DESIGN BASIS KNOWLEDGE

INADEQUATE.

CAUSE DETERMINATION POLICY AND

EXPECTATIONS NOT ESTABLISHED.

-. ~-

.

NO CLEAR CHAIN OF COMMAND AND

RESPONSIBILITIES DURING

TROUBLESHOOTING ACTIVITIES.

VII.

SAFETY SIGNIFICANCE

SINGLE FAILURE - HAD NO ANALYSIS

TO SUPPORT COMPLIANCE WITH THE

GENERAL DESIGN CRITERIA FOR ROD

CONTROL SYSTEMS. ISSUE GENERIC

TO MOST WESTINGHOUSE NUCLEAR

PLANTS.

ROD CONTROL SYSTEM FAILURES

COULD INITIATE TRANSIENTS.

ACTUAL EVENT DAMAGED NON-

SAFETY EQUIPMENT ONLY. REACTOR

PROTECTION SYSTEM UNAFFECTED.

-- ~-

~

NRC ISSUED GENERIC LETTER 93-04 TO*

ALERT LICENSEES AND REQUIRE

REPORTS OF APPLICABILITY AND

ACTIONS PLANNED.

PSE&G

Alfieri, J.

Bachman, M.

Burricelli, R.

Burzstein, M.

Butz, M.

Camp, M.

Chranoski, R.

Cohen, S.

Fogelson, S.

Hagan, J.

LaBruna, S.

Lambert, C.

LaFevre, M.

Mannon, S.

McTigue, W.

Miller, L.

Miltenberger, S.

O'Donnell, P.

Rajkowski, L.

. Shedlock, M.

Thomson, F.

Vondra, C.

Wray, J.

ATTACHMENT 7

Exit Meeting Attendees

USNRC

Barr, s.

Garg, H.

Hodges, W.

Johnson, T.

Lazarowitz, M.

Ruland, W.

Scholl, L.

Schoppy, J.

Stone, J.

White, J.

Zimmerman, J.

Westinghouse

Baird, T.

Baker, T.'

Huckabee, J.

Others

Crowe, P. (Salem Today's Sumbeam)

Duca, P. (Delmarva Power)

Lang, M. (Salem Today's Sunbeam)

Milford, P. (Wilmington News Journal)

Oakes, R. (Atlantic Electric)

Robb, J. (Philadelphia Electric)

Ryan, G. (Radio Station WilC)

Vann, D. (State of New Jersey DEPE/BNE)

Yount, M. (Radio Station WILM)

r

ATTACHMENT 8

Persons Contacted

Public Service Electric and Gas Company

Amicucci, J. I&C Supervisor

Bailey, J., Nuclear Engineering Sciences Manager

Best, D., System Engineer

Budzik, D., Maintenance Engineer

Bursztein, M., Nuclear Electrical Engineering Manager

Carrier, T., Maintenance Engineer

Chranowsk:i, R., Technical Engineer

Cianfrani, W., Onsite Safety Review Engineer

Gallager, R., Operations Engineer

Hagan, J., Vice President - Nuclear Operations

Rambert, C., Manager - Nuclear Engineering Design

Heaton, R., I&C Supervisor

Karimian, S., Technical Consultant

Kent, R., Nuclear Fuels Engineer

LaBruna, S., Vice President - Nuclear Engineering

Lowry, W. D., System Engineer

Miller, L., SERT Manager

Miltenberger, S., Vice President and Chief Nuclear Officer

Panko, M., Maintenance Engineer

Pastva, J., LER Coordinator

Pike, K., Technical Manager (Acting)

Rajkowski, L., I&C Supervisor (Acting) E&PB

Robinson, E., Nuclear Training Department

Ronafalny, J., SERT Member

Shedlock, M., Maintenance Manager

Thomas, B., Licensing Engineer

Thompson, F., Manager - Licensing and Regulation

Villar, E., Licensing Engineer

Vondra, C., General Manager - Salem Operations

Westinghouse

Baker, T., Nuclear Safety Licensing Engineer

King, T., Field Service Technician

Kisic, J., Field Service Technician

Huckabee, J., Salem Site Representative

Pysnick, J., Rod Control System Engineer

Katz, D., Design Engineer

AIT

CAL

CPR

CRDM

DCP

DNBR

EMF

GDC

GM-SO

I&C

IRPI

JCO

LER

NSS

OHA

P-250

PM

PC

PSE&G

RCCA

RCS

RD

RP

RPS

SCD

SDL

SE

SER

SERT

SNSS

SORC

SOER

STA

TS

UFSAR

USQ

WO

ATI'ACHMENT 9

Acronyms and Initialisms

. Augmented Inspection Team

Confirmatory Action Letter

Code of Federal Regulations

Control Rod Drive Mechanism

Design Change Package

Departure From Nucleate Boiling Ratio

Electromotive Force

General Design Criteria

General Manager - Salem Operator

Instrumentation and Control

Individual Rod Position Indication

Justification for Continued Operation

Licensee Event Report

Nuclear Shift Supervisor

Overhead Annunciator System

Plant Process Computer

Preventive Maintenance

Printed Circuit

Public Service Electric and Gas

Rod Control Cluster Assembly

Rod Control System

Relay Driver

Reactor Protection

Reactor Protection System

Slave Cycle Decoder

Supervisory Data Logger

Safety Evaluation

Sequential Events Recorder

Significant Event Response Team

Senior Nuclear Shift Supel'Visor

Site Operating Review* Committee

Significant Operating Event Report

Shift Technical Advisor

Technical Specification

Updated Final Safety Analysis Report

Unreviewed Safety Question

Work Order