ML073400424
ML073400424 | |
Person / Time | |
---|---|
Site: | Indian Point |
Issue date: | 05/31/1993 |
From: | Stone & Webster Engineering Corp |
To: | Office of Nuclear Reactor Regulation |
References | |
Download: ML073400424 (580) | |
Text
{{#Wiki_filter:_ _ _ _ ~",,,,.,,:,~~"", . .,~ :_._, ",IC">~
.'.' ~,,:~ ,1 I
JI'R t..i brr..f~iljsz1qO ~;" I ALTERNATIVE MITIGATION MEASURES I CWSED COOLING SYSTEMS ALTERNATIVE I I BOWUNE ELECfRIC GENERATING STATION ORANGE & ROCKLAND UTIUTIES, INC. I AND ROSETON GENERATING STATION CENTRAL HUDSON GAS & ELECfRIC CORP. I AND INDIAN POINT 2 NUCLEAR GENERATING STATION CONSOUDATED EDISON COMPANY OF NEW YORK I AND INDIAN POINT 3 NUCLEAR GENERATING STATION I NEW YORK POWER AUTHORITY I I I MAY 1993 I I I I STONE & WEBSTER ENGINEERING CORPORATION I I I
I I T ABLE OF CONTENTS I SECDON PAGE I 1. Introduction 1 Objectives 1 1.1 I 1.2 Current Cooling System Technologies 1 1.3 Environmental Aspects - An Overview 5 I 1.4 Study Approach 8
- 2. Indian Point 2 Alternative Closed-Cycle Cooling Systems 14 I 2.1 Summary Description of Existing Plant 14 2.2 Natural Draft Cooling Towers 15 I 2.2.1 Engineering Aspects 15 2.2.2 Cost Estimates 15 I 2.2.3 Environmental Aspects 17 Mechanical Draft Cooling Towers 18 I 2.3 2.3.1 Engineering Aspects 18 I 2.3.2 Cost Estimates 2.3.3 Environmental Aspects 19 20 I 3. Indian Point 3 Alternative Closed-Cycle Cooling Systems 3.1 Summary Description of Existing Plant 22 22 I 3.2 Natural Draft Cooling Towers 3.2.1 Engineering Aspects 23 23 I 3.2.2 Cost Estimates 3.2.3 Environmental Aspects 23 24 I 3.3 Mechanical Draft Cooling Towers 3.3.1 Engineering Aspects 26 26 I 3.3.2 Cost Estimates 3.3.3 Environmental Aspects 26 27 I
0377700.002 I I
I I 4. Bowline Alternative Closed-Cycle Cooling Systems 30 30 4.1 Summary Description of Existing Plant I 4.2 Natural Draft Cooling Towers 31 31 4.2.1 Engineering Aspects I 4.2.2 Cost Estimates 32 4.2.3 Environmental Aspects 33 I 4.3 Mechanical Draft Cooling Towers 34 4.3.1 Engineering Aspects 34 I 4.3.2 Cost Estimates 35 4.3.3 Environmental Aspects 36 I 5. Roseton Alternative Closed-Cycle Cooling Systems 38 5.1 Summary Description of Existing Plant 38 I 5.2 Natural Draft Cooling Towers 39 5.2.1 Engineering Aspects 39 I 5.2.2 Cost Estimates 40 5.2.3 Environmental Aspects 41 I 5.3 Mechanical Draft Cooling Towers 43 5.3.1 Engineering Aspects 43 I 5.3.2 Cost Estimates 44 45 I 6. 5.3.3 Environmental Aspects All Plants: Alternative Closed-Cycle Systems that do not require 46 I an SPDES Permit 6.1 Description Summaries 46 I 6.2 6.3 Commercially Available Dry Cooling Towers Engineering Aspects 47 48 I 6.4 6.5 Cost Estimates Environmental Aspects 57 58 I 7. 6.6 Summary and Conclusions References 58 60 I 0377700.002 ii i I I
I I liST OF FIGURES I fiiure No, I 1 View Inside "Fabric" Dry Cooling Tower I 2 Predicted Condenser Cooling Water Flow Rate Schedules to Achieve Efficient Operation of Indian Point Unit 2 I 3 Indian Point 2 Site Plan - Natural Draft Tower I 4 Predicted Condenser Cooling Water Flow Rate Schedules to Achieve Efficient Operation of Indian Point Unit 3 I 5 6 Indian Point 3 Site Plan - Natural Draft Tower Bowline Site Plan - Natural Draft Tower I 7 General Arrangement for the Two Natural Draft Cooling Tower Configuration at Roseton I 8 Direct and Indirect Dry Cooling Schematics I I I I I I I 0377700.002 iii I I
I I I I I I I I I I I I I I I I I I I
I I 1.0 IN1RODUcnON 1.1 Objectives I Plant cooling requirements at Indian Point, Bowline and Roseton are currently I supplied by large quantities of Hudson River water. The purpose of this study is to discuss the engineering, environmental and cost aspects of all commercially viable I alternative closed-cycle systems that employ significantly less quantities of river water. The report also updates the information and conclusions of the various previous I studies (Reference 1 to 5) as is necessary and appropriate. I 1.2 Current Coolini System Technoloiies I The boilers of the Bowline and Roseton Stations and the steam generators at Indian Point produce steam at high pressure and temperature. That energy is converted to I mechanical energy in the steam turbines which tum the electrical generators of the respective plants. The steam exhausting from the last stages of the low pressure I turbine is indirectly cooled by water in a heat exchanger termed a condenser. The condenser recovers the pure condensate which is recycled back to the boiler or steam I generator for continued power production. The quantity of water used by the I condenser in this cooling process is immense and in the existing once-through design it releases its heat to the environment by mixing with the Hudson River. I This study evaluated wet, dry and wetI dry alternative cooling systems for the Hudson I River plants. These alternatives are briefly described in the following paragraphs. I Wet cooling towers cause the warm water from the condenser to reject its heat to the surrounding air directly. The cooling effect is enhanced by the concurrent I evaporation of a small fraction of that water. By breaking the flow of warm water from the condenser into small droplets and causing ambient air to contact the I resulting exposed surfaces, efficient cooling occurs. To be more specific, in a modem 0377700.002 1 I I
I I counterflow tower of the type likely to be utilized in New York the warm water from the condenser is distributed across the top of the cooling sections, i.e., the fill, by I spray nozzles. (Note that only the mechanics of counterflow towers are described. These towers have been exclusively utilized during the past decade by new utility I projects because they are energy efficient and can achieve a close approach. Only in special circumstances of significant silt or macro-fouling would the open crossflow I tower be specified and installed. Thus, the discussion of towers in the case of the New York utilities can be limited to counterflow designs; all other discussions are I really irrelevant.) As the warm water falls through the fill in sheets and droplets, it is constrained to contact ambient air flowing upward that has entered the tower I through wide inlets below the fill. A small portion of the water evaporates in the process. The cooled water rains into and collects in the basin directly below the fill. I At the same time, the heated ambient air which has been totally saturated with water vapor, exhausts from the top of the tower and is termed the plume. When this I process is accomplished and the fresh air flow is created by a multiplicity of fans it is called a mechanical draft design. When the air flow is generated by the chimney I effect of the huge hyperbolic shells, it is termed a natural draft design. In either case, the cooled water is then recirculated back to the condenser with a relatively I small quantity of water added from the river to make up for the evaporative losses associated with the cooling process and for the loss due to blowdown which is I required to prevent any heavy concentration of the salts that were originally in the I Hudson river. I The heat transfer process of wet cooling towers, as indicated, involves significant evaporation. This causes the cooled water temperature achieved by the tower to be I higher than the ambient wet bulb temperature. The latter, an atmospheric temperature equivalent to the dew point at 100% relative humidity, is almost always I higher than the river temperature at the same season of the year. The difference between the cooled water temperature and the wet bulb temperature is termed the I approach. The approach will be at least lOOP during warm weather to as much as 0377700.002 2 I I
I I 400F during cold weather seasons. That inherent approach combined with the lesser quantity of circulating water used when retrofitting a cooling tower, causes a higher I turbine exhaust pressure. The higher pressure reduces turbine generation. Further, extra auxiliary power is needed to operate all the numerous, giant fans (if applicable) I and to convey the recirculated water through the condenser to the top of the fill section of the tower, and back again. This total loss of station capability is called I derate. I The mechanism of heat release to the air from the recirculated warm condenser cooling water stream, as outlined in the above paragraphs, is essentially descriptive I of other methods of evaporative cooling such as spray ponds, spray cooling canals or cooling lakes. However these latter methods are much less effective at producing the I large water surface required for the evaporation and/or in providing the enormous fresh air quantities necessary to cool those surfaces. Since their application requires I the dedication of land areas that exceed site availability at Indian Point, Bowline and Roseton, they were not considered further. I In another closed-cycle cooling alternative, the water which cools the turbine exhaust I steam in the condenser releases its heat directly to the environment by passing I through a heat exchanger called a dry cooling tower. It is comprised of finned tubes and components similar to that of an automobile radiator but enormous in all I respects. Heat transferred to the surrounding air occurs by the difference in temperature between the warm condenser water and the relatively cooler ambient I air which is caused to flow over the fins. Because this process does not employ any evaporative cooling, river water makeup is not required and a SPDES permit is not I necessary. While many environmental effects normally associated with wet cooling towers are not present, these dry towers are significantly larger and cannot achieve I the same degree of cooling as wet systems. This causes important annual reductions in plant generation and increases operating and capital costs. I 0377700.002 3 I I
I I The closed-cycle dry cooling tower option for Indian Point, Roseton and Bowline stations is discussed more completely in Section 6. I One other closed cycle-cooling option which was considered in the referenced reports I was the hybrid wet-dry cooling system. As its name indicates, it combines the features of wet and dry cooling towers. It was evaluated in this 1993 report update I for the Indian Point, Bowline and Roseton sites with the following considerations in mind: I
- It typically withdraws river makeup and rejects blowdown quantities that are I only slightly lower than those of entirely wet systems.
I
- The plume incidence from these towers is only marginally less than that for entirely wet cooling system.
I
- The size is much larger than a comparable wet cooling system and it requires I more auxiliary power to operate the fans and pump cooling water so plant I generation is reduced.
I
- The maintenance and capital costs are greater than a wet cooling system.
There are more fans and the fin tube dry section tends to foul and corrode. I
- The design reduces station output during portions of the year due to the I existence of the dry section which reduces the effectiveness of the cooling process.
I
- The experience in the industry is very limited with wet-dry systems. in the I late 70's there were expectations this equipment would be commonplace for application to plants like Indian Point, Bowline or Roseton. However, that I has not occurred in the. U.S. Only one wet-dry tower of comparable size has 0377700.002 4 I
I
I I been applied and that to a plant in the southwest with markedly different weather and application circumstances. I Since it has no significant actual advantages, limited actual usage, and since it I exhibits all the worst characteristics of the wet and dry cooling systems, it was not evaluated further as an alternative closed-cycle cooling system in this study. I To summarize, only wet natural draft and wet mechanical draft towers were I considered for those cooling alternatives that requires a SPDES permit since they represent mature and commercially, reliable large-scale designs. I I 1.3 Environmental &peets - An Overview I Wet cooling towers will reduce the usage of Hudson River water by the plants to 2 I or 3% of the current values and thereby reduce aquatic entrainment by a similar amount. However, towers do have strong inherent environmental impacts. Some of I these effects can be generalized or quantitatively characterized; others are related to the specific kind of cooling tower and the specific local site conditions and I environs. The former will be addressed in this section while the latter will be reserved for the particular plant discussions. I One general effect is the aggregate vaporization of the fresh water component of the I Hudson River. On an annual basis from all the towers it amounts to over 13 billion gallons, a value also equal to 40,000 acre-feet. Though this quantity of water is I immense, it is not likely to change the weather or move the Hudson River salt front measurably. To provide a perspective however, the above evaporative consumption I is as much as the supply needs for a city of 300,000 people and it also equals covering the entire state of Rhode Island with one-half inch of water. I s I 0377700.002 I
I I In addition drift, small airborne droplets of cooling tower basin water, is continually emitted in the wet cooling tower exit air plumes. The drift droplets are comprised I of concentrated chemicals, salts and solids that are constituents in the original makeup water. Potentially, PCB's and other pollutants may also be present if they I were present in the makeup water at the time it is pumped into the plant. These drift particles eventually reach the ground and also contribute to airborne I concentrations at ground level. Natural draft towers tend to disperse this drift widely because of their elevated exit while mechanical draft towers have more intense local I effects. In some instances, it can cause harm to certain sensitive trees and foliage. The drift is a small value, a total of less than 50 GPM for all the plants, but it is I persistent and not likely to be eliminated by future technical improvements. Its long term consideration is an important matter. I Icing and fogging from the plume often recur downwind of towers during certain I atmospheric conditions that are conducive to these formations. This occurs frequently during cool or cold weather seasons. Natural draft towers seldom are I responsible for ground level incidents; but mechanical drift towers with their lower, extensively spaced discharges have significantly more troublesome visibility, icing and I fogging events depending on the wind direction and location of the tower. I Noise is another effect that is a pervasive and persistent emittant from both types of I towers. At property lines the sound equivalent of natural and mechanical draft towers may be 60 dBA, the sound equivalent to that of working inside the average I factory or hearing an approaching subway train, 24 hours a day, 7 days a week. It is slightly higher from mechanical draft towers because of their fan noise. Though I much discussion is made of attenuating that noise most reduction techniques involve some form of large scale blockage barriers. This however runs counter to the tower I design performance requirements of providing an easy entry and exit for the admission of cooling air. Hence real costs are very high for noise treatments because I they must account for both the hardware and the compromised cooling performance. 0377700.00z 6 I I
I I Blowdown discharge contains concentrated levels of salts and chemicals present in the makeup water. Its temperature will generally be elevated above the river I temperature by 10 to 300F reflecting that of the cooling tower basin at the particular moment. In terms of quantity it will be about 1 to 2% of the flow circulated in the I plant. I Sludge develops in the basin from silt and heavier suspended solids in the makeup. It will reflect the character of the Hudson River at the location and particular time I of the makeup. Silting will vary seasonally and is generally very site specific. If - severe, it could interfere with operations and cause reduced thermal performance. I As had been indicated for drift, it potentially could contain PCB's and other pollutants. The sludge must be properly managed and in the future, the costs for I testing, removal and proper landfill disposal could be substantial. I Visually, the natural draft towers will be taller than other current station structures. Along with its associated plume it will be very evident. The mechanical draft towers I have a lower profile, similar to that of other plant bUildings, but are much more I extensive than the natural draft designs. The visual impact of the mechanical draft tower however should be evaluated with their frequent lengthy plumes when I comparing them with the effects of the natural draft design. I Both types of towers use significant amounts of land and thus preclude any future necessary alternate plans. I To compensate for the reduced performance, resulting from retrofitting cooling I towers, replacement generation must be created. The replacement plant generation will likely utilize fossil fuels and thereby cause added airborne emissions of CO2, S02 I and NOr This would also produce environmental impacts similar to those associated now with the fossil Hudson River plants. I I 0317700.002 7 I
I I 1.4 Study Approach I Wet cooling tower engineering design was originally defined early in this century by mathematical formulations of the physics of the simultaneous sensible and evaporative heat I transfer processes which occur. Those theories remain valid to this day. Though there have been some technology improvements - notably to fill, fans, drift eliminators, and materials I of construction - a large cooling tower purchased today would be very similar to one of 20 years ago. I Thus this study was able to take advantage of the extensive past work efforts on previous I conceptual closed cycle plants, their engineering and estimates. In fact this allowed the study to effectively focus on past results, planned schedule and cost estimates and only revise I and update to 1993 any outmoded data. These past costs were then escalated using accepted methods that are appropriate for the industry. They are listed in the appropriate sections I for each site. I In addition, though there have been advances in materials, wet natural draft cooling towers still rely on concrete shell construction. That material is strong, permanent, relative easy to I form, and impervious to the corrosive effects of the warm, and sometimes saline, circulated I water it contains. Fabric cooling tower shell materials can reduce the costs of this expensive component but are only applicable and have only been used for dry cooling towers since I they do not operate in the same potentially corrosive environment. One application shown in Figure 1 is a photograph of the inside of a fabric tower in construction at the I Schmehauser Site, a 300 MW fossil plant in Germany. The dry tower "A" frame finned tube headers are clearly shown in the photograph. As indicated, the fabric cooling tower is not I a design that is relevant or applicable to the wet cooling towers of the Hudson River plants. I Retrofitting closed-cycle cooling system may appear to be a simple task, in actuality, the fundamental differences between the once-through and closed-cycle cooling system, the I permanence of the construction of the existing massive structures, the optimum economic 037T70(UlO2 8 I I
I I I I I I I I I I I I I I I I I VIEW INSIDE 'FABRIC' DRY COOLING TOWER I Figure 1 300 Mega Watt Fossil Plant-Schmehausen Site I
I I balances originally struck between the cooling system requirements and the overall station generation, make retrofitting a closed-cycle cooling system a major engineering, design and I construction task. I It should be recognized that since a cooling tower recirculates less water and also produces warmer water than the current once-through cooling, the steam turbine exhaust pressure I increases, lowering plant output. In addition, the auxiliary load to operate the station increases: first, due to the higher pumping power necessary to convey the closed-cycle water I a greater distance to the towers, to the top of the fill and back to the station; and second, in the case of mechanical draft systems, to operate the many fans that are needed to provide I the cooling air supply of the tower. So besides the planning, design and construction of the tower and the permitting process (which could be a very long and costly process when trying I to permit a cooling tower along the Hudson River, especially for a nuclear power plant with its own extensive licensing requirements) the major items that complicate and make I retrofitting costly which are addressed by this study include: I 1. Station performance penalties. I 2. Demolition of existing circulating water intakes, discharges and obsolete runs of piping. I
- 3. Replacement of circulating water pumps and construction of new pump houses and I intake bays to accommodate the higher energy demands.
I 4. Condenser modifications including on-line cleaning systems and possible I reinforcement to withstand higher hydraulic pressures and reduced flows. I 5. Fabrication of new high pressure piping to the cooling tower. I 0377100.002 9 I I
I I 6. Installation of new electrical services, motor control facilities, chemical treatment, new makeup and blowdown systems. I
- 7. Scheduling cut-in construction, testing and start-up of the closed-cycle system to have I only a relatively short term disruption of station generation.
I The last paragraph noted that retrofitting a cooling tower will cause a loss of net station generation. There will be a reduction in turbine pedormance due to the warmer level of I cooling water (compared to Hudson River water temperatures at the same time) and due to the extra power needed (for fans and/or conveying cooling water a larger distance and I up to the hot water distributor of the tower). The total loss is termed a derate. The following paragraphs discuss specifically how the derate was estimated in this study. I First, the results of the original studies were reviewed to ensure that estimates of the extra I auxiliary power identified above were reasonable and still appropriate for 1993. Since the cooling towers, quantity of water and locations of equipment were essentially the same for I the original and present study; and because the state-of-the-art has not improved measurably with regard to auxiliary power requirements in the intervening years, these former values I were determined to be still relevant. I Second, the original studies indicated the wet towers use of asbestos fill had lifted the I pedormance levels and included several depictions of the cooling towers. Based on that information, and a recollection of cooling tower manufacturers offerings for large utility I applications conclusively ascertained that these early studies employed data for film fill, counterflow, natural draft cooling towers and crossflow mechanical draft towers. In fact, I today's natural draft tower study candidates would be basically the same as those of the mid 70's. The mechanical draft tower for utility application today would however more typically I be counterflow using a PVC film fill that had provisions to prevent fouling of the natural sudaces which causes the fill to be deeper. I 0377700.002 10 I I
I I Because the total pumping head for the system is comprised of that through the condenser, valving, lines to and from the tower, over the air inlets up above the fill and across the low I head distributor nozzles, a change in auxiliary pump power between the old and present studies due to an improved fill is minimal. Thus, between the old and present studies the I pump power estimate is within the accuracy of this study. Accordingly, the past study pump power was used and when applicable, the significant fan power requirement, which has not I changed since the mid 70's, was added. I Due basically to a shift to a counterflow design, the performance of a mechanical draft tower has improved since the original studies. Though the size of the tower has remained I about the same, the close up achievable approach of the tower to the design wet bulb temperature has been importantly reduced. This effect was recognized in the current study. I With the above information, the performance loss of the turbine was determined by I computing the corresponding changes in turbine exhaust saturation temperature and then I the pressure. The incremental heat rate curves estimated for that turbine were then examined to determine the impact on station generation. The original capability loss figure I was then modified accordingly. As stated, since the natural drift tower performance is expected to be the same, no change in capability losses was necessary. If a power level I increase had recurred at the station, this loss was included. The auxiliary power requirements due to the retrofit were then added to obtain the total station derate. I It is also important to recognize that although the construction of the cooling tower will be I basically independent of the plant operations, timing the tie-in to coincide with a moderate load period, weather, etc. will require a complex, sensitive plan. The tie-in activities are I many and will take an appreciable time to complete. These activities include, excavations, installation of a security fence, disconnection and hook-in of buried and overhead electrical I services, installation of control and monitoring systems, demolition of affected existing circulating water lines, construction of thrust blocks, construction of foundations in the area I of tie-ins, modification of the inlet for makeup, modifications for a blowdown facility, tests I 0377700.002 11 I
I I and modifications to the condenser for a tube ball cleaning system and to accommodate the potential for higher hydraulic pressures, innumerable tests and start-ups for valves, pumps I and components of the circulating water system, pump performance capacity tests, and finally tests of the detailed components of the cooling tower and its physical and hydraulic I compatibility with the entire system. I Two closed-cycle cooling system designs not considered by this study are the round wet mechanical draft design and the dry natural draft cooling tower. The round wet tower is I significantly more expensive than the wood design and has a limited size range. While this tower has some minor engineering advantages, the rectangular tower sufficiently I characterizes the wet mechanical design. The dry natural draft tower is not a commercially viable, tested design in the U.S. It has also not been considered due to its immense size I such that it has no comparable construction history in this country. I To put this study in perspective, it should be understood that retrofitting closed cycle cooling towers has only happened a few times. The plants where this occurred were much smaller. I A project of this magnitude at anyone of the large plants of this study is unprecedented. To retrofit all six Hudson River plants will be an enormous and perhaps chaotic construction I effort. I Modem day cooling tower technology compared to the 1970's cannot be portrayed as one which has significantly reduced sizing, costs and energy usage. That simply hasn't happened. I Budget estimates of costs and sizing received from three cooling tower manufacturers for I the Hudson River plants were utilized in this study. These costs were similar to ones which could have been developed by escalating, the original mid 70's costs at 5.5% inflation. This I indicates there was no decrease in basic pricing. Further, the tower plot size was about the same as that of the early studies. I As far as energy is concerned, a comparison of the fill heights of difficult (close approach) I thermal duty crossflow towers of the 70's would show them to be much higher than the 0377700.002 12 I I
I I equivalent approach modem counterflow tower. But the originall7"F approach towers of the earlier study are not difficult duties. Besides, the fill height is only one component of I the total hydraulic pumping head. Sufficient pump power is needed to force the circulating water through the condenser tubes, the valves, lengthy piping to and from the tower, up over I the counter flow air inlets, above the required modem non-fouling film fill and through the low head distribution nozzles. Thus, the pump power can essentially be considered to be I the same. In addition, the fan power in the experience of those in the cooling system field still is about the same. Improved heat transfer from modem film fills occurs due to better I air-water interaction. This same mechanism actually increases unit pressure drop that the cooling tower fans must overcome. I With regard to other types of tower designs, the former USSR research idea of a trifolium I tower design and fabric shells are not likely to be of use for the Hudson River plants. This tower is still only a concept and its advantages and disadvantages are likely still under I debate in Russia. It is a long distance between that and a viable, commercial U.S. design. The fabric tower has been used for some Eastern Europe plants but its cable support system I is subject to the long term corrosion effects of the salt and chemicals in wet cooling plumes and drift. Thus, the application of fabric shells for some time will be confined - as it has I been - to dry natural draft cooling towers where no extraordinary drift problems recur. I The use of wetldry towers is rare. In fact, the latter are so rare that there is only one such tower at a plant that is slightly smaller in size to the smallest of the Hudson River plants. I No manufacturers have made any recently. It is installed in the southwest in a climate much I different than New York. Due to this rarity it is unlikely that the capital or operating costs could be even remotely estimated. The degree of plume avoidance ability is proportional I to the dry finned tube surface. This could significantly impact the plant performance, increase cost and be a costly maintenance item due to the corrosion and side effects of the I wet plume on the fins. I 0371700.002 13 I I
I I I I I I I I I I
I I 2.0 INDIAN POINT 2 ALTERNATIVE CWSED-CYCLE COOLING SYSTEM I 2.1 SummaD' Description of Existin& Plant I The Indian Point Power Station contains two units - Unit No.2 is a 1000 MW commercially operating nuclear power plant (pressurized water reactor) owned and operated by I Consolidated Edison Company of New York; Unit No.3 is a 1000 MW commercially operating nuclear power plant (pressurized center reactor) owned and operated by the New I York Power Authority. The site is located in Buchanan, New York, approximately 35 miles north of New York City on the east side of the Hudson River. Its features are described I in Reference (2) and it began commercial operation in 1975. I The existing cooling system is classified as a once-through system. The Indian Point No.2 I turbine generator converts the thermal energy of steam to electric energy. After all recoverable energy has been extracted from the steam, the residual thermal energy is I transferred to the condensers and discharged via the condenser cooling water to the discharge canal. I The discharge structure at the end of the canal is designed to create mixing of the warm I condenser water in such a way as to minimize water temperature differences in the Hudson River. It will accommodate the combined cooling water flow from both active Indian Point I Units and at full flow amounts to about 850,000 gpm including service water from each station. This outfall structure is 270 ft. long and contains a series of submerged ports. I Both intake and discharge structures are located essentially on the river edge. Large pumps I within the intake structure pump the water to the condenser in the main plant area via buried pipes and that water is conveyed back to the river edge discharge structure in a I below-ground canal. I I 0371700.002 14 I
I I The current Consent Order from the Supreme Court of the State of New York mandates the station to operate at the minimum condenser cooling water volume required for I efficient operation taking into account ambient river water temperatures, operating status and the need to meet all water quality and permitting conditions. These flows*are defined I in Figure 2, taken from Table B-1 of the Consent Order. Other stipulations of this consent order can be found in Reference (6), the Order itself. I 2.2 Natural-Draft Coolin: Towers I 2.2.1 Engineering Aspects I The natural draft tower design for the Indian Point 2 Station is the same as that determined in Reference 2. One tower is provided (Figure 3). It has a reinforced concrete shell of I hyperbolic shape. The base diameter is 462 feet and the overall height 565 feet. The tower design point is for a circulating water flow rate of 600,000 gpm, a range of 25
- F and an I approach of 18* F at a 79* wet bulb and 50% relative humidity.
I To maintain the concentration of the circulated station water at reasonable levels, approximately 8,000 gpm of blowdown water per unit would be discharged to the Hudson I River. At the design point, about 12,000 gpm is lost through evaporation, as the thermally induced air flow comes in contact with the warm circulating water sprayed in the fill. I Approximately 12 gpm of water would be lost through drift. The latter is the classification I of water droplets which become suspended in the air flowing through the tower and which are deposited on the surrounding land. All categories of water usage are replaced by I makeup water drawn from the river. I 2.2.2 Cost Estimates I Section 1.4 had indicated wby the cost of the cooling towers is only a small fraction of the overall cost of retrofitting. The capital costs of the system using the wet natural draft cooling I tower are listed in 1993 dollars as follows: 037T700.002 15 I I
~-~~~~-~-~--~~-
-,:)
Figure Z PREDICTED CONDENSER COOLING WATER FLOW RATE SCHEDULES TO ACHIEVE ,., l>
-,:) ..,o'"
EFFICIENT OPERATION OF INDIAN POINT UNIT Z
'"t::1 -.--r- .--.- r-'-'-' .-....-.-r-.-.-.-t-........ -. -r-~-'-'- r---- ** ... *..,,..-.-r l>
900 090 Z
;0 z
UI UI
'"w *80 800 ...., t::1 r-I ,~~ .'- " Vl ~ '~ ~
700 00 0,
- 70 ~
r-'
~ '.,, ~
iJ.:' UJ 0: 2 Q.. eJ .-
~ ,r' "*,. I'"""
60 600
~
M o , I g
~
I *, -",
~
LU Q. I1l (lJ
~
M
"- .n ~ l * ,, ~
III I-< o Li 500 ,.' 0 ., - 50 J-- LU
,I -J ,I ,, " ~,
400
, ,* I EFFICIENT FLOW 40 ,.. INLET ". , , ,,,- I~ ------- TEMPERATURE 300 rr-' r-,-,- r-r'r-- -,.- o.-T_I_.-.*-,-I_._,_*,_I_.o_ ,0-".'_,_,0_.-1-. 1.
030 JAN I FEB I MAR I APR I MAY I JUN I JUL I AUG I SEP I OCT Nnv PRIVilEGED AND CONFIDENTIAL. FOR SETTlEMFNT PURPOSES ONLY
I I I I - I . rOIil
~
l' i-* 1_. 1,_, 0po.. I i __ r I .= t: I .
~ ~
po.. z I z I ~
~
I - rOIil po.. tI.l N po.. I-Ii
-~ ~
I -z z 0 11ii I
~ . 1"_ ~ ...... ~ l I
0 , I", " Ii
-z OJ.,
lL)-t-\I_ _
< "I Z! ' =
I - ~ I ' I~, I f"')
~
II
- E all i .. ,.-
~
I ~ IIII ! ~.. I I I I L I: '
"~11 I .... , oj I
I I
I I Cooling tower - erection, labor and materials $ 26,100,000 Amertap $ 8,651,000 I Piping 20,036,000 Structural 36,174,000 I Electrical 5,683,000 Project Management 2,979,000 I Indirects and Professional Services 11.900.000 TOTAL $111,500,000 I These costs are consistent with previous costs of Table 5-1, Ret 2. Cooling tower costs I shown however are based on 1993 data supplied by cooling tower vendors. All other inherent construction and installation costs of the 1977 studies and including current site I drawings were reviewed and found to be appropriate. They were escalated and updated I from the 1977 studies to reflect the 1993 material and labor costs. An average escalation rate of 55% was assumed from 1977 through 1993. The construction schedule will be I approximately 31/2 years, as indicated in Ref. 2. I Section 1.4 also discusses the kinds of new added operating costs and the related loss in station efficiency when retrofitting cooling towers. The capability losses of this alternative I for Indian Point 2 follow. They are based on Table 3-3 and Table 5-5 of Ref. 2. I Plant Capability Losses I Loss of turbine capability - MW Cooling System Auxiliaries - MW 15 12 I Total Derating - MW 27 Max. Peak Temperature Derate - MW 68 I I I 0377700.002 16 I
I I 2.2.3 Environmental Aspects I A natural draft cooling tower has effects on air quality, ambient noise, terrestrial ecology, water quality, aquatic ecology and aesthetics. The major effects are discussed in Reference I 2 and summarized below. I A) Ground fog incidence is only one event in 11 months; this is inconsequential in comparison to the natural fog incidence of 2%. No icing incidence will occur. I B) Drift dispersion from a natural draft tower is very wide spread. It results in salt I deposition and saline air concentrations that are relatively small. Those studies also indicate most plants in the Indian Point area will not be injured with the possible I exception of hemlock, flowering dogwood and white ash. As reported in these very extensive studies, the potential for biological damage is improbable unless the I ambient conditions described in Reference 2 persist. I C) Blowdown chemistry could be diluted in the discharge canal before discharge to the river. All discharge water chemistry, including intermittent chlorine, will be within I EPA mandated guidelines. Extensive bioassay data on the aquatic populations is I included in the referenced 1977 studies. I D) Noise input studies during construction and operation is expected to be lower for natural draft towers and less localized. I E) Due to its size, a natural draft tower will have a major visual and aesthetic impact I on surrounding areas. It will render ineffective Indian Point efforts at maintaining a low profile. I I 037T700.002 17 I I
I I F) The makeup of a closed cooling system is about 2% of the existing once-through flow. This reduces the entrainment of aquatic life appreciably as Reference 2 shows. I Thermal heat to the Hudson River is also reduced proportionally. I G) Installation of a closed-cycle cooling system necessitates replacement of the fossil power generation due to the amount of the Indian Point 2 derate and the plant I downtime during the cut-in of the cooling tower. This will result in added atmospheric discharges of fuel based oxides and particulates. I 2.3 Mechanical Draft CooUna Towen I 2.3.1 Engineering Aspects I The mechanical draft tower design for the Indian Point 2 Station is the same as that I determined in Reference 2. Four towers are provided. It is a concrete rectangular cooling tower. The size of each tower is 54' x 486' and it is 33 feet to the fan deck. The tower I design point is for a circulating water flow rate of 600,000 gpm, a range of 25
- F and an approach of 10°F at a 79°F wet bulb and 50% relative humidity.
I To maintain the concentration of the circulated station water at reasonable levels, I approximately 8,000 gpm of blowdown water per unit would be discharged to the Hudson I River. At the design point, about 12,000 gpm is lost through evaporation, as the thermally induced air flow comes in contact with the warm circulating water sprayed in the fill. I Approximately' 12 gpm of water would be lost through drift. The latter is the classification of water droplets which become suspended in the air flowing through the tower and which I are deposited on the surrounding land. All categories of water usage are replaced by makeup water drawn from the river. I I 0377700.002 18 I I
I I 2.3.2 Cost Estimates I Section 1.4 bad indicated why the cost of the cooling towers is only a small fraction of the overall cost of retrofitting. The capital costs of the system using the wet mechanical draft I cooling tower are listed in 1993 dollars as follows: I Cooling tower - erection, labor and materials $ 18,000,000
$ 8,651,000 Amertap I Piping 20,036,000 Structural 39,490,000 I Electrical 7,602,000 Project Management 2,556,000 I Indirects and Professional Services 11.560.000 TOTAL $107,900,000 I
These costs are consistent with previous costs of Table 5-2, Ref. 2. The cooling tower costs I shown are based on 1993 data supplied by cooling tower vendors. I All other inherent construction and installation costs of the 1977 studies were reviewed and including current site drawings, found to be appropriate. They are escalated and updated I from the 1977 studies to reflect the 1993 material and labor costs based on an annual rate I of 5.5% I Section 1.4 also discusses the kinds of new added operating costs and the related loss in station efficiency when retrofitting cooling towers. The capability losses of this alternative I for Indian Point 2 follow. They are based on Table 3-2 and Table 5-6 of Ref. 2. I I 0377700.002 19 I I
I I Plant Capability Losses Loss of turbine capability - MW 22 I Cooling System Auxiliaries - MW 14 Total Derating - MW 36 I Max. Peak Temperature Derate - MW SS I 2.3.3 Environmental Aspects I A mechanical draft cooling tower has effects on air quality, ambient noise, terrestrial ecology, water quality, aquatic ecology and aesthetics. The major effects are discussed in I Reference 2 and summarized below. I A) Ground fog (plume) incidence are more frequent than in natural draft cooling towers. Icing incidence will occur. I B) Drift dispersion from a mechanical draft tower is very localized. It results in salt I deposition and saline air concentrations that are higher than natural draft. Those studies also indicate most plants in the Indian Point area will not be injured with the I possible exception of hemlock, flowering dogwood and white ash. As reported in I these very extensive studies, the potential for biological damage is improbable unless the ambient conditions described in Reference 2 persist. I C) Blowdown chemistry could be diluted in the discharge canal before discharge to the I river. All discharge water chemistry, including intermittent chlorine, will be within EPA mandated guidelines. Extensive bioassay data on the aquatic populations is I included in the referenced 1977 studies. I D) Noise during construction and operation is expected to be lower for natural draft towers and less localized. I 0377700.002 20 I I
I I E) The makeup of a closed cooling system is about 2% of the existing once-through flow. This reduces the entrainment of aquatic life appreciably as Reference 2 shows. I Thermal heat to the Hudson River is also reduced proportionally. I F) Installation of a c1osed-cyc1e cooling system necessitates replacement of the fossil power generation due to the amount of the Indian Point 2 derate and the plant I downtime during the cut-in of the cooling tower. This will result in added atmospheric discharges of fuel based oxides and particulates. I It should also be noted that the mechanical draft towers have a lower profile, similar to that I of other plant buildings, but are much more extensive than the natural draft designs. The visual impact of the mechanical draft tower should be evaluated with their frequent lengthy I plumes when comparing them with the effects of the natural draft design. I I I I I I I I I 0377700JlO2 21 I I
I I I I I I I I I I I I I I I I I I I
I I 3.0 INDIAN POINT 3 ALTERNATIVE CWSED-CYCLE COOLING SYSTEMS 3.1 Summaa' Description of Existin& Plant I The Indian Point Power Station - Unit No.3 is a lOOO MW commercially operating nuclear I power plant (pressurized water reactor). The site is located in Buchanan, New York, approximately 35 miles north of New York City on the east side of the Hudson River. Its I features are described in Reference (3) and it began commercial operation in 1977. I The existing cooling system is classified as a once-through system. The Indian Point No.3 turbine generator converts the thermal energy of steam to electric energy. After all I recoverable energy has been extracted from the steam, the residual thermal energy is I transferred to the condensers and discharged via the condenser cooling water to the discharge canal. I The discharge structure at the end of the canal is designed to create mixing of the warm I condenser water in such a way as to minimize water temperature differences in the Hudson River. It will accommodate the combined cooling water flow from both active Indian Point I Units and at full flow amounts to about 850,000 gpm including service water from each station. This outfall structure is 270 ft. long and contains a series of submerged ports. I Both intake and discharge structures are located essentially on the river edge. Large pumps I within the intake structure pump to the condenser in the main plant area via buried pipes and that water is conveyed back to the river edge discharge structure in a below-ground I canal. I The current Consent Order from the Supreme Court of the State of New York mandates the station to operate at the minimum condenser cooling water volume required for I efficient operation with due regard to ambient river water temperatures, operating status and the need to meet all water quality and permitting conditions. These flows are defined I I 0377700.002 22 I
I I in Figure 4, taken from Table B-2 of the Consent Order. Other stipulations of this consent order can be found in Reference (6), the Order itself. I 3.2 Natural-Draft Coolinl Towers I 3.2.1 Engineering Aspects I The natural draft tower design for the Indian Point 3 Station is the same as that determined in Reference 3. One tower is provided (Figure 5). It has a reinforced concrete shell of I hyperbolic shape. The base diameter is 462 feet and the overall height 565 feet. The tower design point is for a circulating water flow rate of 600,000 gpm, a range of 25
- F and an I approach of 18* F at a 79* F wet bulb and 50% relative humidity. A current site drawing was reviewed and it was found that the site would be able to accommodate cooling towers, I though a future plan for an administrative building may require a slight alteration.
I To maintain the concentration of the circulated station water at reasonable levels, I approximately 8,000 gpm of blowdown water per unit would be discharged to the Hudson River. At the design point, about 12,000 gpm is lost through evaporation, as the thermally I induced air flow comes in contact with the warm circulating water sprayed in the fill. Approximately 12 gpm of water would be lost through drift. The latter is the classification I of water droplets which become suspended in the air flowing through the tower and which are deposited on the surrounding land. All categories of water usage are replaced by I makeup water drawn from the river. I Construction times for the overall project are estimated to be the same as in 1977,3112 years. I 3.2.2 Cost Estimates I Section 1.4 indicated why the cost of the cooling towers is only a small fraction of the overall cost of retrofitting. An escalation rate of 5.5% was utilized to adjust costs to 1993. I I 03n700.002 23 I
~---
1J J> Figure 4 PREDICTED CONDENSER COOLING WATER FLOW RATE SCHEDULES TO ACHIEVE 1J f'1
>J EFFICIENT OPERATION OF INDIAN POINT UNIT 3 Cl >J 900 - --.. -. -- '--r-' 1--- --- -r---I --- r- - i 90 \::j J>
Z
- u z
U1 n--r I i I ~ "- ru U1 800,---*--1-----1---+--**-**** I**' -****** --'--1---- .-- !... ~~--I-**
-,,, ~'" . -- -----1*' . - . -.. *!*-----***-*I- 80 <J) w t:j VI . '.., I" ' \ : ; .I 700+----1----*1-*-*1----*-*-,,-*-----\------1 ', ' ,, ;I I . \:I -.. ---1---,\--,1,,,-, 70 ~ ,-' I:" , ~
lU
~ ~- ,,. I \,
Q... a:: (!) o o 600 -J*---I---I---I-F ._+- ",. ------- --'---" .......-.. -.-.- *---r--,~. I--***--*I\:--*-*-*I- 60 2~ 7m o
~
II " I I'
,/ II \ .. ",
lLJ Q...
~
OJ
...t ~ ~
o ---.-.-,.' 1:-- -'- -----.- .--- . .-.. -.--- .-.-- ~,
---.'-,-1---*_ 50 !-tJ J-,:
f-E-' ti lU
,, I ' \.
400 1
-J----I--**--- --,' ---"'1-----'-'
I EFFICIENT FLOW
~
I" ,
- ---'--... ,- ",~'- --I- 40 ~
300 ~~~~~*!=--~l~__ -J-- -- -I~=~=~--{~~~~~:~~~---- .---1-- - ~:+ Iii
- Iii' Iii iii APR I , I ' iii iii Ii'
- Iii, Iii
- I
- iii ii' I '
- i I 30 PRIVILEGED AND CONFIDENTIAL FOR SETTLEMENT PURPOSES ONLY
I ~ I I I I I I I I I __ i
- I I ~ ..
'S I " "\ ' .. - ". ... --~ "'\ \
I z o i I \: I I I I
I I The capital costs of the system using the wet natural draft cooling tower are listed in 1993 dollars as follows: I Cooling tower - erection, labor and materials $ 26,100,000 I Amertap $ 8,651,000 Piping 16,743,000 I Structural 46,237,000 Electrical 6,947,000 I Project Management 3,528,000 Indirects and Professional Services 2.985.000 I TOTAL $121,190,000 I Section 1.4 also discusses the kinds of new added operating costs and the related loss in I station efficiency when retrofitting cooling towers. The operating expenses of this alternative for Indian Point 3 follow. They are based on Table 3-3 and Table 5-5 of Ref. 3. I Plant Capability Losses I Loss of turbine capability - MW 16 I Cooling System Auxiliaries - MW Total Derating - MW 17 33 I Max. Peak Temperature Derate - MW 77 I 3.2.3 Environmental Aspects I A natural draft cooling tower has effects on air quality, ambient noise, terrestrial ecology, water quality, aquatic ecology and aesthetics. The major effects are discussed in Reference I 3 and summarized below. I 037770IUlO2 I I
I I A) Ground fog incidence is only one event in 11 months; this is inconsequential in comparison to the natural fog incidence of 2%. No icing incidence will occur. I B) Drift dispersion from a natural draft tower is very wide spread. It results in salt I deposition and saline air concentrations that are relatively small. Those studies also indicate most plants in the Indian Point area will not be injured with the possible I exception of hemlock, flowering dogwood and white ash. As reported in these very extensive studies, the potential for biological damage is improbable unless the I ambient conditions described in Reference 3 persist. I C) Blowdown chemistry could be diluted in the discharge canal before discharge to the river. All discharge water chemistry, including intermittent chlorine, will be within I EPA mandated guidelines. Extensive bioassay data on the aquatic populations is included in the referenced 1977 studies. I D) Noise during construction and operation is expected to be lower for natural draft I towers and less localized. I E) Due to its size, a natural draft tower will have a major visual and aesthetic impact on surrounding areas. It will render ineffective Indian Point efforts at maintaining I a low profile. I F) The makeup of a closed cooling system is about 2% of the existing once-through flow. This reduces the entrainment of aquatic life appreciably. Thermal heat to the I Hudson River is also reduced proportionally. I G) Installation of a closed-cycle cooling system necessitates replacement of the fossil I power generation due to the amount of the Indian Point 3 derate and the plant downtime during the cut-in of the cooling tower. This will result in added I atmospheric discharges of fuel based oxides and particulates. 0317700.002 2S I I
I I 3.3 Mechanical Draft Coolinl Towers 3.3.1 Engineering Aspects I The mechanical draft tower design for the Indian Point 3 Station is the same as that I determined in Reference 3. Four towers are provided. It is concrete construction. The base diameter is 462 feet and size of each tower is 54' x 486' and it is 33 ft to the fan deck. The I tower design point is for a circulating water flow rate of 600,000 gpm, a range of 25
- F and an approach of 10-F at a 79-F wet bulb and 50% relative humidity. The review of a I current site drawing indicates the site will accommodate mechanical draft towers.
I To maintain the concentration of the circulated station water at reasonable levels, approximately 8,000 gpm of blowdown water per unit would be discharged to the Hudson I River. At the .design point, about 12,000 gpm is lost through evaporation, as the thermally induced air flow comes in contact with the warm circulating water sprayed in the fill. I Approximately 12 gpm of water would be lost through drift. The latter is the classification of water droplets which become suspended in the air flowing through the tower and which I are deposited on the surrounding land. All categories of water usage are replaced by makeup water drawn from the river. I I 3.3.2 Cost Estimates I Section 1.4 had indicated why the cost of the cooling towers is only a small fraction of the overall cost of retrofitting. The capital costs of the system using the wet mechanical draft I cooling tower are listed in 1993 dollars as follows: I Cooling tower - erection, labor and materials Amertap
$ 18,000,000 $ 8,651,000 I Piping Structural 22,761,000 84,013,000 I Electrical 20,180,000 037T700.002 26 I
I
I I Project Management Indirects and Professional Services 5,112,000 19,000,000 I TOTAL $177,700,000 I These costs should be consistent with previous costs of Table 5-2, Ref, 2. Cooling tower costs shown are based on 1993 data supplied by cooling tower vendors. All other inherent I construction and installation costs of 1977 studies were escalated as stated in Sub-section 2.2.2. I All other inherent construction and installation costs of the 1977 studies were reviewed and I found to be appropriate. They were escalated and updated from the 1977 studies to reflect the 1993 material and labor costs at 5.5%. I Section 1.4 also discusses the kinds of new added operating costs and the related loss in I station efficiency when retrofitting cooling towers. The capability losses of this alternative for Indian Point 3 follow. They are based on Table 3-2 and Table 5-6 of Ref. 3. I I Plant Capability Losses Loss of turbine capability - MW 24 I Cooling System Auxiliaries - MW 21 Total Derating - MW 45 I Max. Peak Temperature Derate - MW 73 I 3.3.3 Environmental Aspects I A mechanical draft cooling tower has effects on air quality, ambient noise, terrestrial I ecology, water quality, aquatic ecology and aesthetics. The major effects are discussed in Reference 2 and summarized below. I O37nOO.00l 27 I I
I I A) Ground fog (plume) incidence are much more frequent than in natural draft cooling towers. Icing incidence will occur. I B) Drift dispersion from a mechanical draft tower is very localized. It results in salt I deposition and saline air concentrations that are higher than mechanical draft. Those studies also indicate most plants in the Indian Point area will not be injured with the I possible exception of hemlock, flowering dogwood and white ash. As reported in these very extensive studies, the potential for biological damage is improbable unless I the ambient conditions described in Reference 2 persist. I C) Blowdown chemistry could be diluted in the discharge canal before discharge to the river. All discharge water chemistry, including intermittent chlorine, will be within I EPA mandated guidelines. Extensive bioassay data on the aquatic populations is included in the referenced 1977 studies. I D) Noise during construction and operation is expected to be lower for mechanical draft I towers and less localized. I E) The makeup of a closed cooling system is about 2% of the existing once-through I flow. This reduces the entrainment of aquatic life appreciably as Reference 2 shows. Thermal heat to the Hudson River is also reduced proportionally. I F) Installation of a closed-cycle cooling system necessitates replacement of the fossil I power generation due to the amount of the Indian Point 3 derate and the plant downtime during the cut-in of the cooling tower. This will result in added I atmospheric discharges of fuel based oxides and particulates. I I O3moo.OO2 28 I I
I I It should also be noted that the mechanical draft towers have a lower profile, similar to that of other plant buildings, but are much more extensive than the natural draft designs. The I visual impact of the mechanical draft tower should be evaluated with their frequent lengthy I plumes when comparing them with the effects of the natural draft design. I I I I I I I I I I I I I I 037770(UI02 29 I
I I I I I I I I I I I I I (J t~ (~
~.~
(~.~ I :2
,.f:,
I I I I I
I I 4.0 BOWLINE ALTERNATIVE CLOSED-CYCLE COOLING SYSTEMS 4.1 SummaD' Descriptiye of Existjo& Plant I Bowline Point Station consists of two 600-MW oil-fired and natural gas-fired units, located I in the town of Haverstraw, NY, on the west bank of the Hudson River at river mile 38. The present cooling system is a once-through design. The brackish circulating water is pumped I from the Hudson River, passed through the condenser where it is heated, and released again to the river. I The design circulating water flow rate is 375,620 gpm per unit. However, the flow may I occasionally be restricted as low as 257,000 gpm per unit under the terms of the Consent Order of 13 March 1992 (reference 6) I As stated in Section 1, above, a number of alternative cooling systems have been considered I as replacements for the once-through system. Three offer mature designs and are evaluated I carefully: I
- Wet natural-draft cooling tower Wet mechanical-draft cooling tower I
- Dry cooling tower I A 1977 study (reference 4) evaluated both types of wet cooling towers. The optimum natural-draft ~e considered circulating water flow rates, condenser performance, capital I and operating costs. The optimum mechanical-draft case considered the same costs. Both cases included the effect of changed circulating water flow rate on costs of pumps, piping, I and water-treatment systems.
I The present study obtained new costs of the cooling tower for these optimum cases. The natural-draft design is quite similar to the 1977 study. However, the present study uses a I wet mechanical-draft tower rather than the previous wet-dry design, as wet-dry tower I 0377700.002 30 I
I I technology since 1977 has not matured into wide spread use, as indicated in Section 1. All wet cooling tower discharge a small stream called blowdown, requiring an SPDES permit. I They are evaluated in this Section. The dry cooling tower discharges no water, requiring no SPDES permit, but presents special engineering challenges. Therefore, it is separately I addressed in Section 6. I 4.2 Natural Draft Coolin& Tower 4.2.1 Engineering Aspects I Each of the two units at Bowline Station would have one natural-draft tower. The tower I would have a reinforced concrete shell of hyperbolic cylindrical shape. The base diameter would be 331 feet and the overall height 454 feet. A review of the current site drawing I indicates that the 1977 tower location (Figure 6) is still appropriate. The structure would require about 2.0 acres of land area and there is adequate ground space. The tower would I have a circulating water flow rate of 295,130 gpm to disperse a heat load of 2.795 x 109 BTU/hr, the same as in the 1977 study. I I Construction of the cooling tower would require two construction seasons. Therefore, the tower could not be commissioned until at least two years after the decision to install it. The I overall construction of the retrofit project is however estimated to require about 3 years. I Approxirr River. ) t 1 l, would be discharged to the Hudson i be lost through evaporation, as the I thermally w17i
- le circulating water sprayed on the fill.
Approxim ~ ~r to (" -iov.k::r) Iting water flow) would be lost through I drift, as v deposited
, ~ r 10 c.o ...r-t:/I- fl¥'r air flowing through the tower and are s (8,000 gpm) are replaced by makeup Do -1<<>+/ A . "
I I water dra'
-r :r 7o~( ~:if-.
(-uJ~ v~I~) ) 0377700.002 31 I I
Ficure (; BOWLINE SITE PLAN - NATURAL DRAFT TOWER I "'tooo
~ "v. U
- c".,fD G80IMD .0'
...-?=~;:::==- -=--=:::=-_-::'L=...:.~~..:;;t~'
rr-:~J "'"i.';J. _..at_ IL-'.S* __ _ --r+1L~ I ~--::-:~.. (C._E. _ .
.. -I"'" ""..,
(IIte.M" DIX. . . .* - ....., (UMT ... 1) ( ... w"! .....,.. SECTION ** J. .. " .. !:l:,,~'.~~:.'~l
\1 _.____ I~I til -':.i:~*II..-J1 \\ .I!l::_~k l: \ "- KAl' - ,e . to '.0-HUDSON RIVER ~---- ....U'K(~
I ::t 1 1
. ,::.._ :::.. _ .... .' ...,.....I !:~ ., rIC Jl , *. " :Ii .,jf.
_I'.dO' -- BECHTEL ORANGE AND ROO<LAND UTILITIES. I,..C_ po,,,, .1.' _ ;-1-,
~ IO .....U UNitS.., "Avr.'Ta_ ** r. '0 **
NATURAL DRAFT CLOSED CYCLE COOLING svsnM SITE LAYOUT -~
~._-_.
______ .____ I .. ~_. ____ . ----..::"41----- ~\IO;;; \ SK-M-7 .
I I 4.2.2 Cost Estimates I Cooling tower costs are based on data supplied I All other costs are based on the 1977 study 3 equipment and labor costs. ~ .s-; j % C?j.clIYl.,~. ~,." I Section 1 had indicated why the cost of the c( e I overall retrofit costs. The direct, estimated ca e units using the wet natural-draft cooling towel n I escalation rate of 55% is used to project the e: I Cooling Tower $44,~~ --------~->- -------- On-Line/Cleaning 8,480,000 I Piping 15,072,000 Structural 14,602,000 I Electrical 6,550,000 Indirects and Professional Services 25.890.000 I TOTAlS $115,200,000 I The annual capability loss of the tower is determined in the same manner as in the 1977 study. The duty and circulating water flow would be the same, so the condenser rise and I 'lTD would be unchanged. The effects on the unit heat rate and maximum capacity are estimated from Reference 4. I The circulating water elevation head of the cooling tower in the present study is 34.2 feet, I rather than 37.2 feet in the 1977 study (reference 4). As elevation head is only a portion I of the circulating water total pump head, the power required by the circulating water pumps drops by a negligible amount. There are no fans. The effect on heat rate and unit capacity I is the same as in the 1977 study. I 0377700.002 32 I
I I In addition, the replacement capacity required while the new cooling system is cut-in would I be the same as in the 1977 study. The plant capability losses are as follows: I Plant Capability Losses (per unit) Loss of turbine capability - MW 2.5 I Cooling System Auxiliaries - MW Total Derating - MW 1.7 4.2 I Max. Peak Temperature Derate - MW 22.1 I The present average heat rate of 9500 BTU/net Kwh (VWO, 5% OP) would be 9566 BTU/net Kwh after a natural draft cooling tower installation. I 4.2.3 Environmental Aspects I The environmental impact of the natural draft cooling tower is described in detail in the I 1977 cost-benefit report (Reference 1). Some aspects are described briefly here. I Natural draft cooling towers at Bowline would have effects on air quality, ambient noise, terrestrial ecology, water quality, aquatic ecology, and aesthetics. Air quality impacts result I from the evaporation and drift. A potential concern associated with these emissions is fogging and icing, which are already common in the Hudson River valley. Noise effects I would result from the movement of water droplets and air within the tower. Another concern is the potential of drift to deposit small amounts of salt on downwind areas; this is I expected to have negligible effect on terrestrial ecology. The blowdown water discharge would be of quality high enough and temperature low enough to have negligible effect on I aquatic ecology. The reduction of flow as a result of retrofitting a tower is significant at about 97% of the existing once-through flow. I I The visual impact of the vapor plume and concrete structure of the cooling tower is a major effect. I 0377700.002 33 I
I I 4.3 Mechanical-Draft Coolioe Tower 4.3.1 Engineering Aspects I A mechanical-draft cooling tower may be built of either wood or reinforced concrete, and I may be in counter-flow or cross-flow configuration. A wood-built, counter-flow configuration was assumed for this study because of its compactness, economy, efficiency, low pumping I cost, and demonstrated reliability over a 30-year plant life. I Each of the two units at Bowline Station would have one mechanical-draft tower. The I tower would consist of approximately 18 rectangular cells arranged side-by-side in two horizontal structures. A review of the current site drawing indicates the 1977 tower location I plan is still appropriate. Each cell would have a width of 54 feet, a length of 60 feet and a height of 33 feet, including the fan stack height. The structure would be constructed of I wood supports. The structures would require about 1.4 acres of land area per unit. This tower is larger than the one evaluated in 1977, as the present study uses a design approach I of l00F rather than 15C1f". There is adequate ground space, although the orientation to the prevailing wind may be unfavorable (reference 10, figure 2-1). The circulating water flow I rate would be approximately 321,960 gpm and the pumping head would be approximately 26.5 feet The fan power consumption would be approximately 200 hp per cell. I Construction of the cooling tower itself would require two construction seasons. Therefore, I the tower could not be commissioned until at least two years after the decision to install it. The overall construction of the retrofit project however is estimated to require about 3 I years. I Approximately 2,889 gpm of water, the blowdown, would be discharged to the Hudson River. Approximately 4,719 gpm of water would be lost through evaporation, as the I thermally induced air flow comes in contact with the circulating water sprayed on the fill. Approximately 7 gpm of water (.002% of the circulating water flow) would be lost through I drift, as water droplets become suspended in the air flowing through the tower and are I O3moo.OO2 34 I
I I deposited on the surrounding land. All three losses (8,000 gpm) are replaced by makeup water drawn from the river. However, the drift would be deposited over a smaller area than I for the natural-draft cooling tower. I 4.3.2 Cost Estimates I Cooling tower costs are based on data supplied in January 1993 by cooling tower vendors. I All other costs are based on the 1977 study (Reference 4), escalated to reflect 1993 equipment and labor costs. I The capital costs of the system using the wet mechanical-draft cooling tower are tabulated I below. I The annual capability loss of the tower is determined in the same manner as in the 1977 study. The duty and circulating water flow would be the same, so the condenser rise and I lTD would be unchanged. The effects of the lower water temperature on the unit heat rate and maximum capacity are estimated from reference 4. I The circulating water elevation head of the cooling tower in the present study is 26.5 feet, I rather than 37.2 feet in the 1977 study (reference 4). As elevation head is only a portion of the circulating water total pump head, the power required by the circulating water pumps I drops by about 10%. The tower in the present study has 18 fans of 200 HP each, instead of 20 fans of 200 HP each. Though likely within the accuracy of the estimate this also I reduces fan power by 10%. I In addition, the replacement capacity required while the new cooling system is cut-in would be the same as in the 1977 study. I Section 1 had indicated why the cost of the cooling tower is only a small portion of the I overall retrofit costs. The direct estimated capital cost of both Bowline units using the wet I 0377700.001 3S I
I I natural draft cooling tower are listed in 1993 dollars as follows. Note that an escalation rate of 5.5% is used to project the earlier costs to 1993. I $15,430,000 Cooling Tower I On-line/Cleaning 9,846,000 Piping 16,486,000 I Structural 13,426,000 Electrical 10,100,000 I Indirects and Professional Services 18,936,000 TOTALS $84,200,000 I The plant capability losses are as follows: I Plant Capability Losses (each unit) I Loss of tur1~ine capability - MW 0.4 Cooling System Auxiliaries - MW 4.8 I Total Derating - MW 5.2 Max. Peak Temperature Derate - MW 13.0 I The present average heat rate of 9500 BTU/net Kwh (YWO, 5% OP) would be 9583 I BTU/net Kwh after a mechanical draft cooling tower installation. I 4.3.3 Environmental Aspects I The 1977 cost-benefit study does not discuss environmental impact of mechanical-draft I towers (Reference 1). However, a mechanical-draft cooling tower would have effects on ambient noise, aesthetics, terrestrial ecology and aquatic ecology. I I 0377700.002 36 I I
I I The most severe effect is from tower noise. Measures can be taken to attenuate the fan and inherent water noise. However, it should be noted that the measures are very costly, I increase fan horsepower and are not completely effective. I The effects on terrestrial ecology and aquatic ecology are similar to those for a wet natural-draft cooling tower. I The mechanica1-draft tower is a low, rectangular structure, reducing the area of potential I visibility as a passive structure. However, in operation the mechanical draft tower will emit a significant visual plume for most of the year. It should be compared on this latter visual I basis to a natural draft tower and not simply as a passive structure. I I I I I I I I I I 0377700.002 37 I I
I I I I I I I I I I I I I I I I I
I I 5.0 ROSETON ALTERNATIVE CLOSED CYCLE COOLING SYSTEMS 5.1 Summaa' Description of ExistinK Plant I The Roseton Generating Station, jointly owned by Central Hudson Gas and Electric I Corporation, Consolidated Edison Company of New York, Inc. and Niagara Mohawk Power Corporation is located on the west shore of the Hudson River, approximately 66 miles I upriver of the Battery, near the town of Newburgh, Orange County, New York. The Roseton facility consists of two 600 MWe (net) steam electric units capable of burning both I residual No. 6 oil and natural gas. I The Roseton Plant was constructed in the period 1969-1974. Unit 2 began commercial operation on September 14, 1974 and Unit 1 began commercial operation on December 31, I 1974. The main plant building and stacks are located approximately 1,700 ft from the shoreline and, except for an oil tank area located within an earthen embankment and I miscellaneous service buildings and structures, the ground is level to the river edge. The station is visible from the river itself, from its east shore, and to anyone crossing the river I on the Newburgh-Beacon Bridge, which is situated south of the plant. I The existing condenser cooling system of the Roseton Station is an open-cycle system. An I intake structure is located directly on the river edge. Four pumps located in the intake structure withdraw water (about 641,000 gpm at maximum pumping) from a depth of I approximately 26 feet. Water discharged from the four pumps flows through a single buried, 12-foot diameter, pipe to the steam condensers in each unit. The heated water from both I units is returned to the Hudson River through another buried, 12-foot diameter pipe. Heated effluent is discharged into the Hudson River through a submerged, multi-port I diffuser located approximately 500-feet downstream from the intake. The diffuser is oriented parallel to the longitudinal river axis, with 14 ports discharging essentially I perpendicular to the river flow. Locations of the main plant building, and the intake and discharge structures are shown in Figure 7. I 0377700.002 38 I I
I I PAPER FOR DAN (RN 5/25/93 DS> I suas I o 0 I ~,
~,.., '
I "" II I I I ) I I I I
\ \
I I I I Figure 7 GENERAL ARRANGEMENT FOR THE TWO NATURAL DRAFT COOLING TOWER CONFIGURATION AT ROSETON I I
I I Under the current Consent Order (Reference 6), the the volume of river water drawn into the Roseton pI IhJ. ~ eo~
~ 31 0....40 j ()-rQ .u+ ~
I efficient operation. These volumes are approximated below. Because the number of pumps which mus ~~ d . I dependent upon the number of units in operation and ~ rates for precise periods in the future cannot be spec I from those set forth in the chart because of the neel other permit requirements. I Expected Maximum
---~be(--
of Pumps Approximate River Temperature I Awroximate Period Jan. 1 - May 14 Durin& Period (F) 60 Operatin& 2 Flow. wm 418,000 I May 15 - June 14 June 15 - Sept. 24 71 82 3 4 561,000 641,000 I Sepl 25 - Oct. 16 72 64 3 2 561,000 418,000 Oct. 17 - Dec. 31 I I 5.2 Natural-Draft Cooline Tower 5.2.1 Engineering Aspects I The natural draft tower design for the Roseton Station is the same as that determined in I Reference 5. A recent site drawing was reviewed to ensure the design was still properly located. One tower is provided for each unit. Each has a reinforced concrete shell of I hyperbolic cylindrical shape. the base diameter of each would be 310 feet and the overall height 390 feel The tower would have a circulating water flow rate of 254,400 gpm, a range I of 19.1OP and an approach of 18°F with an ambient wet bulb temperature of 75°F. I At the design point, about 4,600 gpm of water per unit would be lost through evaporation, as the thermally induced air flow comes in contact with the warm circulating water sprayed I 0377700.002 39 I I
I I on the fill. To maintain the concentrations of the circulated water at reasonable levels, approxiinately 3,100 gpm of blowdown water per unit would be discharged to the Hudson I River. Approximately 5 gpm of water would be lost through drift. Drift is the term used for water droplets which become suspended in the air flowing through the tower and are I deposited on the surrounding land. All three categories of water usage are replaced by makeup water drawn from the river. As reported in Reference 7, annual water consumption I .by the two unit station would be about 3.5 billion gallons per year. I 5.2.2 Cost Estimates I Section 1.4 had indicated ",1-- - I. ! ('r/c.1. ly a small portion of the overall retrofit costs. 11 tp>',)f,A-'p", n for both Roseton units I using the wet natural-dr. -1??,'$ft';)--- 1 J) as follows. c!"/~ ~e:. II' ,J $35,900,000 I Cooling tower Allowance-On-Lint J I'~ o';7 1. .. J~4* L $ 9,800,000
" ~ t ;;~ $ 5,800,000 .
I Piping Structural 4v'-
/ #ftl~) v ...,if e~
(., ~ 4~""
/I /- -~)..) 'fCl"" $18,500,000 to (0"-- $ 1,300,000 I Electrical Indirect and Professio; 52Q,700.ooo $92,000,000 I Totals I Cooling tower costs are based on 1993 data supplied by cooling tower vendors. All other inherent construction and installation costs of the 1977 studies were reviewed and found to I be appropriate. They are escalated and updated from the 1977 studies to reflect 1993 material and labor cost. They are based on an average escalation rate of 5.5%. Piping, and I pump replacement costs are also based on the 1977 study but updated to 1993.
I Section 1.4 also discusses the kinds of new added operating costs and the related loss in station efficiency which occurs when retrofitting. I I 0377700.002 I
I I The plant capability losses of this alternative are as follows: I Plant Capability Losses (each unit) Loss of turbine capability - MW 4.6 I Cooling System Auxiliaries - MW Total Derating - MW 2.1 6.7 I Max. Peak Temperature Derate - MW 22.7 I The present average beat rate of 9393 BTU Inet Kwh (VWO, 5% OP) would be 9503 BTU Inet Kwh after a natural draft cooling tower installation. I 5.2.3 Environmental Aspects I A natural draft cooling tower has effects on ambient noise, terrestrial ecology, aquatic I ecology, and aesthetics. Reference 7 examined the environmental effects of a natural draft cooling tower with respect to fogging, icing, saline drift deposits and noise characteristics. I The results from those studies are still relevant today and are summarized below. I The cooling towers would induce 85 hours of fog and 45 hours of icing (concurrent with fogging hours) in a representative year. Principal effects would be: I (a) Restricted visibility on the West Shore and East Shore Railroads, on the I Hudson River (navigation), in the hamlets of New Hamburg and Chelsea on the east shore of the River and in the residential areas north and south of I Roseton on the west shore of the River. I (b) Icing on the West Shore and East Shore Railroads, in the hamlets of New Hamburg and Chelsea and in the residential areas north and south of Roseton I on the west shore of the River. I 41 03moo.OO2 I I
I I Maximum Deposition Rate Maximum K&/Km2 /Mo. I June July 19 50 August 70 I September October 26 70 I The principal increase in community noise was predicted to occur at those residences I located within the industrially zoned area south-southwest of the Roseton Plant property. I The number of stems of tree species which will be affected by salt aerosol drift over the life span of the proposed cooling towers at Roseton was estimated as follows: I Species Low Best High I Estimate Estimate Estimate I Flowering Dogwood 0 0 12, 137l+/-.1,228) I Canadian Hemlock 0 0 7 I White Ash 0 0 7,442 l+/-.1,371) Cherry 0 0 2,370 l+/-.536) I It is concluded that the reaction of the public to the scenic intrusion of a natural draft I cooling tower will be highly variable ranging from approval by a few to strong objection by many who regard the scenic and historic Hudson Valley as a major regional or even national I treasure. It is expected that the strong objections by many in the 1970's is still the prevailing thought in the 1990's. I I 037T1OO.002 42 I I
I I 5.3 Mechanical-Draft CooJinl Tower Not;.. : 5.3.1 Engineering Aspects 30' ~eo~ I I A mechanical-draft cooling tower may be built of e may be in counter-flow or cross-flow configuration. J. I r,~,,"..f 1, was assumed for this study because of its compactl I cost, and demonstrated reliability over a 30-year pJ I The tower for each unit would consist of approximi __"_" ___~_" ____ by-side in two horizontal structures. Each structure would be approximately 400 feet long I and 55 feet wide. The circulating water flow rate would be approximately 254,400 gpm, a range of 19.1°F and an approach of l00Fwith an ambient wet bulb temperature of 75°F. The I pumping head would be approximately 30 feet. The fan power consumption would be approximately 200 hp per cell. I The range and approach selected in this study for a wood rectangular tower are lower than I those selected in Reference 5 for a concrete circular tower. This is due in part to improvements in technology since 1977 and in part to the lower cost of the wood I construction. A recent site drawing was reviewed to ensure the tower locations of reference 5 are still applicable. I I Evaporation, drift, blowdown, and makeup quantities would be about the same as the wet natural-draft cooling tower. However, the drift would be deposited over a smaller area. I 5.3.2 Cost Estimates I The direct estimated capital costs of the system for both Roseton units using the wet mechanical-draft cooling tower are as follows: (1993 dollars) I I 0377100.002 43 I I
I I Cooling tower $ 9,500,000 Allowance-Oll $ 9,800,000 I Piping $ 7,700,000 Structural $21,100,000 I Electrical $ 7,100,000 Indirect and 516.100.000 I Totals $71,200,000 I Cooling tower cost! ing tower vendors. All other inherent constructi~_________~ _____ ~ I> ~ ,s were reviewed and found to I be appropriate. They are escalated and updated from-me 1977 studies to reflect 1993 I material and labor cost. They are based on an average escalation rate of 5.5%. Piping, and pump replacement costs are also based on the 1977 study but updated to 1993. I Section 1.4 also discusses the kinds of new added operating costs and the related loss in I station efficiency which occurs when retrofitting. I The plant capability losses of this alternative are as follows: I Plant Capability Losses (each unit) Loss of turbine capability* MW 1.8 I Cooling System Auxiliaries* MW 3.4 Total Derating* MW 5.2 I Max. Peak Temperature Derate* MW 13.4 I The present average heat rate of 9393 BTU Inet Kwh (YWO, 5% OP) would be 9479 BTUInet Kwh after a mechanical draft cooling tower installation. I I I 0377700.002 44 I
I I 5.3.3 Environmental Aspects. I A mechanical-draft cooling tower would have effects on ambient noise, aesthetics, terrestrial ecology and aquatic ecology. I The most severe effect is from fan noise, produced by motors, gearboxes, and fan blades. I Fan motor noise could be reduced by fitting noise attenuators around the motors. The level of noise at any given location depends on the distance from the noise source. Reference 5 I excluded rectangular mechanical draft towers on the basis that the impact of noise would I be unacceptable. Fan gearbox noise could be reduced by using low-noise gearboxes and optimizing the fan blade. Fan blade noise, particularly pure tone noise caused by blade-strut I interactions, could be reduced by specifying an odd number of struts in relation to the fan blades. I Fogging, icing and drift deposits due to the plume from a mechanical draft tower would I affect a smaller area than that of a natural draft tower, but the magnitude of those effects would be greater. .1 The effects on terrestrial ecology and aquatic ecology are similar to those for a wet natural-I draft cooling tower. I It should also be noted that the mechanical draft towers have a lower profile, similar to that of other plant buildings, but are much more extensive than the natural draft designs. The I visual impact of the mechanical draft tower should be evaluated with their frequent lengthy plumes when comparing them with the effects of the natural draft design. I I I I 03n700.002 4S I
I I I I I I I I I I
I I '.0 ALL PLANTS: ALTERNATIVE CLOSED*CYCLE SYSTEMS TIlAT DO NOT REQUIRE AN SPDES PERMIT I
'-1 Description Summaries I
Report Sections 2 through 5 present summary descriptions of the existing plant cooling I systems at the Indian Point 2 and 3 and the Bowline and Roseton Stations and the evaluation of alternative wet closed cycle cooling systems for each of these plants. The I alternative wet closed cycle cooling systems included the wet mechanical-draft cooling tower and the wet natural-draft cooling tower. The wet closed-cycle cooling schemes which were I determined to be most suitable for these plants, if an alternative to the present once-through cooling system is required. I The selection of a particular wet tower system design depend on the associated economics, I available site area, and environmental considerations such as drift dispersion ability, height limits, plus other considerations discussed in the report. Since these are retrofit I applications, environmental, engineering, and economic factors impose more restrictions than when evaluating wet cycle tower systems at a new site. These selections are not I unusual because wet towers are generally utilized if sufficient water is available to I accommodate the evaporation and blowdown requirements. Also available are "wet/dry cooling towers," which combine the features of the wet and dry cooling towers. However, I the wet/dry technology was eliminated from consideration for the Hudson River plants for the reasons cited in Section 1.2. I All of the above alternative cooling configurations, however, convey the heat load to the I atmosphere mainly by the process of evaporation. Based upon current SPDES regulations, it appears that all cooling devices other than air cooling requires an SPDES permit. I Therefore, an additional cooling device which will be evaluated with respect to these sites is the "dry" cooling tower. The dry cooling tower involves only sensible heat exchange and I 0377700.002 46 I I
I I consists of fixed tubes that totally enclose the medium to be cooled, making mass transfer through evaporation impossible. I Compared to a wet tower, the dry cooling tower requires considerably more land area and I cools at a considerably higher capital and operating cost. Operating costs are higher because the plant operates less efficiently but the dry tower can provide a technically viable I and environmentally attractive option in certain instances where water is not available provided operating or economic restrictions are not so extreme as to preclude its use. I 6.2 Commercially Available DO' Coolin2 Towers I Dry cooled condensers may utilize either direct or indirect condensing. (Ref. 8) In the I direct type, turbine exhaust steam is condensed within tubes which contain external fins. The huge quantities of tubes which are required for large scale power plants, such as the I plants being evaluated at these Hudson River sites, are required to be grouped together in sections which are then generally configured as A-frames for efficient utilization of space. I (Please see Fig. 8). Mechanical fans are installed in the horizontal deck below the A-frames spa~e I and discharge air upwards past the surfaces of the tubes. A large open is required to be provided below the fan deck to admit the enormous air volumes required for cooling. In the indirect system, turbine exhaust steam is directed to the main steam surface condenser. The condenser cooling water is circulated through the dry cooling tower. Because the indirect system circulates a coolant to the air cooler and then back to the surface condenser, it requires all of the attendant equipment and pumps which are associated with a station circulating water system. The indirect air cooler is nominally larger and more expensive than a direct air cooler for a given duty because of the use of an intermediate fluid. The terminal temperature difference between the turbine exhaust steam and the outlet water temperature of the condenser on the indirect system also creates a higher turbine back pressure. As a result, for a new station design a direct dry cooling system generally presents significant advantages. When considering a retrofit application for either the Bowline, Roseton, or Indian Point Stations, however, it is not a realistic design 0377700.D02 47
I I PAPER FOR DAN (RN 5/25/93 DS) I DIRECT CONDENSING I I Steam turbine I I I Pump. Condensate storage tank I INDIRECT CONDENSING I Dry cooling tower Steam turbine I I I I I Pump Pump I Two dry-cooling designs are the direct type (top) and indirect type (bottom) I Figure 8 DIRECT AND INDIRECT DRY COOLING SCHEMATICS I I
I I option. The complex geometry and expense associated with removing the existing main steam surface condensers and circulating water systems and replacing them with air-cooled I condensing sections and an efficient ducting system to an air-cooled tower system positioned in proximity to the turbine building to minimize duct pressure losses and at the same time I utilize available site area for the towers is not practical. I Therefore, although the indirect system requires extra circulating water system type components and provides relatively lower inherent performance potential as compared to I the direct system, limitations imposed by the retrofit applications at the Hudson River plants precludes direct dry-cooling from further consideration. I 6.3 Endneerina Aspects I As a result of the required plant capacity requirements and in accordance with the I established study objectives, only dry-cooling designs which allow the plants to produce full load will be evaluated. Mitigation alternatives or strategies which involve reductions in load I are not considered to be viable alternatives and will not be evaluated. Indirect dry-cooling systems will be evaluated assuming the plants will be required to have the capability to I generate close to their existing maximum output. I The engineering investigations will focus on those plant components or systems where the I relative cost impact or component design limitations associated with dry cooling are greatest. Based upon the requirement to maintain plant full load capability, the impact of dry cooling I on the steam generators, boilers and associated equipment is minimized and will not be addressed here. The infrastructure, civil works, and auxiliary plant components represent I approximately 15 to 25 percent of the overall plant costs. The plants turbine/generators and cooling systems represent approximately 25 percent and 15 percent of overall plant costs, I respectively. An exact assessment of the impact of retrofitting dry cooling on plant infrastructure is difficult to determine in advance of developing preliminary system designs I which are beyond the scope of this report, but representative order of magnitude estimates 0377700.002 48 I I
I I of the physical and economic impacts will be made. The engineering and economic evaluations thus will be focused on the turbine and cooling system areas of the plant which, I as indicated, represent approximately 40 percent of the overall plant cost and which are also highly deterministic of the net cost associated with annual energy production. I 6.3.1 Ovemew of the Mechanism of Indirect Dry-Cooling at the Hudson River Plants I (Ref.8) I For dry heat transfer, surfaces with external fins are typically utilized. Finned tubes are the most expensive component in the dry-cooling system and their proper selection is critical to I the long-term success of the specific plant cooling design. Numerous types of finned tubes are commercially available - integral, extruded bimetal, helically wound, galvanized types, I and others. Two common tube designs are elliptical steel tubes with steel fins, dip galvanized, and round steel tubes with aluminum fins. This elliptical design provides good I service life. This round design provides good heat transfer properties at a given cost. However, the service life may not be as long as the elliptical tubes with steel fins. I I For either selection of tube design, the heat of the ambient air passing over the air-cooled condenser tubes is increased exactly by the heat of the condensing steam from the turbine I exhaust. For dry finned tubes, the temperature of the air increases. For an indirect system, the heat transfer gradient is provided by the difference in temperature between the cooling I air and the coolant for the condensing steam. In most air-cooled condenser designs, usually not more than three to six rows of tubes are crossed in succession by the air which passes I over them. The successive increases in air temperature produces a successively higher steam condensing pressure in each row. A large air volume is required to be delivered to cool the I finned tube sections. A corresponding area of finned tubing is necessary. In order to develop minimal estimates of the required site plot areas for the retrofit applications at the I Hudson River sites, forced mechanical draft air cooled exchangers with the bundles placed in typical A-frame type configuration will be used as the basis for the economic evaluations I rather than other developmental designs or natural draft tower concepts. 037T700.002 49 I I
I I The terminal temperature difference between the turbine exhaust steam and the outlet water temperature of the condenser on the indirect system is one of the governing I parameters in the Hudson River site evaluations since this will create a higher back pressure on the existing low-pressure turbines than was originally designed for. The situation is I particularly exacerbated during warm summer weather when these lower Hudson River I plants are needed to deliver peak electrical loads to the entire service area. I The engineering and operational impact on the turbine-generators due to the higher back pressures associated with dry cooling is a critical aspect of the dry cooling feasibility I determination. I 6.3.2 Turbine Back-Pressure at the Hudson River Plants Using Indirect Dry-Cooling I While the thermal performance of wet cooling towers is dependent on the existing ambient wet-bulb temperature, the performance of dry cooling towers is dependent on the local dry-I bulb temperature patterns. The dry bulb temperature can equal or be greater than the wet bulb temperature, but it is usually 10 to 300F higher during heat waves. I The following definitions are provided for reference. I DIY-bulb temperature: Temperature of the atmosphere as measured by a mercury I thermometer or other device. The 1 percent dry-bulb temperature is the temperature which exists for 1 percent of the time during the warm weather months based upon historical I records for the site vicinity. I Approach: In a dry-tower system, the approach is generally the difference between the initial temperature of the process gas or fluid which requires cooling minus the ambient dry-I ~ulb temperature. I Ranee: Refers to the temperature range between the initial and final-cooled temperature. I 0377100.002 so I
I I Turbine backpressure corresponding to the Hudson River plants 1 percent dry bulb temperature may be estimated as follows: I Variable value I 1% dry-bulb temp., Of' 94. Ref. 9 Approach to dry-bulb, of 30. Engineering estimate I Indirect range, of 35. Engineering estimate Condenser approach, of ...lil. Engineering estimate I Total temperature = 169°F I Therefore, the turbine maximum exhaust pressure corresponding to the 1 percent dry bulb I temperature will be the exhaust pressure corresponding to a 169°F saturation temperature, or 12 in. HgA. Similarly, the turbine exhaust pressure corresponding to approximately the I Hudson River 5 percent dry bulb temperature is estimated to be in the range of 10 in. HgA. I 6.3.3 EtTect or High Turbine Backpressures at the Indian Point 2 and 3 and the Bowline I and Roseton Stations I Large steam turbine-generators are critical components in the Indian Point 2 and 3 and the I Bowline and Roseton Stations. Their continued reliability and operating efficiencies are a I key consideration when making decisions with respect to plant modifications which could I in any way effect their design service conditions. For these existing stations, any plant modifications which negatively influence the generating capability or the reliability of the I turbine-generators will result in substantiallifecycle cost increases for the turbine-generators I and, ultimately, for the station. These costs would include the cost of additional electrical I 0377700.002 SI I
I I system generating capacity needed to compensate for expected turbine-generator unavailability or reductions in efficiency, as well as the cost of replacement energy which I would need to be supplied from other (possibly more expensive) sources when the turbine-I generator is unexpectedly out of service. These indirect costs could become substantial over the life of these plants. Small changes in the plants availability, reliability and operating I efficiency have a substantial impact on the overall costs associated with producing electricity. I I The plants involved are: PlANT BOWLINE ROSETON INDIAN P2 INDIAN P3 I TYPE Fossil Fossil Nuclear Nuclear Location-NY Haverstraw Newburgh Peekskill Peekskill I No. Units 2 2 1 1 I NetMW 600 600 941 965 The Bowline and Roseton plant's turbine-generators are 3,600 rpm tandem-compound four-flow reheat machines which contain 3D-inch last stage blades. The Indian Point 2 and 3 plant's turbine-generators are 1800 rpm tandem-compound six-flow machines which contain 44-inch last stage blades. In the case of fossil and nuclear condensing steam turbine-generators of the large sizes represented here, the turbines are carefully matched to the overall cycle and assigned site conditions. The selection of the optimum low pressure steam turbine exhaust annulus area and the cooling system design are governing parameters in the design of the. plants. Based upon the original design of these plants being cooling using Hudson River water, their low 0317700.002 52
I I pressure turbines have been designed for backpressures in the range of 1 1/2-inches to 3 1/2-inches HgA. As derived in Section 6.3.2, utilizing indirect dry-cooling at these plants I would require the capability to operate with condenser backpressures of up to 10-inches to I 12-inches HgA. I Therefore, the capability of the existing low-pressure turbines to operate efficiently and I reliably at the higher back-pressure limits must be determined. Steam turbine blade failures are the leading causes of large steam turbine unavailability. Based upon industry I experience, most reported blade failures occur in low-pressure turbines and most of these I failures are found in the last two rows. These are the rows which would be most I significantly influenced by operating with higher exhaust end back-pressures. Last stage blading design is very challenging for the following reasons: I I
- The volume flow through the last stage is greater than for other turbine stages. Therefore, the last stage blades are the longest and have the highest I mechanical stresses.
I
- The last stage produces substantially more energy than any other stage in the I turbine - up to 10 percent of the overall unit output. Therefore, maintaining I last stage efficiency is critical.
I I 0377700.002 53 I I
I I
- Long last stage blades require tuning to assure that there are no resonant modes coincident with the low multiples of running speed.
I I
- Last stage blades operate in a wet steam environment.
I
- Last stage blades operate under variable exhaust volume flow, and I thermodynamic conditions with changes in load and back pressure. Some of these conditions can result in flow-excited vibrations, such as stall flutter or I buffeting.
,I I Therefore, as a result of the highly complex design aspects of the low pressure turbines last stages of blading, assessments of high back-pressure operating capabilities are best addressed I by the turbine suppliers. Several turbine-generator suppliers have been contacted and were I requested to address operating these Hudson River plants at the higher back pressures which are associated with dry cooling. They were requested to describe some of the I technical problems associated with operation at the higher backpressures, to describe I backpressure limits being provided for their newer designs (since the existing designs are approximately 20 years old), and to assess the feasibility to modify the existing designs to I suit the new operating limits being proposed. I With respect to the technical concerns associated with high backpressure operation, the I respondents indicated several concerns. High exhaust pressure may cause overheating of I 0377700.002 54 I I
I I the low pressure turbine blading and possible extreme thermal expansion. During low steam flows through the low pressure turbines associated with low loads, the blades do work on I the steam. This is called "negative work" and heats the steam. High exhaust temperatures I can cause damage to the low pressure turbine blading. Distortion of the exhaust areas can affect bearing loading, gland clearances, and blade path seal clearances. Vibration, seal I rubs, and steam leaks can result. These can cause considerable equipment damage and can I result in a forced shutdown of the unit. The basic concern associated with off-design, high I backpressure operation at higher loads is one of high dynamic buffeting loads that can cause high-cycle fatigue damage and ultimate failure of the blading. I I The manufacturers generally indicated that specialized high backpressure (in the range of up to 1S-inches HgA) turbines can and have been provided. This generally means a unit I which has been specifically designed to suit the given application and which contain shorter I (20 to 25-inch vs. the 30 to 44-inch last stage blade lengths which have been used on the Hudson River machines), more rugged last stage designs and associated limits on the flow and loading capability of a given turbine element. With respect to the backpressure limits for current machine designs of the same general end size as the Hudson River plants, the manufacturers generally recommend alarming in the range of S.O-inches HgA and tripping in the range of 71h-inches to 8.O-inches HgA backpressure. However, operation at backpressures approaching 8.O-inches HgA, where permitted, generally requires that specific limits with respect to load and time durations at 0377700.002 ss
I I these higher backpressure limits be observed. Based upon the responses received [to date], I even the current nuclear and fossil turbine designs do not provide the operational flexibility for the lO-inch to l2-inch HgA backpressure limits desired for dry-cooling at these Hudson I River plants. I With the respect to the capability to modify the existing machines, one manufacturer stated I that the option to operate at up to lO-inches HgA would probably mean operation with the I last one or two low pressure turbine stages removed, with an associated substantial reduction in unit capacity. Much more detailed engineering studies would be required to determine I if this option is feasible. However, the need to maintain close to full load capability at these I Hudson River plants does not make this a viable option, assuming that it could be accomplished. I Other manufacturers stated that it might be possible to design new back ends to retrofit into these machines; however, this would represent major modifications and substantial engineering studies to evaluate this further. None of the manufacturers who have responded to date have indicated that they currently have any development programs in-place associated with high backpressure operation. Therefore, at this time, it does not appear that this type of a development program represents a realistic near term option for these Hudson River plants. However, not all manufacturers have responded to date on this issue. 0377700,(102 56
I I 6.4 Cost Estimates I Section 1.4 discusses the cost of the cooling towers as compared to the overall costs I associated with retrofitting a closed cycle cooling system. The capital costs of the cooling I tower fabrication and materials for a system using an indirect dry mechanical draft tower listed in 1993 dollars, are as follows: I I PLANT Cost-CooIini Tower Fabrication and Materials Bowline (each unit) $19,000,000 I Roseton (each unit) 16,000,000 I Indian Point 2 51,000,000 Indian Point 3 51,000,000 I In addition to those high cooling tower material costs for the indirect dry system, substantial costs would also be incurred associated with retrofitting the dry towers. These costs would be associated with the significant land usage requirements and the costs associated with the balance of plant equipment. Based upon the development nature of the modification required for the turbine-generators, plus significant other balance of plant equipment costs, the costs associated with the dry-cooling retrofit are considered to be prohibitive. 0371700.002 57
I I 6.5 Environmental Aspects I An indirect dry mechanical-shaft cooling tower would have effects on ambient noise, I terrestrial ecology, and aesthetics. The most severe effect would likely be associated with fan noise produced by motors, gearboxes, and fan blades. The major advantage associated I with the dry tower is the elimination of the need for an SPDES permit due to elimination I of water usage for cooling. I 6.6 Summan and Conclusions I I The use of dry-cooling systems in large scale (greater than 100 MWe) power plants is a relatively new development which has only recently begun to experience growth worldwide. I In general, plants which have utilized dry cooling have been located at sites where wet I cooling systems could not be used economically due to a shortage of water. New, large dry-cooled plants have been successfully built following highly detailed design optimizations I which determine the optimum location of the cooling cells with respect to the balance of plant, and which carefully match the turbine design and operating characteristics to the dry-cooling system design from the outset of the plant design. This involves matching the main power cycle design characteristics with annual ambient air temperature and required operating loads during periods of peak temperature conditions, considering auxiliary loads and fan power requirements associated with dry cooling. 0377700.002 58
I I At a new site, given the flexibility to optimize these design variables, large-scale dry-cooling has been shown to be a viable option where alternative cooling system designs could not be I utilized. However, for retrofit applications such as at these Hudson River plantS, the design I is necessarily compromised. As discussed in the above sections, the higher turbine back-pressures associated with an indirect dry-cooling system are not compatible with the existing I turbine generator units. Modifications to these machines which involve eliminating the last I row of blading with the attendant reductions in load carrying capability are not considered I to be viable options. Modifications to these units to allow full load carrying capability with the higher backpressures would require significant new and costly development programs I and equipment modifications which are not considered to be viable near term options for I these plants. I Therefore, since other more viable closed loop cooling system designs have been I demonstrated to be available if alternative cooling measures are required, it is not believed that dry-cooling should be investigated further at these plants. I I I I I I 0377700.002 S9 I I
I I I I I I I I I I I I I I
I 7.0 (1) REFERENCES Report on Cost-Benefit Analysis of Operation of Hudson River Steam Electric Units I With Once-thru and Closed Cycle Cooling Systems, July 1, 1977. I (2) Economic and Environmental Impacts of Alternative Closed-Cycle systems for Indian Point Unit No.2, December 1, 1974. I (3) Economic and Environmental Impacts of Alternative Closed-cycle Cooling Systems I for Indian Point Unit No.3, January 1976. (4) Cooling Tower Study, Bowline Point Station, Units 1 and 2, June 1977. I (5) Central Hudson Gas & Electric Corporation, Roseton Steam Electric Station Cooling I Tower Installation Study, February, 1978. (6) Consent Order from the Supreme Court of the State of New York, dated March 13, I 1992. I (7) Roseton Generating Station Engineering, Environmental (Non-biological), and Economic Aspects of a Closed-Cycle Cooling System, July 1977. I (8) "Air Cooled Condensers", Power, January 1990. I (9) USA FETAC Air Weather Service Data, Newburgh, New York, 1942-1969. (10) Environmental Analysis of Natural Draft Cooling Towers for Bowline Generating I Station, January 1977 I I I I 0377100.002 60 I I
I I I I I I I I I I
.-I f!? "tb~ /3'1 80 EnV lre. l( Inc*
PhOne: 414/547 -0141 Fox: Po. Box 1604 414/547 -4120 Waukesha, W/53187-1604 Telex: 201312 ENWA UR 1901 S. Prairie Avenue January 19, 1993 Waukesha, WI 53186-7360 Ston e" Webster Engineering Corporation 245 Summer Street Boston, MA 02210 Attention: Mr. Tom Adams
Subject:
Flne Screen Evaluation
Dear Tom:
The attached report evaluates the impact of retro-fitting the existing intake s at Indian Point, Roseton and Bowline Stations. The chart breaks the project into five (5) major considerations and determ ines the degree of impact to each of the screen modules. The write up discusses the impac t to each module and provides a course of action. The sketches have been provided to supple ment and clarify the written discussion. Please review this information and then give me a call and we can discuss site by site. Michael Quick Product Sales Manager Traveling Water Screens MQlmmf c;m"I and em,II water and wastew ater treatme nt equipm ent
Project: Backfitting existing thru-flow traveling water screens with 2mm - 6mm opening screening media. Backfit being explored as a means to prevent marine life larvae from being entrained into cooling water system. Current screening media is equivalent to \Va square opening. Project Sco.pe: Address operational and environmental considerations that are brought about by the proposed backfit and anticipated increased velocities and differentials. Provide potential mechanical and material enhancements that will ensure adequate performance and reliability. Identify areas that will require either laboratory testing/demonstration or field testing/demonstration or both. Equipment should provide service life and reliability at least equivalent to the current designs employed. (1) ~ Consideration No.1, Increased Velocities: Potentially Higher Differentials Higher Traveling Speeds wm.t- ~ - Fatigue The allached charts indicate the increa:~:;;ties, resulting from using the smaller opening mesh, will generate higher differentials at§ual percent clean figures]for the current mesh. The rate of blinding is dramatically increased, thus the potential for a catastrophic failure is increased unless the following changes are made to the drive system. These items can all be~umerically modelin'} no field or lab work. ] "cde 'It-&
- Possibly the use of a 4-speed or variable speed (preferred) drive to more accurate respond to the rapidly changing conditions.
- Possibly increased horsepower to pull out of potentially higher differentials.
- Higher operating speeds to respond to increased loading and plugging rates. Recommend 5 15-30 FPM (see charts).
- Components of the drive system need to be evaluated in light of the increased cycles expected.
Bearings, seals, etc will need to be of the heaviest duty possible. (2) framin& Consideration No.1, Increased Velocities: Higher Differential Fatigue failures The framing sections will need to be evaluated for structural strength in light of the potentially higher differentials. This can be done mathematically. The fatigue factor, due to the higher cycle rates, can cause a premature failure. This will ~so need to be analyzed carefully. :;-
-llle fatigue _tor
- aDalyr.ed carefully. due to higher eycle rates can cause a premature failure. 11ris-wi1l also need to he Consideration No.2, Spray Wash: - Front Discharge Spray Shower At Water Level The impact from the above is minor but needs to be recognized. Brackets etc. will need to be added. Consideration No.3, Sealing: Frame to Baskets Frame to Intake Typical running clearances in a traveling water screen are in the ~. to :Va* range. In order to seal the screen, the following areas must be addressed.
- Z-frame to basket will be sealed by use of a neoprene strip, bolted to the face of the frame, extending to the curved end plate of the basket.
- In the area of the boot section the clearance between the basket lips and the frame will be sealed by use of a stretched sheet of neoprene, bolted at the leading and trailing edges. The seal contacts the basket lips as they articulate around the foot sprocket always maintaining contact with the basket lips.
In order to maximize the efficiency of the screen and to reduce the entrapment of debris and organisms, the following areas are being addressed. Flow into the screen will be directed by wall mounted flow deflectors which will guide debris organisms onto the face of the basket mesh. Side frame of the structure will be sealed to the walls and floor of the intake. The frame will be sealed to the intake wall by both the standard frame guide and a UHMW strip mounted to the face of the Z-Frame extending to the wall. Evaluation of the above seals should be performed in our lab. (3) Baskets Consideration No.1, Increased Velocities: Higher Differen . Basket Vortexin Fatigue Failure Structural integrity The basket will see the highest level of impact as a result of the proposed change in screening media. The higher differentials and increased travel speeds will place a great deal of stress on the baskets. A numerical model of the loading needs to be made along with laboratory tests. The mesh, in particular, warrants close scrutiny. A review of literature on past attempts to utilize fine screening medias in a bacldit mode have shown a high incidence of failures. These failures occur from two (2) primary sources.
- Structural damage from large sharp, heavy debris
- Fatigue failures in the areas of clamping the mesh to the basket frame.
For a screening media to be successful, the following attributes need to be present in the design.
- Mesh to be constantly and evenly tensional so as to maintain as strong a surface area as possible both in ascending and descending sides.
- Tensioning to be accomplished in a manner that will eliminate load concentrations and -flexing-of mesh over a stationary surface, such as clamp bias etc.
- The screening media needs to be supported in the case of substantial differentials.
It has already been determined that the finer screening media radically impacts the fish bucket. The current configuration in use at Indian Point with larger opening media has proven itself to be highly successful in providing a static environmental for captured marine life. In th~ratory, when the finer screening media was supplanted for the larger media, violent vortexin to place. It will be necessary to reconfigure the basket profile to accommodate the finer screening m . This is presently being done in our laboratory. (
-+CD~ ,
It is strongly recommended that non-metallic materials be used on all basket components, where possible, to achieve performance requirements, both operationally and environmentally. Consideration No.2, Spray Wash: - Outside sprays Higher GPM's, lower PSI's Longitudinal sprays Front discharge The basket again is of a main concern in this area~ It is anticipated that the inside spray will be greatly impacted by the finer screening media. Inside sprays will be highly diffuse d and more reliance will be placed on outside sprays. The debris will "mat" on the surface of the finer screening and tend to "sluff ' or slide off the screening surface. A large amount of it will end up in the fish bucket.. The possibility of a front cleaned bucket should be explored, which will necessitate redesign to prevent vortexing. Survival in the debris laden bucket also needs to be exami ned. Front discharge will also require the use of longitudinal sprays to clean the comers of the bucke
- t. This was demonstrated on previous designs. A front clean fish bucket will require a totally new design and will require a complete new work up.
Consideration No.3 , Sealing: Basket to Basket
- The running clearance between baskets is typically approximately 1/8".
The solution to sealing between baskets will vary with the basket design.
- The metallic fish basket currently in use at Indian Point Station will utilize a sealing strip bolted to and extending upward from the face of the upper lip of the basket
. This seal would bridge the gap between adjoining baskets and maintain constant contact with the curved surface of the bucket forming lower lip.
- The 1/4" die formed basket currently in use at Roseton Station will utilize a sealing strip bolted to and extending downward. This seal would bridge the gap betwe en adjoining baskets and maintain constant contact with the curved outer surface of the upper lip.
The angle lip style basket used at Bowline Station will require total replacement. Additional "considerations" may render current basket designs unacce ptable. Initial evaluation indicates the use of the non-metallic basket with front removal marine life capabi lity may be the preferred design.
- Non-metallic baskets will utilize a large "P" seal bolted to the outsid e edge of the flange to which the basket wire is bolted. The "P" seal maintains constant contac t with the curve face of the upper "D" shaped basket lip. Additionally a seal will be bonded to the face of the lower lip extending to a point overlapping and contacting the upper lip.
(4) ~ Consideration No.1 , Increased Velocities: Fatigu e Failures The current material and design of the carrying chain will need to be reviewed. Due to increased speeds and potentially higher loadings, the fatigue factors and cyclic loadings are going to be dramatically increased. The wearability and strengths need to be analyz ed numerically and both in laboratory and field. There is a very good chance new chain will be required to meet the service factors will provide satisfactory life. (5) Shaft ASsemblies Consideration No.1, Increased Velocities: Higher Differentials Fatigue Failures The shaft assemblies will need to be analyzed mathematically to insure that they can handle the higher pull out loads generated by the higher differentials. Fatigue from a higher rate of cycles will also need to be reviewed. Considerations No.4 & 5: - Debris & Marine Life Handling The interface of the basket with all the other components, ie chain, troughs, etc, will require complete review when considering the marine life and debris which will be captured that presently is being entrained. The wmatted wdebris and much younger/more delicate organisms are going to require a much higher level of interface. wror~.s th i~ m ~
~
All the above should be numerically/software modeled than moved to a laboratory environment for demonstration and then full scale field verification. (6) Spray System Consideration No.1, Increased Velocities: Higher Traveling Speeds The higher speeds that may be employed may impact the spray system. This needs to be modeled in a laboratory, then verified in the field. Consideration No.2, Spray Wash: - Outside Spray Higher GPM, Lower PSI Longitudinal Sprays Front Discharge Spray Shower at Water Line The application of the fine screening media will cause a complete redesign of the spray system. The finer screen will greatly diffuse the existing sprays - particularly the high pressures. Experience has shown that a spray wash system utilizing a greater GPM than PSI ratio than currently employed will be more effective. This ratio will need to be estimated, modeled in the laboratory, then verified full scale in the field. It has been demonstrated in some prior work that an outside spray system is more effective than an inside. This is due primarily to the way the debris behaves when it interacts with the mesh and the high rate of diffusion mentioned above. Longitudinal sprays, front discharge and the spray shower at water level are all new and will need laboratory and field work. Consideration No.4 & 5: Debris Behavior Debris Removal Marine Life Behavior Marine Life Discharge Marine Life Transport The anticipated impact from the fine screen media will cause a total re-think on the interaction of debris and marine life. A complete evaluation, design, laboratory and field should be conducted. (1) Housineffrouehine Consideration No.1, Increased Velocities: Potentially Higher Differentials Higher Travel Speeds The possible change in drive assembly may result in the requirement for a new chain guard. Consideration No.2, Spray Wash: - Front Discharge Higher GPM Longitudinal Sprays Any change in the spray configuration will directly impact the design of housings and possibly the troughinglcollection system. Testing of the basket design will ultimately dictate the housing configuration. Consideration No.3, Sealing: Trough to Basket With the added importance of removal of very small organisms, refinement of the basket to trough transfer area will be required. This minimizes reintroduction of screening into the intake channel. Considerations 4 & 5, Debris & Marine Life Handling: Evaluation of the transport capability of the housings and trough will be required once the basket and spray wash configuration has been finaJized. (8) Controls Consideration No.1, Increased Velocities: Higher Differentials Higher Travel Speeds The requirement for increased travel speeds, as discussed in (1) Drives, will necessitate a matching control package capable of reacting to the conditions as shown on the charts. Controls, in response to changes in differential head, will adjust travel speed proportionally. This item can be numerically modelled - no field or lab work. Consideration No.2 , Spray Wash: - .Usage of Higher GPM Higher travel speeds will require evaluation of spray vs travel, this may require the interaction of controls and auxiliary spray system valving. ANALYSIS, TESTING VERIFICATION The general approach to each consideration listed on accompanying matrices and their related anticipated impacts is as headlined above.
- Analysis-numerical, mathematical and historical to identify areas of deficiency, and also prior applications which may resolve the considerations.
- Testing-physical tests - full scale and modeled were necessary in a laboratory environment to demonstrate the problems and remedial actions.
- Verification - this means full scale application of the remedial actions in the field, under real life conditions.
Consideration No.1, Increased Velocities The velocities can be calculated using accepted formulae. The resulting differentials can be calculated the same way. Analysis of the increased loadings from this new differentials, increased cycles can be performed using several accepted processes. Certain critical, easily tested, components need to set up a laboratory.
- Basket/mesh flexures and cycles under load
- Chain component reevaluations
- Hydrodynamics of fish bucket to minimize vortexing intensity Field verification of all anticipated impacts and their remedial actions needs to be performed.
Consideration No.2, Spray Wash Diffusion It will be necessary to gather the data on this consideration empirically. There is some historical information available from prior installations but it will be necessary to perform specific test for each installation. These must be done in conjunction with Consideration No.4 & 5 (debris and marine life).
- Baseline current spray system with fine screening media at velocities and speeds.
- Test various remedial configurations proposed to determine most effective one.
These remedial changes will then need a full scale, real time verification. Consideration No.3, Sealing I This consideration is extremely important to environmental performance. Analysis can help determine materials and applications but laboratory testing is required to demonstrate effectiveness. Observation of the action of the many seals required is only possible in a laboratory enviro nment. The tests should be done in conjunction with Consideration ~o. 4 & 5 (Debris & Marin e Life).
- Baseline current screen sealing effectiveness by debris, marine life runs on model screen.
The need may arise for synthetic replica of the marine life for observation purposes.
- Quantify improvement in performance with debris and marine life runs in model screen.
- Separate testing on mesh-basket frame interface to minimize entrapment and damage to marine life during capture and release.
The remedial changes will have to be verified in a full size, real time field runs. Considerations No. 4 & 5 - Debris and Marine Life Handling The majority of the work will be done in conjunction with No. 2 and 3 above
. The transport systems will need to be evaluated on a full scale basis by field verification.
_1<0 Envlrextx. ~ .... DAti ENGI NEER ING DA TA CHIOCEDI.
.AI.J.~III!.-"
DAti OIIDEA 1<0 III!f9IENCI Of BASKET CONFIGURATION WATER FLOW II SEA L--- o 0 BASKETFRAMES--~ ____~
\ ,I/'
FISH BASKET AT CON. ED & NYPA
HEADSHAFr-ASSEMBLY ITEM 1
~/ , -...-- I ITEM 2 I
( I
'" \ ,I
\
\ \ \
I
,/ \ \ \ I \ \ ,,
DRIVE CHAIN & SPROCKETS
------------------~~~------------~-------
i I I I Ii Ii I i Ii i I FISH BASKET
$[ Itl !lUll IF I Slt- P"';
I ~ I
~ ----------~----------- f'\
01 01 I i~ I li I ...'"
~ '"c: ~
II)
~ ~
tl t:l VI
~
t
~ '? ~
o 0
~: .-... ',-",._ ';'- - ----._ ~~
r - - - - -....- - - - - - . - . -.............----...........- .........- TAKE UP SCRE SUSPENSION SYSTEM TAKE UP CAPSTA SPLASH HOUSING DRIVEN SPROCKET ASKET & CARRIER CHAIN ASSEMBLY FOOT SPROCKET
$Esti mate s/577 00-1
l . II ii i* it I II
-I I.
Ii I! I I. II a" a I
;1 IJ ~
I II
!. J!I I I I
II
.1 e.*
I I
"~ ~ i I I ~
I I
~
I I I J
~ ~
i ~
~ ~ ~ ~
co Q.
.~
I II Cl' II)
< < ~
II
~
I
! f *I .!.i .'!1 !
I 1. II
*
- D
===--- ~--------~"~"""~-
Envi
., ....... x Inc.
SCRUM OEIAIS TROUGH DEM.S lAOUG11
. .~z I!I HIGH W41£1I UnL £lEY. 100.0' a
l:I a
-=-REAR I ...i'" ~ Lilli "'TEII LIYfL a" {LI" ,,0' I!I oJ t--t-ti-I 2' 'Il' IlASll£T CLIAIIAIICI ...I"
- II
":t::w._c~
_ i:. ---1l-~1Ea _ raaT _ ,
, r ll1ACK CUlT_II '0 _IDE 4 LIWlL lAIr IIIfICH , .. IllYllln SCMIII WILL IIlST, CUlT_" '0 _IDE PIIOTleTlWl MILUDE ., _ ' " rI III' AlII, I' 1IllUIIIl0, January 14. 1993 11:]2:]8 AN
II HI u.
~!f I . ..
II j
*= ** !l i
I I
~
c..
~ ~
c:: Ja a
~
I-
- f-f-
*i f-. !fer ~
I
-- ~
i J - f-! IS! II I I I I I I I I I I I l-I c II c L I: i-II II II I II i.i, I-
-- ~!
I Env
.. ,_lrex........
___** Ut .. Inc. COIIT.IIU IIIIIT SICMII(!)
<:- I 10_
i' lOTION lOOT PUlE aulDl January 18. 1993 01:56:4 4 p",
. I
- D D .. ...
11 I! I .. Ii I -
~ C~L)b
- L
\II ~ ~." ~ ~ ~.." '"-""T'rl
- I II f")
~'"-""" ~ ~ ~1:./.-a-....:1"IT1 ~ ~ ~ ,('
I I I
- ~ 'i a..-_ _ -.
.~
I.JI il
\Q 1:1 ~
I I i l:
Date: Janua ry 13, 1993
SUMMARY
MATRIX FOR OPERATIONAUENVIRONMENTAL IMPACT ON SCREENS Y = Yes Screen Module Impacted Y = Yes N =No E = Evaluate N = New N = No M = Modify X = Okay Consideration Anticipated Impact 1
'3 5
e - -- -- -j - - ~ g-- N u ('f")
~ ~ ~
(Q'
..c::
00
- s 00 en en
~ ~
0C=
~
u i c:: u
£I ~ .~ ~ .~ ~ u en ~Ul i *a ~
- s 8 U
~ ~ ~
0..,.
~
1 Increased velocities
- Potentially higher Y Y N E N E X X X X Y Y from reduced open differentials area of screening media
- Basket vortexing N Y X X N X X X X X Y Y
- Higher traveling Y N N E E E E E X N N Y speeds to maintain differential control
- Fatigue failures of Y N E E E E E E E E Y Y components due to accelerated cycles
- Structural integrity of Y N X X N X X X X X Y Y finer mesh
Date: Janua ry 13, 1993
SUMMARY
MATRIX FOR OPERATIONAUENVIRONMENTAL IMPACT ON SCREENS Y = Yes Screen Module Impacted Y = Yes N =No E = Evaluate N = New N = No M = Modify X = Okay Consideration Anticipated Impact c; a 1! 5
-- -- -- - t - ~ --- ~ ~ .c:
til)
- 1 co C
.E:
G' ~ N M ec: ~ fIJ u ~ J ~ I :i a
.~. &i Q ~ .t: .~ .c:
u fIJ CIl
~ .5 ~ ~
fIJ
- 1 E
U
~ ~ "0 ...."i) ~
2 Spray wash design
- Usage of outside Y Y X X E X X M E E Y Y review due to fme spray in lieu of mesh diffusing inside spray inside spray
- Usage of higher Y Y X X E X X M E E Y Y GPM's and relatively lower PSI's
- Usage longitudinal Y Y X X E X X M E E Y Y sprays
- Usage of front N Y X E N X N M E E Y Y discharge of marine life
- Spray shower at or N Y X E X X X M X E Y Y near water level to flush mesh I
Date: January 13, 1993
SUMMARY
MATRIX FOR OPERATIONAUENVIRONMENTAL IMPACT ON SCREENS Y = Yes Screen Module Impacted Y = Yes ! N = No E = Evaluate N = New N = No M = Modify X = Okay Consideration Anticipated Impact VI
....... ~ -j 1 ..c:::: ~ '3 5
e - u N
~
B
~ ~ (0' co =' ~ -g 00 rn ~ 'r::: ~ ~
J ! i ~ c:::
.~ ~ Ji u < *a 6 ~ :9 ~ ....u U
Ifi ~ en ~ 3 Sealing between
- Seal frame to basket Y Y X M E E X X X X Y Y member/components inclusive of boot to provide consistent 2mm-6mm opening
- Seal basket to basket Y Y X M E E X X X X Y Y
- Seal structure to Y Y X M E E X X X X Y Y intake (Flow orientation basket)
- Seal between Basket Y Y X M E X X X E X Y Y to trough
Date: Janua ry 13, 1993
SUMMARY
MATR IX FOR OPER ATIO NAUE NVIR ONM ENTA L IMPA CT ON SCRE ENS Y = Yes Screen Module Impacted Y = Yes N = No E = Evaluate N = New N = No M = Modify X = Okay Consideration Anticipated Impact
~ ~ 'S 5
e -- - --
.-4 C!.. ~
1 (0'
~ ~ .c=
bO
- s 00 rn
~ ~ ~ *5 J
u
.~ &i .~
Q Jj ~ <
~ ~ .5rn
- s 0
II:
'0 b
U S ~
~ ~
ii: CIl 4 Debris handling and
- Debris behaviorl Y N X X E X X E X X Y Y transport interactive with screening surface (orientation) mattingus stapling
- Debris removal from Y Y X X E X X E E X Y Y screening surface
- Large debris impact Y Y X X E X X X X X Y Y in fine mesh
- Debris transport Y N X X E X X X E X Y Y j
Date: Janua ry 13, 1993
SUMMARY
MATR IX FOR OPERATIONAUENVIR.ONMENTA L IMPA CT ON SCREENS y = Yes Screen Module Impacted Y = Yes N = No E = Evaluate N = New N = No M = Modify X = Okay Consideration Anticipated Impact
'b" ~ - - ~ .d "3
til)
'0 ~ =:s N' (;) 00 ~ -e ~ g
('f')
~ '.:::2 = U (II ~ ~
j 0 U
.~ ~ ~ .5 ! .~ &l .~
Q I <
~
{Il
~ ~
(II
=:s g
U
~ E-t ....:9u ~
5 Marine life handling
- Behavior/interaction N Y X X E X X E X X Y Y and transport with screening surface
- Removal from surface N Y X X E X X E X X Y Y surface
- Marine life transport N Y X X E X X E E X Y Y I
PIIOI'06AL ~o Envlrexhc. ENGI NEER ING DA TA OAOER NO
~AGE Of . AI. RIGHTlI A£SlIM" "'~NlEDIY DATt CHOCED By I DATt AEFlIIEHa I
MAIN FRAME TO INTAK E SEAL INTAKE STRUCTURE
~MAIN FRAME " Z II SECT ION .&..-W ALL MOUNTED FLOW DEFLECTOR 9820 . EN REV 4/89
Envirex-. ENGINEERING DA TA ORDER NO
~Tt ~D" BASKET LIP CONFIGURATION &-"--SEAL WATER FLOW WIRE MESH LOWER LIP
Enviraxk. ENGI NEER ING DATA _NO
~II ~Tl BOOT SECT ION SEAL WATER FLOW W
cC a: I-F~AME SEAL
- I CON ED IP NO.2 NYPA IP NO.3 40.00 35.00 30.00 /
25.00 / 5 en w 20.00
/ ~
15.00 I 10.00 / 5~00
~
0.00 100 50 75 25
% CLEAN X 4Y \o.btb vt"oClecif 1-;7 Water-~~J 1 [ -II=- 2MM FINE MESH ,I
CON ED IP NO .2 NYPA IP NO .3 25.00~1------------------------- 20.00 I 7I 15.00 I 7/ en w J: o ~ 10.00 I 7I 5.00 I : : : ; :-""" .. 0.00 I I 100 50 75 25
% CLEAN 27' Water Depth r=-- 1/4"X1/2'SMoorH rEX J
CENTRAL HUDSON ROSETON STATION 14.00 12.00 ~ 10.00 I
/
I I I o 8.00
/
- I w
J: I o Z 6.00 4.00 / 2.00
~ ~
0.00 100 50 75 20
% CLEAN ~------~ I+~DIA. MESH 26' WATER DEP~
CENTRAL HUDSON ROSETON STATION 25.00~1----------------------------------------------------------------~ 20.00 I j f 15.00 I / en w J: o ~ 10.00 I j I 5.00 I ::::>"~ o.ooll'_ _ _~_------r-------r:-------'-------...I 100 50 75 25
% CLEAN 126' WATER DEPTH-J
[ - - 2MM MESH I
ORANGE & ROCKLAND BOWLINE STATI'ON 10.00 9.00 8.00 /
/
I 7.00 '1
- 6.00 / I f/)
w J: 0 5.00 / I I z 4.00 / 3.00 / 2.00 ~ 1.00 0.00 100 50 75 25
% CLEAN r--- ~---~
28.5' WATER DEPTH
-, I ._-3/8" sa. --- I ~ .~-
MES~
ORGANGE & ROCKLAND BOWLINE STATION 18.00 16.00 14.00 / 12.00 / CIJ 10.00 /
/
w J: o ~ 8.00 6.00 / 4.00 ~ 2.00 0.00
.-... ---- -------- I i
100 50 75 25
% CLEAN
[28.5' WATER DEPTH I r-- 2MM MESH I *1
ORANGE & ROCKLAND BOWLINE STATION 5.00 4.50
/
4.00 / _ 3.50 /
/
C/J -~ 3.00 Q. u.. o g w > 2.50 / 2.00 ~
~
1.50 1.00 100 50 75 25
% CLEAN
[2i5:wATER DEPTH] l+3/8" SQ. MESH
ORANGE & ROCKLAND BOWLINE STATION 5.00 4.50 / 4.00 / en 3.50
/
a.. -~ 3.00 u.. o
/
9w
> 2.50 /
2.00 ~
~
1.50 1.00 0.42 2.292 0.905 9.78 INCHES '\ l28.5' WATER DEPTHj 1--- 3/8" sa. MESH
ORANGE & ROCKLAND BOWLINE STATION 7.00.....-,--------------------------, 6.00 I / 5.00 I 7' Ci) -~ a. u. o 4.0011------------------7/ L -------i 9w 3.00 I :::;;>' <" 2.00 I ::;:::::oao"""""=-: 1.00~------~--------------~----------------~--------------~------~ 0.9 4.206 1.757 17.43 INCHES 128.5 : WATER DEPTH J 1---=-2MM MESH I .
ORANGE & ROCKLAND BOWLINE STATION 7.00~i------------------------------------------------------------~--~ 6.00 I j £ 5.0011----------------------:;j/-£~-----J c;; Q. u.. i; 4.0011--------------~17,~------l o 9 ~ 3.00 I :;;;>>"" 2 .00 I. :::::;:::;::>"" ............--- 1.00-'-----=-r:~------+--------r----------,r------.J 100 50 75 25
% CLEAN ---------1 r28.5' WATER DEPTH [ --- 2MM MESH]
CENTRAL HUDSON ROSETON STATION 6.00 5.50 ,. 5.00 / 4.50 / en a.. 4.00 -~ 3.50
/ /
LL o 9 ~ 3.00 / 2.50 /
~
I 2.00 1.50 I I 1.00 I 0.54 2.96 1.164 12.62 4 INCHES I . -- -~---l 26' WATER DEPTH [ - - . 3/8" DIA. MESH
CENTRAL HUDSON ROSETON STATION 6.00 5.50 ~ 5.00 / 4.50 /
/
I en 4.00 -~ Q, LL o 3.50
/ I I /
I 9
~ 3.00 I 2.50 / ~
2.00 1.50 .- ---------- 1.00 100 50 75 25
% CLEAN ,------PTHl ~. WATER DEPTH [ - - 3/8" DIA. MESH
CENTRAL HUDSON ROSETON STATION 8.00 1 7.00 I l' 6.00 I ,. en e:
~
0 5.00 9w 4.00 3.00 2.00 I - - = - - = - - = 1.00~------~--------------~
--------------~------------ --~------~
100 50 75 25
% CLEAN I-------~
26' WATER DEPT~ l+2M MM ESH I .
0 C\l It)
.('1) J:
o* zo Z UJ W o..Z w
- e w
z -a. ....J U u: 0- :e we:( '#. :e zo.. C\I 0>- U Z It)
...... +
o o s:: a
~
0 0 0 0 0 0 0 0 0 ~ CD 0 0 0 0 0 0 0 0 0 0 oi co ,...: 1ii
..... cO &ri ~ cw) C\I ~
f... (Sd:l) AJJOO13A N
CON ED IP NO.2 NYPA IP NO.3 8.00-r,----------------------------------. 7.00 I 7I 6.00 I ~ ten 5.00 I /
~ 7 o
~~ool > ;,,]I 3.00 I :...,7" .c= 2.00 I ......... -====-===== I I 1.00 100 50 75 25
% CLEAN 27' Water Depth 1-- 1/4"X1/2"SMOOTH TEX
CON ED IP NO .2 NYPA IP NO .3 -
- 8. 00 -r ,- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -. .
7.001 7/ 6.00 I 7 / e:en_ 5.00 I 7 /
~
o 9>w 4.00 I /
~
3.00 I :7' L' 2.00 1 ..,... -===== 1.00~1--------~--------------~--------------
---r---------------~~----~
1.07 5.22 2.184 22.65 6 INCHES l~1 W~t;Oe-;;;J [+ 1/4"X1/2"SMOOTH TEX
CON ED IP NO.2 NYPA IP NO.3 10.00~1---------------------------------------------------------------' 9.00 I f 8.00 I f UJ 7.00 -~ Q. 6.00 0 9w
> 5.00 4.00 I 7" /'
3.00 I ::::::::::::-=- ~ I I 2.00 1.99 9.079 3.828 37.416 INCHES 27' Water Depth [___ 2MM-FiNE MESH I.
I ~~ Cb(~((j,f Iteo I BVALUATIOB O. BNTRAIKMENT ABUNDABCB SAMPLING DESIGNS I Prepared by George Casella, Ph.D. Douglas S. Robson, Ph.D. I Steven J. Schwager, Ph.D. William D. Youngs, Ph.D. I Ithaca, New York 14850 July 1986 I I ADDDJDOK I to BVALUATIOB O. BBTRAIHKBNT ABUBDANCB SAMPLING, DESIGNS I prepared by I steven J. schwager, Ph.D. George Casella, Ph.D. Douglas S. Robson, Ph.D. William D. Youngs, Ph.D. I Ithaca, New York 14850 I February 1990 I Prepared Under contract with I Consolidated Edison Company of New York, Inc. 4 Irving Place I New York, New York 10003 I Jointly Funded By Central Hudson Gas and Electric Corporation I Consolidated Edison company of New York, Inc. New York Power Authority Niagara Mohawk Power corporation I Orange and Rockland Utilities, Inc. I
. -=- .-2. '. '1::
i'
,,' -;'1 -~ - I~
- ~. i~ !
'~ ";t: i~ ~! .t* ,. ":"-'<i.
1-, '. ~~.1. fT"
, 'r;, '1""'" ,- } ,~ , '~u'- ~ ir '=<:~ ~- ..' 'l\I .1' ~:' ~-- - ': ~Ij I~
1 - '~-r:' -~~ ~ ,-
.~ ~<:--- * ,,> ~ 7.,,:,;: .... *.L~_>. -. ',1\~." '-11 . "'. '.-, '~I-" >1; ~' ~ ~ . ;~.~_ -". , ~.-
tb
..,. _--'J ,- ~ -='."
11-i :i
.": '~ ~I .l" -. -,'~ 1,1 ~ :; -- _-..,. ~ _ t '1":-: -- - ""':~ - .* ~~; ... ~--<- ~j~ ~ -.
1,
.~ tl -I ,1..-:.:=_ .,:.~ ;;
i..~ * --i ""',-
;L "tf' =. 'r.,;..i _. _~ _::> -~" t:" .,,': r,l - .' "J'I
_*~i;' Ie ' ** &1:
" , .~~: 'I ,i.
tit,
!'1:.:;< ~~ ;. ':1 ii' ;1- -,- -,~ ?"';: ':~L ~ , ;_ J~r- ~ ;,1: *,il' , ~ -. . J. : I~' "l ~ ':--;0 ; - .f .~ ~ , "U'-"
- 1: .-.
~ .-::' ' .. - t:.- ",I
- 1--
,I jL:"' -; 1 , r. '1
- i'
'.-;;~.~" -':~*I' il- ~=:;;---" .~ ,
- -'-~.
- "
. 'ten:, ., "'~I '14 .,.-.~- **11;
I I BVALUATIOB O~ BNTRAINMENT ABUNDANCB SAMPLING DESIGNS I Prepared by George Casella, Ph.D. I Douglas S. Robson, Ph.D. Steven J. Schwager, Ph.D. William D. Youngs, Ph.D. I Ithaca, New York 14850 July 1986 I I ADDBBDUH I EVALUATIOB O~ to BNTRAINMENT ABUNDANCB SAMPLING DESIGNS I Prepared by I Steven J. Schwager, Ph.D. George Casella, Ph.D. Douglas S. Robson, Ph.D. William D. Youngs, Ph.D. I Ithaca, New York 14850 I February 1990 I Prepared Under Contract with I Consolidated Edison Company of New York, Inc. 4 Irving Place I New York, New York 10003 I Jointly Funded By Central Hudson Gas and Electric Corporation I Consolidated Edison Company of New York, Inc. New York Power Authority Niagara Mohawk Power Corporation I Orange and Rockland Utilities, Inc. I
I I I I I I I I I I I I I I I I I I I
---- - - ---- - -- ----~--
I I I EVALUATION O~ BBTRAIHKBHT ABUNDAHCB SAMPLING DBSIGNS I I I Prepared Under contract with I Consolidated Edison Company of New York, Inc. 4 Irving Place New York, New York 10003 I Jointly FUnded By . I Central Hudson Gas and Electric Corporation Consolidated Edison Company of New York, Inc. I New York Power Authority Niagara Mohawk Power Corporation Orange and Rockland utilities, Inc. I I I Prepared by I George Casella, Ph.D. Douglas s. Robson, Ph.D. steven J. schwager, Ph.D. I William D. Youngs, Ph.D. I Ithaca, New York 14850 I July 1986 I I
I I CONTENTS Page I 1. Study Overview and Background. 1.1 1.1 Study Overview *** 1.1 I 1.2 Plant Description ** 1.3 I 1.2.1 Bowline Point ** ...... 1.3 1.2.2 Indian Point
- 1.6 I 1.2.3 Roseton .*** 1.9 1.3 Field Collection Procedures. 1.11 I 2. Summary and Conclusions ****** 2.1 I 3. Sampling Designs Under Consideration 3.1 3.0 Introduction * * * * * * * *
- 3.1 I 3.1 Two-Stage Sampling Designs ** 3.3 I 3.2 *Three-Stage Sampling *****
3.3 Assumptions Required for Analysis *** 3.9 3.13 I Assumption 3.1. ...... *3.13 I Assumption 3.1' Assumption 3.2. 3.15 3.16 I Assumption 3.3 ** a * .. 3.16 Assumption 3.4. 3.17 I 4. Assessment of Distributional Information
- 4.1 I 4.0 Introduction ***
4.1 The Poisson Model 4.1 4.1 I 4.1.1 Derivation of the Poisson Test Statistic. 4.3 4.2 The Negative Binomial Model. * * . * * * . *
- 4.10 I 4.2.1 The Rationale for the Negative Binomial Model. 4.10 I 4.2.2 Testing for the Negative Binomial Model. . * . 4.18 I
I CONTENTS/2 I 4.3 The Indian Point Replicate Sampling. 4.27 4.4 Sample Duration . . 4.32 I 4.5 An Alternate Model for the Estimation Process . . 4.45 4.5.1 The Classification Model . . 4.46 I 4.5.2 Interpreting the Parameters . 4.48 I 4.5.3 Evaluating the Classification Model 4.53 4.5.4 The Parameters K and c2 . . . 4.59 I 4.6 Model Choice for Entrainment of Juveniles . . 4.60 4.7 Modeling Entrainment Abundance I Using Plant Operating Factors 4.62
- 5. Estimation of Entrainment Abundance I and Entrainment Reductiqn 5.1 5.0 Introduction . . . . 5.1 I 5.1 Performance Criteria Considered. 5.1 5.2 Performance Criteria Selected. 5.5 I
5.3 Some Aspects of the Estimation Problem 5.6 I 5.4 Confidence Intervals . . . 5.7 5.4.1 Confidence Interval 1 . 5.9 I 5.4.2 Confidence Interval 2 . 5.10 5.5 Evaluation of Confidence Intervals 5.11 I 5.6 Confidence Interval Length as a Function of v. 5.29 I 5.7 The Effect of Sample Volume on the Precision of Estimation 5.8 The Estimation of Entrainment Reduction. 5.30 5.32 I 5.9 The Case of Unequal Sampling Fractions v./V. 1 1 5.36 I
- 6. A Simulation Study of Entrainment . . 6.1 6.0 Intj'oduction . . 6.1 I 6.1 Structure of the Simulation Study of Daily Entrainment. 6.1 I I
I CONTENTSj3 I 6.2 Formulas for the Simulation I Study of Daily Entrainment.
************** 6.6 6.3 Results of the Simulation I Study of Daily Entrainment. ************** 6.58 6.4 Simulation Results 2 I
Concerning Estimation of c ************** 6.60 6.5 Structure of the Simulation Study of Seasonal Entrainment *
***** 6.146 I 6.6 Conclusions from the Simulation Study * ****** 6.150
- 7. Sum-Quota Sampling in Entrainment Estimation
- 7.1 I 7.0 Introduction. .......************** 7.1 I 7.1 Sample Splitting and Sum-Quota Sampling:
Estimation for a Single Sample.
** *** **** 7.3 I 7.2 Variance and Bias in Subsampling Estimation **** 7.9 7.2.1 Outline of Sum-Quota Estimation Simulation Study.
I **** **** 7.2.1.1 Poisson(A) distribution . 7.9
**** 7.9 I 7.2.1.2 Poisson sample. * * * ** 7. 10 I
7.2.1.3 Finite population sampling.
**** 7. 10 7.2.1.4 Replication. ****** 7. 11 7.2.2 Results of the Simulation Study.
I ****** 7. 12 7.3 Variance of the Estimate of Entrainment Abundance
- 7. 17 I 7.4 The Estimation of Entrainment Reduction Under Sum-Quota Sampling. 7. 21 I 7.5 Optimization of Sum-Quota Sampling.
***** 7. 21
- 8. Entrainment Estimation Under Noncontinuous Sampling Plans
- I 8.0 Introduction. ..
8.1
**** 8.1 I
I I
I CONTENTS/4 I 8.1 Preliminary Considerations and an Example. 8.1 Assumption 8.1 . . .
- 8.2 I 8.2 Entrainment Estimation Under Noncontinuous Sampling. 8.5 I Assumption 8.2 * . . * . 8.7 8.3 Noncontinuous Sampling at Roseton. 8.14 I
- 9. Sampling Design Recommendations for Future Entrainment Abundance Estimation * . 9.1 I 9.0 Introduction .
9.1 Conclusions on Sampling Designs. 9.1 9.1 I 9.2 Recommendations for Entrainment Abundance Sampling Designs **. . . . . . . . . 0 * * *
- 9.5 I
9.3 Recommendations for Additional Studies Related to Entrainment Estimation. 9.6 I 9.3.1 A Study of Replicate Sampling . . 9.3.2 A Study of Entrainment Abundance 9.6 I and Plant Operating Factors . 9.7 9.3.3 Use of Historical Entrainment Estimates . 9.8 I 9.3.4 A Study of Plant Flow Volume &Sample Volumes 9.9 I References Appendix I Derivation of a Theoretical Model for Poisson I Variation in the Discharge Channel Appendix II Derivation of a Theoretical Model for Negative Binomial Model Variation in the Discharge Channel I Appendix III Derivation of the Beta-Binomial Distribution I Appendix IV The Limit of Negative Binomial Sums Appendix V Calculation of Bias in Subsampling I Appendix VI Testing for a Quadratic Relationship Between Variance and Mean Using Paired Observations I I I
I I CONTENTS/5 I Appendix VII A Time Series Approach to Estimating Entrainment Abundance I Appendix VIII Computer Code (In Basic) for Simulation Study of Chapter 6: One Day (8 p.s.u.'s), Diel Variation I of Density Appendix IX Instructions for Implementing Procedures Given in Evaluation of Entrainment Abundance Sampling Designs I I I I I I I I I I I I I I
I LIST OF FIGURES Number Title I 1.1 Location of the Bowline Point, Indian Point, and Roseton Generating Stations on the Hudson River. (Reproduced from EA, 1984a) . . . . .
- 1.4 I
1.2 Diagram of Bowline Point plant site. (Reproduced from EA, 1984a) * . * . . . 1.5 I 1.3 Diagram of the Indian Point Generating Station circulating water system. (Reproduced from EA, 1984b) **. 1.7 I 1.4 General Overview of the Roseton Generating Station. (Reproduced from EA, 1984c) 1.10 I 1.5 Schematic diagram of sampling components of EA's automated abundance sampling system - I AUTOSAM (U.S. Patent No.4, 145, 925) . * . 1.12 1.6 Schematic diagram of entrainment abundance collection gear, Roseton, 1983 . . . . . . . 1.14 I 1.7 Generalized view of manual pump/net abundance and AUTOSAM collection systems used at D2 at the I Indian Point Generating Station in 1983 . . . . . 1.16 1.8 Schematic diagram of net/barrel collection I system used during entrainment abundance study, 198~~ . . . . . . 0 * * * * * * * * * * * *
- 1.17 I
1.9 Net sampling apparatus at discharge canal 4.1 Station D1, Indian Point Generating Station. Plots of the observed data, Bowline Point 1983 1.19 I Table B2, together with plots of the bootstrap samples . . 0 * * *
- 0 * * * * * * * * * * *
- 4.22 I 4.2 Histograms of the first observation from each pair in the Bowline Point 1983 Table B2 sample, and the first observation from each pair in three bootstrap m samples. . . . . . . . . . . . . ..... 4.24 4.3 Plots of observed data, Indian Point 1983 pumps vs. Autosam . . . . 4.33 I
4.4 Plots of observed lumped data, Indian Point 1983 pumps vs. Autosam. . . . * ..*.*... 4.55 I 4.5 Plots of observed lumped data, Roseton 1982 . . . 4.57 I 7.1 Operating characteristics of sum-quota sampling in sorting to estimate the proportion(s) of organisms of a given type 7.23 I I
I LIST OF TABLES I Number Title I 4.1 Summary of x2 Tests to Evaluate the Poisson and Negative Binomial Models ** *...... 4.4 Contributions to Model x21 s From Each I 4.2 Sample Bowline Point 1983 Table B2 Data 4.6 4.3 Bowline Point 1983, Table B2 Data and I Bootstrap Samples * * * * . * * * * * * *...... 4.26 4.4 Indian Point 1983 x2 Tests to Evaluate I the Poisson and Negative Binomial Models: Pump 1 vs. Pump 2 * * * * . . . *...... 4.28 I 4.5 Indian Point 1983 x2 Tests to Evaluate the Poisson and Negative Binomial Models: Pump 1 vs. AUTOSAM ******* *...... 4.29 I 4.6 Indian Point 1983 x2 Tests to Evaluate the Poisson and Negative Binomial Models: Pump 2 vs. AUTOSAM * * * * . * * *...... I 4.7 Estimated K+ Values From Indian Point 4.30 1983 Paired Pump Data and 10 Bootstrap I Samples * * . * * . * * * * . * . * * .
"+ *..... . 4.37 4.8 Estimated K Values From Roseton 1982 I 4.9 Data and 10 Bootstrap Samples * * * * * * * . . 2 Indian Point 1983 and Roseton 1982 x Tests 4.39 to Evaluate the Poisson and Negative Binomial I Mode 1s* * * * * * * * * * * * * * * * *
- 4.54 I 4. 10 Summary of Data Collected to Evaluate the Assumption of Random Sampling at the Bowline Point Plant, 1983 (Reproduced from EA, 1984a: Table B-2) * * * * * * * . * . . . 4.61 I 5.1 Simulation Study of Confidence Intervals N
I for L ~i: Observed Coverage Probabilities i=1 and Average Lengths When ~i's are Uniform (1,10). 5.13 I 5.2 Simulation Study of Confidence Intervals N I . for L ~i: Observed Coverage Probabilities i=1 and Average Lengths When ~i's are Uniform (1,100) 5.14 I I
I TABLES/2 I Number Title 5.3 Simulation Study of Confidence Intervals I N for I
,=. 1 ~.:
Observed Coverage Probabilities I and Average Lengths When ~.'s are Uniform (1,200) 5.15 5.4 Estimate of Total Entrainment of Striped Bass 5.22 I Yolk-Sac Larvae From Bowline Point 1983 Data. 5.5 Estimate of Total Entrainment of Striped Bass Juveniles From Bowline Point 1983 Data . . . . 5.23 I 5.6 Estimate of Total Entrainment of Striped Bass Yolk-Sac Larvae and Juveniles, Bowline Point I 1983: Coefficient of Variation for Selected 5.7 Sampling Ratios . * . * * . . . . . . . . . . Estimate of Total Entrainment of Striped Bass 5.24 I Yolk-Sac Larvae From Indian Point 1983 Data. 5.25 5.8 Estimate of Total Entrainment of Striped Bass I Juveniles From Indian Point 1983 Data . . . . 5.26 5.9 Estimate of Total Entrainment of Striped Bass I Yolk-Sac Larvae and Juveniles, Indian Point 1983: Coefficient of Variation for Selected Sampling Ra1~ios ****.********.* a 0 * *
- 5.27 I 6.1 Simulation of Sampling for One Day with Uni-form Variation of Density About Level of I 10 Organisms/1000 m3. . . . * * . . . . . 6.7 6.2 Simulation of Sampling for One Day with Uniform Variation of Density About Level I
of 50 Organisms/1000 m3 . . . . . . . . . 6.3 Simulation of Sampling for One Day with 6.13 I Uniform Variation of Density About Level of 100 Organisms/lOOO m3. . * . . . . . . 6.19 I 6.4 Simulation of Sampling for One Day with Uniform Variation of Density About Level I of 250 Organisms/1000 m3. . . * . . . . . 6.25 6.5 Simulation of Sampling for One Day with Diel Variation of Density About Level I of 10 Organi sms/ 1000 m3 . . . . . . . . I I
I I TABLES/3 Number Title I 6.6 Simulation of Sampling for One Day with Diel Variation of Density About Level I of 50 Organisms/lOOO m3 * * * . * . * . 6.37 6.7 Simulation of Sampling for One Day with I Diel Variation of Density About Level of 100 Organisms/lOOO m3* * * * * . * * . . . . . . . " 6.43 I 6.8 Simulation of Sampling for One Day with Diel Variation of Density About Level of 250 Organisms/lOOO m3* * * * * * * . I 6.49 6.9 Simulation of Sampling for One Week with Uniform Variation of Density About Level of 10 Organisms/ I 1000 m3 * * * * * * * * * * * . * * * * . * * * " 6.62 I 6.10 Simulation of Sampling for One Week with Uniform Variation of Density About Level of 50 Organisms/ 1000 m3 * * * * * * * * * * * . * . * * * * . * . 6.68 -I 6.11 Simulation of Sampling for One Week with Uniform Variation of Density About Level of 100 Organisms/ I 1000 m3 * * * * * * * * * * * * * * * * * * * * * *
- 6.74 6.12 Simulation of Sampling for One Week with Uniform I Variation of Density About Level of 250 Organisms/
3 1000 m * * . . * * * * . * * * * . . . . * * . * *
- 6.80 I 6.13 Simulation of Sampling for Five Days with Uniform Variation of Density About Level of 10 Organisms/IOOO m3.......... 6.86 I 6.14 Simulation of Sampling for Five Days with Uniform Variation of Density About Level I of 50 Organisms/IOOO m~ * * * * * * * * * ...... 6.92 6.15 Simulation of Sampling for Five Days with Uniform Variation of Density About Level I of 100 Organisms/IOOO m3 * * * * * * *** ...... 6.98 6.16 Simulation of Sampling for Five Days with I Uniform Variation of Density About Level of 250 Organisms/lOOO m3 * * * * * * * *
- 6.104 I
I
I TABLES/4 Number Title I 6.17 Simulation of Sampling for Three Days with Uniform Variation of Density About Level I of 10 Organisms/1000 m3. * . . . . . . . . 6.110 6.18 Simulation of Sampling for Three Days with Uniform Variation of Density About Level of 50 Organisms/1000 m3 . . . . . . . . . . 6.19 Simulation of Sampling for Three Days with 6.116 I Uniform Variation of Density About Level of 100 Organisms/1000 m3 . . . . . . . . . 6.122 I 6.20 Simulation of Sampling for Three Days with Uniform Variation of Density About Level of 250 Organi sms/lOOO m3 . . . 0 0 0 *
- 6.128 o 6.21 6.22 Simulation of 18-Day Entrainment Season . .
Simulation of 34-Day Entrainment Season . . 6.134 6.140 I 7.1 Example of Output From Simulation Study . . 7.13 I 8.1 Variance of t(r,g) when m = 80 . 8.4 I 0
- 0
- 8.2 Estimates of Daily Entrainment of Striped Bass Yolk-Sac Larvae and Juveniles, Roseton 1983 . . 8.10 8.3 Estimates of Total Entrainment of Striped Bass Yolk-Sac Larvae and Juveniles, Roseton 1983:
I Variance and Standard Deviation for Selected Sampling Ratios . . . . 0 * * * * * * * * * *
- 8.16 I I
I I I I I I
- T---
I I I 1. STUDY OVERVIEW AND BACKGROUND I 1.1 STUDY OVERVIEW I The Indian Point, Roseton, and Bowline Point Generating Stations employ once-through cooling systems to dissipate waste heat resulting from the process of electric generation. Cooling water from the Hudson River is I pumped through condensers where heat is transferred to the water from the plant's exhaust-ste-am'. The warmed circulating water is then returned to the river. This cooling water may contain aquatic organisms small enough to pass through the intake screens. Eggs, larvae, and juveniles I of various taxa are entrained, resulting in some mortality. Concern for the young stages of fish species of recreational and commercial importance has resulted in regulatory requirements. A schedule of unit outages and I flow reductions to lessen the impact of entrainment is directed by the Settlement Agreement of December, 1980 among the Hudson River Utilities, the Environmental Protection Agency, the New York State Department of I Environmental Conservation, and other parties. The purpose of this study is to determine how efficient various sampling deSigns are in evaluating the effectiveness of plant outages in reducing I entrainment. EXisting entrainment data from the Indian Poi~t (Units 2 and 3), Bowline Point, and Roseton Generating Stations are analyzed to accomplish this task. The entrainment sampling programs seek to estimate I as accurately as possible the numbers of ichthyoplankton entrained under various plant operating conditions, including those low and no-flow periods that qualify for outage credit. Total ichthyoplankton counts and I counts of several specified taxa are analyzed, with primary emphasis on striped bass, bay anchovy, white perch, and American shad. Estimates of the number of organisms entrained and the reduction in the I number entrained due to outages are developed. The reduction in entrain-ment attributable to the outages is the difference between the number of ichthyoplankton entrained during the outage periods and the number that I would have been entrained during these periods under more normal opera-ting conditions. Thus estimating these entrainment abundance levels pro-vides a method of estimating the reduction in entrainment. I A common approach to sampling deSign, laboratory procedures, and data analysis is employed for all three generating stations as far as possible, in order to facilitate drawing overall conclusions. The work conducted in I the course of this study can be divided into the following broad tasks. The first task is to specify the class of.sampling deSigns under con-I sideration and their attributes. This class of designs will include those deSigns presently in use. Sampling models to be compared include various two-stage and three-stage sampling designs. In two-stage sam-I pling, the total volume of water passing through a plant is divided I
I 1.2 into primary sampling units (p.s.u.ls) and samples of these p.s.u. IS are drawn for laboratory analysis. In three-stage sampling, the samples I are divided further by drawing splits for analysis. In either two-stage or three-stage sampling, the p.s.u. IS selected for sampling constitute a sample from the overall population of interest, the total volume of I water passing through the plant; all estimates and other inferences will be directed toward this overall population. I The accuracy of estimates of the ichthyoplankton counts in the total volume of water (not just the sample volume drawn for analysis) will be affected greatly by both the type of sampling (two-stage or three-stage) and the design parameters. Two designs of particular interest are the I sampling procedures used at the plants during recent years. In the first design, a p.s.u. corresponds to a three-hour period, and sampling is continuous, i.e., sampling occurs over 8 p.s.u. IS per day, 7 days I per week. In the second design, a p.s.u. corresponds to a one-hour period, and a full day of 24 p.s.u.ls is sampled on days selected to focus on entrainment abundance sampling during the outage period. I The second task is to specify formal criteria by which the performance of sampling designs will be measured. The main objective of the study is estimation of the entrainment abundance level and the effect of I outages in reducing entrainment. Thus, standard statistical properties of estimators can be used in making comparisons among the estimates resulting from different sampling designs. A desirable sampling design I will be associated with a point estimate of entrainment reduction characterized by little or no bias and low variance compared to other estimates, under widely varying conditions. The ability to deal with I relatively rapid changes in the level of entrainment over a short period of time is an important feature of a good sampling design. Because the behavior of sampling designs must be investigated under a broad range of underlying river conditions, the choice of criteria must reflect I this range. The third task is to assess the distributional information available I from the data collected at the three generating stations over the last few years. This information provides guidance concerning the type of conditions likely to be observed in future years. One aspect of this I task is the development of a model relating the entrainment of early life stages to plant operating factors, especially decreased flow and outages. This leads to a unified approach to the estimation of entrain-ment abundance under full flow and under decreased flow, all other I factors being equal. The difference between these estimates of normal flow entrainment and decreased flow entrainment provides an estimate of the reduction in entrainment due to decreased flow. I Another aspect of this task is the determination of whether distribu-tional assumptions that have been proposed in previous entrainment studies I are consistent with recent data from the three generating stations. One commonly proposed assumption is that sample or split ichthyoplankton counts within each p.s.u. may be independent, identically distributed Poisson random variables. This report contains evidence that the Poi-I sson model does not provide an adequate representation of the entrainment I I
I 1.3 I process. The Poisson model often leads to systematic underestimation of standard deviations and coefficients of variation, making entrainment I abundance estimates seem more precise than they actually are. The investi-gation of the distribution of ichthyoplankton counts leads to more effec-tive sampling designs and estimation procedures. These sampling designs include replicate observations, or counts, taken during several p.s.u.1s. I Information from these designs may be incorporated into the analysis of data from years in which no replicate sampling was performed, such as the years prior to 1982. The result will be more accurate estimation for I those years. The fourth task is to analyze the efficiency of estimation procedures I under the sampling designs described above, using the criteria chosen to compare designs. This evaluation hinges on the estimation of entrain-ment reduction due to outages. Comparisons between the two sampling designs used recently and competing designs are of special interest. I The fifth and final task is to recommend sampling designs for future entrainment abundance sampling at the three generating stat'ions. The I efficiency of the two sampling designs used in recent years, relative to other designs that could be employed, is judged by comparing the variance of the estimate of entrainment reduction due to outages, which differs I among these designs. 1.2 PLANT DESCRIPTION I Figure 1.1 depicts the geographic location of the Bowline Point, Indian Point, and Roseton power plants. The following general description of I the physical system at each of these plants has been taken verbatim from the 1983 Annual Reports for Bowline Point, Indian Point, and Roseton (EA, 1984a, 1984b, 1984c). I 1.2.1 BOWLINE POINT I The Bowline Point plant consists of two oil- or gas-fired steam-electric units, each having a net generating capability rating of 600 MWe and a gross capability of 622 MWe. Unit 1 began commercial operation in I September 1972 and Unit 2 in May 1974. Each unit has a separate once-through cooling water system. The cooling I water is pumped from an intake structure located on Bowline Pond (Figure 1.2). Each of the three intake bays for each unit is approxi-mately 5-m (16 ft.) wide and equipped with a bar trash rack, a 9.5-mm (0.374 in.) mesh vertical traveling screen, and a 700-m 3/min (185,000-I gpm) circulating water pump. The circulating water pumps for each unit I I I
1.4 I I NEW YORK I I Oanskammer Point Plant (480 MW(e)) km 107 I POU9hk~psie
.-=>
(,) I i= (,) w Z Z I ou Newburgh I Indian Point Plant I (1,900 MW (~) km 69 I
"Bowline Point Plant (l,200 MW(e)) km 60 I
I NI:W JERSEY I New York City Long Island I I I I Figure 1.1. Location of the Bowline Point, Indian Point, and Roseton I Generating Stations on the Hudson River. (Reproduced from EA, 1984a) I
I 1.5 I I LOCATION OF I AUTOMA TID AlUNOANCI
~;;;.._ _....;~;;;.... URS HUDSON " IR I O'SCHARGE STRUCTURE I
I 'UVER
".. SHORELINE I
I I I IOWUNE
,gND I
I I r 120 I I Figure 1.2. Oiagram of Bowline Point plant site. (Reproduced from EA, 1984a) I* I I
I 1.6 I can be operated individually or in combination. Circulating water flow and approach velocities at the bar racks vary with pumping mode: I Total Flow at Mean Intake Approach Number of Pumps Operat.i.!!g Water Elevation m3/sec (gpm) Velocity m/sec (fps) I 3 2 24.2 (384,000) 20.0 (316,000) 0023 (0.77) 0018 (0.59) I 2 (throttled) 16.2 (257,000) 0.15 (0.49) U During the 1983 sampling season, the Bowline Point plant operated in the two-pump throttled mode (140 x 104m3/day) at each operating unit through late May and operated with two pumps full flow (172 x 104m3/day) for the duration of the sampling period. o The circulating water is pumped through the condensers where the excess heat of the system is transferred to the cooling water. The maximum I condenser cooling ~iater flow is 24.2 m3/sec (384,000 gpm). The circula-ting water is retur'ned to the Hudson River about 400 m from the river shoreline where dispersion of the heated waters is effected by discharge I perpendicular to the river flow through submerged, multipart, high velocity diffusers. I 1.2.2 INDIAN POINT The Indian Point Generating Station consists of three nuclear-fueled I electric generating units. Unit 1, owned by Con Edison, has not been operated for commercial production since October 1974, although its circulating water and service water pumps are operated occasionally. I Unit 2, owned and operated by Con Edison, has been in operation since 28 September 1973 and has a net rated capacity of 873 MWe. Unit 3, owned and operated by the New York Power Authority, has been in opera- I tion since 30 August 1976 and has a net rated capacity of 965 MWe. All three units use Hudson River water for once-through cooling. Each unit has a separate shoreline intake structure for the withdrawal I of water from the Hudson River (Figure 1.3). The intake structure for Unit 1 has four rectangular openings that extend 8.0 m (26 feet) below mean low water. The intake structures for Units 2 and 3 each have six intake I openings extending B.2 m (27 feet) below mean low water. The intakes to Units 1 and 2 are equipped with fixed screens at the entrance to the intake bays and vertical traveling screens located behind the fixed I screens, whereas thE! intake to Unit 3 has only vertical traveling screens at the entrance to the intake bays. All screens are 9.5-mm (O.375-in.) square mesh. I I I
/
Unit 2 Unll :J Condensors Condensers 2 1 3 2
- Unit 1 N
I, -,- . Condollse. T J
- UJ
-*ii".ke .t ~-r~ . r-w 'u ~ . ..1l ".: ..."\~Wc; WI *** , -- UII,I,**"
IIUDSON nlVEn Figure 1.3. Diagram of the Indian Point Generating Station circulating water system. (Reproduced from EA, 1984b)
I 1.8 I Circulating water pumps with rated capacities of 530 m3 per minute (140,000 gpm) are used to pump Hudson River water through the condenser cooling system of each unit. Unit 1 has two circulating water ~umps with a total rated capacity of 1,060 m3 per minute (280,000 gpm1' and I two service water pumps with a combined rated capacity of 144 m per minute (38,000 gpm). Units 2 and 3 each have six circulating water pumps that withdraw water from separate intake bays (Figure 1.3). The I circulating water systems for Units 2 and 3 are designed to operate at either 100 or 60 percent flow. When the ambient water temperature is above 4.4 C (40 F) (spring through fall), the circulating pumps are operated so that the rated maximum cooling water flow for each unit is o 3,200 m3 per minute (840,000 gpm). During the winter, 40 percent of the cooling water is returned to the circulating pumps without passing I through the condenser, reducing the rated maximum intake flow for each unit to 1,900 m3 per minute (504,000 gpm). Service water for Units 2 and 3 is drawn through a separate intake forebay located at the center of each intakei the rated maximum total service water flow for Units 2 I and 3 is 114 m~ per minute (30,000 gpm). The cooling water and service water from all three units flow into a I common discharge channel. The combined discharge is returned to the Hudson River via (In outfall structure located downstream of Unit 3. The outfall structure consists of 12 ports submerged 3.6 m below the surface I (12 feet from center of port to water surface) at mean low water. Calculated transit times of cooling water traveling from intake to river outfall for Units 2 and 3, when operating at full pumping capacity, are 9.7 minutes for Unit 2 and 5.6 minutes for Unit 3. The calculated transit time from the intake to the condensers is about 1.5 minutes, and the calculated transit times through the condensers are 0.14 minutes a for both units. Thus, much of the total transit time through the cooling water systems of Unit 2 and Unit 3 occurs in the discharge canal. Because the discharge canal receives cooling water from all three units, transit I times through the canal are dependent upon the total circulating water flow through all units combined. I Estimated velocities of the cooling water vary as a function of cross-sectional area at different locations within the Indian Point plant condenser cooling water system. Estimated flow velocities are lowest at the intakes, ranging from 0.57 ft/sec at Unit 1 (when circulating I water pumps are operating) to 0.84 ft/sec at the intakes of Units 2 and
- 3. Estimated water velocities in the common discharge canal are con-Sistently higher than velocities at the intakes, and vary according to the I
number of circulating pumps operating at each unit. At the point of sampling in the common Unit 1 and Unit 2 discharge canal (Figure 1.3), estimated maximum .average velocity would be 3.85 ft/sec with all pumps I operating at 100 percent at both units and 2.82 ft/sec with only Unit 2 pumps operational. At the sampling station in the canal serving all three units, maximum average velocity would be 5.14 ft/sec with all three I units operating, 4.35 ft/sec with Units 2 and 3 operating, and 2.17 ft/sec I I
I 1.9 I with only Unit 2 or Unit 3 operating. At the discharge structure, how-I ever, gates to the submerged ports are opened or closed to maintain an estimated velocity of 10.0 ft/sec through the ports, regardless of the number of units operating. I The temperature rise (delta-T) encountered by organisms passing through the condenser cooling systems of the Indian Point plant depends on the cooling water flow rates and levels of power output (Can Edison 1977, I Tables 1-13 and 1-14). At Unit 2, with six pumps operating at full flow and the unit at 100 percent capacity, the predicted condenser tem-perature rise ranges from 8.8 to 8.9 C (15.8 to 16.1 F), depending upon I river temperature. During full capacity winter operation, with Unit 2 circulating pumps operating at 60 percent flow capacity (i,e., 40 per-cent recirculation), the predicted condenser temperature rise is approxi-I mately 14.7 C (26.5 F). At Unit 3, the predicted condenser temperature rise ranges from 9.5 to 9.7 C (17.1 to 17.4 F) for 100 percent capacity with six pumps operating at full flow; during winter operation, the predicted temperature rise is approximately 16.1 C (29 F). The higher I predicted condenser temperature rise for Unit 3, compared to that for Unit 2, is due to the fact that Unit 3 has a higher generating capacity than Unit 2, but uses the same volume of cooling water. I 1.2.3 ROSETON I The Roseton plant consists of two 600-MWe oil-fired units. Unit 2 began commercial operation on 14 September 1974, and Unit 1 on 31 December 1974. The two units operate independently but share a common intake I and discharge cooling system. Water for the once-through cooling system is withdrawn from the Hudson I River at a shoreline intake structure (Figure 1.4). The intake contains four cooling-water pumps and eight 9.S-mm mesh vertical traveling screens. After passing through the condensers, cooling water ;s dis-I charged into the river through a submerged jet diffuser. The area immediately in front of the intake and discharge structure has been dredged to a depth of 8.2 meters for a distance of 61 meters into the I river (CHG&E 1978). Various modes of pump operation are possible at the Roseton plant, depending on the number of units in operation. During 1983, two cooling-I water pumps were usually in operation from January through June and from mid-September through December. During this mode of operation, peak cooling-water flow was approximately 26.4 m3/sec. Cooling-water I pump operation was increased to three pumps in July, August, and early September with peak pumping rates of 35.4 m3/sec. I I I
I 1.10 I I I I
/
II I'I I
/ '1" . / Ii /fl CirC\Jlating Water Tunnel I
Di!Z:harge 1: I, I,
,I,r I I
v~k' -I
'I" 'I 'I I I,
Jr I
,I Entrainment Sampl.ing ___
Seal Well " I I
~ Intake StruC't\Jre I I~'
I, Location
.---- I ~--."'"
Diffuse!r
,,'7 ",
I~ HUDSON RIVER I
- : -*""~"" Scr~nwasi'l Dis.:harge I
I Fi gure 1. 4. General overview of the Roseton Generating Station. (Reproduced from EA, 1984c) I I
I 1.11 I 1.3 FIELD COLLECTION PROCEDURES I The following general description of field sampling procedut'es has been taken from the 1983 Annual Reports for Bowline Point, Indian POint, and Roseton (EA, 1984a, 1984b, 1984c). Many passages have been quoted ver-batim from these sources. I Ichthyoplankton samples we*re collected at Bowl ine Point Generating Station from 1 May through 13 August 1983, at Indian Point Generating Station from I 3 May through 13 August 1983, and at Roseton Generating Station from May through July 1983. Sampling was scheduled to occur continuously through-out the period at Bowline and Indian Point. Some gear malfunctions I resulted in some variation from the schedule at these two stations. Each 24-hour continuous collection consisted of eight discrete 3-hour samples. Sampling at Roseton was conducted weekly during early May, daily from 17 I May through 27 May and 31 May through 22 June, and weekly during late June and July. Roseton ichthyoplankton samples were collected and analyzed as 24 distinct 1-hour intervals on each day of sampling. Locations of sampl-ing points are shown in Figures 1.2, 1.3, and 1.4 for Bowline, Indian I Point, and Roseton respectively. Samples at Bowline and Indian Point were collected using the AUTOSAM I sampler (Figure 1.5). The basic components of the sampler include a 7.6-cm electric pump, a cylindrical collection tank (1 m in diameter and 1.2 m in height) containing a cylindrical SOO-um mesh plankton net, and a I microcomputer control module. All components are housed in an enclosed trailer. The intake hose had a diameter of 10.1 cm. At Bowline Point, samples were collected from the plant cooling water discharge pipe; the I intake hose was inserted into the discharge standpipe to a depth of about 1 m into the flow of water. At Indian Point, samples were collected from the discharge canal at Station 02; the intake hose inlet consisted of a 90 degree elbow fitting positioned at middepth (approximately 3 m below the I water surface of the canal) and oriented into the discharge water flow. The 10.1-cm diameter hose was reduced to connect to the 7.6-cm diameter sampling pump. The pumping rate ranged from approximately 0.7 to 1 m3/min; I however, the rate for most samples was 0.8 to 0.9 m3/min. Variation in rate occurred due to tidally induced changes in water level in the dis-charge canal. Average cross-sectional velocity at the mouth of the I intake hose was approximately 2 m/sec. Operational sequences of the AUTOSAM are controlled by the microcomputer module. During sampling, water is pumped into the net in the collection I tank where primary concentration of the sampled organisms and detritus occurs. Filtered water passes out of the collection tank through a dis-charge drain pipe. Flow rate and volume are measured by an inline flow-I meter mounted to the pipe that transports water from the AUTOSAM sampling pump to the collection tank. At the end of the programmed sampling interval, the following automated operations occur: (1) the pump shuts I off and the collection tank drains; (2) the collection net is rinsed, con-centrating the sample into the bottom of the collection net; (3) the sample is washed into the secondary concentrator and then *jnto a collection I container using chilled water (4.4 C) to reduce organism decomposition; and (4) formalin is automatically injected into the collection container I
From sampling pUmlJ rr=u 1)1 500*"m I~[ plankton' ."
.r Collection =0 tank N ,.,tu. anlL l 3*lnch sam filing rUlnl' tT.nkd",n -IJ ~I, (f Sample I
--') . TV'-.... I"=-p I11JJ ~ ChI1.,,'on con"'n... 181 Formalin InJection
+ DlschargrJ . "'"\....
Turn I a blu and molor syslem Figure 1.5. Schematic diagram of sampling components of EA's automated abundance sampling system - AUTOSAM (U.S. Patent No.4, 145, 925).
I 1.13 I to achieve a 10 percent formalin to water solution. After each sample I is collected, the turntable holding the collection containers rotates, and the sampling sequence automatically begins again. I Each sample can be comprised of many cycles; a cycle being defined as a sequence of pumping, net washdown, and transfer of sample contents to col-lection container. For the 1983 Entrainment Monitoring Prclgram at the Bowline Point Generating Station, each sample was a 3-hour composite of I 1-hour cycles (specifically, each cycle consists of 57 minutes of sample collection and three minutes of net wash and sample transfer). Because of the relatively long sampling period for each sample, thE! normal opera-I tional sequence was modified to inject formalin into the collection con-tainer after each 1-hour cycle to avoid potential predation and decomposi-tion. In addition, the collection of samples in 1-hour cycles resulted I in less damage to ichthyop1ankton by reducing the times these organisms would be in the primary collection net. I At Indian Point from 3 May throu9h 18 May each sample was a 3-hour com-posite, combining 1-hour cycles (specifically, each cycle consisted of 57 minutes of sample collection and 3 minutes of net wash and sample I transfer). To reduce clogging as a result of heavy detritus, samples col-lected after 19 May were 3-hour composites consisting of six 27-minute collections with a 3-minute wash. I Samples at Roseton were collected with pump and net/barrel collection systems and were taken from the cooling water discharge seal-well. The location of the intake hose opening was fixed so as to collect samples I from a constant depth throughout the season. At Roseton, this point was at the approximate midpoint of the discharge pipe, 1.8 meters off the bottom. The pump and net/barrel collection system is depicted in Figure I 1.6. A Homelite 4-in. pump (Model 610 GPM) with 4-in. intake and dis-charge' hose was used to direct the sample through an inline 4-in. Sparling flowmeter (Master Flo, Liters x 1000 Model) which measured total volume I and flow rate. Prior to the sampling season, Sparling flowmeters used at Roseton were shipped to the Sparling Division of Envirotech where the meters were calibrated for use with 4-in. pipe. Two 4-in. ball valves I enabled the operator to direct flow into one of two net/barrel systems so that sample collection and net-washdown/sample recovery could occur simultaneously. Each tank contained a 500-~ cylindrical collection net I with a submerged area of approximately 15 square feet (Figure 1.6). The bottom of the net was attached to a polyethelene funnel to simplify transfer of the sample to a container. I Approximately 15 minutes prior to the initiation of sampling (scheduled for 0900 hours), the sampling pump was started and adjusted to provide a constant rate of 275 gallons per minute (1,040 liters/minute). When I the start time was reached, sampling was initiated by recording the flow-meter value and by adjusting the ball valves to divert water to one barrel. The contents of the barrel filled during flow calibration were discarded. I Sampling continued for 30 minutes with numerous checks conducted to assure a constant flow rate. I
4-ln Intako Hose 4-ln Discharge Hose I t 4-ln Sparling Flowmeter r-o 4-ln Homelite Pump r-o
+:> '~ Identical Net/Barrel Collection System Figure 1.6. Schematic diagram of entrainment abundance collection gear, Roseton, 1983.
I 1.15 I After 30 mi nutes, the total vo 1ume sampled was recorded and f'low was I diverted to the other barrel for the next 30 minutes of sampling. During this time, the net was washed down and the sample recovered f~om the barrel. The procedure was repeated until the entire 24-hour sampling event was completed. Successive 30-minute samples were consolidated I into twenty-four I-hour (2 x 30-minute) composite samples. Field collection procedures required strict adherence to specific samp1-I ling volumes. Failure to maintain a flow of 275!25 gallons per minute and 8,250!750 gallons per 30 minutes for three consecutive 30-minute collec-tions or during any four of the forty-eight 30-minute collections in a I 24-hour period required the invalidation of all samples collected to that point and the reinitiation of a 24-hour sampling series at the nearest l-hour interval. I All samples were preserved with 10 percent formalin and stored in jars labeled inside and out. All collection information was recorded by field crews,onto standard data sheets. I , During the 1983 sampling season, a study was conducted at two generating stations in order to examine the collection efficiency of the sampling gear. The tests were conducted at Roseton and Danskammer Point on 5 August and 8 August 1983, respectively. Because of the unavailability of hatchery-reared striped bass larvae in 1983, neutrally buoyant beads I were used as the release object. The beads were introduced at the pump intake pipe and recovered after 30 minutes of sample collection. A total of 5 batches of 50 beads per batch were released into each net/barrel I collection system at each of the two plants. Recovery rates \lIere very high: at Roseton, 499 beads were recovered out of the total of 500 released; at Danskammer Point, 496 beads out of 500 were recovered (EA, 1984c, Section 5.6). I The sampling program at Roseton was supplemented by concurrent and additional sampling that was done at Danskammer Point. Sampling data from Danskammer Point can be used in the estimation of entrainment at Roseton during inter-I vals when sampling was not done at Roseton, but this approach will not be considered ~n this report~ I Replicate ichthyoplankton samples at Indian Point were also collected using the manual pump/net abundance sampler from 17 June through 8 ~uly 1983. Sampling was scheduled to occur Monday, Wednesday, and Friday throughout I that period. Similar to AUTOSAM collections, each sampling event was a 24-hour continuous collection effort separated into eight discrete 3-hour samples I Two manual pump/net samplers were used to collect samples from the discharge canal at Station 02 (Figure 1.7). The pumps were 4-in. gas-powered Home-lite trash pumps (Model l60-TP4-l). Inlets for the two lO.l-em diameter I intake hoses were located in the discharge canal, adjacent to the I I
I 1.16 I I I I I I ____" ' l 4 - CalBarrel ibration I I I I I I I I Figure 1.7. Generalized view of manual pump/net abundance and AUTOSAM collection systems used at 02 at the Indian Point Generating Station in 1983. I I I
~----~------
---~
From Sampling Pump _
--- I ,. . .
I H ,I...... -- 1 .......-.. -r---I- H2 0 level Ourlng Operation I 4*I"PVC I I I I Net I I I I I I
~ .... )- To Sample CollectIon Cont.lner Figure 1.B. Schematic diagram of net/barrel collection system used during entrainment abundance study. 1983.
I 1.18 I inlet of the AUTOSAM intake line. Inlets were atmiddepth and oriented into the discharge \~ater flow. Each pump sampled a measured volume of water that was recorded by an in1ine 4-in. Sparling (Master Flo) flowmeter. The samp*les were pumped into a modified SOO-um mesh collection net (0.6-m dia-I meter, 1.2-m long) suspended in a collection barrel (Figure 1.8). (Samples collected prior to 1 July were collected with a 333-um mesh net.) At the end of the sampling interval, the pump was shut off, the collection barrel I allowed to drain, and the ichthyoplankton and detritus were concentrated in a codend. I Similar to AUTOSAM collections, each sample was a 3-hour composite com-bining one-hour collections (for the manual pump/net system, each sample collection was appr'oximately 50-55 minutes in duration with 5-10 minutes I of net washdown). After each I-hour collection, the sample was transferred to the sample jar having both inside and outside labels, and was preserved with 10 percent fOl"TT1alin. All collection infor"TTlation was recorded by I field crews onto standard data sheets. Juvenile fish were collected from 15 June through 9 August 1983 using the net sampler apparatus (Figures 1.8 and 1.9) located at Station 01 in the discharge canal at Indian Point (Figure 1.3). Throughout this period~ net collections were stcheduled three days per week on a random basis. Net I collections also occurred continuously over a 24-hour period, and con-sisted of eight 3-hour samples. I Net samples were collected using a O.S-m diameter conical plankton net (57I-um mesh, 1.8-m long) mounted to a vertical frame which could be raised above or lowered beneath the water surface by an electric winch. During I sampling the net was lowered to within 0.5 m of the bottom, then raised in a controlled step-wise manner to provide samples that were evenly distri-buted throughout the water column. Flow was recorded from two General I Oceanics (GO) flowmeters (Model 2030), one positioned in the center of the net's mouth and the other located outside, but close to, the net. Flowmeters were changed between each sample in a 24-hour period. I
'I J
I
.1 I
I
I 1.19 I I
~
Motorized Pulley Net Frame I I I I I
~V~~
Bottom Net GO Flowmeters_
..,1-4 I II I \
I , I (f f I II I I I ~'Platform I I I I Currant I I I Figure 1.9. Net sampling apparatus at discharge canal Station 01, Indian Point Generating Station. I I
- - - - -- ----- ----~--------
I I I 2.
SUMMARY
AND CONCLUSIONS I Entrainment abundance sampling designs for Bowline Point, Indian Point, and Roseton Generating Stations are examined in this report. The primary goal is to determine how efficient various sampling designs are in evalua-I ting the effectiveness of plant outages in reducing entrainment impact. Total ichthyoplankton counts and counts of several specified taxa are treated. Estimates of entrainment abundance and the reduction in entrain-I ment due to outages are developed for sampling designs of interest, including those designs currently in use at the generating stations. The class of sampling designs under consideration is described. It includes I two-stage and three-stage sampling designs. In two-stage sampling, the total volume of water passing through a plant is divided into primary sampling units or p.s.u.1s, which correspond to time intervals during the I entrainment season. Samples are drawn from some or all of these p.s.u.1s and are analyzed for the presence of entrained ichthyoplankton. In three-stage sampling, these samples are subdivided into splits, which are I analyzed for entrainment. The number of splits that are analyzed from a given sample may be either fixed or adaptive. With adaptive sampling, the number of splits analyzed from any sample can depend on the entrainment pattern observed in that sample. I The assumptions needed for a comparative analysis of these sampling designs are carefully stated. Formal statistical criteria by which I sampling design performance will be measured are discussed. One of these is the absence or near absence of bias in entrainment estimates associated with a sampling design. Another is low variances for these estimates. A I good sampling design will produce point estimates of entrainment having little or no bias and low variance, and it will be able to deal with rapid changes in the level of entrainment. I The distribution of ichthyoplankton in the discharge channel is a central aspect of the estimation problem. Bowline Point and Roseton discharge through a pipe; Indian Point discharges through an open channel. The I distribution of organisms in the discharge channel is closely related to the sample counts observed; information from the investigation of distri-butions, through analysis of sample counts, is applied to the estimation I of entrainment. In work to date on the estimation of entrainment abundance, the vari-ability of the estimates has not been properly addressed. This is a I critical issue, because the estimated variance or standard error of any estimator provides essential information about the precision of that estimator. Estimated variances and standard errors of entrainm~nt abundance I and entrainment reduction estimates are not reported in the Indian Point and Roseton 1983 Annual Reports. These quantities are reported in the I I
I 2.2 Bowline Point 1983 Annual Report; however, they are computed using the I assumption of a Poisson model for variation in the discharge channel. This model does not agree with data obtained in replicate sampling, as detailed in Chapter 4 below. Consequently, estimated variances and I standard deviations of entrainment abundance estimates based on the highly specific properties of the Poisson model are only valid under special circumstances. I A different model that is consistent with the replicate sampling data has been determined. We show in Chapter 4 that the negative binomial model agrees very well with these data. This class of distributions is more B general and flexible than the Poisson family. We have developed a method for estimating the variances of entrainment abundance estimates from the data under the negative binomial model. Thus, the variance and standard I error of entrainment abundance estimates can now be estimated in a valid manner. This is a considerable advance over previous methodology; a point estimate of entrainment is of very limited value unless accompanied by an indication of its variability or margin of error. I The shift from the Poisson model to the negative binomial model is a logical evolutionary change. The Poisson model is the simplest model for I random variation in the discharge channel. It was a biologically plaus-ible and analytically convenient starting point. Only with the collection of replicate sampling data in 1982 and 1983 did it become possible to test I whether the Poisson model provides an adequate explanation of variation in the discharge channel. It does not, but the negative binomial model does. The negative binomial family includes all Poisson distributions, I so moving from Poisson-based to negative binomial-based analyses is a natural progression. The effect of sample duration on the precision of entrainment estimates is treated. Shorter p.s.u.'s increase the preciSion of estimation, leading to a recommendation of one-hour to three-hour p.s.u.'s. A model is investigated that would explain the occurrence of sample counts fitting I the negative binomial distribution if the true behavior of the organisms were Poisson. In this model, errors in classifying organisms having a Poisson distribution lead to observations that are not Poisson but are I negative binomial. Examination of data in which classification error is virtually absent ll~ads to the conclusion that this classification error model cannot be used to explain the negative binomial nature of the entrainment data. In other words, the hypothesis that actual Poisson I counts are perturbed by classification errors, resulting in negative binomial data, is not supported by our analysis of the data. I The special case of juvenile data is discussed. The limited availability of replicate observations for juveniles makes it difficult to determine whether the negative binomial model is needed, that is, whether the Pois- I son model must be replaced by a more general distribution in order to obtain a satisfactory fit to the available juvenile data. Based on the small amount of data gathered, it appears that the Poisson distribution may tentatively be regarded as satisfactory. If this should be confirmed I by future analYSis of further replicate data, there will be little difference between the Poisson and the negative binomial approaches. I I
I I 2.3 The estimation of entrainment abundance and entrainment reduction is then considered. Formulas for confidence intervals are derived based on the I negative binomial distribution of sample ichthyoplankton counts. The effect of sample volume on the precision of estimation ;s examined. Methods are presented for both entrainment abundance and the effect of I outages in mitigating entrainment. The standard assumption that ichthyo-plankton entrainment densities are independent of the cooling water flow rate allows our method to be applied to projected as well as actual cool-I ing water flows. A large simulation study of the entrainment process is presented" to show the I behavior of entrainment estimators under a variety of conditions. Entrain-ment sampling is first examined over the course of a single day, for a wide range of organism densities and sample volumes and for uniform and diel daily variation. Entrainment over longer periods is then treated: 3 day, 5 I day, one week, 18 day, and 34 day periods are simulated. An important con-clusion is that sample volumes of 150 cubic meters per 3-hour period, or 1200 cubic meters per 24-hour day, are sufficient for purposes of entrainment I estimation over the range of parameters considered; larger sample volumes " produce only slight gains in accuracy of estimation. I A potentially valuable technique for laboratory analysis of field col-lections has been presented. This approach, known as sum-quota sampling, is a more sophisticated version of the splitting of field samples that "was done on some Bowline Point 1982 data. Sum-quota sampling, an adaptive I form of three-stage sampling design, is a more efficient sampling proce-dure than two-stage or fixed-number-of-splits three-stage sampling designs requiring comparable sampling effort. It would reduce I the cost of laboratory processing of entrainment samples at the three generating stations, requiring less sampling effort to achieve the same accuracy in estimation as competing fixed-sample-size sampling designs. I At present, sum-quota sampling cannot be implemented for two major reasons: first, the laboratory equipment currently available for sample splitting does not provide the necessary randomness in dividing the ichthyoplankton in the sample among the splits of subsamples; and, second, use of the data I for other purposes in addition to estimation of entrainment abundance "and reduction may require that sum-quota sampling not be used. I Noncontinuous sampling plans, in which sampling of the discharge flow is not scheduled during preselected periods within the entrainment season, present more difficulties than continuous sampling plans. The omission I of these periods, ranging in length from one to six days, increases the variance of entrainment estimates. Even more important, the systematic nature of noncontinuous sampling results in estimates that are biased except under the assumption that certain special conditions hold. Neither I the direction nor the magnitude of the bias can be inferred from the data. Negative bias is a clear possibility, stemming from the chance that a peak level of entrainment will occur during a sampling gap of one or several I days; if this happens, entrainment will be underestimated. Entrainment estimators will be unbiased, however, if there is a linear trend in the daily entrainment during each sampling gap. I I
I 2.4 The specific features recommended for an entrainment abundance sampling I design are: continuous sampling (or as nearly continuous sampling as possible); sampling fractions (that is, volume sampled in p.s.u. divided by discharge flow volume in p.s.u.) that are as nearly equal as possible I for all p.s.u. IS; adequate total volume sampled to yield acceptable con-fidence intervals for the entrainment quantities of interest; sufficient replicate sampling to allow precise estimation of the quadratic variance I parameter c 2 , leading to shorter confidence intervals; and p.s.u. IS of one to three hours. Additional studies related to entrainment estimation are proposed to examine I the properties of replicate sampling, the relationship between entrainment and plant operating factors, and the use of entrainment estimates from previous years. I I I I
"I I
I I I I I I
I I I 3. SAMPLING DESIGNS UNDER CONSIDERATION I
3.0 INTRODUCTION
I The sampling design encompasses several phases of an entrainment abundance study that are often viewed as separate activities: the collection of field samples, the rules for determining what types of laboratory analy-I sis are to be performed on which samples, and the method of deriving from laboratory data estimates of entrainment abundance anc reduction. These aspects of the study must be viewed as a cohesive whole; a poor choice at any of these stages introduces inefficiency that cannot be I removed from the estimation process, no matter how well the remaining stages are executed. Data collected from a field plan that is flawed cannot be redeemed by superior laboratory work and statistical analysis. I Perhaps more surprisingly, a sound field collection plan can be offset by a pattern of laboratory analysis that fails to extract as much informa-tion as possible from the field samples. Finally, statistical estimation I procedures that fail to make full use of the information emerging from the laboratory will procuce inefficient estimates even from data of the highest quality. I The field sampling rules must provide full information on when sampling is to take place, how many samples (one or more) are to be drawn con-currently, where the discharge channel is to be sampled, and so* on. The I laboratory rules must provide full information on how the sample is to be processed, including any splitting of the sample into smaller parts and decisions on which of these parts to analyze. The statistical estimation I procedures must provide appropriate, efficient point estimates and confi-dence intervals for entrainment abundance and reduction. I We shall continue to refer to the overall process of field collection, laboratory analysis, and statistical estimation as "sampling deSign," largely for reasons of historical precedent in previous entrainment studies. This process is more commonly referred to in the statistical literature I as "experiment design," a phrase that reflects more clearly the concern with all aspects of the investigation. I Any sampling design must be considered as a complete process that includes field collection, laboratory analysis, and statistical estimation. Field collection plans cannot be rationally compared without asseSSing how I laboratory and statistical analyses will be applied and what the conse-quences for estimation will be. Similarly, neither laboratory analyses nor statistical estimation procedures can be compared without consider-I ing the other aspects of sampling design. I I
._1.,..'....'_-
B 3.2 I This chapter descr'ibes the two-stage and three-stage sampling deSigns being considered for entrainment abundance studies in the future, includ-ing those currently in use. It also specifies the assumptions required I for the derivation of entrainment abundance estimates appropriate for each of these designs. I In i3. three-stage sampling design, each sample is divided in the laboratory into several portions called splits, only some of which are processed. This allows us to allocate a specified level of sampling effort by dividing I each sample into splits and analyzing a greater number of samples partially instead of analyzing fewer samples completely. This procedure gives some-what 1ess informat'i on about a parti cul ar partially-processed sampl ethan I would be available if processin9 were complete, but it allows more or larger samples to be (partially) processed for a given total cost. To determine whether splitting is beneficial, we must ascertain whether the process of field collection, laboratory analysi~ and statistical estimation I gives improved resLilts--more accu.rate,estimates--with or without splitting. A fact emerging clearly from this chapter is the critical need for a model I that accurately reflects the distribution of ichthyoplankton in the discharge channel. One reason for this is that replicate sampling, taking more than one sample in a discharge channel simultaneously, has been done very infre- I quently. The standard two-stage and three-stage sampling formulas for the variability of estimates contain terms that do not exist when replicate sampling is absent.. Thus, estimated variances must be inferred from unrep- m licated samples by other methods, and the model approach is the natural choice. A second reason supporting the model approach is that even if replicate sampling has been performed, modeling the distribution is likely to provide more accurate estimation of entrainment abundance. Consider I estimating entrainment abundance by using these two contrasting strategies: I (1) Make minimal assumptions about the process, or no assumptions
,at a11 . Develop estimati on formul as that are val i d under the extremely general conditions that these minimal assumptions hold.
I (2) Make moderate or strong assumptions about the process, limiting the range of behavior the process is likely to exhibit. Develop estimation formulas that seek to perform with the greatest possible efficiency when the assumed conditions hold.
-I Making the assumptions referred to in the second strategy specifies a model for the process. This is the definition of a statistical model.
When the assumptions do in fact hold, the estimation procedures produced I by the second strategy can be expected to outperform the competing pro-cedures produced b:v the first strategy. This is because the second strategy procedures have been selected to do better than any other I I I
I 3.3 I procedures under these conditions. If the entrainment data are consistent with a specified model, then estimation procedures derived from this I model IS properties should give more accurate results than more general, non-model-based estimation procedures. The extrinsic information supplied in the form of a model thus enhances the intrinsic information supplied by the data. I To state this another way, if an estimation formula is valid under extremely general conditions, this generality often entails a loss of I efficiency for a specific situation. An example, which is not connected to entrainment estimation but which illustrates the point simply, is the following: it is desired to estimate the variance a2 of the distribution I from which the random sample xl, *** ~xn has been drawn. For any distribu-tion with finite variance of a2, an unbiased estimate of a2 is given by I . L~=1(Xi - x)2/(n-1). This estimator is clearly valid under extremely I broad conditions. However, if the distribution is known to be Poisson, then a second unbiased estimator of aZ based on the Poisson model is given by x. This estimator x has lower variance than the first unbiased I estimator when the distribution is actually Poisson, which makes it more efficient under the Poisson model. Thus 2 if the data x1, **. ,x n are consistent with the Poisson model, then x is a better choice as an esti-mator of a2* I 3.1 TWO-STAGE SAMPLING DESIGNS I In a two-stage sampling design, the entire volume of water passing through a generating station is divided into units called primary sampl-I ing units or p.s.u.ls. A set of these is selected for examination. It is logical to let p.s.u.ls correspond to disjoint, exhaustive time intervals, so a given p.s.u. is the cooling water that passes through I the plant in a particular time period. For example, if the p.s.u.1s correspond to three-hour periods over a ten-day interval beginning at midnight, then there are 80 p.s.u.ls: Day 1 0000-0300, Day 1 0300-0600, Day 1 0600-0900, and .so on up to Day 10 2100-2400. Some, and perhaps I all, of these will be selected for subsequent examination. Each p.s.u. is conceptually divided into subvolumes called subunits, some I of which are selected at random for laboratory analysis. The need for subunits is clear, as a p.s.u. is much too large for laboratory analysis. Any subunit selected for laboratory analysis will be called a sample. I The number of p.s.u. 's available, the number selected, the numbers of subunits available and selected within each p.s.u., and the counts of organisms entrained in all chosen subunits are known. Under very broad I assumptions, this information yields an estimate of the total count of organisms entrained in the entire volume of water passing through the plant. (Although the exposition here treats the total overall count of organisms, it applies without change to any particular taxon and life I stage. ) I
I 3.4 I The usual estimation process for two-stage sampling is now outlined. Difficulties inherent in using two-stage sampling will then be discussed. The treatment here follows the notation of Cochran (1977), a standard reference on this material. For convenience, only the case of equal-I sized p.s.u.'s is considered, as the complexities of the unequal-sized p.s.u. case do not alter the main features of this discussion. I The population quantities in two-stage sampling are I N = total number of possible p.s.u.'s M = number of possible subunits in each p.s.u. I
' d'1n the J.th subun1't of the l* th = num ber o'1" organ1sms . en tralne p.s.u., i=l, ..* ,N, j=I, **. ,M I Y.1 = mean number entrained in the ith p.s.u. (over the M subunits in this p.s.u.~ i=l, ... ,N I = overall mean number of organisms ,entrained (over all NM pos-sible subunits) I y = NMY = total overall count of organisms entrained (over all NM possible subunits) I N = L (Y. - ; =1 1 y)2/(N_l) = variance among the N p.s.u. means V.
1 I
= L N M L~"
i=l j=1 1J
- Y.)
2
/N(M-l) = variance among subunits within p.s.u.'s I The sample statistics are I
n = number of p.s.u.'s selected I m = number of subunits or samples selected in each p.s.uc
= n/N = fY'action of p.s.u.'s selected I = m/M = fY'action of subunits or samples selected = numb~£hof organisms entrai~ed in the jth (selected) subunit of I
the 1 (selected) p.s.u., l=I, ... ,n, j=I, ... ,m -y. m I
= 2 y .. /m = mean number of organi sms in the m selected sub-1 j=l 1J units of the ith (selected) p.s.u., i=I, ... ,n I I
I 3.5 I - n
= L n m .r YiJ*/ nm = mean number of organisms y = L Yi/ n in the nm ;=1 i=1 J=1 I observed subunits I y = nmy- = total count of organisms entrained in the nm observed subunits I
A Y = NMy = (NM/nm)y ~ product of the number of entrained organisms counted in the entire sample and inflation factors N/n = 1/f1 and M/m = 1/f2 I n
= L (y.- . 1 y)2/(n_1) = observed variance among the n observed p.s.u.
I 1= 1 means -y; I = n r r m (Yij - Yi)2/ n{m-1) = observed variance among subunits i=1 j=l I within p.s.u.'s I This approach to estimation in two-stage sampling divides the overall variation in the set of NM subunits into two components: Sf is the vari-ance among the N means of t~e p.s.u.'s, and S~ is the variance among sub-I units within p.s.u.'s, that is, the variance among subunits within a A single p.s.u. averaged over the N p.s.u.'s. The estimate Y of the total I entrainment Y is obtained by randomly selecting n p.s.u. IS, taking m randomly chosen subunits within each of these p.s.u.'s, calculating the I mean number y of organisms found in the nm observed subunits, and multi-plying this mean by the total number of subunits in the entire volume of water, NM. This produces the estimator I Y= NMY. I The variance of this estimator is affected by si, s~, and the choices of nand m. It is given by the formula I 2 Var{Y) = N M2[{1 - f1)Si/n + (1 ~ f2)S~/nmJ. I This variance increases as the variance among the N p.s.u. means, si, or th~ variance among subunits within p.s.u.'s, s~, increases, or as I both of these variances increase simultaneously. As the number of p.s.u.'s selected, which is. n, approaches and finally equals N, the term 1 - fl .1
I 3.6 decreases to 0, so the term in the variance of Y involving the variance among the N p.s.u. means diminishes in importance as more 2P.s.u. 's are I sampled. The corresponding decrease takes place in the S2 term as more subunits within each selected p.s.u. are selected and analyzed . I An unbiased estimator of Y is given by Y, and an unbiased estimator of Var(Y) is given by I I (The estimator Var(Y) is derived by showing that s~ is an unbiased estima-tor of s~ and that sf - (1 - f2)s~/m is an unbiased estimator of IS f.) I Two important iSSUE!s must be dealt with in attempting to use these two-stage sampling des*ign formulas to estimate entrainment abundance. The I first of these, a major analytical problem with the current sampling programs, is that a single subunit is selected from each p.s.u., making m equal to 1. In other words, only one sample is drawn from the cooling I wate*r passing through the plant in each time period. This makes it impossible to calculate s~ and therefore ~ar(~). Intuitively, the reason I that this leads to technical difficulties is clear: when only one subunit is observed per p.s.u., there is no way to estimate the variability among
- subunits within a p.s.u. It is possible to use an externally derived esti-I mator s~, that is, an estimator of s~ obtained from sources other than the entrainment samplingAcon~idered here. Such an external estimator s~ would I
allow us to compute Var(Y) when m = 1; however, this approach will not be pursued in this study. I The other issue is that these two-stage sampling design formulas make no use of any information except the nm observed subunit counts y;j' The I formulas are thus valid under extremely general conditions: when n>1 and I
... ... A m>l, Y is an unbiased estimator of Y whose estimated variance is Var(Y).
However, such generality often entails a loss of efficiency for a speci-fic situation, as the example at the end of Section 3.0 illustrated. A ... The generality of Var(Y) suggests that if information about the process that produced the Yij'S w:re available, it could be used to derive an I improved estimate of Var(Y) whose variance is smaller than the variance of Va.r(Y). I As a further illustration that distributional information about a process can lead to improved estimation, we note that under certain circumstances, I it is possible to find an unbiased estimator of Y that has smaller vari-ance than Y.Specifically, if the subunit counts Yij were independent, I I
I 3.7 I identically distributed Cauchy random variables (or equivalently, followed a t distribution with 1 degree of freedom), multiplying NM by the median of the observed subunit counts would give an unbiased estimator of Y. In I A A this situation, neither the mean nor the variance of Y exists; thus Y is not an unbiased estimator of Y, and its variance can be* interpreted as infinite. (See Mood, Graybill, and Boes, 1974, pp. 117,540.) We mention I this example only in passing, because the circumstances do not perta~n directly to the entrainment sampling situation; however, they dramatlze the relevant point that knowledge of the model followed by a process can have enormous impact on the choice of estimation methods. I Information about the model followed by the entrainment process may be able both to overcome the lack of replicate sampling when m=1 and to I improve on the general estimation procedures when m exceeds 1. For example, if subunit counts within each p.s.u. are independent, identi-cally distributed Poisson random variables, then S~ can be estimated by I y when mal as well as for larger values of m {EA, 1983~ pp. E-13 to E-l5}. For this reason and because the turbulence within the discharge I flow suggests a spatially random distribution of organisms in the flow cross-section, it is tempting to make the assumption that independently drawn samples from a p.s.u. will be a set of randomly drawn observations I from a Poisson distribution. (The exact meaning of a spatially random distribution is discussed in Appendix I.) However, Chapter 4 of this report will show that the Poisson model does not fit replicate sampling I data from the generating stations, but that a related model does . Thus, estimates of the variability of Y based on the assumption of Poisson I structure among samples within each p.s.u. are not valid for these data. In the past and present sampling deSigns used at the three generating I stations, m has been restricted to the value 1 except in relatively infrequent replicate sampling experiments. For this reason, the estima-tion of entrainment abundance depends critically on the development of a model for the entrainment process. This provides not merely an opportunity I A ... to improve the estimator Var(Y), but a technique for deriving an estimator
.. ... A of Var(Y} under circumstances (m=l) when Var{Y} does not even exist.
I The parameters of a two-stage sampling deSign are I (1) Nt the total number of p.s.u. 's in the overall time period of interest, the entire entrainment season. Equivalently, the time interval corresponding to a p.s.u. can be used as the I parameter, since N can be found easily from this time interval and the length of the season. I (2) n, the number of p.s.u. 's selected for examination. Equiva-lently, the proportion fl = nlN of p.s.u.'s selected can be used. (3) M, the number of possible samples or subunits contained in an I entire p.s.u. I
I 3.8 (4) m, the number of samples or subunits selected per p.s.u. Equi-valently, the proportion of f2 = m/M of available subunits I chosen can be used. The first two of these parameters can vary over a wide range. The time o interval corresponding to a p.s.u., which is the total number of hours in the entrainment season divided by N, can be very short (one hour or D less), very long (24 hours or longer), or somewhere between these extremes. Intervals shorter than one-half hour seem undesirable, as do intervals longer than 24 hour's. Very short intervals result in samples containing a great deal of random variability or "noise" relative to the amount of I information or "signal" present. They also involve more administra-tive work to labell' transport, process in the laboratory, enter in com-puter data bases, and analyze statistically. There appears to be no I benefit in improved estimation derived from this extra effort. On the othe:r hand, when the time i nterva 1 spanned by each p. s. u. is very long, the estimati on of ()vera 11 seasonal entrainment is based on re 1ati ve ly few I p.s.u.'s rather theln on relatively many shorter p.s.u.1s. Consequently, the accuracy of th4! seasonal entrainment estimate decreases as the time iriterval of a p.s.u. increases. The length of time chosen to serve as a I p.s"u. is treated fully in Section 4.3. The number of p.s.u.'s selected for sampling can range from relatively I few of the N p.s.u.'s to all N of them. The proportion of f1 = nlN can range from near zero to one. Continuous sampling, in which every p.s.u. is sampled, was in effect at Bowline Point (EA, 1984a) and Indian Point I (EA, 1984b) generating stations during the 1983 entrainment season. At Roseton, however, sampling was not continuous during the 1983 entrain-ment season (EA, 1984c). Between May 10 and June 30, sampling was per- I formed on 31 out of 52 days, with gaps occurring on May 11-16, 28, 30, and June 4-6, 11-13, 23-27, and 29-30. As might be expected, continuous sampling produces much more accurate estimates of entrainment than sampl- I ing with many p.s.u.'s omitted, especially if the gaps between sampling periods are long. Continuous and noncontinuous sampling designs are com-pared in Chapter B. I The number of possible samples *contained in an entire p.s.u. is given by I
= volume of cooling water flow past sampler during p.s.u.
M volume of cooling water drawn for (one) sample dur1ng p.s.u. I If replicate sampling is performed, the denominator ;s the volume of a single sample, not the total volume of all samples taken simultaneously. I M is determined by the volumes of cooling water flowing past the sampl-ing gear and into it during the p.s.u.; 11M is the fraction of flow in the discharge channel constituting a sample. Changing the value of M I would entail alte!ring the physical sampling apparatus, e.g., changing the 4-inch diameter intake hose on the AUTOSAM to another size and making corresponding adjustments in the sample pumping rate. There does not I appear to be any reason why this should be considered. I
I I 3.9 The number of samples drawn per p.s.u. has generally been m = 1. There I have been some replicate sampling data collected recently, though, with simultaneous sampling occurring at more than one location within the discharge channel (EA 1984a, 1984b). I 3.2 THREE-STAGE SAMPLING I In a three-stage sampling design, the entire volume of cooling water pas-sing through a generating station is divided into p.s.u.ls corresponding to disjoint, exhaustive time intervals. A set of these is selected for I examination, and a random sample of subunits is drawn from within each chosen p.s.u. The chosen subunits are also referred to as samples. Each of these subunits, or samples, is then further subdivided into several I portions called splits or aliquots, some of which are selected in random order for laboratory analysis. When certain conditions are met, the rules of the sampling procedure may specify that no further splits from I a given sample are to be processed. Should this occur, laboratory analysis continues by moving to another sample, dividing it into splits, and selecting these in random order for processing. I The number of p.s.u.ls available, the number selected, the numbers of samples available and selected within each p.s.u., the numbers of splits available and selected within each sample, and the counts of organisms I entrained in all chosen splits are known. Under very broad assumptions, this information yields an estimate of the total count of organisms entrained in the entire volume of cooling. water passing through the plant. I (The treatment here of the overall count of organisms applies without change to any particular taxon and life stage.) I The usual estimation process for three-stage sampling is now outlined. Difficulties in utilizing these standard three-stage sampling estimation methods will then be discussed. As with two-stage sampling, we follow the notation of Cochran (1977), where formulas omitted here are found, I and treat the case of equal-sized p.s.u. IS. The population quantities in three-stage sampling are I N = total number of possible p.s.u. IS I M = number of possible samples (subunits) in each p.s.u. K = number of possible splits in each sample I = number of organisms entrained in the hth split of the jth sample within the ith p.s.u., i=l, .*. ,N, j=l, ..* ,M, I h=l, ... ,K I I
I 3.10 I Y.. = mean number entrained in the jth sample from the ith p.s.u. 1J (over the K splits in this sample) I
= mean number entrained in the ith p.s.u. (over the MK splits y
in this p.s.u.) I
= overall mean number of organisms entrained per split (over y
all NMK possible splits)
= NMKY = total overall count of organisms entrained (over all I
NMK spl its)
= variance among the N p.s.u. means Yi I = variance among sample means within p.s.u.'s I = variance among splits within samples I The sample statistics are n = number of p.s.u.'s selected I
m = number of samples selected in each p.s.u. I k = number of splits selected in each sample Y'ijh = number of organisms entrained in the hth (selected) split of I the jth (chosen) sample within the ith {selected} p.s.u., i=l, **. ,N, j=l, *.* ,M, h=l, *** ,K I
- = mean number of organisms in the k (selected) splits in the Y'ij jth (chosen) sample within the ith (selected) p.s.u. I V.'I = mean number of organisms per split in the mk selected splits . th e 1.th p.s.u.
1n I
= mean number of organisms per split in the nmk selected splits Y
Y = nmk~ = total count of organisms entrained in the nmk selected I spl its Y = NMKy = (N~'K/nmk}y = product of the number of entrained organisms in the entire sample and inflation factors N/n, M/m, and K/k I n _ I
~, = r (~._y)L/{n_1) i=l 1 = observed variance among the n observed p.s.u.
means Yi I I
I 3.11 I
- ~4(Yij-Yi)2/n(m-l):: observed variance among sample means I lJ within p.s.u.'s I :: ~4Lh(Yijh-.Yij)2/nm(k-1):: observed variance among splits lJ I within samples I
I . An unbiased estimator of Y is given by Y, and an unbiased estimator of I .. Var(Y) is given by I var(Y)
- 2 N M2K2 [(* -~) s~ k (~- ~H ~M(t *)s~l*
+ + -
I The same issues that arose with two-stage sampling design formulas must be dealt with in estimating entrainment abundance from the three-stage sampl-I ing design formulas. Because m :: 1, reflecting the selection of only a single sample (subunit) from each p.s.u., s~ and therefore Var(Y) cannot be calculated. The component of variance due to variation.among sample I means within p.s.u.'s cannot be estimate~. furthermore, even if it were true that m > 1, allowing us to compute Var(Y), the opportunity to find a I better estimator of Var<Y) than Var(Y) would be present if we could model the process that generated the Y;jh's. I In deciding how many splits to analyze from each sample, we may elect either of two different types of strategy, fixed or adaptive. If we choose to analyze a fixed number of splits per sample, as in the estimation pro-I cess just described, the methods are an extension of the two-stage sampling design techniques. If instead we choose to analyze a number of splits that may vary from one sample to another, the processing of splits can be terminated adaptively; that is, the laboratory instructions can specify I that when certain conditions occur, analysis of splits from a given sample will terminate, any remaining splits from this sample will be discarded, and analysis of splits from a different sample will commence. Analyzing I a fixed number of splits per sample is, in fact, a special case of adap-tive sampling design: laboratory instructions would be to select splits from a sample at random, stopping when the condition occurs that k splits have been analyzed. Other possible adaptive rules include sum-quota sampling, to be discussed in Section 7.1, in which instructions would be:
I 3.12 select splits from a sample at random, terminating whenever some split I raises the total number of ichthyoplankton counted in the sample above a specified threshold (e.g., to 200 or"more); if the total sample count remains below the threshold, then count all splits. The class of all I adaptive subsampling rules includes all designs with constant k, in which the same number of splits ;s processed from every sample, as well as a variE~ty of more flexible designs; optimal or nearly optimal adaptive I sampling designs arE! therefore likely to outperform optimal deSigns with a fixed number of splits by a wide margin. I The parameters of a three-stage sampling design are (1) N, the total number of p.s.u.1s in the entrainment season. I Equivalent.ly, the time interval corresponding to a p.s.u. can be used. I (2) n~ the number of p.s.u.1s selected for examination. Equivalently, f1 = nlN of p.s.u.'s selected can be used. the proportion (3) M, the number of possible samples or subunits contained in an I entire p.s.u. (4) m, the number of samples selected per p.s.u. Equivalently, ~he I proportion f2 = m/M of available samples chosen can be used. (5) K, the number of possible splits contained in a sample. I (6) k, the number of splits selected from each sample. If an adaptive sampling design is used, k will vary among samples; here, instead I of a fixed value k, a rule must be provided that specifies the conditions under which analysis of splits from a particular sample terminates. I The ranges of N, n, M,and m to be considered are the same as for two-stage sampling designs. Full details are given in Section 3.1. I The number K of splits into which a sample is divided is limited by the capabilities of the splitting device and by the practical considerations I involved in laboratory proceSSing. These matters, especially the cap-abilities of the splitting device, have a major impact on the feasibility I of three-stage sampling. They are discussed in Section 7.0.
- T~e number k of splits per sample selected for analysis in a design with flxed k can range from k = 1 to k = K. The choice k = 1, however, has drawbacks analogous to those mentioned in the discussion of m = 1 in Sec-tion 3.1: this choice makes it impossible to calculate s~. Because we then cannot use s~ to estimate the variance among splits within samples, we must estimate this variance by making some assumption about the behavior of split counts within subunits. This issue is treated in Section 7.1.
I 3.13 I For adaptive sampling designs, k varies among samples. An almost unlimited variety of, rules for terminating the analysis of splits from any sample could be proposed. Consideration here will be limited to the sum-quota type sampling rules defined in Section 7.1. This class of rules will be I seen to improve greatly on the performance of sampling designs with a fixed value of k. I For sampling designs in which k is fixed, optimal designs are those for which the choices of N, n, M, m, K,and k reduce the variances of entrain-ment estimates as much as possible, consistent with a fixed level of sampl-I ing effort and practical limitations on sampling patterns. The general practice, except in relatively infrequent investigations of replicate sampl-ing, has been to make m = 1; a model for the distribution of ichthyoplankton I in samples from the discharge channel is of crucial importance in this case, as it provides the only way to make inferences from the observed counts to the entrainment abundance in the discharge channel. I For adaptive sampling designs, optimal designs involve choices among N, n, M, m, K,and rules for termination of split analysis. In sum-quota sampl-ing designs, a major part of this choice hinges on the tradeoff among the I time period of a p.s.u. (that is, the time interval spanned by a sample), the number K of split~ into which each sample is divided, and the quota Q that tells how many organisms must be counted before split analysis ceases. I The same level of sampling effort is required by many different combina-tions of these. Optimal designs produce the lowest possible variances for entrainment estimates at a fixed level of sampling effort. Good choices of design parameters can be made on the basis of information. currently I available. Optimization requires developing a formula for estimator vari-ance as a function of N, K, and Q and finding the best combination of N, K, and Q for a given level of sampling effort. I 3.3 ASSUMPTIONS REQUIRED FOR ANALYSIS I The analyses in this report are valid only if certain assumptions about the entrainment process and the sampling of that process are correct. It I is desirable to assume as little as possible so estimates of entrainment abundance and entrainment reduction will be valid under widely varying conditions. The assumptions on which any statistical sampling design depends should be stated explicitly, to allow them to be scrutinized and I perhaps tested by sampling experiments specifically designed to examine their validity. We now list and discuss the assumptions made in this report. I Assumption 3.1. The relationship between an entrainment sample and the contents of the entire discharge channel can be modeled in some way. I This is necessary to ensure that the contents of entrainment samples are related to the contents of the whole discharge channel in a nanner that I is understood to some- degree. In other words, the samples must be, in some sense, representative of the cooling water flow in the E!ntire channel cross-section at the sampling location. Our knowledge of the relation-ship of the sample to the entire cross-section flow can be vague, speci-I fic, or somewhere between these extremes. I
I 3.14 When our knowledge of a process is vague, we can describe the process and I its behavior only at a general level. Our understanding of the process is not complete, but partial. Consequently, this understanding is embodied in mild, general assumptions. For instance, when considering a collection of observations (o'F entrainment abundance or any other phenomenon) from a I process about which little or nothing is known, we may decide to assume only that the observations are independent and identically distributed. We may also decide to assume that the common distribution of the observa-I tions has finite mf~an and variance. These assumptions are very general: they are satisfied by a wide range of distributions. I In contrast, specific knowledge of a process is a detailed description of the mechanisms and relationships involved. This is usually stated as a parametric model. For instance, we may decide to assume that organism counts in an entra'inment process behave as independent observations from u a Poisson distribution whose intenSity parameter or mean is given by a specified function of plant and river conditions. I An example of the IJse of vague knowledge is the treatment of two-stage and three-stage sampling in Cochran (1977, Ch. 10). The only assumptions are that the p.s.u.'s selected for sampling, the samples selected within each o p.s.u., and the sp'lits selected in three-stage sampling within each sample are all chosen at 1"'andom from the set of all available units of each type. No parametric assumptions are made about the distributions of the counts I in these units, so the results of this analysis are broadly applicable. Nevertheless, this method has drawbacks that were pointed out in Sections 3.1 and 3.2, including its inability to cope with situations where m = 1. The importance of the assumption of randomness in this method must be o emphasized - if it is false because of systematic spatial nonrandomness in the discharge channel, then this approach will not perform well in esti-mating entrainment abundance. Systematic spatial nonrandomness occurs I when a departure from spatial randomness is present: an organism is not equally likely to pass the location of the sampling gear at any point in the discharge channel cross-section. Spatial randomness is described in I Appendix I. For example, systematic spatial nonrandomness occurs when the ichthyoplankton count of a sample is influenced by the depth at which the sample is drawn. INhen spatial nonrandomness is present, the sample(s) I selected within a p.s.u. cannot be viewed as chosen at random from all potential samples, unless the location of the sampling gear intake pipe is chosen at random from the discharge channel cross-section. With fixed intake pipe location, the question of whether the selected sample(s) con-o stitute a random s,ample from among all potential samples must be raised. Past studies investigating whether the distribution of organisms in the I discharge channel exhibits spatial randomness have come to different conclusions. A study of net and AUTOSAM densities for various striped bass life stages at Indian Point concluded that no significant depth effect was present for any life stage (EA, 1981, p. 5-67). However, a o gear comparability study of entrainment densities of juveniles at Indian Point two years later concluded that sampling site (that is, depth) had a Significant effect for five of the seven taxa examined (Normandeau and o Con Edison, 1982, p. 30). I o
I 3.15 I Neither of these two studies addressed the question of whether horizontal differences exist in the distribution of ichthyoplankton in the cross-sectional space of the discharge canal. The study of juvenile entrainment I raised this issue and stated that the hypothesis of no horizontal differ-ences would be assumed in its analysis (Normandeau and Con Edison, 1982,
- p. 18-19).
I The assumption of spatial randomness in the discharge channel organism distribution cannot be made without caution; as a recent Indian Point I entrainment report noted in discussing differences among sampling gear (EA, 1984b, p. 4-26), I The cause . .
- could possibly be related to flow patterns within the discharge canal of 02 which resulted in dif-ferences in water velocities at the intakes of the sampling hoses. Flow patterns at 02 are quite variable and change I substantially with changes in pump operation at Units 2 and 3. Thus, the difference may be due to location rather than type of sample.
I An example of specific knowledge is a parametric model for the distribution I of ichthyoplankton in the discharge channel. For instance, the Poisson model for sample counts in the discharge flow has often been adopted. The assumptions of such a model are more narrow than those of the general approach; however, when these assumptions are satisfied, the model-based I analysis will give more accurate results than the general approach. Cor-respondingly, inferences based on a model that does not actually hold can be extremely inaccurate, so checking a model's consistency with observed I data is a critical necessity. The failure of the Poisson model to agree with generating station data will be treated and a more suitable model will be introduced in Chapter 4. I Assumption 3.1'. In this study, random sampling will be assumed: one or more simultaneous samples from different locations in the discharge channel I will be considered to be a random sample from the entire set of pump or net volumes of cooling water passing by the sampling gear during the p.s.u. In other words, the total volume of cooling water that flows by the sampling gear during a p.s.u. can be viewed as a collection of smaller I volumes called pump or net volumes. Each of these is equal in volume to the amount of water that is withdrawn from the discharge channel to become a sample, passing through the sampling gear. The total flow of cooling I water during the p.s.u. is divisible into a set of nonoverlapping (mutually exclusive) pump or net volumes, one or a few of which are v/ithdrawn to become replicate samples; the remaining pump or net volumes are potential I samples that turned out to be unrealized, because they were not withdrawn from the discharge channel by the sampling gear. It will be assumed that the sample or replicate samples actually obtained in any p.s.u. are a random sample from the conceptual set of all pump or net volumes that are I present in the discharge during the p.s.u. I I
I 3.16 I Three reasons can be given for making this assumption of random sampling. First, it is the simplest assumption possible, so it is preferable to more complex assumptions if it can be supported. Second, the available I data do not allow us to propose and support any alternative assumption. Third, it will be seen in Chapter 4 that the ichthyoplankton counts of replicate samples within a p.s.u. are consistent with the hypothesis I that these samples are independent observations from a distribution in a particular parametric family, the negative binomial family, to be dis-cussed in Section 4.2. Consequently, Assumption 3.11 will be made rather than Assumption 3.1. Note that Assumption 3.1 is so broad that it I is unworkable; it is stated and discussed here to emphasize the fact that different approaches to entrainment modeling exist, but some restriction to a more specific formulation must occur before progress can be made. I Assumption 3.2. At any instant of time, the number of organisms entrained is directly proportional to the volume of cooling water flow through the I plant. In other wO"ds, the density of organisms entrained at a plant is identical under both the actual and full plant operating conditions. This relationship holds for each life stage of each taxon at each instant of t<ime. Of course" densities may differ across time and among different I taxa and life stages. This assumption is currently made in entrainment estimation at Bowline I Point, Indian Point, and Roseton generating stations. Recently gathered data from Bowline Point and Roseton, to be discussed in Section 4.7, raise questions about this assumption, but the evidence is not strong enough to I refute it. Should suc~ a refutation occur in the future, a different model relating ichthyoplankton density to flow volume will be required. I Assumption 3.3. The ratio R of the volume of cooling water passing the sampling gear to the volume of water sampled is known for each p.s.u. This ratio is the expansion factor that is multipl ied by the sample I organism count to obtain estimated plant entrainment counts for each p.s.u. In practice, when cooling water flow is known only on a daily (24-hour period) basis, the volume passing through the discharge channel during a I given p.s.u. can be approximated by daily volume / number of p.s.u. 's in a day (3.1) I This approximation depends on the fatt that the plant cooling water flow rate is nearly constant across a single day. I The accurate measurement of the volume of discharge channel flow in each p.s.u. would improve the accuracy of entrainment estimates. Without this information, th(~ approximation (3.1) or another similar approximation I must be used in place of the ratio R for each p.s.u. The critical nature of R in entrainment estimation makes it clear that errors induced by this approximation can have an adverse impact on the accuracy of estimation. I I I
I 3.17 I Assumption 3.4. The relationship in three-stage sampling between split counts and the count of the sample from which these splits are drawn I can be modeled as a random process. More specifically, when a sample is divided into splits, each organism in the sample is distributed at random among the splits, and all organisms are assigned to splits independently. I The evidence pertaining to this assumption and its consequences are dis-cussed in Section 7.0. If the splitting device is not capable of divid-ing a sample into splits in a way that satisfies Assumption 3.4, the I efficiency of three-stage sampling can be seriously impaired. There is a need for further experimental investigation of the validity I of these assumptions. Based on currently available data and analyses, Assumptions 3.1', 3.2, and 3.3 appear to be necessary and warranted. Assumption 3.4 or some other assumption about the variance of counts among splits within a sample is necessary for three-stage sampling plans. Such I an assumption does not seem warranted for splitting devices currently available. I I I I I I I I I I I I
I I I 4. ASSESSMENT OF DISTRIBUTIONAL INFORMATION I 4:0 INTRODUCTION I In order to assess the efficacy of a sampling design, we m~st first obtain the form of the variance of the proposed estimators. The sampling design will affect this variance, and, by understanding the relationship between them, an optimal sampling de~ign (in terms of getting the smallest variance I for the least effort) can be obtained. Estimates of entrainment abundance in the discharge channel are obtained I by either a one-step or a two-step approach within each time period or p.s.u., depending upon whether subsampling is used. In the one-step case, one or more samples are obtained from the discharge channel during each I selected p.s.u., and each sample is sorted and identified in its entirety. In the two-step case, one or more samples are obtained from the discharge channel during each selected p.s.u. Each sample is divided into a number I of splits (or aliquots), and these splits are then sampled according to a sum-quota sampling rule: individual splits are randomly selected and their contents are identified, until the sum-quota rule being used specifies I that no further splits must be identified. Sum-quota procedures are treated in Chapter 7 of this report. The variation in the entrainment abundance estimates ;s composed of many I parts, in particular (i) variation in the discharge channel, I (ii) variation in the subsampling process, and I (iii) variation in the classification process. In the one-step case only (i) and (iii) apply, while (i), (ii), and (iii) I all apply in the two-step case. In this chapter we are mainly concerned with variation in the discharge channel, which leads us to consider both (i) and (iii). Subsampling variation will be treated in Chapter 7. I 4.1 THE POISSON MODEL I If it is assumed that the ichthyoplankton have a spatially random distribu-tion in the discharge channel, then with a few additional assumptions, a theoretical model can be developed which shows that the variation in the I organism counts of the samples from the discharge channel should follow a Poisson distribution. (See Appendix I for details, including a discus-sion of spatial randomness.) Specifically, if we define for a given p.s.u. I I
I 4.2 I A = true mean density of ichthyoplanktonduring the p.s.u., v = volume of water sampled during the p.s.u., X = observed number of ichthyoplankton in the sample, I then X has a Poisson distribution with intensity or mean Av: I for x=O,1,2, ... , ( 4. 1 ) I that is, X - Poisson(Av). This process may be observed over n p.s.u. IS, with different organism den-I sity A, sample volume v, and observed ichthyoplankton count X occurring in each of these p.s.u.'s. It may also happen that two replicate samples, each of volume v, Wl:re drawn independently in each p.s.u. The Poisson I model is easy to extend to this situation;. the notation would be Xi' Y - indepE:ndent Poisson(Aiv ) for i = 1,2, ... ,n, i i (4.2) I where the n pairs X.J. ., 1 are independent of each other and A. = true mean density of ;chthyoplankton during D. s. u. i , I 1 V., = volume of water constituting a sample taken during p. s. u. i , I X. = observed number of organisms in the first sample taken during p.s.u. i , I Y. = observed number of organisms in the second sample taken during 1 p.s.u. i. I In order to test the assumption of Poissonness, data that were obtained through replicate s2:mpling were analyzed. For example, Table B-2 in the I Bowline Point 1983 Annual Report (EA, 1984a) gives data that were simul-taneously sampled. These data consist of 49 pairs of observations; in each pair the two replicate observations were obtained simultaneously. We there-fore have 49 pairs (Xi' Vi)' i=l, ... ,49, each of which is independent of the I remaining 48 pairs. In addition, for each i the two components X.1 and Y.1 of the ith pair are independent, and under the Poisson model they satisfy I (4.2) . To test the Poisson model (4.2) for n independent pairs of replicate I observations Xi,Y i , we can use the test statistic n
~
2 (X.-Y.) j(X.+Y.) (4.3) I i=l 1 1 1 1 I I
I I 4.3 When the Poisson model holds, Sp ~ x~. Large values of Sp lead to rejec-I tion of the null hypothesis HO: the data are from the Poisson model (4.2). The derivation of S is given in Section 4.1.1. Sp is discussed I further in Section 4.2.1. p For the 1983 Bowline Point data on bay anchovy larvae, we obtain I Sp=431.692 (p<lO-7), which is extremely strong evidence against the vali-dity of the Poisson model. I Similar simultaneous data were available from the Roseton 1982 Annual Report (EA, 1983). Since these data were obtained in half-hour samples, I it was possible to composite them in order to test the Poisson assumption for different sample durations. Data were composited into two sets of hourly data, one compositing on the hour, and one on the half-hour. From I these two composites, denoted Composite I and Composite II, two- and three-hour samples were obtained. In all cases, the Poisson model was rejected by the data. A summary of all chi-square tests for Poissonness is given I in Table 4.1. A detailed reporting of the Bowline Point analysis is given in Table 4.2, including the values of the chi-square statistic fGr each data pair. Each I of these values should be approximately equal to 1 if the null hypothesis that the data are from a Poisson distribution is true. The fact that many of these pairs are quite large is important: it. shows that the Poisson
- 1 model is not being rejected because of only a few influent-ial IIbad data ll points. (One extreme'ly large pairwise value could induce rejection, which would make us suspect that rejection may be due to one aberrant data value I rather than to non-poissonness. This is not the case here.) Details of the
*Roseton 1982 analysis are quite similar to those of the Bowline Point 1983 analysis. Because of their lengthiness, they are not reported in full here.
I 4.1.1 DERIVATION OF THE POISSON TEST STATISTIC I Given n pairs of replicate observations Xi,Y i , i=1, ... ,n, the Poisson model (4.2) specifies that each pair consists of two random observations inde-I pendently drawn from a Poisson distribution. The distribution's underlying mean is the same for the two observations in a pair, but may vary arbi-trarily from one pair to another, and every pair of observations is inde-I pendent of all other pairs.
,The test statistic Sp of (4.3) will now be derived as a test statistic of I the null hypothesis HO: the paired data Xi ,Y i ,i=1, ... ,n, are from the Poisson model just described. This derivation is inevitably technical in nature.
I I
4.4 Table 4.1 I S~ of X2 tests to evaluate the I Poisson and ~ative binanial m:x:1els 30wline Point 1983 I Life 2 Significance
- Significance
- Soecies Stase Poisson '!:. Level
<10- 7 Neg.bin X 2
Level I Bay Anc.."1Dvy LirV'ae 431.69 46.86 .51 Roseton 1982, Cancosite I I hour samoIes {;s pcurs~ I
<10- 7 C11.lpeids Alosa spp.
LcirV'ae LcIIVae 273.76 205.25 <10- 7 65.98 93.02
.41 .01 I White perc.'l Eqgs 108.39 .00059 68.70 .32 White perch Yolk-sac 1azvae White perch LIIVae 188.13 ill. 64 <10- 7 .00029 72.30 78.36 .22 .11 I
Striped bass Yolk-sac lazvae 144.80 <10- 7 73.73 .19 Striped bass Lu:vae
- -Drone spp. Lu:vae 132.08 299.11
<10- 7 <10- 7 65.95 75.17 .41 .16 I
C1upeids L:u:vae 2 hour samples (32 188.57 oairs)
<10- 7 40.79 .11 I <10 -7 Alosa spp.
White perch LCLl:vae
~rgs 115.19 41.52 .U 66.83 28.68 .00020 .58 I White perch Yc)lk-sac larvae UO.38 <10- 7 34.85 .29 White perch Striped bass L2LrVae Yc)lk-sac lazvae 67.14 n.E3 .00027 .OOOOU 43.30 31.61 .070 .44 I
Striped bass L2u:vae 68.74 .00017 41.83 .093
- -Drone spp. ULrVae 262.95 <10- 7 34.35 .31 I 3 hour samples (20 pairs)
Clupeids L2LrVae 170.37 64.86
<10- 7 .OOOOOU 29.15 40.85 .064 .0025 I
Alosa spp. ULrVae White perch ~rgs White perch Yc)lk-sac lazvae 23.63 U1.08
.26 <10- 7 16.79 27.70 .60 .089 I
White perch UIrVae 32.02 .043 22.85 .24 Striped bass Yolk-sac lazvae Striped bass ULrVae 78.22 52.12
<10- 7 .00011 23.95 24.21 .20 .19 I <10- 7 19.84 .40 M;:)rone spp. Lu:vae 255.98 I
- The significance level (also known as the p-value) indicates the amount of evidence against the null hypothesis. In the cases considered here, small values of the significance level are evidence against the fit of I
the model: the smaller the value, the more evidence against the fit of the model. I I
I 4.5 I Table 4. 1 continued I ~seton 1982! Comcosite II 1 hour sampIes (61 pairs) Life
- Significance*
2 Significance 2 I Soecies C1upeids Staie Larvae Poisson 229.61
~ Isvel <10-7 Neg.bin ~
61.45
!eve1 .42 AJ.osa spp. 282.62 <10- 7 88.86 .Oll I
Larvae White perch Eggs 78.06 .069 56.12 .62
<10- 7 67.92 .23 White perch Yolk-sac larvae 159.75 5
I tVhite perch Striped bass Larvae Yolk-sac larvae 126.00 138.39
<10- <10- 7 66.49 75.03 .26 .092 Striped bass Larvae ill. 55 <10- 4 69.10 .20 I Morene spp. Larvae 286.38 <10-7 70.32 .17 I Clupeids Larvae 2 hour 161.93 ~les (28 pairs) <10-7 30.14 .31 AJ.osa spp. Larvae 140.62 <10-7 30.55 .29 I White perch White perch Eggs Yolk-sac larvae 39.71 ll4.78 .071 <10- 7 31.59 32.05 .25 .32 I White perch Striped bass Larvae Yolk-sac larva.e 50.31 86.71 .0059 <10- 7 32.62 28.80 .21 .37 Striped bass Larvae 60.17 .00038 33.86 .17 I Mc%One spp. Larvae 255.58 <10-7 30.59 .29 3 hour sameles (18 pairs)
I Clupeids Larvae 132.21 <10-7 21.33 .21 Alosa spp. Larvae 83.95 <10- 7 30.71 .022 I White perch White perch Eggs Yolk-sac larvae 25.91 llS.50
.10 <10- 7 16.22 20.38 .51 .26 White perch Larvae 47.84 .00016 24.98 .095 I Striped bass Striped bass Yolk-sac larvae Larvae 69.61 48.20 <10- 7 .00014 21.67 24.65 .20 .10 <10- 7 I Mcmne spp. Larvae 232.64 18.05 .39 I *The significance level (al~o known as the p-value) indicates the ~~ount of evidence against the null hypothesis. In the cases considered here, small values of the significance level are evidence against the fit of the model: the smaller the value. the more evidence against the fit of I the model.
I I
4.6 I I Table 4.2 I Contributions to M:ldel X2 , s fran each Sarrp1e Bowline Point 1983 Table B2 Data Contribution Contribution I Larvae Count to Poisson to Negative Sample Cbs 1 Clbs 2 Mean Variance X 2 B.inamial X 2 I 1 158 147 152.5 60.5 .39 .01 2 3 4 143 106 57 72 71 74 107.5 88.5 65.5 2520.5 612.5 144.5 23.44 6.92 2.20 1.49
.52 .22 I
5 97 141 119.0 96800 8.13 .47 6 7 8 80 109 109 85 119 96 82.5 114.0 102.5 12.5 50.0 84.5
.15 .43 .82 .01 .02 .05 I
9 350 142 246.0 21632.0 87.93 2.53 10 11 12 224 109 214 175 137 80 199.5 123.0 147.0 1200.5 392.0 8978.0 6.01 3.18 61.07
.21 .17 2.89 I
13 84 45 64.5 760.5 11. 79 1.20 14 15 16 37 67 46 9 24 24 23.0 45.5 35.0 392.0 924.5 242.0 17.04 20.31 6.91 4.17 2.83 1.20 I 17 44 38 41.0 18.0 .43 .06 18 19 20 118 30 116 142 48 93 130.0 39.0 104.5 288.0 162.0 264.5 2.21 4.15 2.53
.11 .66 .16 I
21 137 59 98.0 3042.0 31.04 2.15 22 23 24 61 49 37 50 76 9 55.5 62.5 23.0 60.5 364.5 392.0 1.09 5.83 17.04
.12 .61 4.17 I
25 91 68 79.5 264.5 3.32 .28 26 27 28 55 91 50 99 63 38 77.0 77.0 44.0 968.0 392.0 72.0 12.57 5.09 1.63 1.09
.44 .23 I
29 63 69 66.0 18.0 .27 .02 30 31 32 28 41 44 34 38 26 31.0 39.5 35.0 18.0 . 4.5 162.0
.58 .11 4.62 .11 .01 .80 I
33 53 54 53.5 .5 .00 .00 34 35 36 46 72 24 13 100 36 29.5 86.0 30.0 544.5 392.0 72.0 18.45 4.55 2.40 3.70
.35 .47 I
37 19 8 13.5 60.5 4.48 1.61 38 39 40 32 29 13 25 34 31 28.5 31.5 22.0 24.5 12.5 162.0
.85 .39 7.36 .17 .07 1.86 I
41 8 20 14.0 72.0 5.14 1.80 42 43 44 12 16 20 43 24 17 27.5 20.0 18.5 480.5 32.0 4.5 17.47 1.60
.24 3.71 .43 .07 I 45 14 13 13.5 .5 .03 .01 46 47 ~tj 49 17 22 113 23 10 28 15 60 13.5 25.0 16.5 41.5 24.5 18.0 4.5 684.5 1.81 .72 .27 16.49 .65 .16 .Ois 2.48 I*
I I
I 4.7 I In the case of paired samples, the Poisson assumption may be tested either by an exact binomial test or by the chi-square approximation to the exact I distribution of the Poisson variance test statistic. Unequal measured sample volumes, say vi for the count Xi and vi for the count Y;, with I v; F vi, can also be accommodated by these test procedures. For a given total t; = Xi + Yi the conditional probability distribution of (X;,Y;) is I binomial, P( X1. :ax,1Y. =y1 IX1. +Y .1=t .) = (t.! / x . ly . 1) [v. / (v . +v ~ )] x [v 1 1 v . *V ~ )] y , (4.4) I 111111 I. / ( 1 I which reduces to the simple form P(X 1.=x, Y.=t.-xlx.+Y.:at.) =(ti)(1/2)ti (4.5) I 1 1 1 1 1 X I when v.1 = V;I. Letting I Pi = v1.1 (v.1 +v 1~) (4.6) I ... P.1 = X./(X.+Y.) 1 1 1
= X,../t.1 ,
I we may then test for a significant discrepancy between the observed p.1 and the expected Pi' I The exact test could be implemented by entering a table of binomial confi-dence limits or Clopper-Pearson charts (tloppe~ and Pearson, 1934) to check I whether the expected ,., p.1 is contained within the confidence limits determined by the observed Pi and the sample size t i
- Alternatively, it is a straight-I forward matter to compute the exact probability of obtaining a binomial sample as discrepant as the one in hand. Measuring this discrepancy by the absolute difference between observed and expected values, I
di = discrepancy = Ix; - tiP;1 = Iy.1 - t.(l 1
- p*)I, 1
(4.7) I and letting I I
I 4.8 I x* = t.p. - d.1 (rounded downward to an integer) x** = t.p. 1 1
+ d.1 (rounded upward to an integer),
(4.8) I 1 1 I then the significance level a*1 of this discrepancy is x* t.1 I Ct* 1
= L Bin(x;t.,p.) + L Bin(x;t.,p.) (4.9)
I x=O 1 1 1 1 x=x** where t.-x I Bin(x;t.,p.) = [t.!/x!(t.-x)!Jp~(l-p.) 1 (4.10) 1 1 1 1 1 1 I In the special case of equal volumes (where v. = v~, so p. = 1/2) no raund-1 1
- 1 **
ing is needed to produce the lower and upper integers x and x ,and the I lower and upper tail probabilities in (4.9) are equal, so (4.9) reduces to the simpler expression for Pi = 1/2 I (4.11) I As an example of this latter calculation, consider paired sample 37 in Table 4.2 where X37 = 19, Y37 = 8, so t37 = 27. Assuming equal sample I volumes (P37 = 1/2)1 I d37 = 119 - 13,,5 I = 18 - 13.5 1= 5.5 I
~* = 13.5 5.5 = 8 I x** = 13.5 + 5.5 = 19 I I
I I WWJi_"'. _ _. _ ,.a
I 4.9 I
= (1 + 27 + 351 + 2925 + ..* + 2,220,075)/2 26 I = 3,505,69912 26 I = .052239 I Such calculations become subject to computational inaccuracies as Xi + Yi = t; becomes large, but the normal approximation to the binomial I then becomes applicable (see Snedecor and Cochran, 1983, p. 120), giving the normal test statistic zc' corrected for continuity, I zc = (d.1 - .s)/!t.p.(l-p.)1 5 " , Pi='
I = (I x.1 - Y.1 1 - 1) IJ x. + Y.1 * (4.12) 1 I The square of Zc is approximately distributed under the nu1l hypothesis I as chi-square with one degree of freedom. In the above numerical example with Pi = .5, this gives I Z :8 (119 - 81- 1)/j2i= 1.9245, c I which corresponds to a P-value of .054292 for a two-tailed normal test, as compared to the exact two-tailed probabil ;.ty .052239 calculated above. I For large Xi + Yi the continuity correction has a negligible effect; the square of the uncorrected normal test statistic, I z2 = (X.1 - t.p.)2 /t . p *(l-p*)1 - 5 I 1 1 1 1 1 Pi-' I = (X. _ y.)2 / (x. + Y.), 1 1 1 1 (4.13) I is approximately distributed as a chi-square with one degree of freedom. This chi-square analysis can be done independently for each of the n paired samples Xi,Y i , and summing the n resulting test statistics gives I I
I 4.10 n I Sp = L (X. i =1 1 I the statistic of (4.3), whose distribution under the null hypothesis is chi-square with n degrees of freedom. I For small X.1 + Y.1 thE~ exact test given by (4.9) involves a small number of atoms of probability,. each of relatively large mass. As a result, the test is not very powE~rful; it is capable of detecting only extreme depar-I tures from the null hypothesis. The derivation of Sp is supported by the following intuitive view. If the I pairs (Xi,Y i ) satisfy the Poisson model null hypothesis, then I E[(X.1 + Y.)/2] ~ (A.V. + A.v.)/2 = A.v. 1 11 11 11 I I
. 2 Thus, under the null hypothesis, we expect (Xi - Yi ) /(X i + Yi ) to be approximately equal to 1, which is the mean of the chi-square distribution I with one degree of freedom. This is based on the approximation E[f(X,Y)/g(X,Y)] ~ E[f(X,Y)]/E[g(X,Y)] for functions f and g of the pair (X,V),
which comes from the first-order Taylor series expansion of fIg about the I point (Ef,Eg). This expansion is sufficient for the informal nature of the present discussion. I 4.2 THE NEGATIVE BINOMIAL MODEL The rejection of the Poisson model in the discharge channel leads us to search for an alternative model. A richer class of models, which includes I the Poisson model as a special case, is that based on the negative binom-ial distribution. This is a reasonable place to look for alternatives since it incorporates a set of assumptions similar to those of the Pois-son, but allowing for a wider range of variability. I 4.2.1 THE RATIONALE FOR THE NEGATIVE BINOMIAL MODEL I Cassie points out that samples from natural populations often exhibit over-dispersion, which occurs when the variance in samples from a population exceeds the mean (0'2 > ]..I): "Overdispersion is so common in natural popula-I tions that it will ineVitably be one of the most important features to be observed in the planning and execution of any sampling program." (Edmondson and Winberg, 1971, p. 185). Noting that various mathematical models for I overdispersion have been proposed, Cassie says The general features of most models can be described approximately as follows: If a number of different I I
I 4.11 I sets of samples are taken, each set representing the same natural population, but employing a different sample size, the relationship between I the means and variances of the various sets will approximate to: I 2 0. 1
= ~.1 + 2 c~.
1 (*) I where c is a constant characteristic of the pop-ulation and the subscript i refers to the ith set. I The right-hand side of the equation may be regarded as having two components, the Poisson variance, ~, ariSing from random variation, and the overdisper-I sion variance, c~2. It is difficult to generalize about the likely value of c. In natural plankton populations, probably the lowest value which can be hoped for is about 0.05, while the modal value I may be 0.1 to 0.2 (Cassie, 1963). However, con-siderably higher values are not uncommon ***. If the mean is small (say <1) we find that the I second-order term, c~2, in (*) is negligible and that the distribution approximates to Poisson. For this reason we sbould not be surprised if rare I species show no departure from random distribution. On the other hand, when the mean is large (say >40) the first-order term, ~, is negligible and: I I Cassie's advocacy of the quadratic variance model (*) does not specifical' mention the negative binomial distribution, but several considerations I make this a logical choice for a model of overdispersion: (1) The nega-tive binomial model is ageneralization of the Poisson model, which is the standard model for random distribution; that is, the Poisson family is a special case of the negative binomial family of distributions. (2) The I negative binomial model is the most commonly used model for overdisper-sion. It is a very broad, general class of distributions. (3) The nega-tive binomial model satisfies the quadratic variance relationship of equa, I tion (*). It is one of the few distributions to do so. (4) The negative binomial model is analytically tractable, allowing a wide range of theo-retical and applied calculations. (5) The negative binomial model can be I viewed as a mixture of Poisson distributions, in which a Poisson observa-tion is drawn after the Poisson intensity parameter, the mean density A of ichthyoplankton, is determined as a random observation from a gamma distribution; it is not necessary to view the negative binomial model as I originating in this fashion, but this approach may be helpful to some, so we include it for completeness. We will discuss these points in greater detail after presenting an alternative argument, based on the nature of I the entrainment process, in support of the quadratic variance model. Thi( argument is directly related to the structure of entrainment sampling. I
I 4.12 I Ichthyoplankton organisms of a given species and life stage exhibit tem-poral peaks and troughs in their abundance. Furthermore, at a given point in time, these organisms are not, in general, uniformly distributed through the river's cross-sE!ction at the intake location: the abundance differs I at various points in the water column (that is, at different depths) and at various cross-channel (that is, horizontal) positions in the river. In this sense, the ichthyoplankton entering the intake are spatially clumped, I i.e., they exhibit spatial gradients. As the volume of water that on dis-charge will constitute a p.s.u. passes through the cooling system, the spatial pattern of ;chthyoplankton densities is disrupted and fragmented; I, during the water's passage through the plant, it is divided into sub-volumes that may be described as long, relatively thin threads, which cor-respond to the parallel tubes through which the subvolumes pass. Each thread originates from a specific cross-sectional location in the spatial I dens"ity pattern that enters the intake; consequently, the density of organisms will differ among simultaneously formed threads. After the fragmentation of the intake spatial pattern of densities into threads, the I plant cooling is performed, and then a tumultuous reunion of the threads takes place. This reunion might produce a cross-sectional mixture in the discharge channel that largely preserves the temporal distribution but I reconstitutes the spatial distribution as a mixture of thread fragments. Aftel~ passage through the plant, the density of organisms within each thread fragment may be expected to be virtually random; however, when the threads recombine in the discharge channel, each of them is located near I new neighbors from different sectors of the original spatial pattern. Only complete cross-sectional mixing, of the kind that occurs when there is I total homogenization, would induce Poisson variation between replicate (simultaneous) samples from a p.s.u. in the discharge flow. The cross-sectional mixing resulting from the division into the threads just I described is only incomplete, though: each thread, formed by a segment of the original spatial pattern in the intake, retains its integrity as it passes through the cooling system, because mixing takes place within each thread but not among different threads. This incomplete mixing leads I to partial preservation of the incoming spatial heterogeneity. Such cross-sectional heterogeneity in the discharge would necessarily result in greater than Poisson variation among simultaneous replicate samples from the same I p.s.u. The physical mlxlng process of separation into threads followed by reunion I in the discharge is ii steady state stochastic process acting in common on all ichthyoplankton. The effect on individual species and life stages, though, would depend on the nature of their spatial pattern of density in the intake. Juveniles, for example, might be highly dispersed relative to I the size of the thread fragments of water entering the discharge channel, because juvenile densities are low. Consequently, the tumultuous mixing in the discharge wou"ld effectively constitute homogenization, resulting I in a Poisson distribution in spatial cross-section, i.e., spatial random-ness in distribution.. For the smaller and more abundant life forms, how-ever, the mixing might be incomplete, since neighbors within a small frag- I ment of water mass remain neighbors in the discharge. This clumping or incomplete mixing generates a cross-sectional spatial distribution that must be more variablE! than a random (Poisson) spatial distribution. I I
I 4.13 I Within a small thread fragment of a p.s.u., the spatial distribution of an organi sm shoul d be essenti ally random, but the density of organi sms I within the thread depends upon where the thread originated in the intake pattern. A collected sample is a cluster of such thread fragments yield-ing a sum of different Poisson counts, each of which is subject to Poisson variance. The variance between replicate samples, however, also depends I on the between-cluster component of variance among such composites of Pois-son density parameters (these parameters are the true underlying Poisson mean densities that the sample counts are used to estimate). This second, I between-cluster component of variance depends on the magnitude of the den-sity gradients, or differences in density from one location to another, in the intak~ spatial pattern for the organism in question. The component I is also affected by the physical nature of the mixing process that frag-mentizes and mixes the threads of cooling water independently of the organisms' presence. I If this process does produce a random mixture of fragments of the original spatial pattern (a necessary condition for unbiasedness of Cl fixed-position sampler), then this second component of variance constitutes a quadratic I term in the variance between replicate samples. This component can be expressed as the product of the squared mean and the relative variance among randomly formed composites of fragments of combined volume v (the I sample volume). The coefficient of the quadratic term in the variance corresponds to the parameter K of a negative binomial distribution. This quadratic coefficient would now be representing a sum of three terms I expressing the variance of a quota-sum of products of two independent ran-dom variables. One variable is the volume of a fragment, representing the sum-quota variable itself, and the other is the Poisson density parameter I of that fragment. The three terms in such a sum-quota (sum-quota volume equals v) relative variance comprise, first, the relative variance frag-ment volume; second, the spatial relative variance of density; and third, I the product of these two relative variances. At a constant flow rate the relative variance of fragment size would remain constant through time. If spatial gradients in density are scaled by the overall density (mean den-sity) of the organism at a given time, then spatial relative variance I would likewise remain constant over time. Each kind of organism would then exhibit a quadratic variance function with a quadratic coefficient indexing the degree of clumping in the spatial distribution of that I organism, as modified by the plant mixing process. Either Cassie's general approach to sampling natural populations or the I entrainment-based approach just given leads us to consider a model with a quadratic variance function. The negative binomial model is an obvious candidate for the reasons listed earlier in this section. We now discuss these points. I One way in which the negative binomial can be thought of as a generaliza-tion of the Poisson is through the variance function. If a random vari-I able X is observed with mean ~, then we would have I I
I 4.14 I Var(X) = ~ if X is Poisson I Var(X) = ~+(l/K)~2 if X ;s negative binomial, I where K is an additional free parameter. The value of K in this model, which must be positive, is constant for each species-life stage combina-m tion; it is not altered by changes in sample volume, organism density, etc. from one p.s.u. to another. These equations il'lustrate that the nega,t;ve binomial model contains the Poisson (by taking K=oo), but also I allows a much wider range of alternatives. In fact, the negative binomial model approaches the Poisson model as K+oo, not merely in the value of Var(X) but in the probability of any specified outcome. In other words, I the negative binomial model approaches the Poisson model in distribution as K+oo. Thus the Poisson model is the subfamily of the negative binomial mode'l obta i ned by setti ng K=oo. I There are several probability models that have greater variance than the Poisson model, but the class of negative binomial models is the most com-monly used one. A full treatment of the Poisson, the negative binomial, I and related distributions can be found in Johnson and Kotz (1969, Chapters 4, 5, 8, 9); only a brief discussion will be given here. Historically, the negative binomial has long been used as a generalization of the Pois-I son (Anscombe, 1949; Bliss, 1953; Fisher, 1953) in situations character-ized by dispersion or variability too great to be compatible with the Poisson model. Johnson and Kotz (1969, p. 125) point out the desirability I of the negative binomial model "as alternative when it is felt that a Poisson distribution might be inadequate." The class of negative binomial distributions is a 'Very rich class, that is, the distributions in this class cover a broad range of alternatives, so its use is quite nonrestric-I tive. An interesting and Ire levant recent study of distributions whose variance I is at most a quadratic function of the mean (Morris, 1982) lends further support to the choice of the negative binomial. Morris showed that if we limit considerat'ion to distributions in the exponential class, an I extreme ly broad catl~gory that i ncl udes most standard di stri buti ons and lies at the heart of modern statistical theory, there are only six classes of distributions whose variance is at most a quadratic function of the I mean. Of these, thl~ee are continuous (the normal and gamma classes are among these), and one is bounded (the binomial class), making them unsuit-able for modeling the counts in an entrainment situation. The two remain-ing classes with at most quadratic variance are the Poisson, whose vari-I ance is a linear function of the mean (in fact, the variance equals the mean itself), and the negative binomial. Morris demonstrated th~t the quadratic variance relationship is a pivotal property that greatly sim-I plifies statistical analysis of these six classes of distributions. This analysis, combined ~/ith Cassie's treatment of overdispersion, provides a strong case for the negative binomial distribution. I I
I 4.15 I Another aspect of the negative binomial that weighs in its favor is its analytical tractability. The negative binomial distribution allows a wide range of theoretical and applied calculations. Many of these are I discussed in Johnson and Kotz (1969). I A recent article by Collings and Margolin (1985) examines tests for detecting negative binomial departures from the Poisson model. The authors caution against I the propensity of some investigators to assume Poisson sampl-ing without any attempt at empirical verification of this assumption in situations where such verification with repli-I cate observations is feasible at little or no additional cost. They observe that the *possible consequences of hyper-Poisson variability, I or variation among observations that is greater than would be consistent with the Poisson model, include inefficient estimation and underestimation of the variances of parameter estimates. I Collings and Margoli~ point out that I the negative binomial plays the role of an attractive close alternative to the Poisson. As numerous authors suggest
**** a reasonable test of fit against all close alter-I natives may be obtained by deriving the best test against a particular close al~ernative. The detection of a depart-ure from Poisson sampling is not an end in itself; one is then faced with providing a scientifically credible alter-I native sampling model *.** The negative binomial plays a prominent role here as well, possibly because of its interpretability as a gamma mixture of Poissons. Authors I as early as Student (1907) and Greenwood and Yule (1920) have advocated its use for modeling data that noticeably depart from Poisson sampling.
I Concerning the relationship between the quadratic variance model and the negative binomial distribution, they state I The negative binomial distribution ;s particularly attractive both as a testing alternative to the Poisson I distribution and as a tractable sampling model when there is reason to expect that the variance cr 2(x) and the mean ~(x) functions are related quadratically by I 2 cr (x) = ~(x) + c2~2(x), (*) where x denotes possible covariates and c2 is constant I with respect to x. The evidence for (*) at times may be empirical ** Alternatively, the evidence for (*) at times may be methodological. I I
I 4.16 I Collings and Margolin cite a statistic employed by Gart (1964) to test for departures from the Poisson model when the data constitute a one-way layout. This situation is a more general version of replicate sampling, I in which an arbitra~y number of replicate observations is taken during each of n p.s.u.1s; the number of replicates may differ from one p.s.u. to another. Gart1s statistic is a suitable modification of the standard IIvariance or lIindex of dispersion test for departure from the Poisson li ll I model in the random sample case (independent, identically distributed obser-vations). Our test statistic S in (4.3) turns out to be equivalent to Gartls statistic. p I We nOlI/ give the standard derivation of the negative binomial distribution as a mixture of Poisson distributions. This is included for mathematical I compll~teness. It may be helpful in visualizing the entrainment process; howevE~r, this mixture! model will not be used either to derive or to justify any results in the rest of this report. The negative binomial model fol- I lows from the same assumptions as the Poisson model except that a different supposition is made concerning the Poisson parameter 1. In the Poisson model:. A (the mean de~nsity of ichthyoplankton) is assumed constant. In the nE!gative binomial model, 1 is allowed to vary according to a Gamma I distr;:bution with par'ameters K and e. The GalTDTIa (K, 8) distribution is given by I f(A) for 1>0. (4.14) I The parameters K and S are referred to as the shape parameter and scale I parameter, respectively, of the Gamma distribution. For different values of K and 8, the Gamma distribution can take on a wide variety of forms. The scale parameter, S, can be thought of as a variance, and controls the I dispersion of the distribution. The shape parameter, as its name implies, controls the shape of the distribution, allowing it to change from strictly decreasing to unimodal, with varying degrees of skewness. I Thus, if a volume, v, of water is sampled with mean density A, where 1 varies according to (/~.14), then the observed number of ichthyoplankton fol-lows a negative binomial distribution with parameters K and S/(S+v). That I* is, I for x=O,1,2, ... (4.15) I (See Appendix II for details.) To see that this is, in fact, a generali-zation of the Poisson model, we note that if K~, 8~, and K/8+ constant I in (4.14), then (4.15) becomes the Poisson distribution. I I
I 4.17 I The Poisson-Poisson (or Neyman Type A) distribution has been suggested as an alternative model, which could be used rather than the negative binomial. The main point in this model IS favor is a biological interpre-I tation that seems plausible: the Poisson-Poisson distribution with parameters A and t results from the assumptions: I (1) female fish lay egg clusters that are distributed at random among the subunits within a p.s.U so the number of clusters in a given q subunit is a Poisson variable with mean A; and I (2) the number of eggs surviving (or larvae hatching, etc. for other life stages) from each cluster is also a Poisson variable, w~th mean ~. I The negative binomial distribution can be derived by the exact same line of reasoning, with (1) remaining unchanged but (2) replaced by I (3) the number of eggs surviving (or larvae hatching, etc.) from each cluster is a random variable with the logarithmic distribution (Johnson and Katz, "1969, eh. 7 or Pielou, 1977, p. 119). I This type of distribution, which is called a generalized distribution, "arises if we suppose that groups or clusters of individuals (rather than I single individuals) constitute the entities having a specified pattern, and that the number of individuals per group is a random variate with its own probability distribution." (Pielou, 1977, p. 118). In contrast, I another type of distribution, known as a compound distribut'ion, arises if we assume that the individual organisms are independent of each other (not clustered) and that subunits available to them are not identical (which would lead to a Poisson distribution of organisms) but'dissimilar, I with some having more favorable environments than others, so the para* meter u, the expected number of organisms in a subunit, varies from one subunit to another. This approach, in which u is a random variable, is I used to derive the negative binomial distribution in Appendi)( II. The difference between these two approaches to combining distributions I is more illusory than real, because every compound Poisson distribution corresponds to a generalized Poisson distribution and vice versa; this implies that we cannot reach conclusions about the underlying mechanism I from data on the observed distribution of organisms per subunit (Pielou, 1977, p. 123). For example, the Poisson-Poisson distribution corresponds to a compound Poisson distribution, whose interpretation would be very different, and whose distribution of observed outcomes would be identical I to that of the Poisson-Poisson. The Poisson-Poisson has several unfavorable properties that cast doubt I on its suitability for this entrainment work. For more detail on these, see Johnson and Kotz (1969, Ch. 9). It is extremely multimodal that is, t able to have arbitrarily many modes, making it difficult to deal with I in practice. This excessive f1exibility is a sharp contrast to the moment behavior of the Poisson-Poisson: the third and fourth standardized moments are connected by the near-constant behavior of the expreSsion I (6 2-3)/6 1, which always remains between 1 and 1.21S. As Johnson and Katz I
D 4.18 I note lithe narrow limits of the ratio restrict the field of applicability l of the distribution." (1969, p. 219). I Estimation of the Poisson-Poisson parameters is difficult. Equations for the maximum likelihood estimators can be found, but solving them requires involved iterative computation. Method of moments estimators can be obtained, as can other estimators based on the sample mean, the proportion of OIS, the ratio of observed lis to OIS, and combinations of these approaches. The choice among these methods is complicated by the fact that the relative performances of these estimators are influenced I by the parameter values: as the parameters change, the relative performances of these estimators change. I In applied work, the Poisson-Poisson distribution has been used primarily in situations involving plants and plant distributions. Evans (1953) found. that it did well for plant distributions, but that the negative binomial distribution did better for insect distributions. Generaliza-I tions of the Poisson-Poisson distribution to families with three and four parameters instead of two have been proposed. These generalizations alleviate the "severe restriction on the shape of distribution implied I by the limits on the value of (6 -3)/6 " (Johnson and Katz, 1969, p. 2 1 226) but introduce new difficulties because of the extra one or two para-meters to be estimated. I A final issue is the applicability of the Poisson-Poisson distribution to the replicate sampling situation of the entrainment studies. Data I are collected in the form of two, or at most three, Simultaneous observa-tions. We seek to estimate the parameters of the distribution for each p.s.u. observed, in a way that provides overall information about para- I meters that span the entire entrainment season. This is difficult to accomplish for the negative binomial distribution, it seems outside the realm of possibility to extend these methods, which rely on certain I special properties and the simplicity of structure of the negative binomial, to the Poisson-Poisson. The intractability of the Poisson-Poisson model appears to preclude its use in this context. I 4.2.2 TESTING FOR THE NEGATIVE BINOMIAL MODEL I As w'ith the Poisson model of Section 4.1, replicate data are needed to test whether observations. within a p.s.u. follow the negative binomial distribu-tion. Each observation is a sample ichthyoplankton count. It is hypothe-I sized that observations within p.s.u. i are independent, each satisfying the relationship I 2 a. 1
= ~.1 +
2 (l/K)~. 1 (4.16) I I I
I 4.19 I for each p.s.u. it where K is constant for each species-life stage combina-I tion. P.s.u.'s in which a single sample count is taken will not help in this test, because the mean Pi varies from one p.s.u. to another. To test the appropriateness of the negative binomial model, a chi-square I statistic can again be used. If there are n (Xi,Y i ) pairs, define the statistic I I ( 4.17) I where I "'... K - 2rn x,.Y./ nr [(X.-Y.) 2-(X.+Y.)] + , I i~l ' i-l " , 1 (4.18) I and "+" signifies positive part, i.e., (x) + = max(x,O). The statistic SNB can be used to test the tenability of the negative I binomial model, where large values of 5MB lead to rejection of the nega-tive binomial model. A derivation similar to, though somewhat more com-plicated than, the derivation of Sp (the Poisson test statistic) shows I that SNB has an approximate chi-square distribution with "-1 degrees of freedom. The results of the negative binomial model evaluation are given in Tables 4.1 and 4.2. In all cases, the fit of the negative bi* I nomial model is a substantial improvement over that of the Poisson model, and in al1 but a very few cases it provides a very good fit to the data. I .. + The statistic K is an estimate of the parameter K apoearing in the nega-tive binomial variance function of (4.16), I '1ar(X)
.2 (l/K)p .
I The ~ pa;~ (X1,Y1), ... ,(Xn,Y n) give rise to a single value of the sta~is* I tic K+. The statistic is derived by using a method of moments approach. The positive part is used in the denominator, rather than the absolute value.or somet~ing.simila~; for th~ express pu~pose of elim~nating the I negatlve contr1but10ns. Ine quant,ty ((X.-V.) , 1
-(X.+Y.)],
1 1 or ~hich the I
I 4.20 positive part is taken, is itself an estimate of Variance - Mean, which I satisfies the inequality I Variance - Mean ~ 0 I in all of the models considered. Therefore negative values of this quantity are out.side the allowcible range, and any reasonable estimation procedure will decree that the estimator must be pulled back into the proper, non- I negative range. To illustrate further the effectiveness of the negative binomial model, I a bootstrap technique was employed. A bootstrap analysis addresses the following question: given that the model assumptions are correct, what should samples from the process look like? Does the sample actually I observed appear to be a typical sample from this process? 70 answer these questions, samples are computer-generated according to the hypo-thesized model and the behavior of these samples is compared with the I actual data. Under the negative binomial model, we now ask how the (X.,Y.) pairs I 1 1 should behave. Standard statistical calculations (Appendix III) show that the distribution of Xi' conditional on the observed total of I X;+Y i , follows a beta-binomial (or inverse hypergeometric) distribution. That is, I I P(X.=xlx.+Y.=t) _(K+X-l) (K+t-X-l) 1 1 1 ( x. 2K+~-1) t-x (4.19) I I The Bootstrap technique is a relatively new statistical method (Efron, 1982). !..Je use it here to provide some additional information about the tenability of our model, in the following way. The probability model I believed to be governing the data is specified, along '",ith some summary statistics from the observed data. Using this information, the comouter is used to generate samples, and these generated samples are checked for agreeme~t with the actual sample (usually graphicai methods are used here). I If ther :*;S agre!':ment between the generated samples and the actual samples, 1 this is evidence supporting the assumed probability model. It should be noted that we are using the bootstrap here mostly for illustrative purposes, I, since the chi-square tests have already given us strong evidence for the negative binomial model. I I
I 4.21 I I In this particular instance, the bootstrap was performed assuming that (1) the underlying distribution is negative binomial, and (2) the marginal totals are known, that is,the totals t; = Xi + Yi from the actual data I were used. From this information, we arrive at (419) as the relevant dis-tribution. For a given sample, the parameter K in (4.19) was estimated by I K+. Then, for each value of ti' a value Xi was generated from the distri-bution in (4.~},and the corresponding Yi was calculated using the rela-tionship Yi s ti - Xi. For an actual sample of size n, n new pairs I (X;I,Y;I) were generated in this manner, constituting a bootstrap samplp.. I One can generate as many bootstrap samples as desired, bu: since we are only using the bootstrap as an illustrative procedure, we only present three bootstrap samples. Graphical representations of these bootstrap I samples, together with the actual sample, are given in Figure 4.1 for the Bowline Point 1983 data. It is easy to see that there is close agreement between the bootstrap samples and the actual data, although the bootstrap I samples display less "within variation" than the. actual data. This is, to a certain extent, to be expected, since the bootstrap data are Simulated and, therefore, not subject to any additional sources of variation other I than those specified by the model. It should be noted, however, that the bootstrap sample ;s from a negative binomial distribution and hence has more variability than a sample generated from a Poisson distribution. ,I The fact that the bootstrap sample displays less within variability than the actual data can be interpreted as further evidence against the Poisson assumption. I As mentioned before, we are only using the bootstrap as an illustrative tool, and are not basing any inferences on it. Statistics can be cal-I culated to compare the bootstrap samples to the actual sample, but for our purposes they would add nothing over the included graphs. I In Figure 4.2 the bootstrap samples are presented in histogram form, to allow an alternative method of comparison. The histograms allow compari-son of only one variable at a time, so we have shown only the first vari-I able (Xi) from the pairs. (Histograms of the second variable are com-parable, since the bootstrap pairs and the data pairs have the same sum.) Again, the histograms show close. agreement between the observed data and I the bootstrap sample. Table 4.3 lists the Bowline Point 1983 Table B2 data together with two typical bootstrap samples. I I
I 4.22 I I Figure 4.1: Plots of the observed data, Bowline Point 1983 Table B2, together with plots of the bootstrap sanples BOHLINE TABLE B2 DATR I I I I I observation I (Solid line* 1st observation (Xi)' dotted line
- 2nd observation (V;))
I BOHLINE TABLE B2 DATA I 320 w I crz40 ~ i.l1 I I I I I
I I 4.23 -I I Figure 4
- 1 (continued)
I BOL*jLINE TABLE 82 DATA I N 320
~
I 40 H. I r~16e r't I~! I' ~80'\~
~ 1 ~iJ;;1~ro.~ \" 'V 'V, " '~'
I ~ observation
~
1.0 M N )) I '" - (Solid line - 1st observat i on (X), i d0 tted line.. 2nd observa tioo (V" EOWLINE TAELE E2 DATA I' I I I I I I I
I 4.24 I I Figure 4.2: Histograms of the first observation fran each pair in the Bowline Point 1983 Table B2 sart;lle, and tr.e first observation fran each pair in three !xlotstrap samples I BOWLINE TABLE B2 DRTR I I I I I a~U--J_~~~~~~~~ __~~__~~~~~~__~__~~~~~~__~~ N LARVRE: COUNT (OJ
~ ~ ~
I BOWLINE TRBLE 82 DATR I LS I I 6 I e~U--1,~L-~~~~-L~--~~~~----~~~k---~~;---~--~:~3 at ~_ I BOOTSTRRP 5R~lPLE I I I
I 4.25 I I I Figure 4.2 (continued) I ~~ BOWLINE: TRBLE: 82 DATA I ;-- r..) Z 34 W
- l I ~
L.. Lo.I I!I ~ I-I a:
-I WI a:
L~ ~ I-z -- I Lo.I (,j a: w a.
~
I a ID ID ell l .. CI CII 1 ,I I I ID ID N :i BOOTSTRAP SRMPLE 2 I BO",,"- INE: TRBLE: B2 DATA
~a I ;--
u 0::l 24 I 0
~
L.. I. Lo.I I >.... I-a:
-I Lo.I l~
a: I - I-Lo.I
~ e ~
I a~ >tJ
~ .!l I 9(,("i':, TRAP SRI-1PLE ;!
I I
I 4.26 I I I Table 4.3 Sowline Point 1983, Table B2 Data and Bootstrap 5aITples I Data Bootstrap ~le 1 Bootstrap Sample 2 Obs l:X; 158 Obs 2:Y.J-147 Obs l:X; 166 Obs 2:Y j 139 Obs l:Xj 207 Obs 2:Y j 98 I 143 72 115 100 80 135 106 57 71 74 94 57 83 74 126 62 51 69 I 97 141 138 100 108 130 80 85 82 83 86 79 109 109 ll9 96 96 99 132 106 116 109 228 112 96 264 I 350 142 280 212 224 175 176 223 204 195 109 214 137 80 112 150 134 144 134 104 112 190 I 84 45 60 69 65 64 37 9 21 14 32 67 46 24 24 25 40 31 51 39 44 24 47 46 I 44 38 46 36 35 47 118 30 116 142 48 93 107 48 106 153 30 103 114 47 III 146 31 98 I 137 59 94 102 108 88 61 49 37 50 76 9 50 61 2S 61 64 21 60 80 23 51 45 23 I 91 68 70 89 86 73 55 91 50 99 63 38 89 63 48 65 91 40 85 57 46 69 97 42 I 63 69 63 69 57 75 28 41 44 34 38 26 35 30 36 27 49 34 27 54 27 35 25 43 I 53 54 41 66 46 61 46 72 24 13 100 36 34 88 17 25 84 43 30 82 36 29 90 24 I 19 8 15 12 15 12 32 29 13 25 34 31 27 30 15 30 33 29 29 34 22 28 29 22 I 8 20 10 18 19 9 12 16 20 43 24 17 36 29 23 19 11 14 32 23 18 23 17 19 I 14 13 11 16 10 17 17 22 18 10 28 15 17 27 15 10 23 18 10 31 17 17 19 16 I 23 60 51 32 51 32 I
I '. 4.27 'I 4.3 THE INDIAN POINT REPLICATE SAMPLING I The analysis in Sections 4.1 and 4.2 used data available from Bowline Point and Roseton. Replicate data are also available from Indian Point. I However, because of their different nature, we treat them separately here. Ichthyoplankton samples were collected using two manual pump/net abundance sampler~ and an AUTOSAM at Indian Point Station 02 between June 17 and July 8, 1983. A description of the samplfng is given in the Indian Point I 1983 Annual Report (EA, 1984b). Data from the two pump samplers and the AUTOSAM were analyzed by tha techniques used in Section 4.2 for data from Roseton and Bowline Point. The results of analyses will now be pre-I sented. An explanation of the methodology appears in Sections 4.1 and 4.2. Triplicate observations (Pump 1, Pump 2, and AUTOSAM) were obtained for 47 three-hour periods. Eight species-life stage combinations were I present in numbers suffi ci ent1y great to make ana 1y5 i s worth\"hi 1e. Tables 4.4, 4.5, and 4.6 give the comparisons of the two pumps to each I other and of each pump to the AUTOSAM. Each of these compar'j sons (Pump 1 to Pump 2, AUTOSAM to Pump I, and AUTOSAM to Pump 2) is treated by the methods of Section 4.2. For example, to compare AUTOSAM to Pump 1, the data from these two sampling devices are viewed as replicate observations I from a series of p.s.u.'s and analyzed exactly as 'shown (for different data) in Table 4.1. (The three sampling devices were compared two at a time, rather than all three together, because the extension of the statistical I methods here to triplicate observations is currently not known.) For each of the species-life stage combinations examined, the chi-square statistic testing the goodness of fit of the Poisson model, which was given in Sec-I tion 4.1, is computed. In addition, the constant K is estimated as described in Section 4.2 and the chi-square statistic testing the goodness of fit of the negative binomial distribution is reported. I The assumption that each pair of replicate samples has equal volume is required by the analysis for the negative binomial distribut"ion. Conse-I quently, analysis of the Indian Point data was restricted to those of the 74 p.s.u.'s in which all three sampling devices were operating and all three resulting sample volumes differed by no more than 10 percent,. There were 47 p.s.u.'s satisfying these conditions. I In all cases, the Poisson model was decisively rejected by the data. This is consistent with the results for Bowline Point and Roseton in I Section 4.1. The fit of the negative binomial distribution is vastly better than the fit of the Poisson distribution, but not as good as it was for the data from the other two power plants. This is not unexpected I for the comparisons of Pump 1 versus AUTOSAM and Pump 2 versus AUTOSAM; with two different sampling mechanisms involved, it seems reasonable that their simultaneously taken 6bservations may not constitute indepen-I dent, identically distributed draws from a negative binomial distribution. The difference in sampling mechanisms makes it plausible a priori that the manual pump/net sampler and the AUTOSAM may be generating observatio'ns I from two distributions that are not identical. Conditions that seem neces-sary to assume equality of the sampling mechanisms would be equal area and same direction of the sampling orifice (pipe), equal velocity at the sampl-I
I 4.28 Table 4.4 I Indian Point 1983 X2 Tests To Evaluate The Poisson And Negative Binanial M:xlels: Plm1p 1 vs. PtlIS? 2 I Life (3 Hour Samples, 47 Pairs) Poisson Significance Neg .Bin. Significance I Species " X2 X2 level K level Clupeidae Sta2: Larvae 59.22 70.29 .015 59.79 .083 I Alosa spp. Larvae 6.94 85.38 .00052 62.33 .054 Engraulidae Larvae 44.75 187.82 <10-7
<10-7 67.62 .021 .045 I
Anchoa mi.tchilli Eggs 1.28 184.96 63.49 Anchoa mi.tchilli Larvae MJrone atrericana Larva.e 2.14 3.08 465.08 125.54
<10-7 <10-7 81.00 53.83 .001 .20 I
Morane saxatilis Larvae .65 266.32 <10-7 29.97 .97 Morone spp. Larvae 3.45 112.05 <10-7 45.39 .50 I I I I I I I I I I I
I I 4.29 Table 4.5 I Indian Point 1983 '1:. 2 Tests To Evaluate The Poisson and ~tive s:l.naniaI Moders: Pump vs. AIJroSAM I (3 Hour Samples, 47 Pairs) I ~;e:cies Life Sta2:e K Poisson
,,&2.
Significance Iele1 Neg.Sin.
'/.2-Significance Iele1 C1upeidae La%va.e 24.01 92.00 <10-4 63.53 .044 I Alosa spp.
Engraulidae Larvae La%va.e 2.64 1.63 127.31 1499.51
<10-7 <10-7 58.97 56.00 .095 .15 <10-7 <10- 7 I Anchoa mitchilli Anchoa mitchilli Eggs La%va.e .02 .10 613.27 1694.01 <10-7 808.21 138.48 <10-7 l\k)rone americana LarJae 1.26 204.97 <10-7 58.82 .097 I Morale saxatilis LarJae .85 319.38 <10-7 <10- 7 44.68 .53 Morene spp. larvae 1.52 174.21 55.51 .16 I
I I I I I I I I I I
I 4.30 Table 4.6 I Indian Point 1983 X2 Tests To Evaluate TIle Poisson and Negative Binanial r-*.odels: I Pump 2 vs. AlJIOSlIM (3 Hour Samples, 47 Pairs) I Soec:ies LHe Stage K Poisson X1 Significance Level Neg.Bin. X 2 Significance Level I C1upeidae Laxvae 16.22 114.08 <10-6 69.56 .014 Alosa spp. Larvae 2.74 147.78 <10- 7 68.30 .022 I EngJ:aulidae Larvae 1.71 1565.58 <10-7 58.44 .103 Anchoa rni tchilli Ancl:1Oa mitchilli Eg:Js Larvae
.05 .19 420.81 1688.75 <10- 7 <10-7 320.92 99.75 <10- 7 <10- 5 I <10-7
~r1:me americana Mor'::lI'le saxatilis Larvae Larvae 1.19 2.67 144.37 215.98 <10- 7 37.42 55.39
.81 .16 I M:lrl:)ne spp. Larvae 2.1S 178.16 <10- 7 64.46 .037 I
I I' I I I I I I I I
I 4.31 I ing orifice (intake velocity), and equal duration of pumping. Differences I between sampling systems would cause differences to be expected in the sampling distributions observed, at least in an absolute sens(~. There are slight differences between the AUTOSAM and the pumps in durat'ion of pump-ing and in velocity at the sampling orifice (the AUTOSAM's flow rate is I about .8 to .9 cubic meters per minute, while the pumps' rate is about 1 cubic meter per minute; see E~ 1984b, p. 3-1), but the systems are quite similar in other respects. Note that the pump versus AUTOSAM negative I binomial model is not rejected with overwhelming- force, as the Poisson model is. However, the negative binomial model's goodness of fit can be fairly described as marginal. I It is more surprising that the comparison between the two identical pumps aliso results in a somewhat marginal level of agreement with the negative I binomial model. For two identical pumps located one meter apart hori-zontally in the discharge channel, it is puzzling that the family of negative binomial distributions, which is much broader than the family of Pc)isson distributions, does not provide a better fit. A search of the I observations for clues to what is happening turned up a series of anoma-lous observations. One such set of observations is the following collec-tion of triplicate sample counts of Engraulid larvae: I Pump 1 Pump 2 AUTOSAM I 39 125 44 185 282 0 54 68 203 I 139 240 187 239 14 62 46 102 40 I 835 474 244 858 386 153 266 567 169 178 217 317 I Coefficient 98 155 179 of Variat;on(%) 105.9% 95.9% 87.6% I I Since Engraulids other than bay anchovy are very uncommon in the Hudson River (EA,1984b, p. 4-10), it is appropriate to treat the lumped counts of bay I anchovy and anchovies (i.e., all Engraulids other than bay ar.chovy) com-bined. This js because ichthyoplankton identified as anchovies are vir-tually certain to be bay anchovy. Using lumped Engraulid data has the I effect of replacing counts that are distorted due to incomplete identifica-tion at the species level with undistorted, honest bay anchovy counts. I These counts reveal a great deal of fluctuation, as evidenced by the very large coefficients of variation. The variation observed in these replicate observations ;s greater than negative binomial variation. I
.tot _ _ _ dtnft "_'I'__ ~
4.32 I The fact that the negative binomial model fits so well in all other cases I leads us to retain it and, for the time being, to consider these Indian Point data as outliers or aberrant observations. Since there is some doubt that these datal are true replicates (EA,1984b, pp. 4-24 to 4-26), I there is no reason to go beyond the negative binomial model. The observa-tions listed above ar'e not atypical of the Indian Point data; similar lists can be given for the other species-life stage combinations. It is clear I that these Indian Point data are characterized by a greater degree of inherent variability than the data from the other power plants analyzed previously. Several factors could be responsible for this increased random I fluctuation, including the fact that the discharge channel is an open chan-nel rather than a closed one. Another source of difficulty is the great variation in the volumes of the samples, which introduces substantial com-plexities to the analysis. The extreme disparity among simu.ltaneous counts I indicates that a very broad, flexible class of distributions may be needed in order to obtain a better fit to the observed data. I Two further analyses have been made to supplement the goodness-of-fit tests. The first is a set of plots of the observed data for the eight species-life stage combinations treated. These plots are given in Figure 4.3. They I confirm the pattern of highly discrepant triplicate simultaneous observa-tions illustrated by the counts listed above. The second is a bootstrap analysis of the type described in Section 4.2. This analysis is Table I 4.7. The bootstrap values of ""+ K show a large amount of variability, reiterating I the evidence that these data show variation beyond negative binomial. 4.4 SAMPLE DURATION I The sample duration, or equivalently the length (in time) of a p.s.u., is important because it bears directly on the amount of effort expended in I sampling and the accuracy attainable in estimation. We first focus on the question of whether the negative binomial model is necessary for longer' sample durations. If such samples display reduced variability, I resulting in a large value for K, then the Poisson model may become appli-cab 1e for these longe!r samples. I As sample duration increases, we might expect a "smoothing effect to occur; I l' that is, while ichthyoplankton counts taken on an hourly basis could show relatively large variability, we might expect counts taken over longer periods, such as daily counts, to show less variability. Three reasons can be advanced in support of this expectation .. First, diurnal variation will have less of an effect on counts over longer periods, and no effect on I da i 1y counts. Second, assume that the one-hour count Xi in p.s.u. ance ui + (l/K)~~, where each ~i is positive. Let EX i denote (for con-has mean ~i and vari- I creteness) the sum of 24 one-hour counts. Because it is a sum of indepen-dent random variables, EX.1 has mean E~.1 and variance Eu.1 + (l/K)Eu~1 (where 311 I I
I I 4.33 I I Figure 4.3: Plots Of Observed Data, Indian Point 1983 Pumps vs. Autosam I 1983 PUMP AND RUTOSAM DATA I I I I
.r.:
I Dotted lines = Pump Samples. Solid Lines = AUTQSAM Samples. I IP 1983 PUMP AND RUTOSAM DRTR I w a: 60 I I:J:
..J 40 I
I I ~3At'IPLE I
I 4.34 I I
;:;UI,J r' IP Figure 4.3 (Continued) 1:3:33 PUHP AI'EI AUTIY3Arl DATR I W
7l7\iJ 'L __ I t: I L I ~:,=, '.::JLl (.
- I
-Sl]Or' ~ f* j :.~. i:; I §t co f* .~ -L __ I ~c:uur.
~.
I
';;'" \ iq,.\ if /\'1\
~~J0t* W 1 C10 f* f ~ r~.,!:'j n..
'C 7<
1 Ir1 " if-i' 11 .,. ... 1 4= I
! ~" I
- I~:', j tt'
.... i~l, :1-.'*') ~,:/"\~,\i f;,' - \.i . - : - l L I " l i l l l : ' t I I H I I ..) :
(I) r"- li"') c.*J "J' (r') C) '1" I
- Ar*1PLE Dotted Lines = Pump Samples, Solid Lines = AUTOSAM Samples.
IP 1983 PUMP RND RUTOSRH DRTR I I I I I I
- ,RrlF'LE
I 4.35 I I Figure 4.3 (Continued) I IP 1983 PUMP AND AUTOSAM DATA I I II I II I ::;180 I II t t
~,al'\ '11/1 ~,' ;~\. G',lW\,,\,.,)i ." .11 ~~~~~~~~~~~~:~I.:~\~'~4i~\~!~~~~~_,.\,~~'~'~"~I'!~+_\;~ S~,!I~I'!_~+
I, ":" I I 'r... SAMPLE Uj (\J (., M "<t' I Dotted Lines s Pump Samples, Solid Lines IP 1883 PUMP AND AUTOSAM DATA s AUTOSAM' Samples.
't I
I I a: 28 ,"1; lq! i
!¥ t*}\ \ 'j: ':~At I .310~; ~
(-.
':, #, I ,', fii'; ~~ l~<~ *:~'" :
i,:",:.-h' J,l' b: :. 1\ Ii ':'i,' I', " \
..i:..
IA' ",': I~II I. i I" 'ft' ,
.~II ll.** ~ ,1: I I ....*-;'
171:: "
,k'!! . ' ;, ,
r"_ U') (\J SAHF'LE I I I I
I 4.36 I I Figure 4.3 (Cor. ti;"~ued) I? 1983 FUMP AND RUT0SRM DRTR I I I I I I
'::,m"PLE Dotted Lines = Pump Samples, Solid Lines = AUTOSAM Samples. I IP 1:3:33 PUi"P RiID RUT':6Ar, DATR ~i~ l: I w I
~L., -11 _>,)\:.. i' I
.~ ,
I I I I I I
I 4.37 I Table 4.7 I Estimated Paired i+ Values Pump From Indian Point 1983 Data And 10 BootstraI2 Samp1es* (47 3 Hour Samples) I ...+ K 59.22 6.94 44.75 1.28 2.14 3.08 .65 3.45 I B 81.76 5.95 17.27 1.73 9.85 2.85 .82 2.19 0 I 0 T 100.48 10.60 29.31 1.12 2.59 4.89 .56 2.03 S 59.98 37.82 126.15 1.25 1.68 1.28 .56 2.55 I T R A 21.32 2.83 20.04 496.68 10.50 3.76 .80 3.83 P 81.03 9.36 107.67 .44 2.18 2.57 .45 3.71 I V 55.24 10.88 89.53 1.19 3.77 4.36 .53 1.63 A I L U 117.73 4.32 59.75 25.90 .74 6.13 .87 6.46 E 188.48 4.82 28.19 16.40 1.36 3.91 .52 7.06 I S 93.69 5.49 112.37 45.43 6.54 1.54 .26 2.62 0 I F
.. +
44.55 9.47 41.24 .92 10.84 6.19 .84 2.72 K I Mean 82.13 9.86 61.48 53.85 4.74 3.69 .62 3.48 I Std. 44.61 Dev. 9.65 40.25 147.58 3.94 1.64 .19 1. 77 I *The columns of this table correspond to the eight species-life stage o::mbinations entrained in sufficient numbers for analysis. Each column contains the value of i+ estimated from the data and 10 values calculated I fran bootstrap s~les. COlumn Species Life-Stage I 1 2 C1upeidae Alosa spp. Larvae Larvae 3 Engraulidae Larvae I 4 5 Anchoa mi tchi11i Anchoa mitchilli Eggs Larvae 6 M)rone americana Larvae I 7 8 M)rone saxatilis M)rone spp. Larvae Larvae I
4.38 I sums are over the 24 one-hour p.s.u.ls). However, if we instead view EX.1 I as the count from a single 24-hour p.s.u., the negative binomial model gives the variance of IX i as I~i + (1/K')(I~i)2, where K' corresponds to I 24-hour p.s.u.ls rather than one-hour p.s.u.1s. This implies that (4.20) I I and it then follows from 2 2 I (I~i ) > E~;, (4.21) I which is easily shmm, that K' > Ke Of course, the amount by which K' exceeds K depends on the one-hour p.s.u. means ~i. I Third, under ideal theoretical conditions, in particular constant flow rate in the discharge channel and constant ichthyoplankton mean density, we would expect the variation in the channel to become more like Poisson variation I as the sample duration increases (See Appendix IV). In terms of our para-metets, we would e~:pect K- as sample duration - . However, the empirical evidence, presented below, suggests that this is not the case.
~+
I TablE! 4.8 shows values of K for the Roseton 1982 data, for the eight species-life stage combinations* considered and for sample durations of A ... I 1, 2, and 3 hours. While there is some evidence that K' is increasing as a function of sample duration, it is not increasing quickly enough to support the hypothesis that the di~tributionof the data approaches the I Poisson. In fact, examination of Table 4.1 shows strong evidence that the Poisson distribution does not fit the three-hour samples, while the nega-tive binomial still does quite a good job. I Ten bootstrap samples were generated by the mechanism explained in Section 4.2, in order to get some idea of the variability in the estima- I tion of ~+. Since there are no actual data available to do this (it would require at least two sets of simultaneous paired samples, that is, four Simultaneous samples), the bootstrap was the only available mechanism. The I
. ~+
estlmated K values resulting from these bootstrap samples appear in Table 4.8. We chose to generate 10 samples because it was felt that this would I gi ve enough i nforma ti on about the va ri abi 1ity. of K+. Of course, more samples could have been generated, but then the question of computer time would have become an issue. Furthermore, since these are simulated data, I they are not giving information about the variability of K~ calculated from data in the d"ischarge channel, but rather about K+ calculated from data that follow the assumed model. I I
I I 4.39 Table 4.8 I "'+ Values from Reseton 1982 Data and 10 Bootstr!:E Samples Estimated K Resewn I~S~ Data - I Hour samples; = I K+ 5.04 34.24 25.94 1.77 17.65 8.85 15.97 .83 I B 0 1.86 62.74 19.02 1.25 28.07 13.17 39.04 .34 0 11.16 40.68 16.2 1.87 21.17 9.24 56.83 1. 79 I T S 4.5 87.42 16.31 .9 19.35 19.67 43.83 .58 T I R A 4.74 8.29 104.8 41.88 3.66 15.37 4.76 21.23 .88 P 10.82 37.27 1.03 19.54 4.16 ll.05 1.06 I V
.i\
4.11 105.48 28.16 1.14 20.06 28.58 65.03 .91
- 1:. 6.96 100.15 33.23 1.37 12.14 5.67 15.27 1.81 I TJ E 7.86 99.4 17.27 1.99 19.64 5.75 22.4 1.22 S
I 0 2.1 28.19 27.71 1.38 15.37 8.1 14.26 .33 E" 9.66 31.64 16.02 1.99 26.73 12.92 13.69 .72 I K A+ J Mean 6.03 64.14 25.36 1.67 19.55 10.99 28.96 .95 Std. 3.01 36.17 9.21 .76 4.7 7.42 19.05 .5 Dev.
-.rl'he columns of this table correspond to the eight species-life stage canbinations entrained in sufficient numbers for analysis. Each column A
contains the value of i<+ estimated from the data and 10 values calculated fran bootstrap samples. Column Species Life Stas:e
- 1 Clupeids Larvae 2 Alosa spp. Larvae 3 White perch Eggs 4 White perch Yolk-sac larvae 5 White perch Larvae 6 Striped bass Yolk-sac larvae 7 Striped bass Larvae 8 r-brone spp. Lar-rae
I 4.40 I Table 4.8 continued Roseton 1982 Data - 2 Hour Samples I A+ 6.61 40.33 61.69 L89 49.98 9.01 27.77 .93 K I B 2.62 74.84 28.60 2.09 51.29 5.87 49.90 1.57 0 0 2.45 34.65 46.35 1. 74 36.14 5.85 48.80 1.24 I T S T 5.69 39.05 20.07 1.87 14.01 10.88 23.75 .45 I R 6.12 21.67 34.24 4.08 35.61 16.65 26.74 1.15 A P 2.99 42 .. 09 92.26 1.85 50.16 8.72 41.46 2.37 I V 2.25 34.29 47.25 2.20 27.80 5047 46.68 1.84 A L 7.42 125.93 171.85 3.24 15.54 4.72 76.27 .32 I u E S 16.13 50.17 27.75 1.12 11.02 8.41 29.12 3.36 I 7.79 92.37 26.55 5.89 21.15 21.12 24.75 2.98 0 F 9.14 15.29 148.78 1.24 40.58 13.07 167.53 .56 I A .. p K' MP..an 6.29 51.92 64.13 2.47 32.12 9.98 51.16 1.52
~ ~
Std *. 4.05 33.01 51.96 1.42 15.16 5.16 41.68 1.02 Dev.
I 4.41 I Table 4.8 continued I ~+ Roseton 1982 Data - 3 Hour Samples K 7.86 107.25 52.81 1.55 48.31 10.97 33.08 .91 I B 30.02 57.23 34.19 2.47 27.97 12.67 122.31 .73 I 0 0 31.97 34.88 124.65 4.8 64.59 5.78 28.09 .49 T S 6.82 317 72.75 I T R 10.13 645.96 104.56 1.69
.92 40.82 71.86 21.49 21.34 20.55 123.05 2.21 1.45 A
I P 5.2 770.97 49.71 1.6 34.02 12.86 24.34 .53 V 15.26 109.78 48.09 2.94 17.42 4.79 71.12 3.24 I A L 7.26 145.72 58.31 1.47 41.49 62.45 33.92 .7 U I E S 9.4 36.13 318.78 627.28 17.96 115.15 1.82 1.81 130.13 36.76 31.14 6.91 85.25 47.02
.34 1.42 0
I F A+ 4.87 281.46 28.5 .96 17.26 34.28 67.76 .87 K I Mean 14.99 310.57 64.24 2 48.24 20.43 59.68 1.17 I Std. 11.8 259.76 35.92 1.1 32.07 17.1 37.55 .87 Dev. I I I I I I I - - ----- - - - - - - - - - - - - - - - - ---- ----------~---
I 4.42 I At present, adequate data are not available to test the Poisson vs. nega-tive binomial models for longer sample durations. This is because the compositing process drastic~lly reduces the number of paired replicate I samples available for analysis. For example, the Roseton 3-hour composite analysis contains only 18 observations. Furthermore, compositing must be done over consecutive sampling periods. In most of the cases examined there were gaps in the sampling process, that is, time periods in which I no sampling occurred. This fact also contributes to the small numbers of paired replicate samples entering the analyses of composited data. Com-positing beyond 3-hour periods would reduce the number of paired replicate I samples below the point where analysis would be worthwhile. Investigation of p.s.u. lengths greater than three hours does not seem I profitable for several reasons. These include both statistical and non-statistical considerations pertaining to the estimation of entrainment, as well as the needs of other related analyses. The data collected in I entrainment abundance sampling will be used in mortality and similar studies; the need to provide sufficiently detailed information for these studies makes short p.s.u.ls seem more appropriate than long ones. I The main issue is the effect of p.s.u. length on the standard deviation and the coefficient of variation of the estimate of entrainment over the I entire season. We will now show that when the negative binomial model holds, making the p.s.u. shorter results in a lower CV of seasonal entrain-ment. To avoid unnecessary complications, some simplifying assumptions I wi 11 be made. Assume that the entrainment season consists of d days indexed by i=l, ... ,d, I each of which is divided into r p.s.u. IS indexed by j=l, ... ,r. Define I vlJ.. = the volume of water in p.s.u. j on day i , V ** lJ
= the volume of water sampled from p. s. u. j on day ; ,
I A.. lJ
= the true mean density of ;chthyop1ankton in p. s. u. j on day i , I X..
lJ
= the obser1ed ichthyop1ankton count in p. s. u. j on day i ,
I T1..j = the true number of ;chthyop1ankton in p. s . u. j on day ; I Then A.. = x.1 J-Iv 1..J a,nd the variance of X"J' is given by (4.16), so 1J I d r I
~ 4 ~
T= \' \' (V./v .. )(v .. \ .. ) = \' )' T .. (4.22)
- .. f..
i = Ij =1 lJ lj lJ 1J i j 1J I
I 4.43 I 2 A Var(T) :=I L L (V .. /v .. ) [v .. A. . I i j lJ lJ lJ 1J aLL 2 2 I i j (V. */v . . )(V .. A ** ) + V.. A .* /K lJ lJ lJ lJ lJ lJ (4.23) I Assume that all sample volumes vij have a common value v and that all p.s.u. volumes Vij have a common value V. Then I I I I I (4.24) I (4.25) I Assume that water is sampled from the discharge channel at a constant rate per unit time, regardless of the length of a p.s.u. Consider the A behavior of Var(T) as a function of r, the number aT p.s.u. IS in a 24-I hour day. The annual sample volume, that is, the total amount of cooling water sampled during the entire entrainment season, is given by drv, and the annual discharge volume is given by drY. The overall seasonal mean I density of organisms in the discharge flow ;s T/drV. The quantities drY, drV, T, and T/drV are not affected by the value of r, the number of I p.s.u.ls per day. The only term in the expression for CV(T) affected by r is LLTfj' which decreases as r increases. I A numerical example will illustrate the effect of p.s.u. length, or r
~hich equals 24/p.s.u. length in hours), on the coefficient of variation A
I of T. Let the entrainment season consist of d = 5 days, and let sampling take place at the rate of 150 cubic meters per 3-hour period, or 1,200 cubic meters per 24-hour day. Assume that the total number of organisms I entrained ;s 2,400,000 and for computational convenience assume that they occur at a uniform rate of 20,000 organisms per hour fay' the entire five-day season. Finally, let the cooling water flow through the plant equal 100,000 cubic meters per 3-hour period. Then for any r t I
I 4.44 I v = 1,200/r I v = 800,000/r and I (l/drv)(drV/T) = (1/6,000)(4,000,000/2,400,000) = .0002777, I so the first term on the right hand side of (4.25) ;s negligible. And I Y~T~./T2K = (5)(1)(480,000)2/(2;400,000)2 K = .20(1/K) for r = 1,
.'- 1 J I = (5)(8)(60,000)2/(2,400,OOO)2 K = .025(1/K) for r = 8, I = (5)(24)(20,000)2/(2,400,000)2 K = .00833(1/K) for r = 24, I
so ; f K = 2 CV(T) = [.0002777 12
+ .10J / = .3167 for r = 1, I = [.0002777 + .0125]1/2 = .1130 for r = 8, I = [.0002777 + .004166J 1/ 2 = .0667 for r = 24. I The differences among estimates based on 24-hour p.s.u.ls (r=I), 3-hour p.s.u.ls (r=8), and I-hour p.s.u.ls (r=24) will not be of this magnitude I
when" the entrainment pattern is more complex, but they will show the same direction. In particular, the presence of diel patterns and steep peaks in entrainment density will make these differences smaller. I A further drawback of samples of longer duration is that they introduce other sources of error, such as nonsampling errors due to sorting and I identification. With samples of longer duration, it will be necessary to increase the size of the sample in order to achieve desirable accuracy. Thus. larger batches of material must be sorted and identified. I Although the choice of short p.s.u.ls rather than long ones is clear, the costs of extra processing and data handling must be weighed. More import-antly, it ;s necessary to remember the point, mentioned at the beginning I of this section, that extremely short p.s.u. IS result in a great deal of variability. It can be difficult to use effectively the high-in-noise, 10w-1n-signal information produced by p.s.u. IS that are extremely short. I I
I 4.45 I 4.5 AN ALTERNATE MODEL FOR I THE ESTIMATION PROCESS The model described in Section 4.2, which ascribes negative binomial variation to the discharge channel, does a very good job of explaining I the observed variation in the data. However, based on theoretical con-siderations, such as the derivation given in Appendix I, it could be con-sidered somewhat surprising that the variation observed in the channel I does not conform to the Poisson model. More precisely: I 1. Assume that individual ichthyoplankton organisms have independent spatially random distributions (described in Appendix I) within the discharge channel; this physical characteristic of the I process leads to the conclusion that the data should display Poisson variation, but
- 2. All empirical findings strongly reject the Poisson model as I tenable.
I This situation leads to the question: Is there a reasonable physical model for which the variation in the channel can be Poisson, but the varia-tion in the observed data can be other than Poisson? The answer to this I question has led to an alternative model that not only fits the data adequately, but also contains parameters that have meaningfu"l interpre-tations and show how the observed data may not be Poisson. I It should be noted at this point that we are n21 advocating the model developed in this section as the one model that fits the data. We are I only seeking to explain the observed variation without rejecting Poisson-ness in the discharge channel, and the model developed below does a credible job. The turbulence observed in the discharge channels leads to I the belief that the distribution of larvae is Poisson, i.e., a spatially random distribution. The fact that the Poisson model is not tenable may lead one to believe that there is some sort of clumping in the channel, which can "lead to negative binomial variation (Quenouille, 1949), or else I to believe that additional variation is encountered during the data col-lection process. It is this latter alternative that we consider here. I The model developed in this section should perform well in analyzing the data treated in Section 4.5.3 if the model gives the sole or major cause of the datals failure to conform to the Poisson distribution. The model IS I poor performance in Section 4.5.3 leads us to dismiss it as an explana-tion for the observed negative binomial data; the model is not a credible explanation for this phenomenon. I We also note that with observational data it is impossible to test the adequacy of a model, that is, to find out if there is Significant unex-I plained variation that is not attributable to random error, but can be attributed to inadequacy of the model. This is a very different question than that of whether a model explains a Significant portion of all varia-tion, which we can test with the available data. Therefore it is impos-I sible to justify the following model on data-based arguments; we can
I 4.46 only answer whether or not the model explains a significant portion of I the observed variation. Tests of adequacy (also known as lack of fit) must be based on designed experiments. I The treatment of the remainder of this section is for a single life stage of ichthyoplankton., Separate analyses can be performed for the dif-ferent life stages. I 4.5.1 THE CLASSIFICATION MODEL I Suppose that classification into genus (e.g., Marone spp) can be made correctly, but thclt there is possible misclassification into species I within the genus. Let X* be the number of ichthyop1ankton of a particular genus collected during a specified sampling period. These X~ organisms are later identified according to species. Let X denote the number of these I or~;anisms that ar'e classified as being of a particular species, say species A. Note that the number of organisms of species A collected may differ from X, the number of organisms classified as species A; the two numbers will be equal only if there are no errors, or exactly offsetting I errors, made during the identification process. The classification at this stage is a difficult task, and the assumption of no classification error is probably unrealistic. I Assume therefore that an organism from this genus is classified as species I A with probability p. Then the distribution of X conditional on X* (that is, given the value of X*) is binomial, I X - Binomial(X*,p). (4.26) I (More will be scLid later about p, the probab-ility of classifying a member of the genus as belonging to speCies A.) Assume also that X* has a Poisson distribution with mean 1, ' I X* - Poisson(l). (4.27) I Theoretical arguments of a biological natur~ described in point 1 at the beginning of Section 4.5, support this assumption, which says that the varia-I tion in the channel is Poisson variation. X is the observed quantity in our data, the number of specimens classified I as species A. The unconditional distribution of X, the observed count of s pe c i c sA, i s I X - Poisson(lp). (4.28) I I
- __________=___.. IIIlI
I 4.47 I Now we come to the key point: what is th~ variation among the observed I species counts? In Sections 4.1 and 4.2, we saw that the Poisson model did not fit the observed data, while the negative binomial ~odel did. The motivation leading to the latter was the fact that in the Bowline Point 1983 paired observations from replicate sampling (EA, 1984a, Table I B-2), the within-pair sample variance s2 substantially exceeded the pair rnean on average. This suggested that the equality of the population I variance and mean, 2 I a = IJ, (4.29) which holds for any Poisson distribution, did not hold for these data. A I more promi si ng re 1ationshi p was the possi bil i ty that the popul a ti on vari-ance could be a quadratic function of the mean, I (4.30) I where c is an arbitrary nonnegative const~nt. This characte~istic led us to the negative binomial distribution, which includes the Poisson distribution as a special case (when c = 0). I I The model now being considered is X'* - Poisson(A), I X - Binomial(X'*,p). (4.31) If under this model we observe n replicate counts X1,X 2"",X n of species A from n simultaneously drawn samples (we usually have n = 2), the model becomes X~, - Poisson{).) for i = 1, ... ,n, (4.32) Xi - Binomial{Xi,Pi) for i = l, *.. ,n, where Xi to X; are independent, Xl to Xn are independent, and Pi ;s ~he probability that an organism from the genus being examined ;s classified in the ;th sample as species A. Notice that PI to Pn may differ accord-ing to the level of the sorter1s skill, the working conditions, the time of day at which the sorting is done, and other factors. Define
I 4.48 n I l.! = (lIn) L AP~ = AP, (4.33) I [, i=1 n c2 = .n-1l L (Pi P /p-2 = O'2/-2
- -)2 p p . I i =.1 It can be shown that I
1 n I E n I Xi = E(X) = \.l , (4.34) i=l I 1 n
~ - 2 I
E n-1 I.. (Xi-X) i=l I I This shows that the variance among the simultaneous observed counts is a quadratic function of the mean, which is not the case when these counts are independent, identically distributed Poisson observations. Therefore, under this model, which allows for possible misclassification of species, it is possible to have Poisson variation in the discharge channel and yet not to have Poisson variation in the observed data. The chi-squared t~sts done in Section 4.2, which show that the negative binomial model is a good fit to the data, will also serve to test this model. This is blecause the chi-squared test ;s based on fitting a vari-ance that is a quadratic function of the mean, which is characteristic of this model. This model, however, enjoys many advantages in terms of thE~ interpretation of the parameters. 4.5.2 INTERPRETING THE PARAMETERS The quantity c 2 actually measures how far the observed data are from the Poisson model. That is, if c 2 = 0, then the observed data would be Poisson, and the data move away from the Poisson distribution as c2 increases. If data are classified by n sorters with respective species A classification probabilities Pl""'P n, the parameter c2 measures the level of agreement among the sorters, since c equals the coefficient of variation of the Pi's.
I 4.49 I It is clear from the definition that c2~O. It is also true that c 2 is bounded above, as will now be shown. From (4.33) we have I I c2 =...L n-1 t. n
\ ( 4.35) i=1 I
I Since Pi~O ;=1, *** ,n, ;t follows that ( rP i}2 ~ rpi, because I I {nr Pi} 2 = nLp.2 + r LP*P*J ~ nr p2.* 1 1 1 ( 4.36)
;=1 i=1 irj i=1 I
There~re [Ep~/(Epi)2] ! I, and from (4.35), I c 2-< n...n21 [1 _1] = n. I n (4.37) I This upper bound is attained when one of the n Pi's equals 1 and all of the n-1 other Pi's are zero. We therefore have I o -< c2
-n < ( 4.38)
I for n~2. If n=1 formula (4.33) cannot be used, and an analytic bound cannot be used. The reasonable extrapolation is to take the upper bound I* as 1 if n=l. (The fewer sorters, the less the amount of variation due to classification.) Note that c2 can be identified with 11K in the negative binomial model (Section 4.2), since we have from (4.34) and Appendix II I Classification variance: ~ + c2~2 (4.39) I Negative Binomial variance: ~ + {I/K)~2. I I
I 4.50 It is important to realize that the bounds on c2 can only be derived I within the interpretations of the classification model. Without these interpretations no bounds can be derived on c2. I If we define the II re l ative skill of the ith sorter to be ll I I we can think of pi as the species A classification probability of the ith sorter relative to the other sorters. We can then express c2 as I (4.41) I I whl~re p* = (l/nHpi = l/n. Thus, if all of the sorters classify as species A with thE! same probability, as would occur if one person were to I do all of the sorting under identical conditions, we would have c2 = 0, and the observed data would follow a Poisson distribution. As the rela-tive probabilities pr become more discrepant, c2 will increase to its I maximum value, c 2 = n. I A model with c 2>0 does not imply that the persons identifying the organisms are unskilled, or that there are gross errors in classification. The quantity c2 measures relative, not absolute, consistency in classifica-I tion. It is possible that all classifiers are highly skilled but not equally skilled and not absolutely consistent within and between workdays. Thus, on a relative scale, there can be differences in classification U accuracy among samples, leading to a nonzero c2. There is good reason to believe that there is variation among sorters and identifiers and/or among times and working conditions. Sorting and identification are difficult I tasks, particularly in certain cases where two or more closely related species are hard to differentiate. I During the laboratory analysis of entrainment abundance data, two opera-tions are performed: sorting, or separating the ichthyoplankton; and identification, or counting the number of each species-life stage com-I bination. Several types of technicians are involved: analyzers, who process all samples; I quality control checkers, who process samples determined according to a quality control plan; and I I
I 4.51 I resolvers, who settle any discrepancies between analyzers and I quality control checkers. In sorting, the processing of a sample is considered defective or erroneous if the analyzer misses more than 10% of the ichthyoplankton in the sample. I In identification, the processing of a sample is considered defective or erroneous if either of the following occurs on one or more species: I (a) for the species or one of its life stages, the QC (quality control checker) count is 20 or less and the analyzer and QC counts differ by 3 or more; I (b) the sum of the species-life stage percent errors over all life stages with QC counts above 20 is 10% or more, where the species-I life stage percent error is defined as (analyzer count - QC count)/QC count. I Noticeable differences can exist between analyzer counts and QC counts I without leading to a conclusion of defective processing. For example, consider a sample in which the analyzer counts for a species are 100 eggs, 20 yolk-sac larvae, and 6 post yolk-sac larvae, and the QC counts are I 109 eggs, 18 yolk-sac larvae, and 8 post yolk-sac larvae; the processing of this species in this sample is considered nondefective. The margin of error allowed in sample processing without resulting in a conclusion of I defective processing makes it clear that a degree of disagreement among analyzers processing the same sample is expected. This supports the posi-tion that there are differences in sorting and identification accuracy among samples. I The classification probability, p, of a particular sorter is actually composed of two pieces, and examination of these pieces can be helpful in I coming closer to an optimal sampling scheme. For purposes of illustra-tion, we will concentrate on one particular genus, Marone, and its two dominant species, striped bass (S8) and white perch (WP). I The classification model assumes that organisms can be classified into the correct genus without error. Suppose now that we are interested in the classification of S8. In the model described above, the quantity p I is I p = probability of classifying a Morone spp. as a striped bass. I This probability is composed of two parts: the sorter's skill in identify-ing striped bass, and the abundance of striped bass in the sample. Define I I
I 4.52 I
~SB = proportion of sample of Morone spp. that are striped bass.
I Then I p = Prob[classifying SB as SBJ~SB (4.42)
+ Prob[cla,ssifying WP as SBJ(l-~SB)'
I so we see that the probability p is a function of not only the skill I of the sorter, but also the composition of the sample. If we assume that the skill of the sorter is constant over time, then the only variation in p comes from variation in ~SB. I The interpretation of the classification model leads to the bounds I o< c2 < 1. From (4.34), we see that c 2 is the coefficient of the quad-rat'j c vari ance term di scussed in Secti on 4.2. Whil e the quadrati c vari-anCE~ model does not lead to an analytic bound on this coefficient, I Cassie's comments (quoted in Section 4.2.1) and the empirical results for K = 1/c 2 in this chapter support the view that c2 is generally less than 1, and is often substantially less than 1. Because c 2 < 1 is true I except on rare occasions, it will be useful to take c2 : 1 as an empirical conservative upper bound from time to time. This is not a true bound that I can be established deductively, but only a value that the actual c2 usually falls below. The empirical bound c2 = 1 will be used as a value giving I greater variance in (4.34) than almost all values of c2 observed in prac-tice. Although this empirical conservative upper bound of 1 is a conveni-ent reference point against which to measure an actual c 2 , this use of I c2 = 1 does not mean that an analytic bound for c2 exists. Both here and in later chapters, the empirical bound c2 = 1 will reflect a distribution I of greater-than-usua 1 vari abi 1i ty, but its use wi 11 not mean that values of c2 exceeding 1 are impossible. Setting c 2 = 1 can be described informally as a conservative approach to variability because the true I value of c 2 will most often be less than this. If a lack of replicate sampling data for Cl species-life stage combination leaves us unable to I estimate c 2 , using the empirical conservative bound c 2 = 1 will be a rea-sonable, tentative first step. I I I
I I 4.53 4.5.3 EVALUATING THE CLASSIFICATION MODEL I The classification model hypothesizes that errors in the sorting and identification of ichthyoplankton have a major impact: they are respons-I ible for transforming the sample counts from the Poisson distribution, which would be observed under error-free processing, into the negative binomial distribution. The appropriateness of this model can be examined I by investigating the behavior of counts in situations known to be virtually free of errors in classifying specimens. If counts of this kind appear to be Poisson distributed, then the classification model gains credibility; however, if such counts do not show Poisson behavior, then they strengthen I the case against the classification model. By amalgamating, or lumping, observed counts to the family level, we can I ensure that errors in classification are virtually absent. This is because a specimen from one family will almost never be mistaken for a member of another family. It is more common for a specimen to be misclassified as I belonging to a species that is incorrect but is within the correct family. Another frequent outcome of identification is the labeling of a specimen as a member of a family without specifying its species. For example, a I blueback herring (Alosa aestivalis) could be identified as an American shad (Alosa sapidissima), herring species (Alosa ~.), or herring family (Clupeidae). Such errors occur with some regularity, especially the failure to classify a specimen at the species level; an identifier may I find it difficult to decide on the species of a particular organism, perhaps because of damage to the organism during entrainment. For example, as was noted in Section 4.3, Engraulids other than bay anchovy are I extremely uncommon in the Hudson River, but many specimens are identified as anchovies (Engraulidae) rather than as bay anchovy (Anchog mitch;))j). I Using data amalgamated to the family level, therefore, has the effect of replacing counts that result from incomplete or inaccurate identification at the species level with complete precise counts of organisms in speci-fied families. To evaluate the classification model, three families were I examined: Clupeids, consisting of herring family (Clupeidae), blueback herring I (Alosa aestivalis), alewife (Alosa pseudoharenfus), American shad (Alosa sapidissima), and herring species Alosa ~.); I Engraulids, consisting of anchovies (Engraulidae) and bay anchovy (Anchoa mitchilli); and Morone, consisting of white perch (Morone americana), striped bass I (Morone saxatilis), and temperate bass species (Morone ~.). Replicate lumped counts in these families were examined for the 1982 I data from Roseton and the 1983 data from Indian Point described earlier in this chapter. The Poisson goodness-of-fit statistic S was computed, then R+ was found and SNB for the negative binomial distr~bution was c.om-I puted. The results are shown in Table 4.9. Counts for each p.s.u. are plotted in Figures 4.4 for Indian Point and 4.5 for Roseton. I
I Table 4.9 I Indian Point 1983 and Roseton 1982 12 Tests to Evaluate the Poisson and Negative Binomial Models I Indian Point Punrp 1 vs Pc~
'"K Poisson 'J..2 (64 d. f. )
Neg Bin X2 (63 d.f.) I Clupeid Larvae Engraulid Larvae 14.26 64.52 169.76 205.63 79*51 93.45 I 4.28 60.31 Morone Larvae Pump 1 vs AUTOSAM 398*93 I Clupeid Larvae Engraulid Larvae 8.82 3.66 204.33 1368.57 66.94 85.26 I Morone Larvae 2.96 521. 61 67.15 Pump 2 vs AUTOSAM I 67.43 113.69 90.82 Clupeid Larvae Engraulid L~rvae 2.89 1576.78 83.88 I Morone Larv9.e 7.79 314.43 85.72 Roseton I Poisson X2 Neg Bin X2
'"K ~22 d.f. 2 (21 d.f.) I Clupeid Eggs 21.88 24.81 22.54 Clupeid Larvae Morone Eggs 571. 07 78*73 52.14
- 29. 48 38.46 21.22 I
Morone Yoll,-Sac Larvae Morone Larvae 6.86 12.09 155.50 175*98 31.96 16.25 I I I I I I I
I 4.55 I Figure 4.4 Plots of Observ~ed Data, Indian Point* I 1983 Pumpsvs. AUTOSAM INDIAN POINT LUMPED COUNTS - 1983 I c:: H::.1 . I I I I 34 . I I ..... IJ1 ('\J TIME I Dotted lines = pumps Solid line = AUTOSAM I i:.i (" I!:I . INDIAN POINT LUMPED COUNTS - 1983 I G94 . I Q
~ 518 .
I _.I 0: [E
~ *~4~ .
I L.. -- W
~
I 1Gb* ./ I -1171 ......
\1":1 I
I Figure 4.4 (continued) I I I t-*II) I At,! PC' I t-*JT L_Ui-iF'ED C(:'Ur--*1T:::; - 1'3 :::: J I I I I I I I TIr---1E I I I I I I I I I
4.57 Figure 4.5 Plots of Observed Lumped Data, Roseton 1982 ROSETON LUMPED COUNTS--1982 42 (I) (!l 831 o H W
~ 21 -l u
10 N TIME ROSETON LUMPED COUNTS--1982 c~~~ 1580 w cr:
~1160 -l c
H w a.. 740
- l
-l u
320
-11'11'1 U'l 01 ('I) l"-
N TIME
I Figure 4.5 (continued) I ROSETON LUMPED COUNTs--1982 I I.~U 1112 I oif'
~
w
?4 I z ~ 46 o
L I I TIHE I ROSETON LUMPED COUNTS--1982 I
<~82 It.!
([
&214 I .--J w ~146 0::
I o L 78 I TIME C\I I ROSE TON LUMPED COUNTs--1982 I w ([
~ 158
([ I
.--J ~ 116 (JJ I
I
.--J o 74 w
z I
~ 32 o
L I TIME I
I I 4.59 The Indian Point triplicate data, analyzed for each pair of sampling loca-I trons "(Pump 1 vs. Pump 2, Pump 1 vs. AUTOSAM, Pump 2 vs. AUTOSAM), decisively reject the hypothesis of replicate Poisson observations, as shown by the very large values of Sp' The negative binomial distribution's I goodness of fit is marginal: about half of the values of SNB lead to rejec-tion of the negative binomial hypothesis at the .05 level. For the Roseton data, the Poisson distribution is rejected for three of the five family-I life stage combinations examined; the negative binomial is rejected for one of these five. I The evidence against the Poisson distribution is very strong. Even at the family level, where classification is not causing any difficulty, the data do not conform to the Poisson distribution. Thus some problem other I than classification must be largely or wholly responsible for the failure of the data to fit the Poisson distribution. We conclude that the lumped data do not support the classification model. I 4.5.4 THE PARAMETERS K AND c2 I As has been pointed out in (4.39) and elsewhere, K and c2 are connected by the relationship c 2 = l/K. We can benefit from this. ~or example, I we could estimate c2 by using (4.18) to obtain K+ and then taking l/K+ as an estimate of c 2. Similarly, we could estimate K by 1/~2, using I ~2 from (VI.5) of Appendix VI. It is clear that little would be lost if we were to select either K or c2 for sole use, dropping the other. I It was not always evident during the development of this work, however, that K and -c 2 would turn out to be essentially interchangeable. It seemed I at times that one of these might have substantial advantages over the other, based on some property of the classification model or similar considera-tions. For this reason, both K and c2 were retained and used in different I parts of the study. For example, K+ appears in the test of the negative binomial model in (4.17) and (4.18), while ~2 appears in the confidence I interval given by (5.14) and (5.15). It would be routine to substitute c 2 for I/K or K for l/c 2 and to make the necessary adjustments if one wanted to do so. We have chosen not to go to this trouble. I I I I
I 4.60 I 4.6 MODEL CHOICE FOR ENTRAINMENT OF JUVENILES I The entrainment of ichthyoplankton of the juvenile life stage is of par-ticular ioterest. To some extent, this ;s because juveniles have sur- I vived through the earlier life stages, making them more valuable both ecolo-gically and economically. The numbers of juveniles entrained at Bowline Point, Indian Point, and Roseton are lower than the numbers of early life stage ichthyoplankton entrained. (EA, 1984a, 1984b, 1984c). Nevertheless, I estimation of juvenile entrainment is an important problem. One point that is worth stressing is the fact that when a sample is analyzed I in the laboratory and is found to contain no juveniles of a particular species, a useful piece of information about juvenile entrainment has been obtained. Zero CQunts, for juveniles or any other life stage for that I matter, are informative about entrainment just as counts greater than zero are. In other words, zero counts should not be confused with a lack of information, such as might occur in the absence of sampling. I The methods of estimation to be treated in Chapter 5 can be applied to juveniles. These involve the derivation of confidence intervals for total entrainment under the negative binomial model and the Poisson model, which I is obtained as the negative binomial model with the parameter c 2 = o. If the use of the Poisson model can be justified, Poisson-based confidence intervals will be appropriate. These will be beneficial because they I are shorter than the confidence intervals for any other negative binomial distribution. (Chapter 5 will contain more details on this point.) The statement by Cassie, cited in Section 4.2.1, that the negative binomial I distribution approximates to Poisson when the mean is small, suggests that
~he n~gative binomial and Poisson models may give similar results for Juveml es.
I To investigate whether this theory holds, we can test whether repli-cate sample counts of juveniles satisfy the Poisson model of Section I 4.1 if appropriate data are available. The low level of juvenile entrain-ment makes it difficult to obtain suitable data, but recent replicate sampling has produced three such data sets. All three support the Poisson I modl~l for juvenile counts. The first data set, Table 4.10, is paired counts of bay anchovy juveniles from Bowline Point in 1983. When the Poisson test of Section 4.1 is applied to the 36 nonzero pairs (a paired count of I (0,0) is called a zero count and is discarded for purposes of this test), the statistic S = 38.80, which is much smaller than the upper 5% point of d X~6 distri~ution. In the Indian Point 1983 paired pump observations I discussed in Section 4.4, paired observations of juveniles. were obtained for white perch; I I I
I 4.61 I Table 4.10 I Collected to Evaluate the Assum~tion Bowline Point Plant, 1983 1984a: Table B-2 I I IIIIUUUI I R!:elicate 2 IIx 4o'ba!! !I! An!Ob2U I Vol. Vol. D,c,
~ ...hu. Jettw. J.u... ~ -1.!J. .hu. 1!att ...hL ~
08 AUluet 62.7 S 158 4 1 63.7 10 147 4 1 4S.8 0 143 5 2 46.9 2 I 09 Aueun 58.0 72 2 1 0 106 2 0 S7.4 0 S9.8 0 S7 71 0 1 1 1 S8.9 0 74 2 0 60.0 0 91 1 2 62.6 S9.3 0 141 3 0 7 80 2 0 63.3 10 85 67.4 13 109 0 2 I 10 AUIun 64.8 62.8 63.1 0 1 5 109 350 224 0 2' 0 0 1 0 2 2 69.4 Sa.8 63.8 U.2 7 0 0 4 119 96
!l42 Rn 2
1 1 0 1 0 0 1 63.2 2 109 1 0 64.7 2 Al7 0 3 I U AUIun 12 Aueuat 62.1 Sa.6 71.6 S4.1 22 12 0 1
*214 84 37 67 0
0 1 2 2 1 1 2 61.2 55.S 71.4 4 14 1 80 4S 9 0 1 1 1 0 2 51.6 2 24 0 0 53.9 4 46 2 1 54.4 1 24 I 13 Aueuat 15 AUlun 76.4 55.6 51.6 71.S 1 0 0 44 118 30 0 0 0 0 1 0 17.8 S6.9 51.5 0 0 1 38 142 48 2 0 0 0 0 1 0 0 0 116 1 0 71.9 0 93 16 Auauat 64.0 0 1 1 I 137 1 4 66.1 0 59 0 61.9 0 61 4 0 0 71.6 0 so 0 1 66.2 0 49 0 0 67.1 0 17 !ulun 64.3 76 0 1 0 37 0 2 61.3 0 9 0 57.2 0 91 0 18 Au,ue 1 1 59.8 0 68 0 0 I 19 !ulun 76.2 75.5 61.5 66.9 0 0 0 0 55 91 50 63 1 0 2 1 1 2 76.5 77.2 58.8 0 0 0 99 63 38 3 0 0 2 1 0 1 1 63.3 0 69 2 1 64.1 0 28 I
.1 0 63.4 0 34 2 0 63.1 0 41 3 0 69.3 0 38 1 73.1 0 44 1
2 1 71.1 0 26 0 1 22 !ueun 69.5 0 53 3 0 64.8 0 54 5 0 62.2 0 46 8 1 66.0 0 13
..23 Auaun 1 1 I 24 AUluU 80.1 60.0 S6.9 69.2 0
0 0 0 72 24 19 32 5 4 1 0 2 0 0 0 79.8 59.5 60.5 70.2 0 0 0 0 100 36 8 25 6 3 1 1 0 0 0 70.1 0 29 1 2 0 73.8 0 34 0 0 I 7.5 AUluet 26 Aueun 69.6 66.2 17.0 0 0 0 13 8 12 0 0 0 1 0 1 68.2 66.3 70.5 0 0 0 31 20 43 1 1 0 0 0 2 55.0 0 16 0 1 52.7 0 24 0 71.8 0 20 I 0 1 1 66.9 0 17 0 0 61.0 0 14 0 0 64.6 0 13 0 0 75.5 0 17 3 0 76.9 0 10 0 0
- l7 Auauat 68.3 0 22 0 0 67.2 0 28 0 1 7S.S 0 18 1 0 74.1 0 IS 0 0 I 'rotal Count llIun 72.4 65.23 0
73 1.49 3,483 23 71.1 6S 1.33 1 0 39 72.9 0.80 65.17 0 58 2,922 60 1 48 1 54 1.18 59.6 0.98 0.67 llIeelbn 64.10 0.0 SO.O 1.00 1.00 64.80 0.0 48.0 I StaDelarei n.. iatioa CoefficieDt of Variation (1) 7.78 11.9 4.11 275.8 64.S 90.7 1.66 124.8 0.89 7.69 111.2 11~8 2.97 251.7 43.4 72.8 0.0 1.39 141.8 0.0 0.87 129.8 I
I 4.62 I 1 1 1 2 4 1 3 5 2 1 S P
= 6,133 I 1 a a 2 1 3 5 3 1 1 and for striped bass:
I 1 1 2 0 3 a 4 a I 8.667 1 1 4 1 1 1 a 1 I The test statistic Sp was used here because the volumes of the paired I samples satisfied vi ~ v~ for each p.s.u. i. Neither observed value of Sp was close to the corresponding critical value. Thus all three of these I data sets are consistent with the Poisson model for juvenile counts. At Roseton juvenil es were not co 11 ected in suffi ci ent quantity to perform this test on any species. I The data sets of paired juvenile counts are all quite small, and the power of Sp to detect various departures from the Poisson distribution is I unknown. Consequently, the support shown here for the Poisson structure of juvenile counts must be regarded as tentative. Further replicate sampling data must be gathered before a final conclusion can be reached. I 4,7 MODELING ENTRAINMENT ABUNDANCE USING PLANT OPERATING FACTORS I The task of modeling entrainment abundance as a function of plant out-ages has received little attention in this study for two reasons. First, I the modeling of the discharge channel required a much greater effort than was originally expected. This effort has been well spent, though, for without accurate entrainment estimates it would be foolhardy to attempt I any further modeling. The second, and more important, reason that this task has not been given I attention is the lack of sufficient data to test a model against, To see the severity of this problem, consider the following: The desired end result is a model relating entrainment abundance (ea) to plant operation (po) through a functional relationship I ea = f(po). (4.43 ) I I I
....._111. _ _
- _ _ _11. ~!. ...1_-- --
- -..III_mn_ _. . ..-n _ _ _ _ _ _ _ _ _ IIII n
--1.----.--...**-***.***-**-----
I I 4.63 However, entrainment abundance is a function of both species abundance I (sa) and flow rate in the discharge channel (fr), where species abundance results from a complex pattern involving the density of orcanisms in the plant's immediate vicinity, diurnal variation, weekly variation, and I other factors. Thus a more realistic model ;s
= f(sa,fr). (4.44)
I ea Any attempt to model entrainment abundance as a function of plant opera-I tion while ignoring other variables cannot possibly lead to a realistic model. To remedy this situation, entrainment abundance must be measured over a variety of plant operating conditions while there is an attempt to I hold the other variables constant. Part of this problem can be overcome by using these other variables as covariates; however, with the data pre-sently available, the worth of such an endeavor is questionable. (Such an attempt should be based on a designed rather than an observational I data set.) Therefore, although a model can be fitted with the data pre-sently available, we feel that such a model can be given little credence due to the large number of uncertain relationships. I The model used to relate flow rate f to entrainment T over a time interval (O,t) is I T = f~A(x) f( x) dx = fJ~A(x )dx = tfX (4.45) I where ).(x) is the ichthyoplankton density in the discharge channel at the I instant x, a < x < t, and I is the average density over this period (O,t) of constant flow rate f. The question at issue is whether I I depends on f. This is critical for the estimation of entrainment reduction due to outages. Let T+T' denote the entrainment that would occur under full flow conditions, so T' is the entrainment reduction due to outage during the time interval (O,t). Similarly, let f+f' denote the I flow rate that would occur under full flow conditions. If I and the flow rate are independent, then I T' = tflI; (4.46) I T' can be estimated from the known values of t and f' and the estimated value of I, which is easily o~tained from the estimated true entrainment, I as
'" A"" ,..
I T' = tf'I = tf'(T/tf) = (f'/f)T. (4.47) I
I 4.64 I If, on the other hand, I is a function A(f) of the flow rate, then (4.46) must be replaced by I T' = t[(f+f')X(f+f ' ) - fI(f)]; (4.48 ) I to estimate T' here, the average density of organisms at full flow, I(f+f ' ), must be estimated from the knowri f and f', the estimated value I of I(f), and the furictional relationship connecting I to the flow rate. This is a much more complex estimation process than (4.47). I In the face of such uncertainty, it is usually the case that the simplest solution is the most reasonable. Thus, the linear extrapolation of (4.47) seems to be the most reasonable approach until experimental data from a I designed study become available. An attempt was recently made to perform a designed study of this kind at I the Roseton Generating Station (EA, 1984c, Sec. 5.7). To examine the relationship betwel~n entrainment densities and cooling water flow rates, continuous samplin9 was scheduled immediately before and after planned flow rate changes. A problem with the Roseton flow rate study is that it I took two hours for the flow rate to change from its high setting to its low one (or vice versa); in two hours the river conditions can change a great deal, which,makes it difficult to draw inferences from the com-I parison of densities before and after the flow rate change. If the I change could be made over a shorter transition period, say 15 minutes instead of two hour's, this difficulty would not be as great. I Another study of the relationship between plant flow and ichthyoplankton density was recently performed at the Bowline Point Generatirig Station I (EA, 1984a, Sec. 3.4, 4.6). Samples were collected at night, when den-sities were greatest. Analysis centered on bay anchovy, the only species expected to occur in large numbers. Two patterns of pumping were tested: I 2 pumps to 3, then back to 2; and 3 pumps to 2, then back to 3. Samples were co 11 ected on four of the schedu 1ed ten samp 1i ng dates. The study IS results "seem to indicate that flow and density are inversely related, I [although] this conclusion should not yet be too strongly embraced." (EA, 1984a, p. 4-17). I In order to address this question directly, it would be desirable, but impossible, to measure entrainment simultaneously at two or more dif-ferE~nt flow rates. Recommendations for special studies to determine I whether entrainment densities and cooling water flow rates are related will be made in Section 9.3. I I I
I I I 5. ESTIMATION OF ENTRAINMENT ABUNDANCE AND ENTRAINMENT REDUCTION I
5.0 INTRODUCTION
I The performance of a sampling design can be assessed by statistical cri-teria that evaluate how well the design achieves its objectives. The I main objective of a sampling design is the estimation of two quantities: the total entrainment abundance at a plant over the entire entrainment season and the effect of outages in reducing entrainment. Statistical I properties of estimators can therefore be used to make comparisons among the estimators associated with different sampling designs. A good sampl-ing design will lead to estimators of entrainment abundance and entrain-ment reduction that have low bias and low variance, relative to the I estimates from other designs, over a variety of conditions. In particular, a good sampling design must be able to deal with rapid changes in the level of entrainment over a short period of time. I Our goal is to obtain sampling designs that give the most accurate esti-mates possible for a fixed level of sampling effort. Changing the cost I of sampling judged acceptable would lead to adjustment of the design, resulting in a different design appropriate for the new level of sampling effort. I 5.1 PERFORMANCE CRITERIA CONSIDERED I Performance criteria will be discussed from the perspective of estimation. Our objective is to estimate both entrainment abundance and the reduction in entrainment due to outages, over the entire entrainment season. Esti-I mation is to be as accurate as possible for a fixed level of cost, over a wide range of conditions. The performance of a sampling design will therefore be judged on the basis of accurate estimation. This makes it I crucial to keep in mind, as Section 3.0 pointed out, that a sampling design must be viewed as an overall process that includes field collection of samples, laboratory analysis of these samples, and statistical estima-tion based on the laboratory data. In this framework it makes sense to I investigate the quality of estimation resulting from a given sampling design. I Low bias is one desirable feature of a design. Bias in a design may pro-duce estimates having a systematic tendency either to be too high (posi-tive bias) or to be too low (negative bias). A positively biased design I gives estimates of entrainment that are, on the average over a long series of entrainment seasons, higher than the (unknown) *actual level of entrain-ment. A negatively biased design does the opposite. A design is called I I
I 5.2 unbiased if its entrainment estimates are correct on the average. For an unbiased design, entrainment will be overestimated in some years and I underestimated in others, but over a long series of seasons the average estimate will approach the average true level of entrainment. Formally, A I bias is defined in terms of the expected value of the estimator T of the true entrainment level T: A E[l] T is positive bias A I E[T] = T is zero bias or unbiasedness (5.1) A I E[T] < T is negative bias. I A possibility that must be considered is the presence of bias in entrainment estima.tes due to the physical characteristics of the sampl ing gear. Poten-tial sources of bias are in the location of the intake orifice for the pumping samples, the velocity of water at the intake orifice, and the I direction of the intake orifice. Intakes for the pumps were more or less fixed and sampled from the same pOint in the plant discharge. The assump-tion of random mixing of the ;chthyoplankton in the discharge water is I necessary in order to consider the sampling to be representative. Accu-rate estimates of the entrained ichthyoplankton are also dependent upon the plant discharge water velocity matching the water velocity at the sampler I intake. There could conceivably be a positive bias if the water velocity at the sampler intake exceeds the plant discharge water velocity, a nega-tive bias if the opposite ;s the case. The direction of the sampler intake I orifice will most likely cause a negative bias unless facing directly into the plant discharge flow. Open-ended pipes pointing directly downward into the discharge would result in underestimation of the ichthyoplankton entrainment. Details are given by Martin Marietta (1981). I Low variance of entrainment estimates is another important feature that I any good design must have. Variance measures the degree to which an estimator is likely to differ if estimation takes place many times under identical conditions so high variance indicates a lack of precision in l I the es timati on procedure. Low vari ance means that if severa 1 independent samples were drawn from a single random process and each sample were used independently to obtain an estimate (e.g., for entrainment abundance), I the several independent estimates of the same underlying quantity would be very close to each other. This allows us to have confidence that the one sample we actually draw in practice will give an estimate that is I highly consistent with other samples we might have drawn. Conversely, high variance in an estimator means that several independent samples from a process are likely to produce estimates differing greatly from one another. This dispersion implies that the estimator cannot do a precise I job of estimating the quantity of interest. I I
I 5.3 I It is sometimes convenient to consider the standard deviation of an entrainment estimator, which is the square root of the variance of the estimator. The standard deviation is expressed in the same units of . I measurement as the observations, which often puts it on a scale that is appropriate for interpreting its magnitude. The variance and standard deviation are equivalent pieces of information, since knowing either one I makes it easy to calculate the other. Another useful measure of variation is the coefficient of variation or CV, I which is the ratio of standard deviation to mean. This measure of rela-tive dispersion is often expressed as a percentage, 100(standard dev)/ (mean) %. It measures variability, as expressed by the standard deviation, I but is scaled by the size of the mean. Thus the CV gives the size of the standard deviation as a fraction (or multiple) of the mean. One particu-larly useful property of the CV is that it enables us to calculate the I probability of erring by any specified percentage. For example, if the CV of an entrainment estimator is 25 percent, then we can be 95 percent certain that this estimator will not err by more than 50 percent, and 80 percent certain that the estimator does not differ from its target by more I than 33 percent. In general, the probability of erring by at most 100e% is (approximately) I (5.2) where ~(z) is the standard normal distribution. Since 1 - 2~(z) = .95 I for z = -1.96 ~ -2, then with 95 percent certainty the fractional error of the estimate will not exceed twice the CV. I The coefficient of variation can be viewed as a scaled version of the standard deviation; instead of expressing the level of variability in the same units as the observations, however, the CV expresses variability rela-I tive to the mean. This is useful as a criterion for comparing estimators and sampling designs because, for distributions such as the Poisson and negative binomial, the standard deviation increases as the mean increases. I The fraction by which an estimator is likely to be in error, which can be found from (5.2), is often more important than the absolute magnitude by which the estimator is likely to be in error, which can be found from the standard deviation. I Both bias and variance should be considered in evaluatin~ an estimator, as it is possible for an estimator to behave well on one of these criteria I but poorly on the other. For example, if we are concerned only with achieving zero bias and we ignore variance, an estimator based on data gathered on a single randomly chosen day during the entrainment season I can be unbiased, but any such estimate will have an enormous variance, making it practically useless. Conversely, if we are concerned only with minimizing variance and we ignore bias, an estimator that jgnores the data I and gives a fixed value, e.g., the previous year's estimated entrainment level, has a variance of zero--any sample collected this year leads to exactly the same estimation result--but has a huge bias because of year-I to-year fluctuation in entrainment. As these examples show, it is easy for an estimator to perform well in terms of either bias or variance by ignoring the other criterion; to be useful, an estimator must perform well I in both of these areas.
I 5.4 I A standard approach to estimation that combines variance and bias is to introduce the mean-squared error of an estimator. As the name suggests, this is the average value, over a long series of replications, of the I squared difference between the estimator and the quantity it seeks to estimate. This is the long-run average of the square of the distance between the estimator and its target. The relationship connecting the I mean-squared error (MSE), variance, and bias of an estimator is MSE = variance + (bias)2. (5.3) Thus an estimator that has low MSE is performing well as measured by a combination of variance and bias. An unavoidable complication is that variance, bias, and MSE are affected I by entrainment conditions. That is, the general level of entrainment, the distribution of entrainment across the season, and other similar factors are involved in detl:!rmining the variance, bias, and mean-squared error of I estimation for any design. If we knew the values of such factors, we could use this knowledge in comparing designs, but unfortunately these values are almost all/lays unknown. It is therefore necessary to seek I designs that do relatively well (compared to other designs) over a broad range of entrainment patterns. In particular, any good design must handle, as well as possible, entrainment patterns including several steep peaks I of entrainment. These occur at times that can be predicted only approxi-mate'ly, perhaps to \/li thi n a peri od of two to three weeks. Such peaks may be for different taxa, different life stages of a particular taxon, or both. I Note that no sampling design can be optimal under all conditions. To see this, observe that if we knew entrainment to be concentrated within a I single week, any optimal design would concentrate all sampling effort within that week. Yet no design of this type merits serious considera-tion, because its optimality under the conditions of a one-week entrain-I ment season is overshadowed by its extremely poor performance under vir-tually all other conditions, including those we are likely to encounter. I It would be mathematically convenient to restrict attention to the class of unbiased estimators. There are several reasons for this: I (1) It limits the class of estimators to be dealt with, giving us fewer candidates to compare. I (2) Since all estimators in this class are unbiased under all possible entrainment conditions, it is natural to judge estimators by comparing their variances under various entrainment conditions. I This is because, for any unbiased estimator, we have I I
I 5.5 I MSE = variance + (bias)2 (5.4) I = variance I since bias = O. Therefore, in dealing with the class of unbiased estimators, comparisons of MSE reduce to comparisons of variance. I (3) Unbiased estimators can often be derived without any distribu-tional assumptions, as in Cochran (1977, Ch. 10). I Unfortunately, the restriction to unbiased estimators has the major draw-back that it eliminates from consideration many potentially useful esti** rnators. An estimator with a small amount of bias, perhaps a few percent I or even less, can have a variance that is much lower than the best (minimum-' variance) unbiased estimator. A slightly biased estimator of this kind has much to recommend it and should not be discarded without consideration. As will be seen in later chapters of this report, the estimators that are I reasonable candidates exhibit little or no bias, so the MSE comparisons reduce to comparisons of variance. I Thus far, our discussion has assumed that the level of sampling effort, as measured by cost, is fixed. Variance, bias, and MSE of competing sampling designs can be compared under this fixed-cost assumption. There is another I perspective that can be useful in thinking about design comparisons: the sampling effort needed to obtain a given degree of accuracy. Among designs that are unbiased or only slightly biased, the best designs are those that I require the least sampling effort to achieve a specified standard error or variance for the estimates of interest. Sampling effort will not be known in advance if certain aspects of the sampling process, such as the I number of ichthyoplankton collected in samples throughout the entrainment season, are random. In this case, the expected value of sampling effort can be used in place of sampling effort. I 5.2 PERFORMANCE CRITERIA SELECTED I Two criteria have been chosen for the comparison of sampling design per** formance. The first is the bias of entrainment abundance and entrainment I reduction estimates. This bias should be zero or nearly zero. The second is the variance of these estimates for a fixed level of sampling effort, or equivalently, the amount of sampling effort required for the I estimates to attain a specified level of accuracy. Where appropriate, we will consider the coefficient of variation, which gives the level of accuracy relative to the mean, rather than the variance or standard devia-tion, which gives the absolute level of accuracy. I A good sampling design must perform well on both of these criteria over a I broad range of entrainment c~nditions, including situations of rapidly fluc-tuating entrainment abundance. I
5.6 I The absence of substantial bias is highly desirable in any estimator of entrainment levels. A systematic tendency to overestimate entrainment I abundance, for example, would create obvious problems. A systematic ten-dency in the other direction would be even worse, as it would raise ques-tions of whether entrainment abundance was being underestimated inten-tionally for purely Ilonstatistical reasons. The overwhelmingly preferable I behavior of an estimator is unbiasedness, guaranteeing that estimates are correct on average, so overestimates and underestimates will average out over time. However, if more precise estimates can be obtained by allowing I a small amount of bias to be present, this tradeoff may be justified. Minimizing the variances of entrainment estimates for a specified amount I of sampling effort is the second aspect of sampling design performance. Alternatively, this can be thought of as minimizing the amount of sampling effort necessary to a.chieve a particular level of accuracy in the estima-I tion, that is, to reduce the estimated variances of the entrainment esti-mates to specified levels. The variance or standard error of an estimate provides a measure of the precision of the estimate, with low variance I reflecting a high degree of precision. An optimal sampling design gives the greatest precision possible if the amount of sampling effort is fixed in advance; it requil'"es the minimum necessary sampling effort if the pre- I cision of the estimates is fixed in advance. I 5.3 SOME ASPECTS OF THE ESTIMATION PROBLEM One of the major goals of this study is to provide accurate point and I interval ~stimates of the total number of entrained ichthyoplankton. Using the models devl~loped by both empirical and theoretical means, this can be accomplished. Furthermore, by inspection of the form of the inter-val estimates, it is possible to get some idea of how to optimize the I sampling and estimation process. As will be seen, it is impossible to complete a total optimization of the process because of the lack of cer-tain necessary infonnation; however, with certain assumptions (to be I specified later), a design can be recommended. With further study, it may be possible to v':rify these assumptions. I The estimation probl,:m treated here has two aspects. One is to estimate the entrainment actually achieved through the realized plant discharge regime. The other is to estimate the reduction in entrainment due to the I difference between the potential and the realized discharge. Both aspects pertain to the actual temporal sequence of ichthyoplankton densities in the source water during the year in question, since the realized entrain- I ment at any given time is a sample from that source and the potential entrainment is simply a larger sample from the same source. In other words, the realized entrainment is a sample from the potential entrain- I ment, and hence is subject to subsampling error. (Experiments have been undertaken in the past to test whether realized I entrainment is indeed a representative sample of potential entrainment. This was tested by controlled changes in the discharge rate over short intervals to determine whether the density of entrained organisms changed with the discharge rate (EA, 1984a, Sec. 4.6; 1984c, Sec. 5.7).) I I
I 5.7 I While actual' density in the source water does vary through time according to some unknown stochastic process, our direct concerns are not with that I process but with the actual density. Indirectly, we must be concerned with the temporal pattern over the year as it affects our sampling plan and sampling error, but chance variations in actual density are incorpora-I ted into both the actual and potential entrainment for that year and are thus included in the estimation target. If each entrained organism had a price tag, the total cost for the year would be determined by the I realized entrainment, not by the process that led to the realized entrain-ment. I It has been suggested that time series techniques for investigating the underlying stochastic process can be used directly in estimating entrain-ment abundance at power plants. A recent paper by Madenjian and Jude (1983) takes this time series approach to the estimation of entrainment I abundance. Their approach attempts to explain the underlying process variation, which, as stated in the preceding paragraph, is not of direct concern. Therefore, the time series approach is not appropriate for the I. estimation problems addressed in this study. A fuller explanation appears in Appendi x VII. I 5.4 CONFIDENCE INTERVALS I For convenience, we will deal with each species-life stage combination separately. Thus, suppose we observe X1'X 2 ' ... 'X N' the observed counts of a particular species-life stage combination, over N p.s.u.1s. If we I define I A. 1
= number of organisms per unit volume in p.s.u. i ,
Vi = volume of water sampled in p.s.u. i t I Vi = volume of water in discharge channel in p.s.u. i t I , then the total number of organisms entrained during the N p.s.u.1s is N T= LV.A .* (5.5) i=l 1 1 I The quan.tity Ai is the mean density in the discharge channel during p.s.u. I i, hence ViAi is the true total entrainment during p.s.u. i. Under the assumption of randomness in the discharge channel, the observed X; is a I realization of a random variable whose mean is viA; = ~;' Therefore, I
I 5.8 the total number of organisms entrained can be expressed in terms of I the )J. S as I I 1 N T L (v./v*h*
= i=1 1 1 1 (5.6) I For now, assume that the sampling fraction Vi/Vi for p.s.u. i is the I
same for all i=I, ... ,N. This assumption will be dropped in Section 5.9. (It does, however, have some basis in fact. The quantity Vi is I not measured directly, but is calculated from engineering specifica-tions of the pumps involved. This flow rate is constant, or virtually constant, over long intervals of time, allowing us to take V.~V 1 for all I i in such intervals. Furthermore, when the sampling is done using con-stant velocity pumps, we also have vi=v for all i.) Under these conditions, I N T = (V/v) L ~.,
. 1 1 (5.7) I 1=
and we now concentrate on a confidence interval for N r u*, The observed I
. 1 1 1=
counts Xl' X2 "o"X N can be assumed to have a variance that is a quadratic I function of their means, i.e., 2 2 I Var(X.) =~. + Cu., (5.8) 1 1 1 I where the parameter c 2 has been discussed in Sections 4.2 and 4.5. Simul-taneous replicate sampling is necessary in order to estimate the para-meter c2 ; however, as an alternative approach applicable when there has I been no replicate sampling, the empirical conservative bound of Section 4.5.2, c 2 < 1, gives I 2 Var(X.) 1
< u* +
1
~ ..
1 (5.9) I Using the fact that LXi is a point estimate for LUi' two different con-fidence intervals can be constructed. Any confidence interval for LU' I 1 gives rise directly to a corresponding confidence interval for T: if I I
I 5.9 I ('1"2) ;s a confidence interval for LUi' then (V'I/ v'V'2/ v) is the cor-I responding confidence interval for T. I 5.4.1 CONFIDENCE INTERVAL 1 Since I' N N Var( LXi) = L u* + c L u*, 2N 2 (5.10) i=1 ;=1 1 i=1 1 I estimation of this quantity would lead to a natural confidence interval I for LUi' Under the assumption that the Xi'S are independent with E(X i ) = ui and Var(X i ) = ui + C2\.1~' it follows by substitution that I I (5.11)
= u*1 + (c 2+1)\.1.2 I 1 and hence I
I I N X.(I+c X.) E L' i=1 l+c 22 1 I 1
= ~ LE(X.) + ~
l+c i ' c l+c; 2 2 LE(X.) 1 I I (5.12) I I N
= L \.I.
i=1 '
+c 2 N 2 L \.I., ;=1 1 I
I I
I 5.10 g1vlng us two methods of estimating the variance of EX., First, recalling I 1 2 from Section 4.5.2 the empirical bound c ~ 1, we can estimate the vari-ance conservativelJ( by taking c2 = 1, which leads to I A2 S = I N X.(l + X.) I 1 1 (5.13)
-j =1 2 'I Second, if the estimate ~2 of (VI.5) in Appendix VI is calculated from rep'licate sampling data, the variance can be estimated by I -2 S = L\ ' "2 2 N X. + c X. ~'2 , (5.14)
I i=11+(: I In either case, a confidence interval for I~i is then given by I N N I A I ~.
. l' e: I x. . 1 1 +/- bS, (5.15) ,= 1=
where b is a constant chosen to give the desired confidence level, e.~., I 90% or 95%. 5.4.2 CONFIDENCE INTERVAL 2 I An alternative confidence interval can be derived by noting that I N N 2 N 2 Var{ LX.) = ;=1
;=1 1 I~* + c 1
L~' i=1 1 (5.16) I
=
N I~. + 2 (c /N) N L~' [2 + c 2 IN{~. - -)2 .
~
I i =1 1 [ i=l 1 i=1 1 If N L (~.1 - ~)2 is negligible as compared with I
. 1 1=
I I I
I 5.11 I I which will be the case if the ~.IS 1 are close together, then we I have I Var [ I X.] ;; (5.ll) i=l 1 I I N where we define T = L ~.. Taking c2 ,= 1 as an empirical upper I for which i =1 bound, a confidence interval for 1 T is then the set of all values of T I N 2 L X. I [ i=l 1
- t a2 (S.lB)
I or, equivalently, I N N 2 I N 2 L Xi 4 L x.1 + a 2 + (4/N) U1X; L ~. i=l ;=1 , (5.19) I i =1 1 2(1 - a 2/N) I where a 2 is a constant chosen to give the desired confidence level. This confidence interval is usable only if a 2< N, that is, the probability I constant a 2 must be less than the number of p.s.u.1s. If this does not hold, then the set of T satisfying the constraint ;s not an interval. I 5.5 EVALUATION OF CONFIDENCE INTERVALS I Confidence intervals are evaluated according, to two criteria, confidence I
I 5.12 level and length. Informally speaking, one wants to get the greatest amount of confidence for the smallest length. I The two confidence "intervals given here were compared by a simulation study. Five different valuE~s of the season length N were examined: N = 3, 6, 8, I 16, and 24. The number of p.s.u.1s in the season, N, may be thought of as the sample size in the confidence interval computation, so we should expect better performance as N increases. The true, unobserved means Pi I of the observed counts X. were drawn from three different populations: the uniform distribution'on the interval (1,10), the uniform distribution on (1,100), and the uniform distribution on (1,200). These distributions I represent entrainment sampling from low, moderate, and high organism den-sities, respectively. Each observed count Xi was simulated as an observa-tion from a negative binomial distribution with mean Pi and variance given I by (5.8) with c2 = 1, which is the empirical conservative bound for c2 described in Section 4.5.2. This choice of c 2 was made to provide a situa- I tion with high variance of observed counts, in which the confidence inter-vals for Pi will be wide; smaller values of c 2 would lead to shorter (bet-ter behaved) confidence intervals. I These five values of N and three populations for the mean Pi result in 15 I (= 5 x 3) combinations. For each of these, 1000 repetitions were performed as follows: (1) N means p., i=l, *** ,N, were drawn from the uniform dis-I tribution specified as the parent population of Pi; (2) for each value of Pi drawn, a random count Xi was simulated, having a negative binomial dis-tribution with mean p. and corresponding variance (the simulation of nega- I tive binomial random ~ariables is discussed in Appendix II); (3) the N means p. and N observations X. were used to calculate the true mean Lp., I the confidence intervals given by (5.15) and (5.13) for s~veral values of the constant b, and the confidence intervals given by (5.19) for several values of the constant a 2; (4) for each value of b in confidence interval 1, ~iven by (5.15), and each value of a 2 in confidence interval 2, given I by (5.19), the length of the confidence interval was found and it was determined whether the true mean LPi was covered by the confidence interval. I From the 1000 repet"itions of these steps, the average length of the confi-dence interval and the probability of covering the true mean were calcu-lated for each value of the constant, b in confidence interval 1 and I a 2 in confidence interval 2. These average lengths and coverage prob-abilities are reported for all five values of N and the first, second, and third populations for Pi in Tables 5.1, 5.2, and 5.3, respectively. I I I I
-~~-
5.13 Table 5.1 Simulation Study of Confidence Intervals N for I lJ* : i=1 ' Observed Coverage Probabilities and Average Lengths When lJiiS Are Uniform ~1,10) SamQle Coverage Probability Average Length Size N Constant Interval 1 Interval 2 Interval 1 Interval 2 3 1.50 .74 .90 25.63 113.96 2.00 .80 34.17 2.50 .83 42.71 3.00 .85 51. 25 6 1.50 .76 .87 39.29 63.88 2.00 .84 .95 52.39 159.95 2.50 .89 65.48 3.00 .92 78.58 8 1.50 .77 .85 46.77 64.21 2.00 .86 .95 62.35 123.21 2.50 .90 .98 77.94 352.53 3.00 .92 93.53 16 1.50 .80 .82 69.92 76.38 2.00 .89 .92 93.22 116.74 ~ 2.50 .93 .97 116.53 179.72 3.00 .96 .99 139.83 300.63 24 1.50 .82 .81 86.66 87.99 2.00 .90 .92 115.54 127.63 2.50 .94 .97 144.43 179.84 3.00 .97 .98 173.31 255.51 Coverage Probabilities and Average Lengths are calculated based on 1000 simulations
5.14 Table 5.2 Simulation Study of Confidence Intervals N for L ',.l.:
- i=l l.
Observed Coverage Probabilities and Average Lengths when ',.l. is Are Unl.form (1, 100) l. Sample Coveras:e Probabili tv Average Len9th Size ~ Constant Interval I Interval 2 Interval I Interval 2 3 1. SO .70 .87 235.73 1043,02 2.00 .75 314.31 2.50 .80 392.89
- 3.00 .84 471. 47 6 1.. 50 .78 .83 381.73 607.00 2.00 .84 .93 508.98 1517 . 54 2.50 .87 636.22 3.00 .90 763.46 8 1. 50 .79 .82 455.24 612.00 2.00 .87 .93 606.99 1173.01
;Z.50 .91 .96 758.73 3351.51 3.00 .92 910.48 16 1. 50 .84 .82 666.59 714.59
- 2.00 .91 .92 888.79 1091. 74 2.50 .94 .97 1110.99 1679.61 3.00 .96 .99 1333.19 2807.37 24 1. SO .85 .81 826.21 825.14 2.00 .91 .93 1101.61 1196.46 2.50 .94 .97 1377.01 1685.17 3.00 .97 .99 1652.41 2392.95 Coverage Probabilities and Average Lengths are calculated based on 1000 simulations
I I 5.15 I I I Simulation Stud~ Table 5.3 of Confidence In~ervals N I for L lJ*:
- . 1 l.
l.=: Observed Covera~e Probabilities and I Average Lengths When lJi's Are Uniform (1,200) I SamEle Size N Constant Covera~e prObabi1it~
!ntervaI I Interval Avera~e IntervaI r Len~th Interval 2 3 1.50 .69 .89 466.13 2057.75 I 2.00 2.50 3.00 .75 .80 .84 621. 51 776.89 932.27 I 6 1.50 2.00 2.50 .76 .82 .87 .84 .94 726.48 968.65 1210.81 ll51. 56 2903.91 3.00 .90 "--- 1452.97 I 8 1.50 2.00 .77 .86 .84 .93 867.57 1156.77 1172.71 2247.69 2.50 .90 .95 1445.96 6422.01 I 16 3.00 1.50 .92 . 83* .83 1735.15 1293.03 1390.97 2.00 .90 .92 1724.04 2125.09 I 2.50 3.00 .93 .96 .97 .99 2155.06 2586.07 3269.38 5464.54 24 1.50 .84 .83 1617.82 1609.29 I 2.00 2.50 3.00 .92 .95 .97 .93 .97 .99 2157.09 2696.37 3235.64 2333.47 3286.58 'lt666.96 I
Coverage Probabilities and Average Lengths are I calculated based on 1000 simulations I I I
I 5.16 I Examination of these tables shows that the first confidence interval outper-forms the second for smaller sample sizes (N = 3,6,8): for the same observed coverage probability, the first interval has shorter average length; for the I same observed average length, the first interval has higher empirical cover-age probability. For example, in Table 5.2 for N = 6, when the constant b = 1.50 is used in the second confidence interval, this interval covers the true mean E~i with probability 83% and has average length 607; when the con-I stant a 2 in the first confidence interval takes the value 2.00, this interval I covers the true mean 1~;th (coverage) probability 84% and has average length 509. Comparing these performances, we see that the first confidence interval has a higher coverage probability and a lower average length than the second. In the same table for N ~ 8, confidence interval 2 with empirical coverage I probability 82% has average length 612 (these correspond to the value 1.50 of the constant, b~whereas the competing confidence interval 1 with cover-age probabil i ty 87% has average 1ength 607 (these are obta i ned for the I value 2.00 of the constant, a 2). The same phenomenon occurs throughout Tables 5.1 to 5.3. The difference in performance between the two types of confidence intervals appears to diminish as N, the number of p.s.u. IS, becomes I larger'. Neither of the inter'vals performs particularly well in the case N = 3 (3 p.s.u. I~) in the sense that the confidence coefficients are relatively I low. That is, no matter which value of the constant (which is b for interval 1 or a 2 for interval 2) is chosen, the probability that the inter- I v~l contains the true value of E~i cannot be made satisfactorily large (90% or 95%). For example, when we consider the constant b = 2.00 for A I N = 3, the observed probability that EX.1 -+ 2.00S contains the true r~.1 is
.80 in Table 5.1, .75 in Table 5.2, and .75 in Table 5.3. Thus b = 2.00 does not give a .90 or .95 confidence coefficient. Similarly, the other I
values of b for interval 1 and a 2 for interval 2 fall short of the 90% and 95: confidence levels when N = 3. o For larger values of N, the performance is more satisfactory. In genera 1 , the recommended intl~rval ;s interval 1: I N N [1 N
~ ~. e: ~ X. ~b ~5" ~ X. (1 + X.) ] 1/2 (5.20) I i=1 ' ;=1' '-;=1' ,
where, for a 90 percent confidence interval, the recommended values of I b, based on Tables 5.1 to 5.3, are I I I
I 5.17 I N I 6 8 3.0 2.5 a 16 2.0 I For other values of N, linear interpolation will suffice for now; more I precise calculations can be done when a value of N is decided upon. rf replicate sampling were employed, it would be possible to estimate the I parameter c 2, and the result would be a smaller estimate of standard error. N More precisely, the estimated variance of r Xi from {5.14} can be written I a.s the sum of two pi eces: i=l I (5.21) I I The first piece, EX i , is the variance under the Poisson assumption; the remaln1ng piece can be thought of as measuring the error incurred because I of the departure of the data from the Poisson model. This departure may be attributable to causes discussed in Sections 4.2 and 4.5. If c2 = 0, the second term on the right-hand side is zero, and the estoimated vari-I ance becomes the expression (EX.) that is the optimal estimator under the Poisson model of variation. 1 I Conservative estimates of error (this means that the confidence intervals may tend to be wider than needed) are obtained by taking c2 equal to the empirical conservative bound of c2 = 1. However, this approach can be I very costly in terms of the accuracy of estimation, because a great improvement is possible from estimating c2 , using (VI.5) of Appendix VI, I and substituting (5.14) rather than (5.13) into the confidence interval f 0 rmu 1a (5. 15) . I An example will illustrate the techniques for obtaining the estimated variance and standard error of EX i . To focus attention on the central issues, we will deal with an artificially simple situation. Consider I an entrainment season divided into eight p.s.u. IS, each of which has a sample volume v drawn from a total volume in the discharge channel of V. The number of striped bass yolk*sac larvae identified in these samples is I as fa 11 ows: I
5.18 I X. 1 (5.22) I 1 2 13,000 14,000 I 3 18,000 4 5 43,000 69,000 I 6 63,000 7 8 7,000 3,000 I To obtain a confidence interval for Iu., we may proceed in any of three I di fferent ways: . 1 I
- 1. We may assume that the Xi'S follow a Poisson model and estimate the variance of EX , which is our estimator of Iu i , by IX;. The estimated i
standard error is the square root of this variance. An approximate 90% I confidence interval for EUi' based on the normal approximation to the Poisson distribution (Mood, Graybill, and Boes, 1974, p. 120), is I IX i
+/- 1.645(IX )1/2 i = 230,000 +/- 789, (5.23) I whe*re the value 1.645 is the 95 percent point of the standard normal I
distribution. This method is, unfortunately, not appropriate in many entrainment situations. As detailed in Section 4.1, the Poisson model is not consistent with the pattern of entrainment data that has emerged from I studies at all three power plants in which replicate -observations were taken. The Poisson model is mentioned here primarily as a remihder that although it gives a simple method and a short confidence interval, its I failure to fit the data adequately means that the confidence interval will be erroneous. That is, the interval that is purported to include the true underlying (unknown) value of Iu; with probability 90% may in I fact include this value with a much smaller probability.
- 2. A conservative estimate of the ~tandard error comes from equation I (5.13), in which c" has been taken to be 1. Applying this formula to the N = 8 counts given above, or equivalently applying equation (5.21) 't'lith A2 I
c **1, gives I S = [230,000 + i (11,326,000,000 - 230,000)]1/2 = 75,254. (5.24) I The corresponding 90% confidence interval for Eu. is 1 I
I 5.19 I EX; t 2.50 (75,254) = 230,000 t 188,135, (5.25) I where the constant 2.50 comes from the value of b corresponding to N = 8 in the table below (5.20). This interval is correct, but it is so large I that it may not be very useful: the length of this confidence interval is approximately 160% of the mean. This happens because the example was structured to have only eight p.s.u.ls. The number of p.s.u.ls plays a crucial role in determining the length of confidence intervals that can I be obtained from the entrainment data. To illustrate this, take the eight p.s.u. IS in the example each to be one week long. Assume that these eight weeks span the entire entrainment season for a particular species and life I stage of ichthyoplankton. (This is realistic for many species and life stages.) We could instead adopt a p.s.u. of 24 hours, which would give us 56 one-day p.s.u.'s instead of 8 seven-day p.s.u.ls. I Analyzing a given body of entrainment data as 56 one-day p.s.u. IS would give results that are very different from an analysis of the same data I structured as 8 seven-day p.s.u. IS. This is because the variance 52 is computed from equation (5.13) or (5.14). A 7-day p.s.u. produces one large term Xi(l + Xi )/2 in (5.1~ or a similar large term in (5.14); I if the 7-day period is divided into 7 one-day p.s.u. IS, though, the result is seven small terms of the same type, whose sum is less than the single I large term present when the seven days formed a single p.s.u. To illus-trate this point, assume that the data of (5.22) come from an entrainment season that has been divided into eight 7-day p.s.u.'s. The data from these 7-day p.s.u.'s give the confidence interval (5.25) for r~;. The I original data were undoubtedly structured with shorter p.s.u.ls, however, and p.s.u. IS of one day in length would give the following data for striped I bass yolk-sac larvae in 56 p.s.u.'s (where it has been assumed for conveni-ence that each of the seven days constituting a p.s.u. in the data of (5.22) has the same observed value, e.g., each of the first seven ene-day p.s.u. IS has observed value 13,000/7 = 1,857): I ; Xi for each of these (5.26) I 1 to 7 1,857 8 to 14 2,000 I 15 22 to to 21 28 2,571 6,143 29 to 35 9,857 36 to 42 9,000 I 43 50 to to 49 56 1,000 429 Applying formula (5.13) to these data, the first seven days now produce a combined term of 7 x 1,857{1 + 1,857)/2 = 12,076,071 instead of 13,000 x 13,001/2 = 84,506,500. *Similar reductions occur in the remaining terms, and (5.13) now yields
I 5.20 I S = [1,857(1+1,857)/2 + **.. + 429(1+429)/2J 1/ 2 ; (809,103,001)1/2 I
= 28,445. (5.27)
I The corresponding 90% confidence interval for Eu* is , EX i : 2.00 (28,445) = 229,999 : 56,890, (5.28) I where b = 2.00 is chosen pecause N = 56 is greater than 24. I This demonstrates forcefully that dividing the entrainment season into many p.s.u.ls, each of short duration, leads to more accurate estimates I of entrainment than using few p.s.u.1s of long duration. This fact was discussed earlier in Section 4.3. The variance, standard deviation, and coefficient of variation of entrainment estimates are all reduced by I adopting short p.s.u.1s instead of moderate to long ones.
- 3. An accurate and appropriate estimate of the standard error comes from I equation (5.14), in which c 2 is estimated instead of being set equal to 1.
If a series of replicate paired samples were taken during this entrainment season, yielding an estimate ~2 = .05, then applying (5.21), which is equi-I valent to (5.14), to the data of (5.22) gives I S = [230,000 ... ~ (11,326,000,000 - 230,000)J 1/ 2 = 23,228 1.05 (5.29) I The corresponding 90% confidence interval for Eu i , again using b is
= 2.50, I EX i : 2.50(23,228) = 230,000 =58,070. (5.30)
I I A value of c 2 like~ .05 is quite consistent with the values of ponding to several of the species-life stage combinations in Table 4.8. K corres- '" As Cassie has notE!d, values of c 2 between .1 and .2 are" commonly encount-ered (see Section 4.2.1). Changing the value of c 2 in this example would produce the following resul ts:
I I 2 5.21
" of !~aximum S C -S /0 I 480 0.6 °.02 .01 10,600 14,910 14 20 I .03 .05 18,169 23,228 24 31 .10 32,091 43 I .20 .50 43,449 61,445 58 82 .75 69,672 93 I 1.00 75,254 100 This table illustrates that there ;s substantial improvement possib1e I from replicate sampling, since it allows the estimation of c 2. It also shows that the Poisson assumption results in an unrealistically small variance. In many cases, using the Poisson variance will give systematic I underestimation, leading to erroneous confidence statements. .
To estimate the total seasonal entrainment T from (5.30), multiply through I by Vivo For instance, let v = 8,400 and V = 60,480,000, as might be the case for a one-week p.s.u. at Indian Point. Then V/v = 7,200, and a 90% confidence interval for T is I 1,656,000,000 ~ 418,104,000. I Estimates of total entrainment of striped bass yolk-sac larvae and juveniles for Bowline Point and Indian Point in 1983 are computed in Tables 5.4 to 5.9. P.s.u.'s are 3-hour periods. The confidence interval formulas in Section 5.4.1 are generalized in Section 5.9 to the case of II unequal sampling ratios Vi/vi; these generalizations are used in the calculations leading to Tables 5.4, 5.5, 5.7 and 5.8. For convenience in calculation and for information about the entrainment season pattern, II these tables are organized by one-week segments. The CV of the total number entrained during the entire season could be decreased by using replicate sampling to estimate c2 and then obtaining the estimated standard I error from formula (5.14) instead of (5.13). . Notice that in each table the coefficient of variation of the estimated I total number of striped bass entrained is much lower than the CV of the one-week estimated entrainment values. To understand the reason for this,
.. consider the simple ~uation ;n which N weekly entrainment estimates I T; are made, each having the same underlying mean T1 and variance var 1.
The seasonal entrainment estimator I .. N ... T = L T. (5.31)
. 1 1 I 1=
has variance equal to Nvar l and mean equal to NT l , so I
I 5.22 I Table 5.4 I Estimate of Total Entrainment of Striped Bass Yolk-Sac Larvae from Bowline Point 1983 Data I Week Beginning Total Estimated Entrainment Standard Error
- Coefficient of Variation ** I 22MAY83 6,649 4,077 61.3%
29MAY83 10JUN83 13,139 16,619 9,704 8,717 73.9% 52.5% I 12JUN83 102,896 27,852 27.156 19JUNB3 26JUN83 411,330 13,815 80,110 5,308 19.5% 38.4% I Entire Season 564,447 86,071. 7 15.2% I Approximate 90% confidence interval for total number entrained *** = 392,304 to 736,590 I
- Standard error calculated from formula (5.9) o
** Coefficient of variation = 100% x (Standard Error)/(Total Estimated Entrainment)
I
*** Confidence interval based on + 2 standard errors D
I I I
5.23 Table 5.5 Estimate of Total Entrainment of Striped Bass Juveniles from Bowline Point 1983 Data Week Total Estimated Standard Coefficient of Beginning Entrainment Error
- Variation **
19JUN83 26,496 9,857 37.2% 26JUN83 60,531 16,711 27.6% 03JUL83 54,673 16,336 29.9% lOJUL83 27,275 7,992 29.3% 17JUL83 13,805 4,831 35.0% 24JUL83 1,031 1,031 100.0% 31JUL83 0 07AUG83 7,107 3,381 47.6% Entire Season 190,918 27,257.5 14.3% Approximate 90% confidence interval for total number entrained *** = 136,403 to 245,433
- Standard error calculated from formula (5.9)
** Coefficient of variation = 100% x (Standard Error)/(Total Estimated Entrainment)
- Confidence interval based on + 2 standard errors
5.24 Table 5.6 Estimate of Total Entrainment of Striped Bass Yolk-Sac Larvae and Juveniles, Bowline Point 1983: Coefficient of Variation for Selected Sampling Ratios Yolk-Sac Larvae Sampling Estimated Season Standard Coefficient of Ratio Entrainment Error
- Variation **
Actual 564,447 86,072 15.2% 1,000 564,447 85,302 15.1% 2,000 564,447 86,791 15.4% 4,000 564,447 89,696 15.9% 8,000 564,447 95,240 16.9% Juvenil es Actual 190,918 27,257 14.3% 1,000 190,918 26,886 14.1% 2,000 190,918 28,576 15.0% 4,000 190,918 31,686 16.6% 8,000 190,918 37,133 19.4%
- Standard error calculated from formula (5.9)
- Coefficient of variation = 100% x (Standard Error)/(Total Estimated Entrainment)
_ ____11._.___...._1.,*** ______----
I 5.25 I Table 5.7 I I Estimate of Total Entrainment of Striped Bass Yolk-Sac Larvae from Indian Point 1983 Data I I I Week Total Estimated Standard coefficient of Beginning Entrainment Error
- Variation **
I 03MAY83 lOMAY83 11 ,318 38,279 7,524 14,331 66.5% 37.4% 17MAY83 62,249 20,240 32.5% I 24MAY83 31MAY83 105,991 267,897 29,100 60,003 27.5% 22.4% 07JUN83 500,861 105,608 21.1% I 14JUN83 21JUN83 241,726 28,261 115,204 12,452 47.7% 44.1% 28JUN83 9,216 6,518 70.7% I Entire Season 1,265,798 172,456.6 13.6% I Approximate 90% confidence interval for total number entraine~ *** = 920,885 to 1,610,711 I
- Standard error calculated from formula (5.9)
I ** Coefficient of variation = 100% x (Standard Error)/(Totalj Estimated Enrrainment)
*** Confidence interval based on ~ 2 standard errors I I
I I I I I I
I 5.26 I Table 5.8 I Estimate of Total Entrainment of Striped Bass I Juveniles from Indian Point 1983 Data Week Tota 1 Es tima ted Standard Coefficient of I Beginning E:ntrainment Error
- Variation **
2lJUN83 4,856 4,856 100.0% I 28JUN83 68,498 30,718 44.8% 0~jJUL83 1£~JUL83 122,755 78,147 32,539 23,821 26.5% 30.5% I 19JUL83 57,937 19,519 33.7% 26JUL83 26,303 10,563 40.2% I Entire Season 358,497 55,551.6 15.5% I Approximate 90% confidence interval for total number entrained *** = 247,394 to 469,600 I
- Standard error calculated from formula (5.9) I
** Coefficient of variation = 100% x (Standard Error)/(Total Estimated
- Confidence interval based on + 2 standard errors Entrainment)
I I I I I I I I
I 5.27 I I Table 5.9 Estimate of Total Entrainment of Striped Bass I Yolk-Sac and Juveniles, Indian Point 1983: Coefficient of Variation for Selected Sampling Ratios I Yolk-Sac Larvae I Sampling Ratio Estimated Season Entrainment Standard Error
- Coefficient of Variation **
Actual 1,265,798 172,457 13.6% I 1,000 2,000 1,265,798 1,265,798 162,996 164,898 12.9% 13.0% 4,000 1,265,798 168,638 13.3% I 8,000 1,265,798 175,878 13.9% I Juveniles Actual 358,497 55,552 15.5% 1,000 358,497 49,668 13.9% 2,000 358,497 51,441 14.3% 4,000 358,497 54,815 15.3% 8,000 358,497 61,006 17.0%
I 5.28 (5.32) I In other words, the CV of the sum of N weekly estimates with the same mean I and variance will be N- 1/ 2 times the CV of any weekly estimate. When the weekly estimates are somewhat different in mean and variance, a more general formulation of this result continues to hold. I For example, consider a season with N = 9 p.s.u. IS, in each of which A I the estimated entrainment A l l has a mean of E(T.) = T. = 100,000 and a var"iance of Var(T.) = 2,500,000,000, hence a standard deviation of A 1 SD(T } = 50,000. In each p.s.u., the coefficient of variation of the I i estimate is A A I
= SD{Ti)/E(T i ) = 50,000/100,000 = .500 = 50% (5.33)
I For the entire season, E(T) = 9(100,000) = 900,000 and , A A A Var(T) = Var(T 1) + .** + Var(T g) = 22,500,000,000, A SO(T) = 150 1 000, (5.34) CV(T) = 150,000/900,000 = .1667 = 16.67% = 9- 1/ 2(50%). Ta.bles 5.6 and 5. 9 compare the accuracy of the sampling actually performed with the accuracy that would have resulted from fixed sampling ratios V./V? 1000, 2000, 4000, and 8000 for every p.s.u. It is assumed that the pla~t 1 flow volume and the organism density observed in the sample are, for each 3-hour p.s.u., equal to the values that actually occurred. Such regular behavior of sample organism density, remaining fixed as sampling ratio V./v. varies from 1000 to 8000, should not be expected, as Chapter 6 will deta'n.' Nevertheless, assuming this regularity allows a simple comparison of dif-ferent sampling ratios. An important assumption in the method that produced Tables 5.4 to 5.9 is that each daily estimated entrainment figure is based 2" the number of organisms entrained in that day's sample. Each daily T. is an estimate of 1 Ti based on the sample taken on day i, so the daily estimates are inde-pendent of each other. ("Independent" is used here in its technical sense, A which means that knowing the error of estimation T.1 - T.1 would not give h you any information whatsoever about T. - T. for any j t i.) J J
I 5.29 I This .independence plays a key role in the method used in combining daily estimates into weekly and seasonal total estimates. The data from Roseton I do not have this independence, because there are days when no sampling occurred: for any such day, the estimated daily entrainment is derived by linear interpolation between the closest actual observations before and I after this day. The Roseton data are therefore not suitable for the approach used to construct Tables 5.4 to 5.9. A method of dealing with gaps in the entrainment sampling will be discussed in Chapter 8. I 5.6 CONFIDENCE INTERVAL LENGTH I AS A FUNCTION OF v We now consider the relationship between the sample volume and the length I of the corresponding confidence interval for the total entrainment T. We restrict attention to the case in which all samples have a common volume v and the volume of water in the discharge channel equals the same value V in every sampling period. I The total number of organisms entrained during the entire entrainment season is T = (V/v)t~i' It was shown in Sections 5.4 and 5.5 that a I A (conservative) confidence interval forAt~i can be constructed from the S of (5.13). This interval has length 2bS, so the corresponding confidence I interval for T, I (V/v)tX. + (V/v)bS, 1 - (5.35) I has length I (5.36)
= v.A. = VA.,
I Using the fact that EX l* 1 1 1 we see that (5.37) I I Tnerefore I (5.38) I I
I 5.30 It is clear that when all else is held fixed, the confidence interval length L decreases as v increases. The rate of this change in L depends I on the term I (5.39) I When v is much smaller than tAi' a small increase in v will cause this term G(v) to decrease substantially; however, when v is much 1arger than I rA i , the same incr~ase in v will cause only a" negligible decrease in G(v). No matter h,ow large v becomes, G(v) remains greater than L\~. By , I using tX~/v2 to approximate rA~, and EX./v to approximate rAo, we can
, 1 ",
investigate the relationship between sample volume and the length L of I this confidence interval for T. If replicate sampling data are available and the constant c 2 can be I estimated, the confidence interval for T is still A I (v/v)rx; +/- (V/v)bS, (5.40) A I but 5 is now given by (5.14). Routine algebra shows that I (5.41) I Again L decreases as v increases, and the relationship between sample volume and L can be examined. If ~2 is sufficiently close to zero, the I term ~2rA~/(1 + ~2) in L will be negligible, which would make the length I of the confidence interval proportional to v- 1/ 2 S.7 THE ~FFECT OF SAMPLE VOLUME I ON THE PRECIS[ON OF ESTIMATION Assume that there is continuous sampling in the discharge channel, so I every p.s.u. in the entrainment season is sampled. The annua1 entrain-ment T is the sum over all N p.s.u.1s of the within-period entrainment, I N T= }' T.,
. "1 1 =. 1 (5.42) I I
I 5.31 I and is estimated by I T* (5.43 ) I where the entrainment T; in p.s.u. i is estimated by I ... ... T. * (V./v.)',J.
", for i == 1, ... ,N. (5.44)
I If the variance function among rep 11cate
- counte d samp 1es 1S
. a 2==~ oj.
C22~ , then the coefficient of variation of the annual entrainment estimator I .. T is given approximately by I (5.45) I I The ev of the annual entrainment estimator T is estimated by I I cv <T)' [L T~ (!- + c2 ) ]11YI Ti (5. 46 ) lli I ihe effect of the sample volume on eV(T) depends on the magnitude of the constant c2* If c2 is equal .to . or very close to zero,. so the c2 term in I (5.46) can be ignored, then eV(T) is approximately proportional to v- 1/2 through the term l/~.. In other words, doubling every sample volume I Vj ~ould multiply eVer) b/2- 1/2 :: .71, quadrupling every v'i would mUltiply eV(T) by 4- 1/ 2 :: .5, and so on. This situation would occur if the observed entrainment data were found to satisfy the Poisson model, in I which c 2 = a holds. On the other hand, .. when c 2 is not zero, it will, in most cases, be larger than Vi/ViTi = llX i (since Xi is usually a large I number), so increasing... .. the sample volume v will have a negligible effect on the value of eV(T). This situation would occur if the observed I entrainment data were found to satisfy the negative* binomial model but not the Poisson model.
I 5.32 Increasing the length of the time period over which the samples are com-posited serves to increase the sample volumes vi and possibly the within-I period entrainment levels Ti . (This assumes that the flow rates in the sampler and the discharge channel are not changed while the p.s.u. 's are I lengthened.) Ideally, however, vi will be increased only during periods when Aiis relatively small, so*as to retain relatively constant entrain- I ment Ti across p.s.u.1s. This is because for a fixed number of p.s.u.'s N, the magnitude of o:rf)lIJ1T i is minimized when all of the T; are equal, I This results in a small CV(T). Thus, during periods of low entrainment the composite duration should be extended by combining adjacent p.s.u. IS, but only if these p.s.u.'s have a constant discharge volume V. I s.e, THE ESTIMATION OF I ENTRAINMENT REDUCTION For a given p.s.u. i, let Vi denote the realized discharge volume and let I V.1 + V!1 denote the potential discharge volume under full flow conditions. Similarly, let Ti denote the realized entrainment and Ti + T~ the potential I entrainment when the discharge volume is V., + V~. 1 A tion due to reduced flow is T~ and its estimator T~ is given by The entrainment reduc-I I T'., = (V!/V.)T 1 1 1 (5.47) I Temporarily, omit the subscript i on all terms (for notational convenienc:) until (5.52) below. Using the fact that I T' - T' = (V'/V)(T - T) + (V1/V)T - E(T') + E(T') - T' (5.48) I it fa 11 ows tha t I E[(T' .' T' )2JTJ = (V'/V)2T2(~ + c 2) + E(T' - TV'/V)2 fl (5.49) I
= (V'/V)2T2(~ c 2) [TV'/V - E(T')J 2 ).J + +
I
+ E[T' - E(T' )J2.
I I i', I , /. ,, II I I
, II
I 5.33 I Assuming a Poisson division of the entrainment between the volumes V and VI , we see tha t I (5.50) I Averaging over T gives E(T) = VA and Var(T) = VA, T' = VIA, and I E(1" .. T,)2 = ;:-;[1 + V'/V + (f.' + V'/V)(~ + c2)] (5.51) I = f72(lIT' + lIT + (l + l/T)(~ lJ
+ c2)].
I lJ Therefore I Cvcr~) [lIT!1
- liT. .;. (1 + l/r.)(~ + c2 )]1/2 I (5.52)
- I 1 1 1 lJ; I clnd, 'letting CV(I:Ti) denote the coefficient of variation of the estimator N ...
I 'j L Ti or the total entrainment reduction over all N periods,
=1 I
I I An estimator of the latter ;s given by I cv(d~), = (11th" tT~ l 2 1 1/lJ.1 + c
+ "'2 "2 'C + l/T~
1
+ lIT. , F2 (5.54)
I Note that Ti and Ti will ordinarily be large integers, so I ... . ... V./v.T .
+ c2 . ( 1/2 I CV(tTi) : (l/ET! ) , ) tT; 2 1 " (5.55) 1 + c"2 I
I 5.34 The task of decreasing CV(ET~) can be approached from several directions. The most clear and direct approach is to control the sample counts I X.1 : (v./V.)T. by increasing sample volume v., during periods when ichthyo-1 1 , plankton entrainment density is low, through more extensive compositing I of samples. This could also have the side effect of equalizing the T~ values, and for a fixed number of composites--that is, for a fixed 1 N I numbel~ of terms N in L r1 2--the magnitude of (ET~2)1/2/ET~ is minimized 1 1
.1 1=
when all of the T~ are equal. The latter effect could be deliberately I rather than incidentally achieved by using historic seasonal entrain-ment patterns in conjunction with the current outage regime to make the N values Ti for i=1, *** ,N approximately equal for the more important species I and life stages through appropriate compositing. I Enlar~~ement of the s:ampler ' s intake capacity to increase the sample volume per unit time would serve to counteract any crass-sectional clumping in the spatial distribution of entrained organisms in the discharge channel, and thereby to decrease the contribution of this factor to the value of I 2 c
- The resulting increased cost of counting and sorting larger numbers of organisms captured in a continuous sampling program could then be I counteracted by invoking subsampling at this stage of the monitoring pro-cess. If random mixing can be achieved in the plankton splitter then formula (5.53) for CV(ETi) applies and the clumping component of c2 I
will have been reduced. Quantification of this reduction can only be determined experimentally. I Formulas for estimation of total potential entrainment, T + T', can be derived by the same methods used for the reduction in entrainment, T'. I Total potential entrainment is estimated by
...'" N A A N I T+T' = T+T' = L (T.+T!) = L ((V.+V!)/v.)X. (5.56) i=1 ' i=1 ' ,
1 1 1 I For each p.s.u. i, using the fact that E(T;) = T; and (5.47), I E[(i 1*-T,.)(i 1!-T,!)] = E[(i 1*-T,.)aV!/v.)(i.-T.) + (V~/V.)T. - T!))
'11, 111, I = (V1~/V1*)E[(i.-T.)2] + [(V!/V.)T.-T!]E[T.-T.]
11
'11'11 I = (V~/V.)Var(T.) + O.
1 1 1 (5.57) I I
I "5.35 I Consequently, N I '" Var(T+T') A A
= L Var(T.+T~) = tE(T.+T!-T.-T~) #Ii, A 2 ] ,='1 " 1111 I
I A A ~
= Var(T) + Var(T ' ) + 2r(V!/V.)Var(T.). 1 1 1 (5.58)
I Thus
... ........... A " .... .....
N Var{T+T ' ) = Var{T) + Var{T ' ) + 2 I (Vi/V.)Var(r.) I where i =1 ' 1 (5.59) I A.. '" "" N
= I TI. . 2[V./V.T.+C ... "'2 J
Var{T ' ) = Var(tT!) 1 i 1 1 1 x2 + liT,! + liT,. * (5.60) I i=1 1 + C I Then (5.61) I ,.. ,.. A ""'" A A The formula for Var(Ti), from which Var(tTi) and CV(ETi) of (5.54) follow I immediately, is derived as follows. Error variance in the predicted reduction Ti due to an outage of volume Vi is comprised of two uncorrelated components of error. The first component is due to estimation error in I '" the achieved entrainment estimate Ti for th: realize~ discharge ~f volume Vi; this error contributes to the error in Ti sinc: Ti = (Vi/V;)Ti, and I the magnitude of this estimation error is (V,~/Vl*)(Tl* - Ti ). Its squared 2'" . 2 '" 2 l. "2 value is estimated by (Vi/vi) Var(X i ) =~i/vi) (Xi + c Xi)/(l + c ). I The second component of error is due to the deviation of the predictor ViTi/Vi from an entrainment Ti that would have occurred in an additional discharge of volume Vi. Here Ti is a chance variable, and assuming that I the total potential entrainment Ti +T1 during,p.s.u. i would be randomly partitioned between Vi and Vi, the squar:d error of the prediction, I [(ViTi/V;) - Ti]2, ;s then estimated by Ti[l + (Vi/V;)]. combining the two estimated components gives I Var(T~) = T~
"",, A 2(lIX; +c "2 "2 + l/T~ + l/T.
A A ) (5.61a} 1 1 1 +"C 1 1 I
I 5.36 5.9 THE CASE OF UNEQUAL I SAMPLING FRACTIONS v./V. The sampling fraction in the ith p.s.u. is the ratio vi/Vi' It has been I assumed until now that this ratio, or equivalently its reciprocal Ri = Vi/vi' is constant over all sampling periods, that is, R1 , R2 , R3 , I and so on have a common value. Violation of this assumption can lead to many difficulties, some more subtle than others. To illustrate the problems associated with unequal sampling fractions, we shall examine its I effect on the coefficient of variation (CV) of T, the estimator of total entrainment abundance. Before doing so, though, we shall discuss the derivation of confidence intervals in the presence of unequal sampling I fractions. Assume that we observe ichthyoplankton counts X ,X 2, ... ,X N collected 1 I during N p.s.u.ls. When sampling is continuous, N is the total number of p.s.u.1s ;n the entrainment season. Define I A. 1
= mean density of organisms per unit volume in the discharge channel during p.s.u. i ,
I
- v. = volume of water sampled in p.s.u. i ,
1 I V. = volume of water in discharge channel in p.s.u. i. I Then, defining ~i = viA;, we have I E(X,.) = ~i ' Var (X),
. = , + C2112... ,.* ~. (5.62)
I The total entrainment abundance ;s given by I N N T L (V./V.)ll. = L R.ll.,
= i=l ' , , ,= *1' ,
(6.63) I which is estimatE!d by I I I I
I 5.37 I ... N N T = L (V 1*/v.)X. = L R. X.* (5.64) I i=l " ;=1 ' , This estimate has variance I I Var(T) (5.65) I To develop an estimator of this variance, note that, as was pointed out in Section S.4.1, I I E X. + c X. 22[ 2' = lJ'1 + 2 2 (5.66) [ 1 + C C lJ*. 1 I This leads to two estimators that are analogous to the $2 of (513) and (S.14). First, if we use the empirical bound (see Section 4.5.2) c2 < 1 I and set c2 equal to 1, wP.' obtain the conservative p.s~imator - I (5.67) I Second, if we are able to estimate c2 for the particular species-life stage combination being considered, we obtain the estimator I (5.68) I As has been discussed in Section 5.5, although replicate sampling must be performed in order to calculate c2, the extra effort required is worth-I while: when c2 is very small, ... ... i.e., near zero rather than near one, the consequent reduction in Var(T) can be dramatic. I Based on studies of the equal sampling fraction case, where all Ri's are the same, and on asymptotic approximations of a theoretical nature, it I appears acceptable to use the values of the constant b, which were tabled in Section 5.5, in the case of unequal sampling fractions. An important caution is called for, though: the further from equality the Ri's are, the I I
I 5.38 less accurate this method will be. In other words, the nominal 90% con-I fidence interval may actually have a confidence level of 70%, 97%, or some other value far from 90%. It would take extensive simulation work to specify precisely the patterns of inequality among the R.ls that will I cause this inaccuracy. This possibility should provide an inducement to keep the R; IS as similar as possible. I Relatively small differences among the sampling ratios Ri will cause only minor inaccuracy. An example later in this section will show that if the I largest R; is twice as great as the smallest, the effe:t on the accuracy of estimation, as reflected in the value of the CV of T, is not large, while a greater discrepancy between the largest and smallest R. can lead I to substantial inaccuracy. ' Note that the preceding discussion of accuracy is not concerned with I absolute sample volumes, but rather with the allocation of a given sample volume. For example, suppose one has decided on a sample volume of 3600 m3 in a 24-hour period. How are those 3600 best allocated? The I answer is to allocate them so that the R.ls are kept as constant as possible. Deviation from this will caus~ a loss in accuracy, which becomes greater as the R.ts become more unequal. I
. 1 We now consider thE! effect of unequal sampling fractions on the coefficient A
I of variation (CV) of T, the estimator of total entrainment abundance. The coefficient of variation is the ratio of the standard deviation to the mean, which measures relative error. The square of the CV of I T ;s given by I N 2 2 2 L R.(u.+
. 1 ' .,
1= (CV)2 = ~~~--~~-- cu.) 1 I (5.69) [ I R'U.]2 i =1 1 1 I When Vi is constant over all p.s.u.1s, Ri will be constant only if Vi is, I that is, only if sampling is done with equal volume. It will now be shown that having constant Ri is an optimal case, which, in the case of constant Vi' means that sampling should be done with constant Vi. It should be I realized, however, that if V. varies, v. should also vary in such a way I that R., 1. o~ V./v., is constant.
, 1 If R.is 1
constant, say R.1 = R, then I I
I 5.39 I N 2 2 I (CV)2 R L (lJ*
= i =1' + C lJ.) , (5.70)
I N L Il*] [ i =1 ' . 2 I while whether Ri is or is not constant, it is true that I 1-1 I I (CV)~. ~.~N 1 I,-1 lli lJi2
+ c2 2 lli (5.71)
We cannot answer whether (CV)~. ~ (CV)~ in general, but sor1e approximate calculations indicate that in most cases this is so. Begin by writing I I (5.72) I The second term on the far right-hand side will, in many cases, be I negligible in comparison to the first term, so we have I (5.73) I where the second inequality always holds (its proof is an application of I Chebyshev's inequality). Relating this .last quantity to our lower bound on (CV)~., we have that, to our level of approximation, I 2 2
~
I , ( CV ) R. (CV) R . (5.74) I I
I 5.40 Therefore, unequal volume ratio sampling will lead to a higher degree of I relative error than equal volume sampling when E~~ >> E(~i-~)2, a cir-cumstance that occurs frequently. I To examine the effects of unequal R.'s, it is helpful to rewrite formula (5.69), using the relationships R.p. 1 1 1
= V.l.
1 1
= T.1 and T = IT.,
1 where Ti is I entrainment in p.s.u. i and summation is over all N p.s.u. IS. Substi-tution into (5.69) gives I I (5.75) I The first expression on the right-hand side depends only on the constant c2 and the daily entrainments T., and not on the sampling ratios R.. Con-1 1 I sider an entrainment season consisting of 320 3-hour p.s.u. IS (that is, 40 days), in which the actual entrainment is 10,000 organisms in each of 32 p.s.u. IS, 5,000 organisms in each of 48 p.s.u. IS, and zero in the remain- I ing 240 p.s.u.ls. (These simple numbers have been chosen to simplify the computations, but are not entirely unrealistic.) Assume that c 2 : .2 and that Vi is constant for all p.s.u.ls. We now compare three sets of Rils: I Number of p.s.u.ls Entrainment/p.s.u. First Rils Second Rils Third Ri IS I 32 48 10,000 5,000 2,500 2,500 4,000 2,000 10,000 1,666.7 I 240 o I The sampling ratios Ri in p.s.u.ls with Ti = 0 do not affect the computa- I tion, so they are omitted from the table. The three sets of R.IS 1 here distribute the same total sample volume differently among the p.s.u.ls. (If Vi is 600,000 cubiic meters per 3-hour p.s.u., the volume sampled using the I first R.IS 1 totals (32)(600,000/2,500) + (48)(600,000/2,500) = 19,200 cubic meters; I the second and third R. IS yield the same total.) case: 1 Computing the CV for each I I I
I 5.41 I T = 32(10,000) + 48(5,000) = 560,000 I ET~ = 32(10,000 2) + 48(5,000 2) = 4,400,000,000 I , c2ET~/T2 = .2(4,400,000,000)/560,000 2 = .00281 I First R;IS: LT;R;/T2 = [32(10,000) (2,500) + 48(5,000)(2,500)]/560,000 2 I = 1,400,000,000/560,0002 = .00446 I CV = (.00281 + .00446)1/2 = .0853 I Second R;IS: LT;R;/T2 = [32(10,000)(4,000) + 48(5,000)(2,000)]/560,000 2 I = .00561 I CV = (.00281 + .00561)1/2 = .0918
+ 48(5,000)(1,666.7)]/560,000 2 I Third R;IS: ETiRi/T2 = [32(10,000)(10,000)
I = .01148 I CV = (.00281 + .01148)1/2 = .1195 The difference between the first case, in which all Ri IS are equal, and the I , second case, ;n which the largest R. ;s twice the smallest, ;s not great;
~
the CV of T has increa~ed only a small amount because of the unequal R;IS, I so the variability of T is almost the same as when all R; IS are equal. Using the constant b with these unequal R;IS is therefore quite accurate. I In the third case, though, ;n which the largest Ri ;s six times the smal-lest, the CV has increased by 40% over its value when all R;IS are equal, I so using the constant b here does not give a correct picture of the
~
variability of T. I I
I 5.42 The sampling in this example was constructed to give an idea how much the CV can be affected by unequal Ri's: sampling is performed at a lower rate I (because Ri is higher) during high entrainment conditions than during low entrainment conditions, which gives less precise information about high I entrainment periods, producing ultimately a less accurate estimate of T. When the p.s.u. IS with high R.1 are spread more evenly among high and low A entrainment conditions, the estimate T will be less affected than in the I extreme case analJ1zed here. Thus the analysis of situations in which the largest Ri is twice the smallest should be quite accurate. I In principle, one might improve on the accuracy of estimation resulting from equal Ri's by doing the opposite of what was done in this example: sampling at a higher rate during high entrainment conditions than during I low entrainment conditions. In practice, this should not be attempted for several reasons. First, a gain in accuracy will be made only if it is correctly anticipated which p.s.u.ls will have high entrainment and which I will have low entrainment; if these forecasts are incorrect, a loss in accuracy will result. Second, any potential gain will be negligible unless the departures from equality of the Ri's are extreme, leading to a I great loss in accuracy if preseason judgments about the occurrence of high and low entrainment p.s.u.ls are incorrect. Third, while the process of predicting high and low entrainment periods might be feasible for a single I taxon and life stage, it is intractably complex to attempt to choose unequal Ri's that will simultaneously be effective for many taxa and life stages of ichthyoplankton. In short, any attempt to improve on equal Ri's I carries a high risk of producing inaccurate results for one or more taxa and life stages of interest. We conclude, as stated earlier in this section, that the Ri's should be kept as similar as possible. I Throughout this discussion it has been assumed that V., 1 v., 1 and v./V. 1 1 are I known. If they are not known and must be estimated, there is a random amount of difference between our data value for Vi and the underlying, unknown true value of V., and similarly for v.1 and v./V .. The presence of I 1 1 1 these random error's would have serious mathematical consequences for the entire analysis. We therefore reiterate the critical assumption that Vi and vi have been fixed, known constants wherever they have appeared. I I I I I I
I I I
- 6. A SIMULATION STUDY OF ENTRAINMENT I
6.0 INTRODUCTION
The behavior of entrainment, the estimators of entrainment, and their I variances will be examined in this chapter. Entrainment sampling is first examined over the course of a single day. This is done for a wide range of organism densities, sample volumes, and values of c2, the parameter I . t he qua d 1n ' var1ance rat1c . mo de1 a2 = p + C2P2 d'1scusse d ear 1ler
. 1n. th 15 report (e.g., Section 4.2). Both uniform variation throughout the I sampling day and diel variation are considered. Estimators based on individual sampling days are discussed in detail. The effects of sample volume, organism density, temporal pattern (uniform or diel), and the I quadratic variance parameter c2 on accuracy of entrainment estimation are treated. An important conclusion is that sample volumes of 150 cubic meters per 3-hour period, or 1200 cubic meters per 24-hour day, are suf-I ficient for purposes of entrainment estimation; larger samples produce only slight gains in accuracy of estimation.
I The estimation of entrainment over an entire sampling season is then treated. A season consists of a series of sampling days on which organisms are present with density that diffe~s from one day to another. Both a short entrainment season of 18 days and a long season of 34 days I are simulated *. The estimation techniques developed in earlier chapters of this report are shown to produce accurate estimates of entrainment. I 6.1 STRUCTURE OF THE SIMULATION I STUDY OF DAILY ENTRAINMENT The simulation of entrainment sampling presented here ;s structured in terms of a 24-hour day divided into eight 3-hour periods. The behavior I of entrainment sampling over the course of a day is examined over a wide range of A, the density of organisms in the discharge channel; c 2, the I quadratic variance model parameter; and v, the volume of a sample drawn from the discharge flow over a three-hour period. All combinations of the following values have been examined: I c 2 = 0, .05, .10, .33, .50, and 1.00 I A = 10, 50, 100, and 250 organisms/IOOO cubic meters (6.1)
*- v = 100, 150, 200, and 300 cubic meters/3-hour period
6.2 I I The values of c 2 represent various degrees of departure from the Poisson model, all of which are within the negative binomial family of distribu-tions. When c 2 = 0, there is no departure, because the negative binomial I distribution coincides with the Poisson in this case. As c 2 increases, the corresponding negative binomial distribution departs from the Poisson to a greater and greater degree. In particular, as was discussed in I Section 5.5, the confidence intervals for entrainment estimates must be made wider as c 2 becomes larger. I The values of A represent a variety of possible organism densities within the discharge flow. A density of 10 organisms per 1000 cubic meters ;s rather sparse. A density of 250 organisms per 1000 cubic meters, on the I other hand, is relatively high. The density A is for the entire discharge flow, not merely for the sample that is drawn. The density during a 3-hour period equals the number of organisms present in the entire discharge I flow during the period divided by the number of lOOO-cubic-meter volumes of water in the discharge flow during the period. The entrainment sampl-ing process aims to estimate the organism density in the entire discharge I flow during a given period from a small sample of the discharge flow. (Estimating discharge flow density and total organism count during a given period are equivalent problems because the discharge flow volume is I assumed known.) This simulation study addresses the question of how well the number of organisms entrained during the sampling day is estimated when samples are of the volumes v, ranging from 100 to 300 cubic meters I per 3-hour period, listed above. Entrainment sampling procedures in which 9ne sample per period ;s drawn are examined, as are procedures involving replicate sampling of two samples per period. I A channel flow volume of 100,000 cubic meters per 3-hour period has been adopted for numerical convenience. There is no loss crf generality from this choice; conversion to different discharge flow volumes will be I described in Section 6.3, after the details of the simulation have been treated. I At each of the 96 (6 x 4 x 4) possible combinations of c 2 , A, and v, 100 simulated days were run. To simulate one day with a specified combination of c 2 , A, and v, the following steps were taken. I
- 1. Random values of the density, based on the fixed value A, were genera-ted independently for the day's eight 3-hour periods. These random den-I sities, which will be denoted by L, reflect the fact that the density of organisms is not fixed throughout the day; density varies from one 3-hour period to another during the day. Two patterns of density variation were considered, uniform variation and .. diel variation:
I 6.3 I
- a. Uniform variation. The density A (=10, 50, 100, or 250) was per-I turbed by adding uniformly distributed random noise. This resulted in independent values of L having a Uniform(.SA,l.SA) distribution. That is, starting with the given density A, random I values were gener~ted from the interval between .SA and 1.SA; each random value was independent of all others and was drawn from a uniform distribution on this interval. This pattern of varia-I tion can be interpreted as consisting of 3-hour organism den-sities that vary from -50% to +50% of the density A, throughout the entire day. Thus the average organism density that is I realized in the simulated day's set of eight 3-hour periods ;s approximately A.
I b. Diel variation. In this pattern of variation, three of the eight 3-hour periods have high densities relative to the other five periods. For comparability with the uniform variation case, we want the average organism density over the eight 3-hour periods I to be approximately A, where A is 10, 50, 100, or 2S0 as before. To accomplish this, the random densities L during the three high-density periods are equal to 2.2SA plus uniformly distribu-I ted random noise introducing variation from -50% to +SO% of the density 2.2SA. During the five low-density periods, the random densities L are equal to .2SA plus uniformly distributed random I noise introducing variation from -100% to +100% of the density
.2SA. In other words, the densities L during the five low-density periods are independent draws from a Uniform(O,.SA) distribution.
The average organism density obtained in the simulated day's set I* of eight 3-hour periods is approximately I [3(2.2SA) + S(.25A)]/8, (6.2) I which reduces to A. This makes the simulations of uniform vari-ation and diel variation for a common A comparable, because they I have the same underlying daily mean density. These two patterns of density variation were chosen to reflect important I biological patterns, although the choices inevitably were somewhat arbi-trary. The uniform pattern of variation represents the situation in which no strong diel effect is present, so the density fluctuates about a constant level throughout the day. Fluctuations varying from 50% below to 50% above I the baseline density were judged to be a wide enough range to avoid the pat-tern of a density that remains essentially constant throughout the day, and yet narrow enough to be consistent with the interpretation that density I varies about a fixed baseline level. The diel pattern of variation repre-sents the situation in which three of the day's eight periods have densities that are quite high compared to the remaining five periods. This would I occur if organism densities were systematically higher during nine hours (not necessarily nine consecutive hours, but any three 3-hour periods during I
6.4 I the day) than during the rest of the day. Random variation in the diel pattern was motivated by the same considerations as in the uniform I pattern. To simplify the exposition, the case of uniform variation will be I described. The case of diel variation can be similarly described with only minor modifications.
- 2. Two random sample counts were generated from each of the simulated I
day's eight 3-hour periods. This simulated replicate sampling at two locations in the discharge channel. The first step in this process was to calculate p, the expected value of an observed sample count, for a I specified period, for example, the first period. This is found as the product of the sample volume and the mean organism density L, randomly generated in step (l){a), for this period: I p = {Sample volume}*{Mean density} (6.3) I
= v*L I
The sample volume and mean density must be expressed in compatible units: if L is in organisms per 100 cubic meters, then v must be expressed as a fraction (or multiple) of 1000 cubic meters, so a sample of 300 cubic I meters in a 3-hour period must be written as v = .3. Thus p is the mean sample count u~der these sampling conditions. I Two independent random sample counts, X and Y, were then simulated using
- p. For c 2 = 0, these counts were simulated as Poisson(p) random vari-ables with mean, or intensity parameter, p. For any c 2 greater than 0, I
the two counts were simulated as negative binomial (K,p) random variables with parameter values K and p determined by c 2 and p: I (6.4) I p = l/(c 2p + I}. I The relationships in {6.4} and the method of simulation are both detailed in Appendix II. The parameter p can be viewed in any of three different ways. First, it can be thought of as a function, given by (6.4), of the I 2 constant c discussed by Cassie in Section 4.2 and the mean count p of a sample of specified volume taken in a specified p.s.u. This view of p is I essentially as an empirical constant, which changes with the p.s.u. mean count p. Second p can be thought of as equalling S/(S+v} of (4.15), 1 where the negative binomial distribution was derived as a Gamma (K,S) mixture of Poisson variables, so p depends on the parameters of the I I I
I 6.5 I Gamma mixing distribution and on the sample volume v. Third, p can be thought of as given by (6.4), the mean count ~, and the classification I model constant c2 given by (4.33) in Section 4.5. Of these three views, the first is the least restrictive, and therefore the most satisfactory. In all three interpretations, p changes from one p.s.u. to another. I (It was pointed out in Section 4.2 that the negative binomial family includes the Poisson family. Thus, although the parametrization used I here for computational purposes makes it appear that the two families are entirely different, this is not the case.) This process was carried out for each period in a simulated day. Two I independent sample counts, X and Y, were generated for each of the eight 3-hour periods, based on that period's randomly generated mean density L and the sample volume v. I
- 3. These simulated sample counts were used to compute one-sample and two-I sample statistics for the sampling day. One-sample statistics were based
, on equations (5.1~ and (5.15), using only the simulated X counts, and assuming that c 2 = 1. This approach ignores the Y counts, which are replicates of the X counts in the eight periods, proceeding as if only the X counts, one per period, were available. Two-sample statistics were calculated from equation (5.21), as it appears and with c replacing I A c, using both the simulated X and Y counts. This was done in two ways: using the known value of c 2, as would be done if c2 were known from his-torical data; and using an estimated value ~2 based on replicate observa-I tions X and Y. . The one-sample statistics computed for each simulated day are estimates I of: I Daily total (the true daily total entrainment) T A Variance of T I Coefficient of Variation of T A I All of these are computed, as discussed in Section 6.2, from. data con-sisting of one sample per period. The estimated variance and coefficient of variation are based on the assumption that c2 = 1. I The two-sample statistics computed for each simulated day are estimates of: I I I
I 6.6 Da ily Tota 1 T I Variance of i (c 2 known) Coefficient of Variation of i (c 2 known) I Variance of i (c 2 estimated) I Coefficient of Variation of i (c 2 estimated) I All of these are computed, as discussed in Section 6.2, from data con-sisting of two samples per period. I
- 4. The true realized value T of the sampling day's total organism count was computed from the formula I (6.5) I I
where V is the v61ume of discharge flow during a 3-hour period, which is 100,000 cubic meters in this simulation. The true values of the variance I and the coefficient of variation of the one-sample estimator of T were obtained from formlJlas to be treated in Section 6.2. Steps (1) to (4) were executed to simulate a single day. For each com-I bination of c 2, A, v, and temporal pattern (uniform or diel variation), 100 of these simulated days were generated. The true values discussed in I (4) and the one-sample and two-sample statistics described in (3) were averaged over the 100 simulated days; these averages appear in Tables 6.1 to 6.8. Standard deviations rather than variances are given in these I tables, each being computed as the square root of the average variance over the 100 simulated days. - I 6.2 FORMULAS FOR THE SIMULATION STUDY OF DAILY ENTRAINMENT I A summary of the entrainment process described in Section 6.1 begins with the parameters defined there for an eight-period sampling day: I [text continues on page 6.55J I I I
I I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared=(I.00 Volume of Sample in cubic meters I 100 150 200 300 TRUE DAILY 8010.23 8010.23 8010.23 8010.23 I TOTAL TRUE STD DEV 2830.24 2310.88 2001. 28 1634.04 I TRUE COEFF. OF VARIATION 35.46 28.96 25.08 20.47 I DAILY TOTAL (one sample) 7980.00 8120.00 7975.00 8106.67 STANDARD DEV. 3485.69 3126.94 2891.80 2649.74 I (one sample) COEFF. OF VAR. 44.63 38.53 36.11 32.46 (one sample) I DAILY TOTAL (two sample) 7655.00 8040.00 7870.00 8111. 67 I STANDARD DEV. (ceq known) 2766.77 2315.17 1983.68 1644.35 STANDARD DEV. 3000.83 2472.07 2196.59 1795.52 I (csq estimated) COEFF. OF VAR. 37.30 29.40 25.60 20.50 (csq known) I COEFF. OF VAR. (csq estimated) 40.27 31.18 27.97 22.15 I ESTIMATE OF C-SGlUARED
.41 .14 .18 .07 I The value given is the average over 100 simulations.
I I I I
6.8 I Table 6.1 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .05 I Volume of Sample in cubic meters 100 150 200 300 I THUE DAILY 8010.23 8010.23 8010.23 8010.23 TOTAL TF<UE STD DEV 2905.78 2402.80 2106.75 1761.64 I TRUE COEFF. OF VARIATION 36.40 30.09 26.38 22 .. 05 I D~HLY TOTAL (one sample) 7990.00 7826.67 8045.00 8066.67 I STANDARD DEV. 3519.94 3071.73 2896.55 2697.32 (one sample) COEFF. OF VAR. 45.09 39.46 35.99 33.20 I (cme sampl e) DAILY TOTAL (two sample) 8330.00 7860.00 7792.50 7990.00 I STANDARD DEV. (c:sq known) 2967.14 2377.95 2073.49 1756.86 I STANDARD DEV. 3228.00 2514.84 2175.14 1807.08 (c:sq estimated) COEFF. OF VAR. 36.52 30.76 26.93 22.16 I ( c:sq known) C[)EFF. OF VAR. (c:sq estimated) 39.34 32.23 28.07 22.61 I ESTIMATE OF C--SQUARED
.41 .19 .15 .08 I
The value given is the average over 100 simulations. I I I I I
I 6.9 I Table 6.1 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .10 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 8010.23 8010.23 8010.23 8010.23 TRUE STD DEV 2979.40 2491.34 2207.20 1880.60 I TRUE COEFF. 37.31 31.19 27.63 23.53 OF VARIATION I DAILY TOTAL (one sample) 8220.00 8273.33 7705.00 7903.33 I STANDARD DEV. (one sample) 3641. 43 3247.56 2891.37 2699.18 COEFF. OF VAR. 44.81 39.24 37.14 33.76 I (one sample) DAILY TOTAL 7930.00 7940.00 7772.50 7823.33 (two sample) I STANDARD DEV. (csq known) 2960.13 2483.60 2171. 61 1853.45 I STANDARD DEV. (csq estimated) 3193.74 2624.67 2240.26 1962.42 COEFF. OF VAR. 38.35 31.80 28.36 23.83 I (csq known) COEFF. OF VAR. 40.73 33.06 28.84 24.67 (csq estimated) I ESTIMATE OF C-SQUARED
.49 .30 .16 .17 I The value given is the average over 100 simulations.
I I I I
6.10 I Table 6.1 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= I Volume of Sample in cubic meters 100 150 200 I TRUE DAILY 8010.23 8010.23 8010.23 8010.23 tOTAL TRUE STD DEV 3301.34 2868.59 2625.59 2357.69 I TRUE COEFF. 41.30 35.86 32.81 29.44 OF VARIATION DAILY TOTAL (one* sampl e) 7440.00 8393.33 8145.00 8250.00 I STANDARD DEV. 3502.86 3447.38 3154.36 3006.10 (one sample) COEFF. OF VAR. 48.10 40.96 37.99 35.63 I (one sampl e) DAILY TOTAL (two sample) 7640.00 8093.33 7867.50 8055.00 I STANDARD DEV. (csq known) 3180.52 2889.82 2599.89. 2361.54 I STANDARD DEV. 3340.66 2935.61 2667.63 2471.50 (csq estimated) COEFF. OF VAR. 42.36 36.20 33.08 29.25 I (csq known) COEFF. OF VAR. (csq estimated) 43.69 35.99 33.07 30.09 I ESTIMATE OF C-SQUARED
.79 .58 .61 .59 I
The! value given is the average over 100 simulations. I I I I I
I 6.11 I Table 6.1 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 10.01 1000 cubic mete~s Value of c-squared= .50 I 100 Volume of Sample in cubic meters 150 200 30t) I TRUE DAILY TOTAL TRUE STD DEV 8010.23 3513.29 8010.23 3110.17 8010.23 2887.58 8010.23 2646.33 I TRUE COEFF. 43.92 38.86 36.06 33.02 OF VARIATION I DAILY TOTAL (one sample) 7690.00 8120.00 7625.00 8273.33 I STANDARD DEV. (one sample) 3620.77 3446.09 3051.23 3082.39 COEFF. OF VAR. 47.80 41.93 39.26 36.05 I (one sample) DAILY TOTAL 8090.00 8063.33 7817.50 8098.33 (two sample) I STANDARD DEV. (csq known) 3520.80 3104.91 2813.58 2663.56 I STANDARD DEV. (csq estimated) 3522.78 3192.00 2859.41 2638.29 COEFF. OF VAR. 44.28 38.48 35.86 32.49 I (csq known) COEFF. OF VAR. 43.42 38.77 35.22 30.89 (csq estimated) I ESTIMATE OF C-SQUARED
.85 1.10 .73 .59 I The value given is the average over 100 simulations.
I I I I
6.12 I Table 6.1 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,OOO cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared=I.00 I Volume of Sample in cubic meters 100 150 200 300 I TF~UE DAILY 8010.23 8010.23 8010.23 8010.23 I TOTAL THUE STD DEV 4083.65 3742.47 3559.65 3366.90 TFtUE COEFF. 51.00 46.70 44.40 41.97 OF VARIATION D':ULY TOTAL
<<me sample) 7630.00 7766.67 8220.00 7900.00 I
STANDARD DEV. 3836.67 3575.22 3616.63 3336.17 (cIne sample) CDEFF. OF VAR. 51.21 44.74 41.92 39.86 I (one sample) DAILY TOTAL (t.wo sample) 7910.00 7803.33 8122.50 8093.33 I STANDARD DEV. (c:sq known) 4029.47 3628.79 3568.91 3428.80 I STANDARD DEV. 3940.81 3603.39 3635.76 3421.01 (csq estimated) CCiEFF. OF VAR. 50.37 45.17 42.90 40.56 I (csq known) CCiEFF. OF VAR. (csq estimated) 47.38 44.40 41.31 39.92 I ESTIMATE OF C-*SQUARED
- 1. 27 1. 89 2.67 1. 74 I
The value given is the average over 100 simulations. I I I I I
I 6.13 Table 6.2 I Simulation of Sampling for One Day with Uniform Variation of Density About Level of 50 Organisms/1000 m3 SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared=O.OO I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 39744.21 39744.21 39744.21 39744.21 TRUE STD DEV 6304.30 5147.44 4457.81 3639.79 I TRUE COEFF. 15.93 13.00 11.26 9.20 OF VARIATION I DAILY TOTAL (one sample) 40450.00 40093.33 39030.00 39463.33 I STANDARD DEV. (one sample) 12319.90 11674.28 11095.04 10960.64 COEFF. OF VAR. 30.18 28.84 28.19 27.62 I (one sample) DAILY TOTAL 40000.00 40016.67 39227.50 39420.00 (two sample) I STANDARD DEV. (ceq known) 6324.~6 5165.05 4428.74 3624.91 I STANDARD DEV. (csq estimated) 6989.28 ~717.23 4955.68 3958.54 COEFF. OF VAR. 15.93 12.99 11.36 9.25 I (csq known) COEFF. OF VAR. (csq estimated) 17.46 14.24 12.57 10.05 I ESTIMATE OF C-SQUARED
.05 .03 .03 .01 I The value given is the average over 100 si mul at i ':mB.
I I I I
6.14 I Table 6.2 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .05 I Volume of Sample in cubic meters 100 150 200 300 I TFi:UE DAILY 39744.21 39744.21 39744.21 39744.21 TOTAL TFi:UE STD DEV 7105.65 6102.64 5533.37 4898.38 I TF:UE COEFF. OF VARIATION 17.93 15.39 13.94 12 .. 33 I DP,ILY TOTAL (one sampl e) 39790.00 40766.67 39515.00 39393.33 I STANDARD DEV. 12360.42 11938.73 11546.97 11067.82 (one sample) COEFF. OF VAR. 30.64 29.02 28.81 27.94 I (c*ne sample) DAILY TOTAL (two sample) 40210.00 40333.33 39557.50 39850.00 I STANDARD DEV. (c:sq known) 7154.53 6146.11 5520.23 4906.72 I STANDARD DEV. 7427.65 6258.86 5916.40 4824.82 (csq estimated) COEFF. OF VAR. 17.89 15.29 13.99 12.32 I (csq known) CDEFF. OF VAR. (csq estimated) 18.30 15.38 14.70 11.89 I ESTIMATE OF C-*SQUARED
.07 .06 .08 .05 I
The value given is the average over 100 simulations. I I I
I 6.15 I Table 6.2 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uni~orm perturbations. I Mean Density o~ Organisms= 50.0/ 1000 cubic meters Value of c-squared= .10 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 39744.21 39744.21 39744.21 39744.21 TRUE STD DEV 7925.36 6927.35 6431.50 5994.09 I TRUE COEFF. 19.72 17.44 16.19 14.92 OF VARIATION I .DAILY TOTAL (one sample) 39430.00 40133.33 40275.00 40396.67 I STANDARD DEV. (one sample) 12472.37 12090.77 11962.12 11619.09 I:OEFF. OF VAR. 31.19 29.81 29.00 29.46 I (one sample)
)AILY TOTAL 39500.00 39900.00 39665.00 40106.67 (two sample) 1 STANDARD DEV.
(csq known) 7904.36 6950.27 6399.43 5939.06 I STANDARD DEV. (csq estimated) 7742.09 6799.37 6394.14 5934.10 C:OEFF. OF VAR. 19.81 17.43 16.12 14.77 I (csq known) COEFF. OF VAR. 19.29 16.79 15.33 14.06 (csq estimated) I ESTIMATE OF C-SQUARED
.10 .10 .10 .10 I* The value given is the average over 100 simulations.
I I I I
6.16 I Table 6.2 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,OOO cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 50.0/ 1000 cubic meters yalue of c-squared= I Volume of Sample in cubic meters 100 150 200 300 I THUE DAILY 39744.21 39744.21 39744.21 39744.21 TOTAL H~UE STD DEV 10553.85 9906.34 9566.17 9213.45 I THUE COEFF. OF VARIATION 26.53 24.89 24.02 23.12 I D,qIL Y TOTAL (one sample) 39950.00 39700.00 40615.00 39490.00 I STANDARD DEV. 13375.35 13030.56 12965.15 12373.58 (one sample) COEFF. OF VAR. 32.98 31.89 31.10 30.47 I (one sample) DAILY TOTAL (two sample) 40245.00 39763.33 40437.50 39531.67 I STANDARD DEV. (csq known) 10613.41 9907.85 9688.50 9131.26 I STANDARD DEV. 10593.16 10188.12 9500.07 9028.38 (csq estimated) COEFF. OF VAR. 26.22 24.64 23.70 22.85 I (csq known) COEFF. OF VAR. (csq estimated) 25.29 24.29 22.66 21.94 I ESTIMATE OF C-SQUARED
.39 .39 .34 .36 I
Th~ value given is the average over 100 simulations. I I I I I
I 6.17 I Table 6.2 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .50 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 39744.21 39744.2i 39744.21 39744.21 TRUE STD DEV 12132.79 11573.96 11284.17 10986.74 I TRUE COEFF. 30.48 29.06 2B.32 27.56 OF VARIATION I DAILY TOTAL (one sample) 39480.00 39446.67 40785.00 38196.67 I STANDARD DEV. (one sample) 14011.07 13625.79 13979.36 12335.14 COEFF. OF VAR. 34.36 33.16 32.68 31.09 I (one sample) DAILY TOTAL 40565.00 40146.67 40952.50 37991. 67 (two sample) I STANDARD DEV. (csq known) 12215.77 11642.07 11613.54 10461.04 I STANDARD DEV. (csq estimated) 11938.80 11722.53 11894.17 10092.38 COEFF. OF VAR. 29.88 28.44 27.78 26.94 I (csq known) COEFF. OF VAR. 28.83 27.25 27.28 24.98 (csq estimated) I ESTIMATE OF C-SQUARED
.58 .58 .65 .49 I The value given is the average over 100 simulations.
I I I I
6.18 I Table 6.2 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared=I.00 I Volume of Sample in cubic meters 100 150 200 300 I TRUE DAILY 39744.21 39744.21 39744.21 39744.21 TOTAL TRUE STD DEV 15958.22 15537.59 15322.95 .15105.26 I TRUE COEFF. 40.05 38.98 38.43 37.88 OF VARIATION DAILY TOTAL (one sampl e) 41780.00 37006.67 42145.00 40316.67 I
~;TANDARD DEV. 16257.00 14609.59 16277.67 15402.78 (one sample)
C:OEFF. OF VAR. 37.12 36.41 36.00 35.24 I (one sample) DAILY TOTAL (two sample) 41200.00 37653.33 40957.50 40245.00 I STANDARD DEV. (csq known) 16419.60 14661. 77 15646.53 15264.72 I STANDARD DEV. 16443.24 14705.55 15700.76 15341.83 (,:sq estimated) COEFF. OF VAR. 38.47 37.45 36.82 36.43 I (csq known) COEFF. OF VAR. 37.35 36.96 35.80 34.92 (csq estimated) ESTIMATE OF 1. 37 1. 46 1 a 52 1. 26 C-SQUARED The value given is the average over 100 simulations. I I I I
I 6.19 I Table 6.3 Simulation of Sampling for One Day with Uniform . Variation of Density About Level of 100 Organisms/lOOO m3 SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared=O.O( I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 79709.19 79709.19 79709.19 79709.19 TRUE STD DEV 8928.00 7289.68 6313.05 5154.58 I TRUE COEFF. 11.24 9.18 7.95 6.49 OF VARIATION I DAILY TOTAL (one sample) 79740.00 81120.00 79720.00 79520.00 I STANDARD DEV. (one sample) 22524.65 22397.02 21763.44 21351. 79 COEFF. OF VAR. 28.05 27.45 27.18 26.69 I (one sample) DAILY TOTAL 79550.00 80390.00 79940.00 79618.33 (two sample) I STANDARD DEV. (csq known) 8919.08 7320.75 6322.18 5151.64 I STANDARD DEV. (csq estimated) 9740.12 8034.51 6819.82 5657.98 COEFF. OF VAR. 11.28 9.14 7.94 6.50 I (csq known) COEFF. OF VAR. 12.23 9.91 8.48 7.08 (csq estimated) I ESTIMATE OF C-SQUARED
.02 .01 .01 .01 I The value given is the average over 100 simulations.
I I I I
6.20 I Table 6.3 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared= .05 I Volume of Sample in cubic meters 100 150 200 300 I TRUE DAILY 79709.19 79709.19 79709.19 79709.19 TOTAL TRUE STD DEV 11089.17 9818.35 9116.75 8356.45 I TRUE COEFF. OF VARIATION 13.93 12.33 11..44 10.48 I DAILY TOTAL (one sample) 79450.00 80286.67 78530.00 78620.00 I STANDARD DEV. 23089.17 22740.47 22004.09 21545.25 (one sampl e) COEFF. OF VAR. 28.85 28.07 27.73 27.17 (one sample) DAILY TOTAL (two sample) 79325.00 80560.00 78980.00 79245.00 I STANDARD DEV. (csq known)
*11077.45 9912.35 9051.37 8300.28 I
STANDARD DEV. 11032.91 10002.22 9494.47 8173.30 (c:sq estimated) COEFF. OF VAR. 13.99 12.31 11.45 10.46 I (csq known) C:OEFF. OF VAR. (c:sq estimated) 13.76 12.28 11.54 9.97 I ESTIMATE OF C-SQUARED
.06 .06 .06 .05 I
The value given is the average over 100 simUlations. I I I I I
I 6.21 I Table 6.3 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HDUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=100.0/ 1000 cubic meters Value of c-squared= .10 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 79709.19 79709.19 79709.19 79709.19 TRUE STD DEV 12893.03 11817.81 11241.70 10634.42 I TRUE COEFF. 16.18 14.82 14.09 13.32 OF VARIATION I DAILY TOTAL (one sample) 78410.00 80080.00 81670.00 78806.67 I STANDARD DEV. (one sample) 23181. 03 2324~.36 23208.19 22064.68 COEFF. OF VAR. 29.14 28.57 28.05 27.62 I (one sample) DAILY TOTAL 79220.00 79923.33 81330.00 78780.00 (two sample) I STANDARD DEV. (ceq known) 12896.34 11819.56 11396.96 10502.29 I STANDARD DEV. (csq estimated) 12436.64 12059.94 11965.58 10685.24 COEFF. OF VAR. 16.27 14.75 13.97 13.28 I (csq known) COEFF. OF VAR. 15.36 14.45 14.09 13.10 (ceq estimated) I ESTIMATE OF C-SQUARED
.09 .11 .12 .11 I The value given is the average over 100 simulations.
I I I I
6.22 I Table 6.3 (continued) SIMULATION ClF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=100.0/ 1000 cubic meters Value of c-squared= I Volume of Sample in cubic meters 100 150 200 300 I TRUE DAILY 79709.19 79709.19 79709.19 79709.19 TOTAL TRUE STD DEV 19186.20 18480.81 18117.82 17747.41 I TRUE COEFF. OF VARIATION 24.03 23.14 22.68 22.21 I DAILY TOTAL (one sample) 76240.00 77546.67 77905.00 77246.67 I STANDARD DEV. 24137.73 24034.60 24119.65 23730.95 (one sample) COEFF. OF VAR. 30.74 30.09 29.99 30.13 I (one sample) DAILY TOTAL (two sample) 78175.00 79376.67 79312.50 79413.33 I STANDARD DEV. (csq known) 18580.48 18393.37 18012.44 17731.91 I STANDARD DEV. 18244.59 17839.10 18664.77 17536.72 (csq estimated) COEFF. OF VAR. 23.48 22.92 22.39 22.00 I
<csq known)
COEFF. OF VAR. (csq estimated) 22.77 21.24 22.28 21.13 I ESTIMATE OF C-SQUARED
.36 .33 .39 ,c .. _I...)
I The value given is the average over 100 simulations. I I I I I
I 6.23 I Table 6.3 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density OT Organisms=100.01 1000 cubic meters Value of c~squared= .50 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 79709.19 79709.19 79709.19 79709.19 TRUE STD DEV 22634.38 22039.63 21736.15 21428.38 I TRUE COEFF. 28.34 27.58 27.20 26.81 OF VARIATION I DAILY TOTAL (one sample) 77120.00 78673.33 78160.00 77700.00 I STANDARD DEV. (one sample) 25606.64 26645.99 25495.98 25276.74 COEFF. OF VAR. 32.34 32.43 31.48 30.97 I (one sample) DAILY TOTAL 77610.00 78830.00 78077.50 79923.33 (two sampl e) I STANDARD DEV. (c:sq known) 21833.51 219:18.62 21438.55 21455.11 I S1"ANDARD DEV. (C51q estimated) 21866.18 22253.04 20627.68 21611.11 COEFF. OF VAR. 27.74 27.18 26.88 26.16 (csq known) COEFF. OF VAR. 27.07 26.19 25.61 25.64 (csq estimated) ESTIMATE OF .58 .61 .55 .56 C-SQUARED The value given is the average over 100 Simulations.
6.24 I Table 6.3 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=100.0/1000 cubic meters Value of c-squared=1.00 I Volume of Sample in cubic meters 100 150 200 300 I TRUE DAILY 79709.19 79709.19 79709.19 79709.19 TOTAL TRUE STD DEV 30739.56 30304.31 30084.32 29862.71 I TRUE COEFF. DF VARIATION 38.46 37.91 37.63 37.35 I DAILY TOTAL (one sample) 75880.00 78700.00 85180.00 82463.33 I STANDARD DEV. 30189.90 30359.33 33012.99 31318.42 (one sample) COEFF. OF VAR. 36.17 35.36 35.46 35.04 I (one sample) DAILY TOTAL (two sample) 78220.00 80683.33 81990.00 81563.33 I STANDARD DEV. (csq known) 30188.19 30760.52 31533.24 30357.47 I STANDARD DEV. 31260.84 32213.35 30833.10 30460.63 (csq estimated) COEFF. OF VAR. 36.88 36.60 36.82 35.7"::'- (csq known) COEFF. OF VAR. 36.38 36.59 34.96 34.87 (csq est i mated:' ESTIMATE OF 1. 45 1. 41 1.25 1. 33 C-SQUARED The value given is the average over 100 simulations.
I 6.25 Table 6.4 I Simulation of Sampling for One Day with Uniform Variation of Density About Level of 250 Orqanisms/lOOO m3 SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,OOO cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density oT Organisms=250.01 1000 cubic meters Value of c-squared=O.OO I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 201714.34 201714.34 201714.34 201714.34 TRUE STD DEV 14202.62 11596.39 10042.77 8199.89 I TRUE COEFF. 7.07 5.78 5.00 4.08 OF VARIATION DAILY TOTAL 201060.00 202180.00 201315.00 201910.00 (one sample) STANDARD DEV. 54180.44 53806.15 53175.35 53136.08 (one sample) COEFF. OF VAR. 26.80 26.46 26.29 26.16 (one sample) DAlLY TOTAL 201535.00 201033.33 200425.00 201473.33 (two sample) STANDARD DEV. 14196.30 11576.80 10010.62 8194.98 (csq ~,"own) STANDARD DEV. 15573.86 13160.21 10928.75 8853.12 (csq estimated) COEFF. OF VAR. 7.08 5.79 5.02 4.09 (csq known) COEFF. OF VAR. 7.69 6.50 5.43 4.39 (csq estimated) ESTIMATE OF .01 .01 0.00 0.00 C-SGlUARED The value given is the average over 100 simulations.
6.26 I Table 6.4 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume;100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=250.0/ 1000 cubic meters Value of c-squared= .05 Volume of Sample in cubic meters 100 150 200 300 I TRUE DAILY 201714.34 201714.34 201714.34 201714.34 TOTAL TRUE STD DEV 21829.96 20231.39 19382.73 18495.16 I TRUE COEFF. 10.82 10.02 9.59 9.15 J OF VARIATION DAILY TOTAL 201830.00 200626.67 200440.00 199596.67 (one sample) STANDARD DEV. 55293.22 54863.10 54445.27 53840.27 (one sample) COEFF. OF VAR. 27.14 27.03 26.93 26.74 (one sample) DAILY TOTAL 202800.00 201753.33 200035.00 199636.67 (two sample) STANDARD DEV. 21906.75 20276.63 19257.12 18323.55 (csq known) STANDARD DEV. 21982.49 20893.38 19024.13 18126.84 (csq estimated) COEFF. OF VAR. 10.78 10.02 9.60 9.14 (csq known) COEFF. OF W~,R. 10.55 9.93 9.20 8.78 (csq estimatE!d) ESTIMATE OF .05 .05 .05 .05 C-SQUARED The value given is the average over 100 simulations.
I 6.27 I Table 6.4 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean D'ensity of Organisms=250.01 1000 c:ubic meters Value of c:-squared= .10 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 201714.34 ,201714.34 201714.34 201714.34 TRUE STD DEV 27411. 32 26156.11 25505.35 24837.55 I TRUE COEFF. 13.57 12.94 12.61 12.27 OF VARIATION I DAILY TOTAL (one sample) 201670.00 206053.33 204905.00 200580.00 I STANDARD DEV. (one sample) 56705.82 57200.23 57002.94 55451. 55 COEFF. OF VAR. 27.78 27.35 27.36 27.30 I (one sample) DAILY TOTAL 202955.00 203683.33 204710.00 203793.33 (t .... o sample) I STANDARD DEV. (csq kno .... n) 27649.18 26352.31 25920.95 25135.28 I STANDARD DEV. (csq estimated> 27449.86 27008.35 26114.99 25020.46 COEFF. OF VAR. 13.56 12.85 12.57 12.26 I (c:sq kno .... n) COEFF. OF VAR. 12.94 12.81 12.14 11.99 (csq estimated) I ESTIMATE OF C-SGlUARED
.10 .11 .11 .11 I The value given is the average over 100 simulations.
I I I I
6.28 I Table 6.4 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared= .33 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 201714.34 201714.34 201714.34 201714.34 I TRUE STD DEV 45098.96 44347.24 43966.56 43582.56 TRUE COEFF. 22.28 21.90 21.71 21.52 I OF VARIATION DAILY TOTAL (one sample) 199070.00 210920.00 199875.00 203116.67 I STANDARD DEV. (one sample) 60529.74 63189.17 61446.56 61520.86 I COEFF. OF VAR. 29.59 29.00 29.82 29.27 I (one sample) DAILY TOTAL 201720.00 209593.33 200425.00 203663.33 (two sampl e) STANDARD DEV. (csq known) 44805.71 45643.61 43811.97 44257.00 I STANDARD DEV. (csq estimated) 46917.05 44801.44 42960.13 41653.66 I COEFF. OF VAR. 21. 88 21.43 21.48 21.25 (csq known) COEFF. OF VAR. (csq estimated) 21.92 20.33 20.85 19.44 I ESTIMATE OF C-SQUARED
.40 .33 .36 .31 o ThE! value given is the average over 100 simulations.
I I I I I
***U_II*_ _. _
I 6.29 I Table 6.4 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel volume=100,000 cubic:: meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=250.0/ 1000 cubic meters Value of c-squared= .50 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE DAILY TOTAL 201714.34 201714.34 201714.34 201714.34
'"RUE STD DEV ~4314.31 53691.77 53377.78 :53061.92 I TRUE COEFF. 26.83 26.51 26.35 26.20 OF VARIATION I DAILY TOTAL (one sample) 195740.00 203673.33 210075.00 200853.33 I STANDARD DEV.
(one sample) 63652.18 66512.52 67444.40 66310.29 COEFF. OF VAR. 31.31 31.01 30.61 31.24 I (one sample) DAILY TOTAL 201245.00 205940.00 206165.00 201586.67 (two sample) I STANDARD DEV. (csq known) 53970.54 55062.82 54218.29 53457.57 STANDARD DEV. 52566.67 53476.85 56345.03 55124.74 (csq estimated) COEFF. OF VAR. 26.29 25.96 25.57 25.86 (csq known) COEFF. OF VAR. 24.95 24.43 25.29 25.85 (csq estimated) ES1"IMATE OF .52 .51 .58 .60 C-SiGlUARED The value given is the average over 100 simUlations.
6.30 I Table 6.4 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared=1.00 I Volume of Sample in cubic meters 100 150 200 300 I
-----------~--------------------------------------------~-
TRUE DAILY 201714.34 201714.34 201714.34 201714.34 TOTAL TRUE STD DEV 75487.57 75040.89 74816.55 74591.54 I TRUE COEFF. OF VARIATION 37.27 37.05 36.93 36.82 I DAILY TOTAL (one sample) 203900.00 204320.00 210225.00 192256.67 I STANDARD DEV. 77385.08 74912.72 78845.56 73258.33 (one sample) COEFF. OF VAR. 35.09 34.66 34.40 I 33.72 (one sample) DAILY TOTAL 203810.00 204473.33 204277.50 190016.67 (two sample) STANDARD DEV. 77094.85 74346.66 75832.91 70290.71 (csq known) STANDARD DEV. 77695.24 72854.31 74297.89 75305.56 (csq estimated) COEFF. OF VAR. 36.14 35.29 35.40 34.95 (c:sq known) COEFF. OF VAR. 34.38 34.31 34.99 35.14 (csq estimated) ESTIMATE OF 1. 32 1.25 1. 28 1. 41 C-SQUARED The value given is the average over 100 simulations.
I 6.31 I Table 6.5 Simulation of Sampli"C for One Day with Diel Variation of Density Aboutevel of 10 Organisms/1000 m3 I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared=O.OO I Volume of Sample in cubic meters 100 150 200 300 I TRUE DAILY TOTAL 7905.42 7905.42 7905.42 7905.42 I TRUE STD DEV 2811. 66 2295.71 1988.14 1623.31 TRUE COEFF. 35.83 29.26 25.34 20.69 I OF VARIATION DAILY TOTAL 8030.00 8180.00 7900.00 8006.67 (one sample) I STANDARD DEV. (one sample) 3984.97 3757.66 3477.43 3328.66 I COEFF. OF VAR. (one sample) 49.87 44.92 43.31 40.51 DAILY TOTAL 7845.00 8150.00 7950.00 7991.67 I (two sample) STANDARD DEV. 2800.89 2330.95 1993.74 1632.14 (csq known) I STANDARD DEV. (csq estimated) 3083.02 2554.73 2265.50 1863.84 I COEFF. OF VAR. (csq known) 37.27 29.21 25.59 20.73 COEFF. OF VAR. 40.62 31.70 28.52 23.28 I (csq estimated) ESTIMATE OF .26 .11 .09 .06 C-SGlUARED I The value given is the average over 100 simulations. I I I I
6.32 I Table 6.5 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic mete~s per 3-hour period. I M~an Mean density subjected to diurnal perturbations. Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .05 I Volume of Sample in cubic meters I 100 150 200 300 TR:UE DAILY TOTAL 7905.42 7905.42 790:5.42 7905.42 I TRUE STD DEV 2952.32 2465.98 2182.54 1856.32 I TRUE COEFF. 37.56 31.35 27.73 23.55 OF VARIATION DAILY TOTAL 7430.00 8193.33 7650.00 7913.33 I (one sample) STANDARD DEV. (one sample) 3845.78 3784.18 3420.53 3314.61 I COEFF. OF VAR. (one sample) 52.07 44.84 43.84 40.86 I DAILY TOTAL 7745.00 7960.00 7837.50 7886.67 (two sample) STANDARD DEV. 2922.14 2472.57 2169.78 1849.45 I (csq known) STANDARD DEV. (csq estimated) 3351.87 2703.91 2492.74 2037.43 I COEFF. OF VAR. (csq known) 39.06 31.71 28.00 23.63 I COEFF. OF VAR. 43.35 33.51 31.18 25.12 (csq estimated) ESTIMATE OF .46 .17 .18 .12 I C-SQUARED The value given is the average over 100 simulations. I I I I I I
I 6.33 Table 6.5 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .10 I Volume of Sample in cubic meters I TRUE DAILY 100 7905.42 150 7905.42 200 7905.42 300 7905.42 TOTAL I TRUE STD DEV 3086.58 2625.23 2360.99 2063.18 I '"RUE COEFF. OF VARIATION 39.21 33.31 29.92 26.10 DAILY TOTAL 8230.00 7733.33 7800.00 7953.33 I (one sample) STANDARD DEV. (one sample) 4159.33 3839.56 3561. 95 3389.53 I COEFF. OF VAR. (one sample) 50.31 47.67 44.18 41.40 I DAILY TOTAL (two sample) 8170.00 7673.33 7792.50 8061. 67 STANDARD DEV. 3147.86 2594.46 2344.33 2088.69 I (csq known) STANDARD DEV. 3419.06 2843.32 2509.23 2181.87 (csq estimated) I COEFF. OF VAR. (csq known) 39.63 34.31 30.31 26.00 COEFF. OF VAR. 41.99 36.74 31.61 26.84 I (csq estimated) ESTIMATE OF .51 .38 .22 .18 C*-SQUARED I The value given is the average over 100 simUlations. I I I I I
6.34 I Table 6.5 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .33 I Volume of Sample in cubic meters I 100 150 200 300 T::;:UE DAILY TOTAL 7905.42 7905.42 7905.42 7905.42 I TRUE STD DEV 3648.37 3267.34 3059.07 2835.55 I TRUE COEFF. 46.13 41.22 38.53 35.63 OF VARIATION DAILY TOTAL 8150.00 7673.33 7405.00 7506.67 I (one sample) STANDARD DEV. (one sample) 4418.14 3916.91 3607.28 3538.99 I COEFF. OF VAR. (one sample) 52.65 48.38 46.13 43.95 o DAILY TOTAL 8125.00 7716.67 7797.50 7771.67 (two sample) STANDARD DEV. 3731.04 3230.45 3055.05 2834.66 I (csq known)
~;TANDARD DEV.
(c:sq estimated) 3767.63 3249.96 3166.43 3083.92 I COEFF. OF VAR. (csq known) 45.96 41.49 38 .. 23 35 .. 42 I COEFF. OF VAR. 44.14 40.02 37.91 36.22 (c:sq estimated) ESTIMATE OF .84 .61 .68 .58 I C-SQUARED The value given is the average over 100 simulations. I I I I I I
I 6.35 Table 6.5 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters I TRUE DAILY 100 7905.42 150 7905.42 200 7905.42 300 7905.42 TOTAL I TRUE STD DEV 4001. 66 3657.62 3472.84 3277.65 I TRUE COEFF. OF VAR I ATI ON 50.48 46.03 43.64 41.10 DAILY TOTAL 8040.00 8326.67 8595.00 8050.00 I (one sample) STANDARD DEV. 4371.50 4396.97 4350.86 3875.42 (one sample) I COEFF. OF VAR.
~one sample) 53.07 48.83 46.93 44.10 I DAILY TOTAL (two sample) 8090.00 8003.33 8005.00 7951. 67 STANDARD DEV. 4043.51 3751. 21 3439.91 3328.05 I (csq known)
STANDARD DEV. 4054.63 3813.72 3638.68 3001.76 (c:sq estimated) I COEFF. OF VAR. (csq known) 49.29 45.41 42.13 39.94 I COEFF. OF VAR. (csq estimated) 48.07 44.05 42.82 36.15 ESTIMATE OF 1. 15 1.00 1.20 .73 I C-SQUARED The value given is the average over 100 simulations. I I I I I
6.36 Table 6.5 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 10.0/ 1000 cubic meters Value of c-squared=I.00 I Volume of Sample in cubic meters I 100 150 200 300 TF~UE DAILY TOTAL 7905.42 7905.42 7905.42 7905.42 I TF(UE STD DEV 4911.33 4635.30 4490.93 4341.76 I TF<UE COEFF. 61.71 58.12 56.24 54.29 OF VARIATION D/HLY TOTAL 7810.00 7806.67 8080.00 8880.00 I (one sample) STANDARD DEV. (one sample) 4929.50 4379.75 4513.04 5203.31 I C()EFF. OF VAR. (one sample) 58.85 51.94 50.73 47.78 I DIULY TOTAL (two sample) STANDARD DEV. ( I:sq known) 7740.00 4846.99 7706.67 4400.51 7760.00 4314.90 8166.67 4575.74 o STANDARD DEV. (csq estimated) 4354.31 4394.44 4122.20 4933.45 B COEFF. OF VAR. (csq known) 59.02 53.47 52.45 50.69 I COEFF. OF VAR. 53.71 51.38 47.51 49.19 (csq estimated) ESTIMATE OF 1. 39 2.63 3.36 2.90 I C-SQUARED ThE! value given is the average over 100 simulations. I I I I
I 6.37 I Table 6.6 Simulation of SamPlin for One Day with Diel C of 50 Organisms/lOOO m3 Variation of Density Aboutevel I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume c l00,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared=O.OO I Volume of Sample in cubic meters 100 150 200 300 I TRUE DAILY TOTAL 39562.33 39562.33 39562.33 39562.33 I TRUE STD DEV 6289.86 5135.65 4447.60 3631. 45 TRUE COEFF. 16.01 13.08 11.32 9.25 I OF VARIATION DAILY TOTAL 38270.00 39233.33 39855.00 39640.00 (one sample) I STANDARD DEV. (one sample) 15115.55 15041.72 15150.66 14865.92 I COEFF. OF VAR. (one sample) 39.05 37.68 37.27 36.98 DAILY TOTAL 39440.00 39500.00 40012.50 39516.67 I (two sample) STANDARD DEV. 6280.13 5131.60 4472.83 3629.36 (csq known) I STANDARD DEV. (csq estimated) 6855.65 5949.79 4879.68 4015.66 I COEFF. OF VAR. (csq known) 16.09 13.12 11. 31 9.28 COEFF. OF VAR. 17.38 14.93 12.18 10.18 I (c:sq estimated) I::STIMATE OF .02 .02 .01 .01 C-SGlUARED I The value given is the average over 100 simulations. I I I I
6.38 I Table 6.6 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .05 I Volume of Sample in cubic meters I 100 150 200 300 THUE DAILY TOTAL 39562.33 39562.33 39562.33 39562.33 I THUE STD DEV 7753.99 6851.05 6351.63 5809.43 I TF~UE COEFF. 19.63 17.31 16.03 14.63 OF VARIATION D~HLY TOTAL 37950.00 39793.33 39615.00 39660.00 I (one sample) STANDARD DEV. (<)ne sample) 15450.24 15600.43 15278.91 15143.72 I C()EFF. OF VAR. (one sample) 39.37 38.20 37.63 37.24 I DiULY TOTAL 38195.00 40043.33 40032.50 39708.33 (two sample) STANDARD DEV. 7577.85 6908.70 6389.39 5835.47 I (csq known) STANDARD DEV. (c';;q estimated) 7843.15 7029.30 6525.81 5729.12 I COEFF. OF VAR. (csq known) 19.90 17.25 15.91 14.61 I COEFF. OF VAR. 20.08 16.96 15.62 13.39 (csq estimated) ESTIMATE OF .07 .06 .06 .05 I C-SQUARED The value given is the average over 100 simulations. I I I I I I
I 6.39 Table 6.6 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .10 I Volume of Sample in cubic meters I TRUE DAILY 100 39562.33 150 39562.33' 200 39562.33 300 39562.33 TOTAL I TRUE STD DEV 8982.56 8215.78 7804.18 7369.64 I TRUE COEFF. (IF VARIATION 22.66 20.69 19.62 18.50 DAILY TOTAL 39690.00 39460.00 39490.00 39813.33 I (one sample) STANDARD DEV. 16148.99 16062.66 15768.72 15462.68 (one sample) I COEFF. OF VAR. (one sample) 39.13 38.80 38.64 37.56 I DAILY TOTAL (two sample) 38690.00 39073.33 39735.00 39805.00 STANDARD DEV. 8772.01 8146.59 7861.59 7425.07 I (csq known) STANDARD DEV. 9005.55 8165.~1 8236.05 7673.22 (csq estimated) I COEFF. OF VAR. (csq known) 22.58 20.57 19.56 18.35 I COEFF. OF VAR. (csq estimated) 22.57 19.88 18.88 17.86 ESTIMATE OF .16 .12 .12 .12 I C-SQUARED The value given is the average over 100 simulations. I I I I I
6.40 Table 6.6 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volu~e=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .33 I Volume of Sample in cubic meters I 100 150 200 300 Tf':UE DAILY TOTAL 39562.33 39562.33 39562.33 395621>33 I TFi'UE STD DEV 13290.65 12784.91 12524.38 12258.32 I TRUE COEFF. 33.35 32.03 31.36 30.66 OF VARIATION DAILY TOTAL 37710.00 37700.00 37100.00 41733.33 I (c-ne sample) STANDARD DEV. (one sample) 16841.32 16143.39 16062.61 17885.81 I COEFF. OF VAR. (one sample) 42.15 40.11 39.52 39.21 I DAILY TOTAL 38055.00 39360.00 37280.00 41216.67 (two sampl e) STANDARD DEV. 12810.03 12662.11 11689.38 12798.71 I (csq known) STANDARD DEV. (csq estimated) 12701.38 12600.00 11596.66 12226.16 I COEFF. OF VAR. (csq known) 32.67 31.11 30.28 29.74 I COEFF. OF VAR. 31.33 28.91 28.49 25.72 (csq estimated) ESTIMATE OF .44 .43 .43 .,33 I C-SQUARED The value given is the average over 100 simulations. I I I I I I
I 6.41 Table 6.6 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 50.01 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters I TRUE DAILY 100 150 200 300 39562.33 39562.33 39562.33 39562.33 TOTAL I TRUE STD DEV 15658.32 15231.40 15013.39 14792.16 I TRUE COEFF. OF VARIATION 39.23 38.12 37.55 36.98 DAILY TOTAL 36100.00 40506.67 40825.00 37390.00 I (one sample) STANDARD DEV. 16428.33 18816.90 19006.38 16908.64 (one sample) I COEFF. OF VAR. (one sample) 41.64 41.~2 40.75 40.21 I DAILY TOTAL (two sample) 37435.00 39366.67 39302.50 39071.67 STANDARD DEV. 14635.40 15231. 87 15222.89 14815.46 I (csq known) STANDARD DEV. 15835.25 15229.80 16166.44 14832.30 (csq estimated) I COEFF. OF VAR. (ceq known) 37.35 36.56 35.86 35.53 I COEFF. OF VAR. (csq estimated) 37.57 32.88 34.93 33.15 ESTIMATE OF .80 .58 1.09 .63 I C-SQUARED The value given is the average over 100 simulations. I I I I I
6.42 I Table 6.6 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared=1.00 I Volume of Sample in cubic meters I 100 150 200 300 Tf~UE DAILY TOTAL 39562.33 39562.33 :39562.33 39562.33 I THUE STD DEV 21232.14 20919.28 20761.08 20601.67 I TF~UE COEFF. 53.11 52.29 51.88 51.46 OF VARIATION DfHLY TOTAL 41360.00 35640.00 37990.00 36420.00 I (one sample) STANDARD DEV. (one sample) 22547.51 19210.65 20103.92 18136.73 I COEFF. OF VAR. (one sample) 46.02 43.44 43.00 43.08 I NHLY TOTAL 40240.00 38396.67 38757.50 37743.33 (1:wo sample) STANDARD DEV. 21601,,31 19640.38 20813.71 19767.19 I (c:sq known) STANDARD DEV. (csq estimated) 23337.52 21854.42 21581.10 20738.08 I COEFF. OF VAR. (csq known) 49.27 46.98 46.79 45.91 I CDEFF. OF VAR. 49.19 48.31 43.88 43.34 (c~;q estimated) ESTIMATE OF 5.54 2.92 1. 86 1. 68 I C--SQUARED The value given is the average over 100 simulations. I I I I I I
I 6.43 I I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=100.0/ 1000 cubic meters Value of c-squaredcO.OO I Volume of Sample in cubic mete~s 100 150 200 300 I TRUE DAILY TOTAL
------------------------------~----------------------- ----
78634.06 78634.06 78634.06 78634.06 I TRUE STD DEV 8867.58 7240.35 6270.33 5119.70 TRUE COEFF. 11.34 9.26 8.02 6.55 I OF VARIATION DAILY TOTAL 78320.00 78926.67 78895.00 78500.00 (one sample) I STANDARD DEV. (one sample) 29791.78 29603.75 29142.84 28815.39 I COEFF. OF VAR. (one sample) 37.38 36.77 36.48 36.22 DAILY TOTAL 77425.00 78310.00 78735.00 79225.00 I (two sample) STANDARD DEV. 8799.15 7225.42 6274.35 5138.90 (csq known) I STANDARD DEV. (csq estimated) 10425.69 8140.84 6882.59 5908.42 I COEFF. OF VAR. (ceq known) 11.45 9.30 8.03 a.53 COEFF. OF VAR. 13.24 10.38 8.72 7.38 I (csq estimated) ESTIMATE OF .02 .01 .01 .01 C-SQUARED I The value given is the average over 100 s1 mul ilti ons. I I I I
6.44 Table 6.7 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared= .05 I Volume of Sample in cubic meters I 100 150 200 300 THUE DAILY TOTAL 78634.06 78634.06 "78634.06 78634.06 I TBUE STD DEV 12623.03 11538.18 10955.54 10340.12 I TBUE COEFF. 16.03 14.62 13.87 13.07 OF VARIATION DiHLY TOTAL 78490.00 78246.67 77380.00 79433.3:;, I (one sample) STANDARD DEV. (one sample) 30343.04 29923.61 29104.77 30299.34 I C()EFF. OF VAR. (one sample) 37.60 37.45 36.70 37.06 I D?\ILY TOTAL 79275.00 78526.67 78185.00 79340.00 (two sampl e) STANDARD DEV. 12680.80 11550.62 10864.95 10479.61 I (c:sq known) STANDARD DEV. (csq estimated) 13084.53 12072.19 11538.20 10518.08 I CClEFF. OF VAR. (csq known) 15.94 14.62 13.80 13.06 I CClEFF. OF VAR. 15.81 14.32 13.65 12.30 (c~;q estimated) ESTIMATE OF .06 .06 .06 .06 I C-*SQUARED The value given is the average over- 100 simulations. I I I I I I
I 6.45 Table 6.7 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared= .10 I Volume of Sample in cubic meters 100 1:50 200 300 I TRUE DAILY TOTAL 78634.06 78634.06 78634.06 78634.06 I TRUE STD DEV 15493.47 14623.14 14167.94 13697.63 TRUE COEFF. 19.62 18.49 17.90 17.29 I OF VARIATION DAILY TOTAL 78210.00 77720.00 77145.00 78856.67 (one sample) I STANDARD DEV. (one sample) 30873.61 30:5:50.43 30134.24 30629.:59 I COEFF. OF VAR. (one sample) 38.29 39.12 37.59 37.44 DAILY TOTAL 78525.00 78660.00 77715.00 79:501.67 I (two sample) STANDARD DEV. 1:5477.32 14659.18 14078.84 13849.14 (csq known) I STANDARD DEV. (csq estimated) 16310.58 148:57.5:5 14222.87 13852.94 I COEFF. OF VAR. (csq known) 19.55 18.39 17.75 17.15 COEFF. OF VAR. 19.60 17.92 17.21 16.00 I (csq estimated) ESTIMATE OF .14 .12 .11 .10 C-SQUARED I The value given is the average over 100 simulations. I I I I I
6.46 Table 6.7 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared= .33 I Volume of Sample in cubic meters I 100 150 200 300 TF~UE DAILY TOTAL 78634.06 78634.06 78634.06 78634.06 I TF<UE STD DEV 24832.92 24299.44 24028.26 23753.98 I TF<UE COEFF. 31.33 30.63 30.28 29.92 OF VARIATION D/ULY TOTAL 78820.00 76206.67 81320.00 75736.67 I (one sample) STANDARD DEV. (one sample) 33701.63 32764.62 34547.50 31187.28 I C[JEFF. OF VAR. (one sample) 40.01 39.30 39.18 38.28 I DIHLY TOTAL 79810.00 79520.00 79085.00 77495.00 (t:wo sample) STANDARD DEV. 24830.25 24733.47 24189.48 23419.02 I (c:sq known) STANDARD DEV. (csq estimated) 26076.62 27165.26 23848.11 25388.81 I CCIEFF. OF VAR. (csq known) 30.09 29.93 29.27 28.63 I CCIEFF. OF VAR. 29.28 28.14 26.33 27.65 (cs.q estimated) ESTIMATE OF .44 .51 .36 .44 I C--SQUARED The value given is the average over 100 simUlations. I I I I I I
--------~-----
I 6.47 Table 6.7 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel vOlume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters I TRUE DAILY 100 78634.06 150 78634.06 200 78634.06 300 78634.06 TOTAL I TRUE STD DEV 29760.75 29317.07 29092070 28866.58 I TRUE COEFF. OF VARIATION 37.52 36.94 36.65 36.35 DAILY TOTAL 74240.00 78960.00 76245.00 78360.00 I (one sample) STANDARD DEV. 32922.18 35167.60 34626.15 34388.43 (one sample) I COEFF. OF VAR. (one sample) 39.74 39.85 39.98 39.91 I DAILY TOTAL (two sample) 77070.00 78400.00 72532.50 77905.00 STANDARD DEV. 28932.82 29127.03 26556.29 29042.61 I (csq known) STANDARD DEV. 27273.06 28516.58 28219.30 24643.27 (csq estimated) I COEFF. OF VAR. (csq known) 35.29 35.01 34.42 34.37 I COEFF. OF VAR. (csq estimated) 31.33 32.94 32.77 29.05 ESTIMATE OF .51 .68 .70 .50 I C-SQUARED The value given is the average over 100 simulations. I I I I I
6.48 I Table 6.7 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared=I.00 I Volume of Sample in cubic meters I 100 150 200 300 TRUE DAILY TOTAL 78634.06 78634.06 78634.06 78634.06 I T.RUE STD DEV 41143.29 40823.51 40662.68 40501.21 I TRUE COEFF. 51.83 51.41 5l..20 50.99 OF VARIATION DAILY TOTAL 71010.00 79933.33 75930.00 81146.67 I (one sample) STANDARD DEV. (one sample) 36668.38 40420.68 39048.91 41!516.41 I COEFF. OF VAR. (one sampl e) 42.87 43.11 43.21 43.19 I DAILY TOTAL 70160.00 80856.67 74727.50 80931.67 (two sample) STANDARD DEV. 35488.73 42449.28 38651.39 41050.04 I (csq known) STANDARD DEV. (csq estimated) 35877.99 36801.18 37426.75 40008.84 I COEFF. OF VAR. (csq known) 45.91 46.36 46.48 46.57 I COEFF. OF VAR. 43.~6 40.95 43.69 43.54 (csq estimated) ESTIMATE OF 1. 73 1.58 1. 61 1. 53 I C-SQUARED The' value given is the average over 100 simulations. I I I I I I
6.49 Table 6.8 Simulation of Sampling for One Day with Diel . Variation of Density About Level of 250 Organisms/IOOO m3 SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to diurnal perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared=O.OO Volume of Sample in cubic meters 100 150 200 300 TRUE DAILY 197901. 02 197901.02 197901.02 197901.02 TOTAL TRUE STD DEV 14067.73 11486.25 9947.39 8122.01 TRUE COEFF. 7.17 5.85 5.07 4.14 OF VARIATION DAILY TOTAL 199370.00 197673.33 196330.00 198413.33 (one samp'! e) STANDARD DEV. 74067.94 72579.92 72088.14 72568.70 (one sample) COEFF. OF VAR. 36.40 36.16 36.17 36.04 (one sample) DAILY TOTAL 199010.00 198050.00 196752.50 198248.33 (two sample) STANDARD DEV. 14107.09 11490.58 9918.48 8129.13 (csq known) STANDARD DEV. 15876.08 12570.07 11135.92 9278.50 (csq estimated) COEFF. OF VAR. 7.16 5.86 5.08 4.14 (ceq known> COEFF. OF VAR. 7.94 6.30 5.60 4.65 (csq estimated) ESTIMATE OF .01 0.00 0.00 0.00 C-SQUARED The value given is the average over 100 simulations.
6.50 Table 6.8 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to diurnal perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared= .05 Volume of Sample in cubic meters 100 150 200 300 TRUE DAILY 197901.02 197901.02 197901.02 197901.02 TOTAL TRUE STD DEV 26738.78 25475.39 24819.59 24145.98 TRUE COEFF. 13.41 12.75 12.41 12.06 OF VARIATION DAILY TOTAL 198290.00 196013.33 197585.00 197940.00 (one sampl e) STANDARD DEV. 74681. 39 73976.00 74101.48 74168.61 (one sample) COEFF. OF VAR. 36.72 36.81 36.48 36.40 (one sampl e) DAILY TOTAL 198590.00 197836.67 196885.00 197965.00 (two sampl e) STANDARD DEV. 26759.34 25461.84 24691.99 24119.79 (c:sq known) STANDARD DEV. 26872.48 26108.28 24899.05 24801.20 (csq estimated) COEFF. OF VAR. 13.35 12.72 12.35 11.96 (csq known) COEFF. OF VAR. 12.75 12.24 11.54 11.34 (csq estimated) ESTIMATE OF
- 0"-
,J .06 .06 .05 C-SQUARED The value given is the average over 100 simulations.
I 6.51 Table 6.8 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=lOO,OOO cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared= .10 I Volume of Sample in cUbic meters; I TRUE DAILY 100 150 200 300 197901.02 197901.02 197901. 02 197901.02 TOTAL I TRUE STD DEV 35100.20 34147.57 33661. 15 33167.60 I TRUE COEFF. OF VARIATION 17.55 17.05 16.80 16.54 DAILY TOTAL 201120.00 197320.00 199060.00 196833.33 I (one sample) STANDARD DEV. 77949.73 73639.50 76383.78 75424.28 (one sample) I CCIEFF. OF VAR. (one sample) 37.15 37.07 36.78 36.99 I DAILY TOTAL (two sample) 199580.00 197660.00 196995.00 196591. 67 STANDARD DEV. 35279.40 33867.13 33460.60 32894.39 I (csq known) STANDARD DEV. 38917.99 32860.79 30520.12 33284.66 (csq estimated) I COEFF. OF VAR. 17.37 16.82 16.56 16.42 (csq known) COEFF. OF VAR. 17.23 14.76 14.30 15.56 (csq estimated) ESTIMATE OF .12 .10 .09 .10 C-SGlUARED The value given is the average over 100 simulations.
6.52 I Table 6.8 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS I Channel vo18me=100,000 cubic meters per 3-hour period. Mean density subjected to diurnal perturbations. I Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared= .33 Volume of Sample in cubic meters I 100 150 200 300 TRUE DAILY 197901.02 197901.02 197901.02 197901.02 TOTAL TRUE STD DEV 60373.36 59824.54 59548.23 59270.64 I 29.82 I TRUE COEFF. 30.10 29.67 29.52. OF VARIATION DAILY TOTAL 206010.00 207346.67 198025.00 207193.33
':one sampl e) !5TANDARD DEV.
(one sample) 88565.74 87946.42 82324.18 87123.85 I COEFF. OF VAR. (one sampl e) 39.01 39.03 38.43 38.95 I DAILY TOTAL 200535.00 206943.33 198322.50 201836.67 (two sample) 61349.67 63769.10 I STANDARD DEV. 58996.41 60888.17 (csq known) STANDARD DEV. 66573.83 62657.00 61095.86 59292.57 (csq est. i mated) COEFF. OF VAR. 29.10 29.08 28.58 28.65 (csq known) COEFF. OF VAR. 28.56 25.20 26.54 25 .. 92 (csq estimated) ESTIMATE OF .44 .33 .41 .36 C-SQUARED The value given is the average over 100 simulations.
I 6.53 Table 6.8 (continued) I SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=250.0/ 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters I "rRUE DAILY 100 197901.02 150 197901.02 200 197901. 02 300 197901.02 TOTAL I TRUE STD DEV 73270.15 72818.60 72591. 77 72364.23 I TRUE COEFF. OF VARIATION 36.52 36.28 36.16 36.04 DAILY TOTAL 204930.00 199560.00 203765.00 197510.00 I (one sample) STANDARD DEV. 91791. 23 89631.42 93073.29 91016.73 (one sample) I COEFF. OF VAR. (one siiimple) 40.32 39.83 39.70 39.17 I I>AILY TOTAL (two sample) 203700.00 198363.33 203932.50 208858.33 STANDARD DEV. 76008.41 .72889.67 76356.38 75987.06 I (csq known) STANDARD DEV. 74049.65 72163.53 72822.77 81650.13 (csq estimated) I COEFF. OF VAR. (cliq known) 34.90 34.50 34.09 34.07 I COEFF. OF VAR. (csq estimated) 31.22 32.85 30.94 33.75 ESTIMATE OF .59 .64 .54 .76 I C-SQUARED The value given is the average over 100 simulations. I I I I I
6.54 *1 Table 6.8 (continued) SIMULATION OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to diurnal perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared=1.00 I Volume of Sample in cubic meters I 100 150 200 300 TRUE DAILY TOTAL 197901.02 197901.02 197901. 02 197901.02 I TRUE STD DEV 102660.26 102338.47 102177.19 102015.66 I TRUE COEFF. 51.14 50.97 50.88 50.80 OF VARIATION DAILY TOTAL 199140.00 194400.00 200850.00 191456.67 I (one sample) STANDARD DEV. (one sample) 99741.97 105160.68 101733.43 96379.72 I COEFF. OF VAR. (one sample) 42.20 42.28 41.84 41.69 I DAILY TOTAL 205700.00 204720.00 200625.00 200303.33 (two sample) STANDARD DEV. 103691.82 106867.63 103907.53 101982.63 I (csq known) STANDARD DEV. (csq estimated) 102056.85 116753.07 97171.64 106045.32 I COEFF. OF VAR. (csq known) 45.65 46.30 46.04 45.74 I COEFF. OF VAR. 43.07 43.00 41.83 45.66 (csq estimated) ESTIMATE OF 2.05 1.70 1. 32 1. 96 I C-SQUARED The value given is the average over 100 s1 mLtl ati ons. I I I I I I
I 6.55 I c2 'I A, used to generate L., 1=1, *** ,8, in uniform or diel patter'n v ' I The index i represents the period within the day, ranging from 1 to 8. I From v and Li , i-1, *** ,8, we can obtain the expected value, or mean, of the number of organisms in a sample of volume v drawn during period i, I
",.
- vL .* (6.6)
I I Then, using c 2 and "'i' we generate two independent, identically distribu-ted random observations Xi and Yi from the distribution I I whose probability density function is (11.3) in Appendix II, using (11.4) I and (II.6). Applying general formulas for the mean and variance of a Nega-tive binomial random variable (see Mood, Graybill, and Boes, 1974, p. 538) to this distribution shows routinely that for ;=1, *.. ,8, I I = u.
.1 + C 211.,
2
"" 1 (6.7)
I and likewise for Yi . I The parameters of the situation are the true underlying values of the I various quantities of interest. The following important parameters can-not be calculated from the sample data, observations X. and Y.; they can 1 1 be calculated in this simulation from the values of c2 and )J., as well as I v and V, but would be unknown in a real sampling situation. Throughout this section, all summations E will indicate sums as i ranees from 1 to 8. In addition, it will be useful to define the samplingrati~ R = V/v and I the variable for i=1, ... ,8 I
I 6.56 Z.1 = (X.1 + Y.)/2,, (6.8) I which has E(Z,') = u., and var(Z.)
= (u.1 + c2u~)/2.
1 I' As was noted in Section 6.1, the true realized value T of the sampling I dayHs total organism count is given by
= REu.*
I T 1 (6.9) I The one-sample and two-sample estimators of Tare
.. "I Tl = R!X.1 (6.10) .. ,I T2 = R!Z; .
Both are unbiased estimators of T, since (for example) I E(T 1 ) = R!E(X.) = R!u.1 = T (6.11) I 1 Not surprisingly, the variance of the one-sample estimator is twice that I of the two-sample I:sti ma tor:
,I = R2!var(X,') = R2 22 !~u.+c u.)
I 1 1 (6.12) I Standard deviations (SO) and coefficients of variation (CV) are easily obtained as I I (6.13)
..... A CV(T 1) = SO(T 1)/E(T 1) = SO(T 1)/T I and similarly for T2.
I I
I 6.57 I We now examine the variability of the one-sample estimator of T. Under the assumption that c2 = 1, the estimated variance, SO, and CV of T1 are !I est var(T 1)
- R2test var(X i ) = R2rX i (1+X i )/2 I est SC(T 1) = [est var(T )]1/2 1 (6.14)
I ~ A A est CV(T 1)
- est SO(T 1)/T 1.
The variability of the two-sample estimator of T must be ~ons;dered in two I A ~ 2 ~c ways. Let T~ denote the estimator T2 when c is known, T, the estima-tor when c2 must be estimated. When the value of c2 is knowr. I (6.15) I This choice of estimator for ~he variance of T~ is based on this esti- ° 1 mator's unbiasedness for var(T 2), that is, E[est var(T~)] = var(T 2). (6.16) I When the value of c2 is unknown and must be estimated, substituting the I estimate ~2 for c2 gives
'I A
est var(TCz) = R2t(Z.+~2Z~)/(2+~2). 1 1 (6.17) I For both of these situations, estimated 50's and CV's are calculated in the usual way. I It is informative to see the effect of erroneously assuming in these computations that each observation Xi or Yi has a Poisson distribution: I ~ est var p(T 1) A
= R2rx.1 = RT1 (6.18)
I ~ .. est var p{T 2) = R2rz;l2 = RT 2/2 . I These formulas will be instrumental in assessing the accuracy of various I entrainment estimation procedures over an entire sampling season. This will be treated in Section 6.4. I
I 6.58 6.3 RESULTS OF THE SIMULATION STUDY OF DAILY ENTRAINMENT I The accuracy of entrainment estimation is affected by the sample volume I v, the organism density ~, the quadratic variance parameter c2 , and the temporal pattern (uniform or diel variation). The relative performance of one-sample and two-sample procedures is also of interest. The behavior of estimation procedures can be judged by examining the standard devia-I tions and coefficients of variation associated with them. The results reported in Tables 6.1 to 6.8. will now be discussed in detail. I The sample volume v assumed values of 100, ISO, 200, and 300 cubic meters per 3-hour period of this study. Over the range from 150 to 300, changes in sample volume can be seen to have relatively little impact on the I accur!cy of entrainrnent estimation. Increasing the sample volume results in lower standard deviations and coefficients of variation, as we should expect, but the changes are sma 11 as v increases from 150 to 300. These
'I changes can be observed by reading from left to right across any row of Tables 6.1 to 6.8. The slight gains in accuracy are not sufficient to justify using $amplE~ volumes larger than 150; it would be much more pro-I ductive to in~rease the amount of replicate sampling done instead of increasing the volume v of each individual sample. This point will be returned to in Section 6. ~ where sampling season calculations are dis-I cussed. (Other values of v were considered in preliminary work but were found to lie outSide the range of usefulness: they were either too small to provide reasonably accurate estimation or too large to be practical I
for the same reasons as v = 300.) The density ~ of organisms in the discharge channel has a greater effect I than the sample volume does on entrainment estimation. This confirms the intuitive belief that the estimation of sparsely distributed organisms and the estimation of more densely distributed ones can be achieved with I differing levels of relative and absolute accuracy. The values ~ = 10, S0, 100, and 250 organisms per 1000 cubic meters were used in the simu-lation study. As ~ increases while the combination of c 2 , v, and temporal I pattern is fixed, standard deviations increase but coefficients of varia-tion decrease. For example, with c 2 = .10, v = 150, and uniform variation, I two-sample estimates with c 2 estimated (as it would be if its value were not known in advance) can be extracted from Tables 6.1 to 6.4, giving I Density), 10 50 100 250 Estd daily total t 7,940.00 39,900.00 79,923.33 203,683.33 I
~
Estd std dev of T A Estd CV of T 2,624.67 6,798.37 12,059.94 27,008.35 I 33.06% 16.78% 14.45% 12.81% I I
I 6.59 I This display illustrates that as density increases, the variability of I entrainment estimation increases when viewed in absolute ':erms, as a standard deviation, but decreases when viewed in relative terms, as a coefficient of variation. This distinction makes it clear that the decision whether to measure the efficiency of entrainment estimation pro-I cedures in terms of relative or absolute error is an important choice with far-reaching implications. 'I Two temporal patterns were examined. Under uniform variation, each of a simulated day's eight 3-hour sampling periods had its density L drawn independently from a uniform distribution centered at A. Under diel I variation, three of a dayls sampling'periods had their de~sit;es L drawn independently from a uniform distribution centered at 2.25A, while the remaining five periods had densities L drawn independently from a uniform I distribution centered at .2SA. Once these Lis had been randomly genera-ted for a given day, sample counts were simulated uSing these realized densities L, as described in Section 6.1. I There is little difference between these two temporal patterns in the simulation results. For given values of the average daily density A, I c2, and v, the diel pattern produces greater variability in the entrain~ ment estimates than the uniform pattern does. This is not surprising, because the diel pattern puts most of the organisms that appear during a I day into three relatively dense periods rather than spreading the organisms fairly evenly over eight periods. Therefore the bulk of the work in esti-mating entrainment must be done by three samples of volume v rather than I by eight samples. Aside from this natural difference between the uniform and diel patterns, the simulation results for these two te~poral patterns are quite similar. I It is straightforward to adapt the results of this simulation study to situations in which the volume V of the discharge channel flow differs I from the value of 100,000 cubic meters per 3-hour period used here. Con-version of the results in Tables 6.1 to 6.8 to apply to a discharge channel flow of V cubic meters per 3-hour period requires that: I (i) daily totals, both true and estimated, must be ad~usted by multiplying each table entry of this type by V/100,OOO; I (ii) standard deviations, both true and estimated, must be adjusted by multiplying each table entry of this type by V/100,OOO; I (iii) coefficients of variation, both true and estimated, are not changed and thus need no adjustment. I I I
I 6.60 ThE! simulation of daily entrainment sampling is not intended to illustrate eVE!ry sampling situation that actually occurs, but rather a representa-I tive range of situations. There are two reasons for this: first, imple-menting an exhaustive approach would require prohibitive amounts of time and effort; second, the consequent proliferation of detail would compli-I cate enormously ti1e tasks of assimilating, presenting, and comprehending the study's results, without improving our understanding of the entrain-ment process in any w~y. Any actual one-day sampling situation is a com-I bination of a density pattern over the day*s eight 3-hour periods, an average organism density A, a sample volume v, a discharge channel flow volume V, and a quadratic variance parameter c 2 A broad range of com-I binations of these variables was analyzed in the simulation study. Any actual sampling situation can be handled by (i) approximation if it is vet*y similar to one of the cases analyzed, (ii) interpolation if one or I more variables assume values between those analyzed, (iii) extrapola-tion if one or more variables assume values outside, but not too far from, the range analyzed, or (iv) a combination of these three methods. I This is a more effective and illuminating approach than attempting to I simulate every potential sampling situation that might occur. The daily simulation study can be used to examine the behavior of entrain-ment and its estimation over an entire sampling season. This is the pri-I mary issue in the ,evaluation of entrainment sampling designs. It is the topic of the rest of this chapter. I 6.4 SIMULATION RESULTS CONCERNING ESTIMATION OF c 2 I l The estimates of c in Tables 6.1 to 6.8 are somewhat unsatisfactory. Comparing the true value of c2, given in the table headings, to the esti- I mates on the bottom line of each page of these tables, we see that the 2 average estimate of c over 100 simulations can be far from the true value of ~2. T~i~ happens when organism density is low, c 2 is high, or I both. Th1S def1C1E!ncy must be remedied before we can seriously consider
. A2. *.
uS1ng C 1n our est1mat10n procedures for entrainment abundance. I I~ Table~ 6.1 to 6.8, c 2 is :stimated in each simulation based on only e1ght pa1red 3-hour observat10ns. This small number of pairs cannot be I expected to lead to very accurate estimates of c 2. Not only can the A2 average c over 100 simulations under specified conditions differ substan-tially from the trUie value of c2 , but the individual 2*s vary greatly c I from one simulation to another. . I I I
I 6.61 I Using a larger number of paired observations should improve this situation. I T~is is logical on both mathematical and practical grounds: we would be uncomfortable uSing only eight paired observations to estimate c2 and using the resulting ~2, which is likely to be inaccurate, in seasonal I calculations. The key issue to be resolved is the number of paired observations needed to produce more accurate estimates of c 2
- I Tables 6.9 to 6.22 address this question. Tables 6.9 to 6.12 report the results of simulating one week, consisting of 56 3-hour periods, rather than one day, which contains only a 3-hour periods. The mean density of I organisms is A = 10, 50, 100, and 250 organisms per 1000 cubic meters in Tables 6.9, 6.10, 6.11, and 6.12, respectively. Perturbation of the density about these mean levels is uniform, as in Tables 6.1 to 6.4. Results are I quite similar for die1 variation of the density, which is '!lot tabled in this report. Tables 6.13, 6.14, 6.15, and 6.16 are identical to Tables 6.9 to 6.12, except that they simulate five days, consisting of I 40 3-hour periods, rather than 7 days. As with Tables 6.9 to 6.12, the density is uniformly perturbed, and similar results for diel varia-tion are not reported. Tables 6.17 to 6.20 are also identical to Tables 6.9 to 6.12, except that they simulate three days, consisting of 24 3-hour I periods, rather than 7 days. Similar results for diel variation are not reported. Tables 6.21 and 6.22 report on simulations of la-day and 34-day entrainment seasons, each consisting of a series of days of different I mean organism intensity. Again, only uniform variation of the density level is discussed in detail; results for the case of diel variation are quite similar to these.
I In these tables, the average estimate of c2 over 100 simulations is always very close to the true value of c2* Estimation improves as the mean I density of organisms increases, becoming very good indeed for A = 100 and 250 organisms per 1000 cubic meters. This tells us that when we have a reasonably large number of paired observations, e.g., 24 to 56 of them, the I I [Text continues on page 6.146J I I I I I
I 6.62 I I Table 6.9 Simulation of Sampling for One Week with Uniform 3 Variation of Density About Level of 10 Organisms/1DDD m I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared=O.OO I Volume of Sample in cubic meters 100 150 200 300 I TRUE WEEKLY 56132.56 56132.56 56132.56 56132.56 TOTAL TRUE STD DEV 7492.17 6117.33 5297.76 4325.60 I TRUE COEFF. OF VARIATION 13 .. 35 10.90 9.44 7.71 I
'.... EEKLY TOTAL. 55900.00 57293.33 57055 .. 00 55423~33 (one sample)
STANDARD DEV. (one sample) 9269.,30 8341. 06 7769.97 6949.50 I COEFF. OF VAR. (one sample) 16.61 14.56 13.59 12.53 I WEEKLY TOTAL 55915.00 56080.00 56645.00 55666.67 (two sample) STANDARD DEV. 7477.63 6114.46 5321.89 4307.62 I (csq known) STANDARD DEV. (,=sq estimated) 7694.48 6316.47 5594.86 4464.55 I COEFF. OF VAR. 13.41 10.93 9.42 7 .. 75 (csq known) COEFF. OF VAR. 13.78 11.30 9.87 8.03 I (,=sq estimated) ESTIMATE OF C-SQUARED
.06 .05 .. 05 .03 I
The value given is the average over 100 sImulations. I I I I
I I 6.63 I I Table 6.9 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS eh.nnel lIoluma-l00,000 cubic mat.r. par 3-hour pErriod. I Maan density subjected to r.ndom perturbations. Mean Dansity of Orgoanisms= 10.01 1000 cubic maters Value of c-squarad= .05 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE WEEKLY TOTAL 56132.56 56132.56 56132.56 56132.56 TRUE STD DEV 7692.68 6361.3:: 5577.73 4664.29 I TRUE eOEFF. OF VARIATION 13.71 11.34 9.94 8.31 I WEEKLY TOTAL (one sample) 53720.00 55106.67 55225.00 56133.33 STANDARD DEV. 9131.81 8212.73 7649.84 7122.89 I (oroe sample) eOEFF. OF VAR. (one sample) 17.04 14.90 13.83 12.67 I WEEKLY TOTAL (two soample) 5:5300.00 55843.33 56025.00 56228.33 STANDARD DEV. 7633.99 6345.36 :5573.18 4669.48 I (csq known) STANDARD DEV. 7863.84 6491.53 5677 .15 4679.45 (C:!lq estimated) I eOEFF. OF VAR. (C:liq known) 13.87 11.40 9.97 8.32 eOEFF. OF VAR. 14.26 11.64 10.11 8.29 I (csq estimated) ESTIMATE OF .13 .09 .07' .05 C-SQUARED I The value gillen is the average over 100 simulations. I I I
I 6.64 I I Table 6.9 (continued) I SIMULATION elF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channe'l vOlume=100,000 c:ubic: meters per 3-hour period. I Me.lin densi ty subjected to random perturbations. Mean Density of Cirganisms= 10.01 1000 cubic meters Value of c-squared= .10 I I Volume of Sample in cubic: meters 100 150 200 300 TRUE WEEKLY 56132.56 56132.56 56132.56 56132.56 TOTAL TRUE STO DEV 7888.10 6596.31 5844.30 4980.00 I TRUE COEFF. OF VARIATION 14.06 11.76 10.42 8.87 I WEEKLY TOTAL 57170.00 56660.00 55670.00 54536.67 (one sample) STANDARD DEV. 9534.15 8493.66 7833.10 7047.54 I (one sample) COEFF. OF VAR. (one sample) 16.72 15.00 14.05 12.89 I WEEKLY TOTAL 55995.00 56420.00 55717.50 55583033 (two sample) STANDARD DEV. 7879.25 6613.85 5826.11 4950.73 I (c:sq known) STANDARD OEV. (c:sq estimated) 7933.16 6850.47 5915.97 4905.33 I I COEFF. OF VAR. 14.13 11.75 10.48 8.92 (c:sq known) COEFF. OF .... AR. 14 .. 22 12.13 10.61 8.80 (c:sq estimated) ESTIMATE OF C-SQUARED
.13 .17 .13 .09 I The value given is the average over 100 simulations.
I I I I
I 6.65 I I I Table 6.9 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. Mean denSity subjected to random perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .33 I 100 Volume of Sample in cubic meters 150 200 300 TRUE WEEKLY 56132.:56 :56132.:56 56132.:56 56132.56 I TOTAL TRUE STD DEV 8742.48 7597.37 6954.47 6245.74 I TRUE COEFF. OF VARIATION 15.:58 13.54 12.39 11.13 WEEKLY TOTAL 54800.00 54880.00 55920.00 57023.33 I (one aample) STANDARD DEV. 9558.77 8707.09 8270.13 7745.68 (one vampl.) I COEFF. OF VAR. (one vampl.) 17.43 15.86 14.74 13.:55 56550.00 I WEEKLY TOTAL 55855.00 56193.33 55860.00 (two sample) STANDARD DEV. 8693.21 7620.26 7002.56 6211.84 (cmq known) I STANDARD DEV. (csq e.timatlld) 8704.88 7688.38 6970.47 6089.52 I COEFF. OF VAR. (csq known) COEFF. OF VAR. 15.60 15.59 13.58 13.60 12.38 12.26 11.11 10.B7 (csq estimated) I ESTIMATE OF C-SQUARED
.40 .37 . ""!I"~ ~..,
I The value given is the average over 100 simulations. I I I
6.66 I I I Table 6.9 (continued) I SIMULATION (IF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS ChannE!l volume=100,000 c:ubic: meters per 3-hour period. MeeLn densi ty subjec:ted to random perturbations. I Mean Density of Organisms= 10.01 1000 c:ubic: meters Value of c:-squared= .50 I Volume of Sample in c:ubic: meters 100 150 200 300 I TRUE WEEKLY 56132 .. 56 56132.56 56132.56 56132.56 I TOTAL TRUE STD DEV 9304.86 8238.30 7649.45 7011. 33 TRUE COEFF. OF VARIATION 16.58 14.68 13.63 12.49 I WEEKLY TOTAL 56910.00 55673.33 53980.00 54573.33 (one sample) STANDARD DEV. (one sample) 10173.99 9126.03 8350.90 7840.56 I COEFF. OF VAR. (one sample) 17.83 16.29 15.38 14.31 I WEEKLY TOTAL 57320.00 55630.00 55342 .. 50 55481. 67 (two sample) STANDARD DEV. 9451. 67 8234.70 7559.35 6953.30 I (csq known) STANDARD DEV. ':c:sq estimated) 9633 .. ::8 8263.04 7553.06 6980.41 I COEFF. OF VAR. 16.51 14.79 13.64 12 .. 52 (csq known) COEFF. OF VAR. 16.73 14.79 13.55 12.52 I ':csq estimated) ESTIMATE OF C-SQUARED
.64 .58 c-.-, ... -..1 ..... .. S3 I
The value given is the average over 100 simulations. I I I I
I I 6.67 I I Table 6.9 (continued) I SIMU~ATION OF SAMP~ING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume-100,000 cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean DenSity of Organisms a 10.01 1000 cubic meters Value of c-squared~1.00 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE WEEK~Y TOTA~ 56132.56 :56132.56 :56132.56 56132.56 TRUE STD DEV 10817.96 9915.:51 9431.97 892.2.25 I TRUE COEFF. OF VARIATION 19.27 17.66 16.80 15.89 I WEEK~Y TOTA~ (one s_mple)
- 54440.00 55686.67 57385.00 56533.33 STANDARD DEV. 10:582.06 9881.75 9467.44 9007.03 I (one sample)
COEFF. OF VAR. (one s_mple) 19.38 17.62 16.40 15.79 I WEEK~Y TOTA~ (two sample) . 56215.00 565:0:6.67 55522.50 56863.33 STANDARD DEV. 10853.9~ 9964.~ 9286.03 9078.85 I (csq known) STANDARD DEV. 10830.28 987:5.22 9184.70 9229.00 (csq estimated) I COEFF. OF VAR. (csq known) 19.29 17.58 16.67 15.88 COEFF. OF VAR. 19.11 17.32 16.45 16.08 I (csq estimated) ESTIMATE OF 1.15 1.03 1. OS 1. 15 C-SQUARED I The value given is the average over 100 simulations. I I I
I 6.68 I I Table 6.10 Simulation of Sampling for One Week with Uniform 3 Variation of Density About Level of 50 Organisms/lOOO m I SIMULATION OF' SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to random perturbations. I
/'lean Density of O,."ganisms= 50.01 1000 cubic meters Value of c-squared=O.OO I
Volume of Sample in cubic meters 100 150 200 300 I TRUE WEEKLY 279716.68 279716.68 279716.68 279716.68 I TOTAL TRUE STD DEV 16724.73 13655.69 11826.17 9656.03 TRUE COEFF. OF VARIATION 5.98 4.88 4.23 3.45 I WEEf(LY TOTAL 279550.00 279206.67 277540.00 278406.67 (one sample) STANDARD DEV. {one sample) 32201.40 30638.61 29728.35 29041.31 I C:OEFF. OF VAR. (one sample) 11.50 10.96 10.70 10.42 I L-IEEKLY TOTAL 278810.00 278303.33 278800.00 279345.00 (two sample) STANDARD DEV. 16697.60 13621. 14 11806.78 9649.61 I (csq known) STANDARD DEV. (c:sq estimated) 17227.01 14113.59 12228.86 10019.84 I C:OEFF. OF VAR. 6.00 4.90 4.24 3.46 (csq known) C:OEFF. OF \jAR. 6.18 5.07 4.38 3 .. ::;8 I (c:sq estimat:ed) ESTIMATE OF C-SQlJARED
.01 .01 .. 01 0.00 I
The value given is the average over 100 5iillulati~ns. I I I I
I I 6.69 I I Table 6.10 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channal voluma-l00,OOO cubic maters per 3-hour period. I Mean dansity subjected to random perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .05 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE WEEKLY TOTAL 279716.68 279716.68 279716.68 279716.68 TRUE STD DEV 18851.82 16191.12 14681. 04 12996.67 I TRUE COEFF. OF VARIATION 6.74 5.79 5.25 4.65 I WEEKLY TOTAL (one sample) 277640.00 277960.00 282005.00 280680.00 STANDARD DEV. 32563.17 31092.62 30750.98 29883.94 I (one sample) COEFF. OF VAR. (one sample) 11.71 11.17 10.89 10.63 I WEEKLY TOTAL (two sampla) 279810.00 280396.67 279155.00 280436.67 STANDARD DEV. 18858.57 16218.14 14666.57 13014.75 I (csq known) STANDARD DEV. (csq estimatad) 18917.19 16066.60 14417.70 13223.97 I COEFF. OF VAR. (csq Imown) 6.75 5.79 :5.26 4.64 COEFF. OF VAR. 6.74 5.71 5.14 4.69 I (csq estimated) ESTIMATE OF .05 .05 .05 .05 C-SQUARED I The valul! given is the average over 100 simulations. I I I
I 6.70 I I Table 6.10 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean Densit.y of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .10 I Volume of Sample in cubic meters 100 150 200 300 I TRUE WEEKLY 27'9716.68 279716.68 279716.68 279716.68 TOTAL TRUE STD DEV 20762.12 18380.06 17064.80 15639.31 I TRUE COEFF. OF W~RIATION 7.42 6.57 6.10 5.59 I WEEKLY TOTAL 2713980.00 276546.67 274215.00 277360.00 (one sample) STANDARD DEV. 3:)402.25 31643.50 30490.20 30103.32 I (one sample) COEFF. OF IJAR. (one sample) 11.95 11.41 11.09 10.83 I WEEI<LY TOTAL 277765.00 279230.00 277080.00 279543033 (two sample) STANDARD DEV. 20676.21 18361.40 16933.33 15612.36 I (csq known) STANDARD DEV. (csq estimated) 21055.52 18201.04 16861. 94 15669.04 I COEFF. OF VAR. 7.45 6.58 6.11 5.58 (csq known) COEFF. OF VAR. 7.55 6.48 6.06 5.57 I (csq estimated) ESTIMATE OF C-SQUARED
- 11 .10 .10 " 10 I The value given is the average over 100 simulations.
I I I
I 6.71 I I I Table 6.10 (continued) I SIMU~ATION OF SAMP~ING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel vo!um.-l00,000 cubic meters par 3-hour period. M.an density subjected to random perturbations. I Mean Density of Organisms- ~O.O/ 1000 cubic meters Val u. of c-!!iquared= .33 I 100 Volume of Sample in cubic meters 1~0 200 300 TRUE 279716.68 279716.68 279716.68 279716.68 I WEEK~Y TOTA~ TRUE STC DEV 28003.72 26286.30 2~384.05 24448.53 I TRUE COEFF. OF VARIATION 10.01 9.40 9.07 8.74 I WEEK~Y TOTA~ (one sample)
.STANDARD DEV.
276130.00 35304.11 281446.67 345~9.51 282130.00 34155.78 279393.33 33204.9~ (one sample) I COEFF. OF VAR. (one liollmple> 12.73 12.23 12.03 11.83 WEEK~Y TOTA~ 279905.00 277986.67 282552.50 278235.00 (two sample> STANDARD DEV. 27993.30 26071.86 25670.49 24360.03 Ceaq known) STANDARD DEV. 27647.33 25680.60 25853.70 24925.22 (csq estimated) COEFF. OF VAR. 9.99 9.36 9.07 8.74 (csq known) COEFF. OF VAR. 9.79 9.17 9.05 8.85 (csq estimated) ESTIMATE OF .32 .32 .34 .36 C-SQUARED The value given is the average over 100 simUlations.
I 6.72 I I Table 6.10 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean Densi ty e,f Organisms= 50.0/ 1000 cubic meters Value of c-squared= .50 I I Volume of Sample in cubic meters 100 150 200 300 TRUE WEEKLY 279716.68 279716.68 279716.68 279716.68 TOTAL TRUE STD DEV 32194.12 30711.93 29943.33 29154.48 I I TRUE COEFF. 11. 51 10.98 10.70 10.42 OF VARIATION WEEKLY TOTAL 272190.00 271060.00 277635.00 275390.00 (one sample) STANDARD DEV. 36860.95 35147.50 35495.28 34371.21 (one sample) COEFF. OF VA". 13.46 12.97 12.71 12.40 (one sample) WEEKLY TOTAL. 277135.00 276073.33 281080.00 279421.67 (two sample) STANDARD DEV. 32030.81 30337.60 30097.53 ::9155.59 (csq known) STANDARD DEV. 31850.67 30016.29 30542.14 28940.45 (csq esti matE!d) COEFF. OF W~R. 11.53 10.96 10.68 10.41 (csq known) CCEFF. OF Vj~R. 11.39 10.81 10.75 10.27 (csq esti mat'2d) ESTIMATE OF .51 .. 50
..... .e, .. 50 C-SQUARED The value given is the average over 100 simulations.
I 6.73 I I I Table 6.10 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS I Mean Density Mean density subjected to random perturbations. o~ Organisms- 50.01 1000 cubic metars Value o~ c-squared-l.00 I 100 Volume o~ 150 Sample in cubic meters 200 300 I TRUE WEEKLY TOTAL TRUE STD DEV 279716.68 42346.27 279716.68 41230.66 279716.68 40661.38 279716.68 40084.02 I TRUE COEFF. OF VARIATION 15.14 14.74 14.53 14.33 I WEEKLY TOTAL (one sample) 283050.00 282066.67 279990.00 272660.00 STANDARD DEV. 42584.50 41239.84 40726.47 39847.77 (one sample) I COEFF. OF VAR." (one sample) 14.92 14.41 14.39 14.31 I WEEKLY TOTAL (two sample) 276495.00 282426.67 279132.50 277048.33 STANDARD DEV. 41630.06 41593.91 40610.87 40157.69 I (ceq known) STANDARD DEV. (csq .stimated) 41206.01 41226.53 41346.54 40390.43 I COEFF. OF VAR. (csc! known) 14.98 14.63 14.46 14.32 COEFF. OF VAR. 14.78 14.44 14.63 14.27 I (csq .stimated) ESTIMATE OF C-SQUARED 1.02 1.02 1. 14 1. 08 I The value given is the average over 100 si mul ati ons. I I I
I 6.74 I I Table 6.11 Simulation of Sampling for One Week with Uniform Variation of Density About Level of 100 Organisms/lOOO m3 I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Chcknnel volume=100,000 c:ubic: meters per 3-hour period. I Mean density subjec:ted to random perturbations. Mean Density e,f Organisms=100.01 1000 c:ubic: meters Value of c:-squared=O.OO I 100 Volume of Sample in c:ubic: meters 150 200 300 I I TRUE WEEKLY 559434.61 559434.61 559434.61 559434.61 TOTAL TRUE STD DEV 23652.37 19312.08 16724.75 13655.70 TRUE COEFF. OF VAR IATION 4.23 3.45 2.99 2.44 I WEEKLY TOTAL (Qne sample) STANDARD DEV. 558880.00 59884.47 558633.33 58264.07 558720.00 57409.99 561216.67 56904.05 I (one sample) COEFF. OF VAR. (vne sample) 10.70 10.42 10.27 10.13 I WEEKLY TOTAL (two sample) 558680.00 559696.67 558102.50 16704.83 560765.00 1::671.93 I STANDARD DEV. 23636.41 19316.60 (c:sq known) STANDARD DEV. (c:sq estimated) 24528.55 20106.66 17449.75 14284.43 I COEFF. OF VAR. (c:sq known) 4.23 3 .. 45 2.99 2.44 I COEFF. OF VAR. 4.::9 3 .. 59 3.12 ::.54 I ( c:sq estimated) ESTIMATE OF .01 .01 0.00 0.00 C-SQUARED The value given is the average over 100 simulations. I I I I
I 6.75 I I I Table 6.11 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channal voluma-l00,000 cubic meters per 3-hour period. Mean density subjected to random perturbations. I Mean Density of Organisms-l00.0/ 1000 cubic meters Value of c-squarad- .05 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE WEEKLY TOTAL TRUE STD DEV
~9434.61 293:59.08 559434.61
- 5989.95 559434.61 24129.62
OF VARIATION 5.25 4.6:5 4.31 3.9~ I WEEKL.Y TOTAL (one sample) STANDARD DEV. 560490.00 61369.05 556440.00
- 59379.27 557255.00 58650.98 563413.33 58382.84 (one SAmple)
- 1 COEFF. OF VAR.
(one sample) 10.93 10.66 10.:51 10.35 I WEEKLY TOTAL. (two sample)
- 562300.00 557700.00 556107.50 559011. 67 STANDARD DEV. 29478.30 25935.47 24019.64 22113.86 (csq I:nown)
I STANDARD DEV. (csq estimAtad) 29671.20 25743.43 24157.94 21780.38 I COEFF. OF VAR. (csq known) 5.24 4.65 4.32 3.96 COEFF. OF VAR. 5.25 4.60 4.3~ 3.88 I (csq estimated) ESTIMATE OF C-SQUARED
.05 .05 .05 .05 I The value given is the average over 100 simulations.
I I I
I 6.76 I I Table 6.11 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Ch,mnel volume=100,000 c:ubic: meters per 3-hour period. I Mean density subjec:ted to random perturbations. Mean Density of Organisms=100.01 1000 cubic: meters Value of c:-squared= .10 I 100 Volume of Sample in c:ubic: meters 150 200 300 I TRUE WEEKLY 559434.61 559434.61 559434.61 559434.61 TOTAL TRUE STD DEV 34124.43 31272.97 29744.91 28133.97 I TRUE COEFF. OF VARIATION 6.10 5.59 C" ~~
...J * ...;, .... 5.03 I WEEKLY TOTAL 560860.00 5~8493.33 554360.00 551643.33 (one sample)
STANDARD DEV. 62461.75 60809.68 59604.19 58374.87 I (one sample) COEFF. OF VAR. (one sample) 11
- 11 10.87 10.72 10.57 I WEEKLY TOTAL (two sample)
STANDARD DEV. 561320.00 34194.06 557283.33 31190. 11 559305.00 29769.51 555203.33 27926.91 I (c:sq known) STANDARD DEV. (csq esti mated)' 34406.25 31893.47 29651.22 27797.92 I COEFF. OF VAR. 6.09 5.60 I
~'"'"
II:::'
..J * ..j~ 5.03 (c:sq known)
COEFF. OF VAR .. 6.10 5.69 5.27 4.97 (csq esti mated;' ESTIMATE OF C-SQUARED
.10
- 11 .10 .10 I The value given is the average over 100 simulations. I I
I I
I 6.77 I I I Table 6.11 lcont1nued) I SIMULATION OF SAMPLING FOR ONE WEEI<- FIFTY SIX 3-HOUR PERICIDS Mean density subjected to random perturbations. I Mean Density of Organisms=100.01 1000 cubic meters Val ue of c:-squared= .33 I 100 Volume of Sample in cubic meters l~O 200 300 I TRUE WEEKLY ~~9434.61 359434.61 5~9434.61 ~~9434.61 TOTAL TRUE STD DEV 307~6.~0 4aa8~.01 47921.86 46938.96 I TRUE COEFF. OF VARIATION 9.07 8.74 8.56 8.39 I WEEKLY TOTAL (one sample) STANDARD DEV. 354730.00 67384.49 5~6646.67 66063.91 566120.00 66062.21 563926.67 65412.20 (one sample) I COEFF. OF" VAR. (one sample) 12.09 11.81 11.60 11.34 I WEEKLY TOTAL (two sample) STANDARD DEV. 5~9393.00 S0837.81 563360.00 49255.70 555937.50 47520.22 563626.67 47302.47 (csq known) I STANDARD DEV. (csq estimated)
- 50938.15 48674.02 46967.4~ 48152.07 I COEFF. OF VAR.
(csq known) 9.07 8.73 8.52 8.38 8.47 COEFF. OF VAR. 9.03 8.59 8.36 I (ceq estimated) ESTIMATE OF C-SGlUARED
.34
- oJ...,) .33 .:;5 I The value given is the average over 100 simulations.
I I I
6.78 o I I Table 6.11 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I
lean densi ty subjected to random perturbations.
Mean Densi ty o,f Organisms=100.0/ 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters 100 150 200 300 I TRUE WEEKLY 5::;9434.61 ::;59434.61 559434.61 559434.61 TOTAL TRUE STD DEV 59871.91 58293.80 57488.51 56671.77 I TRUE COEFF. OF VARIATION 10.70 10.42 10.27 10.13 I WEEKLY TOTAL 556820.00 548866.67 542975.00 552023.33 (one sample) STANDARD DEV. (one sample) 71530.20 68888.99 67360.54 67873.61 I COEFF. OF VAR. (one sample) 12.74 12.48 12.32 12.22 I WEEKLY TOTAL 553640.00 558400.00 550707.::;0 560683.33 (two sample) STANDARD DEV. 59355.:::: 58219.43 56628.14 56747.65 I (csc known) STANDARD DEV. (csq estimated) 60166.52 58607.32 56190.05 57164.58 I COEFF .. OF VAR. 10.68 10.40 10.26 10.08 (csq known) COEFF .. OF VAR. 10.78 10.39 10.14 10.08 I (csq estimated) ESTIMATE OF C-SQUARED
. co~ ~,.;,
co.,.
* ..J ..... .50 co.,. .~~
I The value given is the average over 100 simulations. I I I I
- *
- 0 _ 1 *** _ _. _ : c _ I I_ _ =_I _ _ '
I 6.79 I I I Table 6.11 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channal volumac 100,000 cubic maters per 3-hour period. Mean density subjected to random perturbations. I Mean Density of Organisms z l00.01 1000 cubic meters Val u. of c*-squ.red= 1.00 I 100 Volume of Sample in cubic meters
----------------~-----------------------------~-----------
150 200 300 I TRUE WEEKLY TOTAL TRUE STD DEV 559434.61 81301.03 559434.61 80145.99 559434.61 79562.18 559434.61 78974.05 I TRUE COEFF. OF VARIATION 14.53 14.32 14.22 14.11 I WEEKLY TOTAL (one sample) STANDARD DEV. 557310.00 80732.71 561313.33 79212.63 564605.00 79419.88 556296.67 78420.95 (on. sample) I COEFF. OF VAR. (one sample) 14.30 13.93 13.96 13.96 I WEEKL.Y TOTAL (two sample) 563030.00 553433.33 567392.50 555438.33 STANDARD DEV. 81961.45 78821.66 80738.62 78167.78 (csq known) I STANDARD DEV. (csq estimated) 80692.13 78306.72 80137.64 79473.77 I COEFF. OF VAR. (csq known) 14.49 14.15 14.15 14.00 COEFF. OF VAR. 14.20 14.01 13.95 14.15 I (csq estimated) ESTIMATE OF C-SIiIUARED
.99 1.04 1.03 1.13 I The value given is the average over 100 simulations.
I I I
I 6.80 I I Table 6.12 Simulation of Sampling for One Week with Uniform Variation of Density About Level of 250 Organisms/lOOO m3 I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Charlnel volume=100,000 cubic meters per 3-hour period. I
"'lean densi ty subjected to random perturbations.
Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared=O.OO I I Volume of Samcle in cubic meters 100 150 200 300 TRUE WEEKLY 1405554.17 1405554.17 1405554.17 1405554.17 TOTAL TRUE STD DEV 37490.72 30611.05 26509.94 21645.28 I TRUE COEFF. 2.67 2.18 1.89 1.54 OF VARIATION WEEKLY TOTAL 1408470.00 1402053.33 1401735.00 1402340.00 (one sample) STANDARD DEV. 143535.19 141176.09 140394.88 139458.59 I (one sample) COEFF. OF VAR. (one sample) 10.19 10.06 10.01 9.94 I WEEKLY TOTAL 1405470.00 1404433.33 1402070.00 1403565.00 (two sample) STANDARD DEV. 37489.60 30598.84 26477.07 21629.96 I (csq known) STANDARD DEV. (csq estimated) 39355.05 31744.05 27411.95 22413.77 I I COEFF. OF VAR. 2.67 2.18 1. 89 1. 54 (csq known) COEFF. OF ',JAR. 2.80 2c26 1. 95 1. 60 (csq estimated) ESTIMATE OF C-SQUARED 0.00 0.00 0.00 0.00 I The value given is the average over 100 simulations. I I I I
I 6.81 I I I Table 6.12 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY. SIX 3-HOUR PERIODS Mean density subjected to random perturbations. I Mean Density of OrganismR-2~0.01 1000 cubic meters Value of c-squared= .03 I 100 Volume of Sample in cubic meters 1~0 200 300 TRUE WEEKLY 140~~~4.17 140~~4.17 140~554.17 140~~~4.17 I TOTAL TRUE STD DEV ~7~69.0~ ~3344.89 51102.04 467~6.12 I TRUE COEFF. OF VARIATION 4.10 3.79 3.64 3.47 WEEKLY TOTAL 1395145.00 I 1407~30.00 1408980.00 1404303.33 (one sample) STANDARD DEV. 146672.77 144916.4~ 142770.69 142992.51 (one sample) I COEFF. OF VAR. (one sample) 10.41 10.26 10.22 10.17 I WEEKLY TOTAL (two sample) STANDARD DEV.
*1412855.00 ~7809.05 1404743.33 53264.93 1398350.00 50857.15 1402126.67 48615.63 (csq known)
I STANDARD DEV. (csq estimated)
~8:Z34.:Z3 ~16:Zb.4b 50359.b3 49341.63 I COEFF. OF VAR.
(csq known) COEFF. OF VAR. 4.09 4.09 3.79 3.66 3.64 3.58 3.47
- .50 (c:eq estim.-ted)
I ESTIMATE OF C-SQUARED
.05 .05 .05
- Oe-
;;;J I The value given is the average over 101) simulatl.ons.
I I I
I 6.82 I I Table 6.12 (continued) I I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,OOO c:ubic: meters per 3-hour period. M'ean densi tv subjec:ted to random perturbations. Mean Density of Organisms=250.01 1000 c:ubic: meters Value of c:-squared= .10 I 100 Volume of Sample in c:ubic: meters 150 200 300 I TRUE WEEKLY 1405554.17 1405554<117 1405554.17 1405554.17 TOTAL TRUE STD DEV 72269.20 68951.57 67231. 39 65466.03 I TRUE COEFF. OF VARIATION 5.14 4.90 4.78 4.66 I WEEKLY TOTAL 1412490.00 1409786.67 1403330.00 1412233.33 (one sample) STANDARD DEV. 150525.78 148492.89 147267.21 147454.96 I (one sample) COEFF. OF VAR. (one sample) 10.64 10.51 10.47 10.42 I WEEKLY TOTAL (two sample) STANDARD DEV. 1410560.00 72501.79 1405913.33 69041.40 1409927.50 67516.61 1410490.00 65784.03 I (c:sq known) STANDARD DEV. (c:sc estimated) 73367.64 68759.19 66931.73 65684.96 I COEFF. OF VAR. ( c:sc known) COEFF. OF VAR. 5.14 5.17 4.91 4.85 4.78 4.71 4.66 4.63 I ( c:sq estimated) ESTIMATE OF C-SGUARED
.10 .10 .10 .10 I The value given is the average over 100 simulations. I I
I I
I 6.83 I I I Table 6.12 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channal voluma-l00,OOO cubic matars per 3-hour period. I Maan dansity subjected to random perturbations. Maan Density of Organisms=250.0/ 1000 cubic meters Value of c-squared~ .33 I 100 Volume of Sample in 130 cubi~ 200 maters 300 I TRUE WEEKLY TOTAL TRUE STD DEV 1405334.17 118868.43 1403334.17 116881.08 1403534.17 113874.62 1403534.17 1148:59.34 I TRUE COEFF. OF VARIATION 8.45 8.31 8.24 8.17 I WEEKLY TOTAL (on. sample> 1418790.00 1404733.33 1417380.00 1385730.00 STANDARD DEV. 165870.01 163093.67 162878.51 158533.19 (one sample) I COEFF. OF VAR. (one sample> 11.63 11.33 11.44 11.40 I WEEKLY TOTAL (two sample) 1407465.00 1414450.00 1403277.50 1404611.67 STANDARD DEV. 118941.64 117464.90 115287.60 115182.13 I (csq known) STANDARD DEV. (csq e.timated) 118489.94 118640.94 114843.90 115621.25 I COEFF. OF VAR.* (csq known) 8.43 8.28 8.20 8.19 COEFF. OF VAR. 8.36 8.31 8.10 8.16 I (csq est1mated) ESTIMATE OF C-SQUARED
.33 .35 .34 .34 I The valulil given is the average over 100 simulations.
I I I
I 6.84 I I Table 6.12 (continued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean Density 0+ Organisms=250.01 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters 100 150 200 300 I TRUE WEEI<LY 1405554.17 1405554.17 1405554.17 1405554.17 TOTAL TRUE STD DEV 143150.16 141504.25 140674.06 139838.96 I TRUE COEFF. OF VARI~TION 10.18 10.06 10.00 9.94 I WEEKLY TOTAL 1426410.00 1410980.00 1412735.00 1427950.00 (one sample) STANDARD DEV. 175567.37 171628.79 172980.22 176234.23 I (one sample) COEFF. OF VAR. (one sample) 12.23 12.08 12.17 12.23 I WEEKLY TOTAL 1426480.00 1407846.67 1407460.00 1417908.33 (two sample) STANDARD DEV. 145927.87 141649.88 141044.85 141878.29 I (csq known) STANDARD DEV. (csq estimated) 143985.59 141590.35 139916.56 143206.88 I 9.99 9.96 I COEFF. OF VAR. 10.19 10.03 (csq known) COEFF. OF 'JAR. 9.99 9.94 9.87 10.01 (CSq estimated) ESTIMATE OF C-SQUARED
.50 .50 .,50 * =~ ...J~
I The value given is the average ~ver 100 simulations. I I I I ,** "-,,,= _ _ ._11=
I 6.85 I I I Table 6.12 (cont;nued) I SIMULATION OF SAMPLING FOR ONE WEEK- FIFTY SIX 3-HOUR PERIODS Channel volume-l00,000 cubic metal"'. per 3-hour p.riod. Mean density subjected to random perturbations. I Mean Density o~ Organisms-250.0/ 1000 cubic meters Value of c-squared=1.00 I 100 Volume o~ 1~0 Sample in cubic meters 200 300 TRUE WEEKLY 1405~~4.17 140~5~4.17 140~~54.17 1':~055~4. 17 I TOTAL TRUE STD DEV 198943.17 197762.15 197168.98 196~74.03 I TRUE COEFF. OF VARIATION 14.15 14.06 14.02 13.98 I WEEKLY TOTAL 1418480.00 1417106.67 1382510.00 14221~6.67 (on. sample) STANDARD DEV. 204112.15 198366.22 193167.41 198437.87 (on. sample) Ii COEFF. OF VAR. 14.10 13.87 13.74 13.70 (one sample) WEEKLY TOTAL 1420615.00 1402173.33 1408890.00 14~8013.33 (two sample) STANDARD DEV. 200883.77 195432.32 199019.10 200637.73 (cs" known) STANDARD DEV. 201967.56 197644.80 199128.19 197043.70 (esC! estimatad) CDEFF. OF VAR. 14.00 13.87 14.00 13.95 (esC! known) COEFF. OF VAR. 13.95 13.97 13.90 13.55 (esq estimated) ESTIMATE OF 1.06 1.09 1.04 .98 C-SQUARED The value given is the average over 100 simulations.
!l'_PWW*._'=_
I 6.86 I Table 6.13 Simulation of Sampling for Five Days with Uniform I Variation of Density About Level of 10 Organisms/IOOO m3 SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I Channel volume=100,OOO cubic meters p2r 3-hour period. Mean density subjected to random perturb~tions. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-sGuared=O.OO Volume of Sample in cubic meters I TRUE SEASON 100 40039.39 150 40039.39 200 40039.39 300 40039.39 I TOTAL TRUE STD DEV 6327.67 5166.52 4474. :3A 3653.28 I TRUE COEFF. 15.82 12 .. 92 11.18 9.13 OF VARIATION SEASON TOTAL 38770.00 39866.67 40435.00 40276.67 (one sample) STANDARD DEV. 7681.15 6918.25 6500.77 5944.00 (one sample) COEFF. OF VAR. 19.86 17.36 16.07 14.74 (one sample) SEASON TOTAL 39365.00 40140.00 40020.00 40151.67 (two sample) STANDARD DEV. 6274.15 5173.01 4473.25 3658.40. (csq known) STANDARD DEV. 6518.05 5366.98 4670.92 38:::0.80 (csq estimated) COEFF. OF VAR. 15.99 12.93 11. 21 c;-.13 (csq known) COEFF. OF VAR. 16.58 13.40 11.69 9.54 (csq estimated) ESTIMATE OF .08 .05 .05 .03 C-SQUARED The value given is the average over 100 simulatIons. __ ._1...... _. __...
I I 6.87 I Table 6.13 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to random perturbatlons.
~ean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .05 I Volume of Sample in cubic meter"s I TRUE SEASON 100 150 200 300 40039.39 40039.39 40039.39 40039.39 TOTAL I TRUE STD DEV 6496.92 5372.47 4710.65 3939.16 I TRUE COEFF.
OF VARIATION 16.24 13.43 11.77 9.84 SEASON TOTAL 39860.00 39953.33 40670.00 39893.33 I (one sample) STANDARD DEV. 7921.49 7001.90 6621.56 6011. 01 (one sample) I COEFF. OF VAR. (one sample) 19.89 17.54 16.25 15.03 I SEASON TOTAL (two sample) 40715.00 40010.00 40360.00 40188.33 STANDARD DEV. 6552.78 5369.79 4730.16 3948.13 I (csq known) STANDARD DEV. 6795.22 5584.70 4887.74 4080.65 (csq estimated) I COEFF. OF VAR. (csq known) 16.17 13.48 11.76 9.84 I COEFF. OF VAR. (csq estimated) 16.72 13.97 12.11 10.11 ESTIMATE OF .15 .12 .09 .08 I C-SQUARED The value given is the average over 100 simulations. I I I
I 6.88 I Table 6.13 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .10 I Volume of Sample in cubic meters I 100 150 200 300 TRUE SEASON TOTAL 40039.39 40039.39 40(139.39 40039.3'7 I TRUE STO DEV 6661.86 5570.81 4935.66 4205 .. 65 I TRUE COEFF. 16.65 13.92 12.33 10.51 OF VARIATION
~3EASON TOTAL 39360.00 40413.33 39755.00 40620.00 I
(one sample) STANDARD DEV. (one sample) 7886.70 7165.66 6605.11 6178.37 I COEFF. OF VAR. (one sample) 20.04 17.73 16.59 15.16 I SEASON TOTAL 39825.00 40540.00 39617.50 40766.67
':two sample)
STANDARD DEV. 6635.96 5607.69 4900.63 4249.72 I (csq known) STANDARD DE\,'. ( c:sq estimated) 6768.68 5782.73 4969.28 4357.75 I COEFF. OF VAR. (csq known) 16.73 13.88 12 .. 42 10.44 I C:OEFF. OF VAR. 17.00 14.26 12 .. 52 10.61 (csq estimated) ESTIMATE OF .17 .17 .12 1':' I (-SQUARED The value given is the average over 100 simulat~ons. I I I I
I I 6.89 I Table 6.13 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3--HOUR FERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to random perturb&tions. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .33 I Volume of Sample in cubic meter's I TRUE SEASON 100 40039.39 150 40039.39 200 40039.39 300 40039.39 TOTAL TRUE STD DEV 7383.04 6415.83 5872.79 5274.13 I TRUE COEFF. OF VARIATION 18.45 16.03 14.67 13.17 SEASON TOTAL 38900.00 40573.33 40195.00 40333.33 I (one sample) STANDARD DEV. 8085.17 7492.22 7030.65 6565.65 (one sample) I COEFF. OF VAR. (one sample) 20.77 18.43 17.44 16.21 I SEASON TOTAL (two sample) 39970.00 40096.67 39910.00 40743.33 STANDARD DEV. 7352.06 6410.47 5864.08 5332.01 I (csq known) STANDARD DEV. 7353.23 6407.81 5919.25 5381.24 (csq estimated) I COEFF. OF VAR. (csq known) 18.43 16.02 14.70 13.08 I COEFF. OF ',JAR. (csq estimated)
'18.33 15.90 14.71 13.08 ESTIMATE OF .38 .36 .38 .36 I C-SQUARED The value given is the average over 100 simulations.
I I I
I I 6.90 Table 6.13 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HDUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to random perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .50 Volume of Sample in cubic meters I TRUE SEASON 100 40039.39 150 40039.39 200 40039.39 300 40039.39 I TOTAL TRUE STD DEV 7857.77 6956.87 6459.48 5920.44 I TRUE COEFF. OF VARIATION 19.63 17.38 16.13 14.78 I SEASON TOTAL 40010.00 40573.33 40490.00 40166.67 (one sample) STANDARD DEV. 8410.11 7763.73 7267.05 6767.57 I (one sample) COEFF. OF 'JAR. (.:Jne sample) 21.00 19.09 17.85 16.73 I SEASON TOTAL (two sample) 40340.00 40183.33 40527.50 40523.33 I STANDARD DEV. 7880.23 6964.58 6525.20 5944.58 (csq known) STANDARD DEV. 7811. 53 6983.63 6590.62 5992.12 I (csq estimated) C()EFF. OF VAR. (csq known) 19.54 17.34 16.08 14.63 I CClEFF. OF '.,IAR. ~cs.q ~stimated) 19.22 17.24 16.10 14.60 I ESTIMATE OF . 54 .56 .56 c"o=-
.-.J-.J C-SQUARED I
The value given is the average over 100 simulations. I I I
I I 6.91 I Table 6.13 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value o~ c-squared=1.00 I Volume of Sample in cubic meter"s 100 150 200 300 I TRUE SEASON TOTAL 40039.39 40039.39 40039.39 . 40039.39 I TRUE STD DEV 9135.08 8372.77 7964.27 7533.71 TRUE COEFF. 22.82 20.91 19.89 18.81 I OF VARIATION SEASON TOTAL 39540.00 40646.67 40110.00 39516.67 (one sample) I STANDARD DEV. (one sample) 8943.15 8422.98 8035.23 7375.03 I COEFF. OF VAR. (one sample) 22.64 20.50 19.73 1B.47 SEASON TOTAL 40370.00 40160.00 40962.50 39860.00 I (two sample) STANDARD DEV. 9196.19 8418.23 8124.58 7441. 87 (csq known) I STANDARD DEV. (csSl estimated) 9140.57 8544.00 8228.53 7441.62 I COEFF. OF VAR. (csq known) 22.75 20.86 19.73 18.55 COEFF. OF VAR. 22.54 20.89 19.77 18.43 I (csq estimated) ESTIMATE OF 1.21 1.21 1. 19 1. 11 C-SQUARED I The value given is the average over 100 si mul at.i on5. I I I
D I 6.92 Table 6.14 Simulation of Sampling for Five Days with Uniform Yariation of Density About Level of 50 Organisms/lOOO m3 I S IMULA~" ION or SAMPLING FOF: FIVE CAYS - FC:'<TY 3-i-IOUR PERl DOS Channel volume=100,OOO cubic ffietEr3 per 3~hour perlOd. Mean density subjected to random perturbations. Mean Density of Organisms= 50.01 1000 cubic meters ValUE 0f =-squared=O.OO I Volume of Sample in cubic meters I 100 150 200 300 TRUE SEASON TOTAL 200835.19 200835.19 200835.19 200835.19 I TRUE STn DEV 14171.63 11571.09 I:j020.86 8182.00 I TF:UE COEFF. 7.06 5 .. 77 4.99 4.08 OF VARIATION SEASm~ TOTAL 201960.00 200826.67 201290.00 20(:846.07 I (one sample) STANDARD DEV. (one sample) 27468.53 26079.54 25444.89 24657.97 I COEFF. OF VAR. (one sample) 13.57 12.97 12.62 12.26 I SEASON TOTAL 201415.00 200893.33 200575. (1(: ::00431.67 (two sample) STANDARD DEV. 14192.08 11572.76 10014.36 817-:!.... 77 I (csq known) STANDARD DEV. (c:sq estimated) 14896.14 12046.58 10239.51 8410.87 I I COEFF. OF VAR. 7.06 5.77 5.00 4.08 (csq known) COEFF. OF VAR. 7.39 5.99 5. 11 4.20 (csq estimated) ESTIMATE OF .02 .01 0.00 0.00
-I C-SQUARED The value given is the average over 100 simulations.
I I I I
I I 6.93 I Table 6.14 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PER!OOS Channel volume=100,000 cubic meter-s per 3-hour pe,-ioCi. I Nean density subjected to random perturbatione. Mean Density of Organisms= 50.0/ 1000 cubic metel-s Value of c-squar~d= .05 I Volume of Sample in cubic 200 met~-s 300 100 150 I TRUE SEASON TOTAL 200835.19 200835.19 200835.19 200835.19 12449.46 11023.45 I TRUE STD DEV 15981.45 13728.13 TRUE COEFF. 7.96 6.84 6.20 5.49 OF VARIATION I SEASON TOTAL (one sample) 200960.00 200720.00 200655.00 200566.67 I STANDARD DEV. (one sample) 27703.61 26473.13 25845.65 25182.60 I COEFF. OF VAR. 13.75 13.16 12.85 12.54 (one sample) SEASON TOTAL 200160.00 200293.33 200440.00 199946.67 (two sample) I STANDARD DEV. (csq known) 15955.22 13709.36 12443.49 10987.15 I STANDARD DEV. (csq estimated) 15446.36 13214.64 11985.62 10531.22 COEFF. OF VAR. 7.98 6.85 6.21 5.50 I (csq known) COEFF. OF VAR. 7.72 6.57 5.97 5.23 (csq estimated) I ESTIMATE OF C-SQUARED
.04 .04 .04 .04 I The value given is the average over 100 simulations.
I I I
I 6.94 D Table 6.14 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FOF:TY 3-HOUF< F'ERI:JDS Channel volume=100,OOC cubic meters Der 3-hour period. Mean density subjected to random perturbations. M2an Density of Organisms= 50.0/ 1000 cubic meters \/al Lie <J f- .. 1 (/ I 100 Volume of Sample in 150
~ubic 200 meters 300 I
TRUE SEASON TOTAL 200835.19 200835.19 200835.19 200835.19 I TRUE STD DEI,,' 17606.19 15589.52 14476.20 TRUE COEFF. 8.77 7.76 7.21 6.61 I OF VARIATION SEASON TOTAL (one sample) 201540000 201333.33 200840.00 201946.67 I STANDARD DEV. (or,e sar,lple) 28211.52 27066.09 26385 .. 65 25930.38 I CDEFF. OF VAR. 1::'.* 96 13.40 13.11 12.80 (one sample) SEASON TOTAL 200160.00 200590.00 20030:'.50 200683.33 I (two sample) STANDARD DEV. (csq known) 17585~29 15585.94 14471.51 13286.31 I STANDARD DEV. (csq estimated) 16863.87 14960.61 13927.18 12818.87 I C[)EFF. OF VAR. 8079 7.77 7.22 6.62 (csq known) CDEFF. OF VAR. 8.41 7.41 6.91 6.34 I (csq estimated) ESTIMATE OF C--SQUARED
.03 .08 .09 .09 I The value given is the average over 100 simulations. I I
I I
I I 6.95 I Table 6.14 continued SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I Ch~nnel volume=100,000 cubic meters per 3-hour period. Mean densi ty subjected to random pel~turbati on5. I Mean Density of Organisms= 50.01 1000 cubic meters Value of c-s-:juared= I 100 Volume of Sample in cubic meters 150 200 300 I
-----------------------------------------.--.-~--.--.-- ----------
TRUE SEASON 200835.19 200835.19 200835.19 200835.19 TOTAL TRUE STD DEV 23762.18 22309.10 21545.85 20754.54 I TRUE COEFF. 11.83 11.11 10.72 10.33 OF VARIATION I SEASON TOTAL (one sample) 199620.00 200200.00 200205.00 200660.00 I STANDARD DEV. (one sample) 30056.95 28960.51 28499.74 28018.90 COEFF. OF VAR. 14.97 14.36 14.15 13.84 I (one sample) SEASON TOTAL 199020.00 198896.67 199505.00 199721. 67 (two sample) I STANDARD DEV. (csq known) 23683.62 22122.55 21503.17 20774.67 I STANDARD DEV. (c:sq estimated) 22764.45 21339.84 20687.07 19843.12 COEFF. OF VAR. 11.87 11.09 10.75 10.35 I (c:sq known) COEFF. OF VAR. 11.38 10.62 10.28 9.84 (csq estimated) I ESTIMATE OF C-SQUARED
.30 .31 .30 .30 I The value given is the average over 100 simulations.
I I I
I 6.96 u Table 6.14 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-nour period. I density subjected to random perturbatjons. I Me~n Mean Density of Organisms= 50.01 1000 cubic meters ValUE cf c*-~.auarEd= . SO 100 Volume of Sample in cubic meters 150 200 I TF:UE SEASON TOTAL 200835.19 200835.19 200835.1 c1 200835.19 I TRUE STD DEV 26069.23 25419.13 24751.96 I 27323.06 TRUE COEFF. 13.60 12.98 12.65 12.32 I OF VARIATION SEASON TOTAL 198490.00 199226.67 199805.00 199390.00 (one sample) STANDARD DEV. (one sample) 31208.49 30456.82 29744.16 29281.56 I COEFF. OF VAR. 15.59 15.10 14.73 14.51 (one sample) SEASON TOTAL 197855.00 198226.67 199372.50 199200.00 I (two sample) STANDARD DEV. C=sq known) 27090.35 26015.26 25422.06 24753.42 I STANDARD DEV. 26026.81 24925.04 23997.32 23715.35 (csq estimated) CDEFF. OF VAR. 13.62 13.03 12.67 1.2.32 (csq known) C()EFF. OF VAR. 13.03 12.42 11.92 11.79 I (csq estimated) ESTIMATE OF C--SQUARED
.46 .46 .45 .47 I The value given is the average over 100 simulations. I I
I I
I 6.97 Table 6.14 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to random perturbations. I ~1ean DenSity of O'-ganisms= 50.0/ 1000 cubic metErs Value of c-sQuared=1.00 Volume of Sample in cubic meter~ I 100 150 200 3(;0 TRUE SEASON 200835.19 200835.19 200835.19 200835.19 I TOTAL TRUE STD DEV 35948.07 35004.56 34523.13 34034.89 I TRUE COEFF. or VARIATION 17.89 17.42 17.18 16.94 I SEASON TOTAL (one sample) 197930.00 198920.00 197995.00 195540.00 STANDARD DEV. 35003.00 34204.09 33785.17 32558.85 I (one sample) COEFF. OF VAR. 17.29 16.85 16.74 16.32 (one sample) I SEASON TOTAL (two sample) 198070.00 198570.00 197912.50 196275.00 I STANDARD DEV. (csq known) 36138.30 35496.47 34635.09 33702.18 STANDARD DEV. 34486.08 33253.14 32741.81 32262.01 I (csq estimated) COEFF. OF VAR. 17.99 17.55 17.25 16.91 (csq known) I COEFF. OF VAR. (csq estimated) 17.07 16.41 16.35 16.11 I ESTIMATE OF C-SQUARED
.96 .91 .96 .96 I The value given is the average over 100 simulations.
I I I
I I 6.98 Table 6.15 Simulation of Sampling for Five Days with Uniform I Jariation of Density About Level of 100 Organisms/lOOO m3 SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour perlod. I Mean density subjected to random perturbations. Mean Density of Drganisms=100.01 1000 cubic meters Val ue of c--sqL,ared=C.OO I Volume of Sample in cubic met2r~ I 100 150 20(J 30(: TRUE SEASON 39904*6.10 399046.10 399046.10 399046.10 TOTAL
-rRUE STD DE'J 19976.14 16310.45 14125.26 I
TRUE COEFF. 5.01 4.09 3 .. 54 2.8':;> OF VARIATION SEASON TOTAL 400980.00 400360.00 400125.00 399943.33 I (*one sample) STANDARD DEV. (<:>ne sample) 50707.59 49425.77 48678.18 47931.8:':;: I C[JEFF. OF VAR.
<<)ne sample) 12.63 12.33 12.15 11.97 I
SEASON TOTAL 398265.00 400033.33 399120.00 399801.67 (two sample) STANDARD DEV. 19956.58 16330.61 14126.57 11544. 14 I (c:sq known) STANDARD DEV. (c~;q estimated) 20812.14 17097.63 14935.36 12048.07 I CDEFF. OF VAR. (csq known) 5.02 4.09 3.54 .2.89 I COEFF. OF VAR. IC'" ~,....
...J.L'" 4.27 -3 .. 74 3.01 (c=q estimated)
ESTIMATE OF .01 .01 .01 O.OU I C-SQUARED The value given is the average over 100 simulations. I I I I
I I 6.99 Table 6.15 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to random perturbations. I Mean Density of Organisms=100.0/ 1000 cubic meters Value of c-squareO= .05 Volume of Sample in cubic meters I 100 150 200 30C TRUE SEASON 399046.10 399046.10 399046.10 399046.10 I TOTAL TRUE STD DEV 24800.23 21955.32 20384.51 l8682.10 I TRUE COEFF. OF VARIATION 6.22 5.50 5.11 4.68 I SEASON TOTAL (one sample) 398080.00 399860.00 397640.00 397100.00 STANDARD DEV. 51571.31 50557.16 49495.03 40805.71 I (one sampl e) COEFF. OF VAR. 12.92 12.62 12.42 12.27 (one sample) I SEASON TOTAL (two sample) 398745.00 399946.67 397607.50 398998.33 I STANDARD DEV. (csq known) 24787.13 22017.51 20345.10 18667.23 STANDARD DEV. 24722.16 21862.19 20171.42 18834.73 I (csq estimated) COEFF. OF VAR *. 6.22 5.51 ..,.
..J * .1...:.. 4.68 (csq known)
I COEFF. OF VAR. (csq estimated) 6.16 5.42 5.03 4.69 I ESTIMATE OF C-SQUARED
.05 .05
- 0"" oJ .05 I The value given is the average over 100 simulations.
I I I
I' 6.100 I Table 6.15 continued I SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HDUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to random perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value 0": c--squared= .. 10 I 100 Volume of Sample in cubic meter--s 150 20G 300 I Tf'<UE SEASON 399046.10 399046.10 ::9904,S.1(1 399(:46. 10 TOTAL n~UE SiD DE\} 28828.05 26420.47 2513G.33 23770.28 n~uc COEFF. -r
/ .. ...:... ..... ~..,
l,.62 6.,30 5.95 I OF VAF\IATIDrl SEASON TOTAL (one sample) 396190.00 399186.67 3'?5440.0,-) 396826.67 I STANDARD DEV. (one saollFl e) 52367. 17 51703.94 50329.79 49862.48 I CDEFF. OF VAR. 13.17 12.91 12.69 12.53 (cone sample) SEASON TOTAL 398565.00 401143.33 398892.50 398463.33 I (twc, sample) STANDARD DEV. (csq known) 28827.93 26565 .. 62 25155.47 2372~.50 I STANDARD DEV.
- csq estimated) 28968.34 27245.18 25129.84 23893.84 I
COEFF. OF VAR. 7.23 6.62 6.30 5.95 (csq known) COEFF. OF VAR .. 7. 19 6.74 6.25 5.94 I (csq estimated) EST l11ATE OF C-SQUAF:ED
.10
- 11 10 .10 I The value given is the average over 100 simulations. I I
I I
I I 6.101 I Table 6.15 continued SIMULATION OF SAI"IPLING FOR FIVE DAYS - FORTY 3 -HOUf: F'EFdODS I Channel volume::;100,000 cubic metel-S Pi;t- 3-h..:>w- pet-iod. Mean density subjected to random perturbatior.s. I Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared= I 100 Volume of Sample in cubic: meters 150 200 300 I TRUE SEASON TOTAL 399046.10 399046.10 399046.10 399046.10 TRUE STD DEV 42884.35 41304.38 40491.28 39661.52 I TRUE COEFF. 10.74 10.35 10.14 9.93 OF VARIATION I SEASON TOTAL Cone sample) 397340.00 397113.33 394735.00 393103.33 I STANDARD DEV.
<one sample) 57526.43 55857.08 55042.48 54258.63 COEFF. OF VAR. 14.35 13.98 13.81 13.70 I (one sample)
SEASON TOTAL 401935.00 397653.33 398757.~0 397326.67 (two sample) I STANDARD DEV. (csq known) 43245.77 41158.12 40349.64 39580.40 I STANDARD DEV. (csq estimated) 43689.76 40915.25 41083.04 40126.77 COEFF. OF VAR. 10.72 10.32 10.08 9.94 I (csQ known) COEFF. OF VAR. 10.75 10.15 10.22 9.94 (csq estimated) I ESTIMATE OF C-SQUARED
.35 ............. ~-;r .36 .35 I The value given is the average over 100 ;;imulations.
I I
I I 6.102 Table 6.15 continued I SIMULATlcr~ OF ~;Af'lF'LING FOF~ }=!VE DA'VS - FORT\{ .:'-HCUF~ r:EF~';:ODS I Mean dens1ty subjected to random Mean Density of Organisms=100.01 1000 cubic pertur~atl0ns. met~rs I Volume of Sample in cubic mEters I 100 150 200 300 TRUE SEASON TOTi);L 399046.10 399046.10 399046.1;) 39'7C46.10 I TRUE STD DE'*j 50587.54 49255.29 48575.47 47885.99 I TF,UE COEFF. 12.67 12.34 1::.16 11.99 DF I,/ARIATICN SEASON TOTAL 3942.30.00 392706.67 401140.00 39231::::,.33 I (one sample) STANJ::ARD DEV. (onE sample) 59382.57 58349.86 59564.86 572;)2.18 I COEFF. OF VAR. (one sample) 14.91. 14.70 14.70 14.42 I SEASON TOTAL 398820.00 396826.67 403925.00 398343.33 (two sample) STANDARD DEV. 50148.C.6 49035.03 49346.95 47970.75 I (csq known) STANDARD DEV. ~csq estimated) 51253.68 49317.66 50056.27 47909.08 I COEFF. OF VAR. (csq known) 12.51 12.29 12.17 11.98 I COEFF. OF VAR. 12.67 12.19 1::.27 11.81 (csq estimated) ESTIMATE OF .. 54 .54 .50 I C-SQUARED lhe value given is the average over 100 simulations. I I o _1.1_._.-.
I I 6.103 I Table 6.15 continued SIMULATION OF SA~lPLING FOR FIVE DAYS - FORTY ;::,--HOUR FERIODS I Channel valume=100,OOO cubic meters per 3-hour period. Mean density subjected to random perturbations. I Mean Density of Organisms=100.01 1000 cubic meters Valu~ of c-squar8d=1.00 I 100 Volume of Sample in cubjc meter's ISO 200 300 I TRUE SEASON TOTAL 399046.10 399046.10 399046.10 399046.10 TRUE STD DEV 68696.09 67721.02 67228.19 66731. 71 I TRUE COEFF. 17.20 16.96 16.83 16.71 OF VARIATION I SEASON TOTAL (one sample) 394660.00 396013.33 391585.00 396790.00 I STANDARD DEV. (one sample) 68408.19 67443.54 66574.36 66964.38 COEFF. OF VAR. 16.97 16.64 16.64 16.44 I (one sample) SEASON TOTAL 396980.00 402566.67 397277.50 401463.33 (two sample) I STANDARD DEV.
<csC! known) 68393.45 68136.28 67531.19 ,67316./0 I STANDARD DEV.
(csq estimated) 68023.30 68671.29 67986.66 67925.40 COEFF. OF VAR. 17.05 16.74 16.84 16.57 I (esq known) COEFF. OF VAR. 16.78 16.73 16.75 16.64 (c:sC! estimated) I ESTIMATE OF C-SQUARED 1.07 1.10 1.09 1. 12 I The value given is the average over 100 simulatic'ns. I I I
I 6.104 Table 6.16 I Simulation of Sampling for Five Days with Uniform
~'ariation of Density About Level of 250 Organisms/lOOO m3 SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I
Channel volume=100,OOO cubic meters per 3-hour perioG. ME,an densi ty subjected to raOidom pertL:rbat:i or,s. I Mean Density of Crganisms=250.01 1000 cubic m2ters I Volume of Sampl2 in cub~c meters 100 150 200 .300 I TRUE SEASON 998845.50 998845.50 998845.50 996845.50 TOTAL TRUE STD DEV 31604.52 25804.98 22347.77 18246.88 I TRUE COEFF. OF VARIATION 3.17 2.59 2.24 L83 I SEASON TOTAL (one sample) 998880.00 994953.33 996255.00 998766.67 I STANDARD DEV. 120553.93 118682.81 118158.88 117700.8::; (one sample) CDEFF. OF VAR. 12.06 11.92 11.85 i 1
- 7E:
I (one sample) SEASON TOTAL (two sample) 997860.00 996536.67 997540.00 998346.67 I STAt*JDARD DEV. (csq known) 31588.92 25775.14 22333.16 18242.3:2 I STANDARD DEV. 33214.30 27115.35 23450.75 19171.51 (csq estimated) COEFF. OF VAR. 3.17 2.59 2.24 1. 83 I (csq known) COEFF. OF VAR. ~csq estimated) 3.32 ..., -' J/O- * * .L.. 2 .. 35 1. 92 I ESTIMATE OF C-SQUARED 0.00 0.00 0.00 O.VO I The value given is the average over 100 simulations. I I D I
I ..... I 6.105 I Table 6.16 continued SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I Channel volume=100,OOO cubic meters per 3-hour period. Mean density subjected to random perturbations. I Mean Density of Organisms=250.0/. 1000 cubic meters Value of c-squared= .05 I 100 Volume of Sample in cubic 150 200 met~-s 30',) I TRUE SEASON TOTAL 998845.50 998845.50 998845.50 99884~.50 TRUE STD DEV 48481.52 44916.69 43023.65 41043.40 I TRUE COEFF. 4.85 4.50 4.31 4.11 OF VARIATION I SEASON TOTAL (one sample) 998850.00 1000066.67 1003355.00 999093.33 I STANDARD DEV. (one sample) 122862.12 121942.10 121849.59 120436.63 COEFF. OF VAR. 12.28 12.17 12.13 12.03 I (one sample) SEASON TOTAL 996390.00 997840.00 1000592.50 995990.00 (two sample) I STANDARD DEV. (csq known) 48394.17 44910.20 43176.69 40922.11 I STANDARD DEV. (c:sq estimated) 46630.14 43371.68 41346.81 39628.88 COEFF. OF liAR. 4.86 4.50 4.31 4.11 I (csq known) COEFF. OF VAR. 4.65 4.32 4.11 3.95 (c:sq estimated) I ESTIMATE OF C-SQUARED
.04 .05 .04 .05 I The value given is the average over 100 simulations.
I I I
I 6.106 I Table 6.16 continued SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I Channel volume=100,OOO cubic meters per 3-hour MEan density subjected to random oErturb6tions. p~ri0d. I Mean Density of Organisrns=250.01 1000 cubic meters ValUE of c squared= .10 I Volume of Sam~le in cubic meters 100 150 2(1(; I TRUE SEASON 998845.50 998845.50 9'7'8845.50 7'9884=.50 TOTAL TF,UE STD DEV 60844.63 58044.13 56591. "*3 5:5101.47 I TRUE COEFF. (iF '.JAR I ATI ON 6.09 5.81 5.66 5.51 I SEASON TOTAL (one sample) 999380.00 1000153.33 998000.00 997276.67 I STANDARD DEV. 125904.05 124455.08 123164.78 122580.92 (one sample) COEFF. OF VAR. (one sample) 12.58 12.42 12.31 12.26 I SEASON TOTAL (two sa.mple) 994840.00 996506.67 995315.00 994361.67 I STANDARD DEV. (csq known) 60683.21 57954.61 56364.64 54795.81 I STANDAFm DE'J. 58486.32 55597.92 53810.78 529~.5. 81 (csq est.i mated) COEFF. OF VAR. 6.10 5.81 5.66 5.50 I (csq known) COEFF. OF VAR. (csq estimated) 5.84 5.54 5.37 5.28 I I ESTIMATE OF .09 .09 .09 .09 C-SQUARED I The value given is the average over 100 simulations. I I I
6.107 Table 6.16 continued SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to random perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-sq~ared= .33 Volume of Sample in cubic meters 100 150 200 300 ~
----------------------------- -------- --- - ------~- _._-._--- . . _._-
TRUE SEASON 998845.50 998845.50 998845.50 998845.50 TOTAL TRUE STD DEV 100047.50 98369.47 97519.63 96662.32 Tr:UE COEFF. 10.01 9.84 9.76 9.67 OF VARIATION SEASON TOTAL 995950.00 998273.33 993960.00 992436.67 (one sample) STANDARD DEV. 136048.74 136012.81 134142.44 132913.73 (one sample) COEFF. OF VAR. 13.56 13.54 13.43 13.34 (one sample) SEASON TOTAL 991685.00 993736.67 994160.00 990730.00 (t""o sample) STANDARD DEV. 99629.35 98462.13 97638.23 95923.07 (csq ~mown) STANDARD DEV. 95549.54 94064.79 92982.73 91751. 25 (csq estimated) COEFF. OF VAR. 10.01 9.87 9.79 9.65 (csq known) COEFF. OF VAR. 9.54 9.39 9.2'7 9.18 (csG estimated) ESTIMATE OF .31 .31 .30 .31 C-SQUARED The value given is the average over 100 simUlations.
6.108 Table 6.16 continued SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. Mean density subjected to random pE~turbaticns. Mean Density of Organisms=250.01 1000 cubic meters ValUE of c-squared= .~O Volume of Sample in cubic meters 100 150 20(: 30(: TF,UE SEASON 998345.50 998845.50 998845.50 998845.50 TOTAL TRUE STD DEV 120478.07 119088.27 118387.26 1176E;2.C6 rRUE COC:FF. 12.06 11.92 :i.l. 8~' 1 ~ .. 78 'JF V~RIATION SEASON TOTAL 998810.00 993080.00 994600.00 99445(l.00 (.one sample) STANDARD DEV. 145997.95 143774. e'7 142595.91 141622.41 (one sa.mpIE) COEFF. OF VAR. 14.40 14.26 14.19 14.09 (one sample) SEASON TOTAL 994380.00 989196.67 988282.50 992251.67 (t~.. o sample) STANDARD DEV. 121314.01 119128.08 118054.22 117679.52 (csq known) STANDARD DEV. 115329.74 114273.27 111880.38 112567.26 (csq estimated) COEFF. OF VAR. 12. 11 11.95 11 .. 85 11.78 (csq known) COEFF. OF VAR. 11.43 11.41 11.22 11.21 (csq estimated) ESTIMATE OF .45 .47 .45 .46 C-SQUARED The va.lue glven is the average over 100 simulations.
I I 6.109 I Table 6.16 continued SIMULATION OF SAMPLING FOR FIVE DAYS - FORTY 3-HOUR PERIODS I Channel volume=lOO,OOO cubic meters per 3-hour period. Mean density subjected to random perturb~t10ns. I Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared=1.0U Volume of Sample in cubic meters I 100 150 20(1 300 TRUE SEASON 998845.50 998845.50 998845.50 998845.50 I TOTAL TRUE STD DEV 167424.86 166427.57 165926.68 165424.27 I TRUE COEFF. OF VARIATION 16.75 16.65 16.60 16.55 I SEASON TOTAL (one sample) 989790.00 994713.33 988310.00 980526.67 STANDARD DEV. 163394.55 164043.96 163636.59 159597.23 I (one sample) COEFF. OF VAR. 16.18 16.10 16.12 15.94 (one sample) I SEASON TOTAL (two sample) 992135.00 989896.67 982787.50 979835.00 I STANDARD DEV. (csq known) 168822.33 168865.63 165399.05 164588.06 STANDARD DEV. 160229.16 158598.00 158803.14 154121.89 I (csq estimated) COEFF. OF VAR. 16.82 16.73 16.60 16.56 (csq known) I COEFF. OF VAR. (csq estimated) 15.92 15.79 15.80 15.43 I ESTIMATE OF C-SQUARED
.95 .92 .95 .90 I The value given is the average over 101) simulatioi1s.
I I I
6.110 I Table 6.17 Simulation of Sampling for Three Days with Uniform Variation of Density About Level of 10 Organisms/1000 m3 I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared=O.OO I 100 Volume of Sample in cubic meters 150 200 300 I TRUE 3-DAY TOTAL. 24250.09 24250.09 24250.09 24250.09 I TRUE STD DEV 4924.44 4020.79 3482.10 2843.13 TRUE COEFF. 20.33 16.60 14.38 11.74 I (IF VARIATION 3-DAY TOTAL (one sample) 24450.00 24526.67 24160.00 24073.33 I STANDARD DEV. (one sample) 6183.04 5456.09 5015.23 4592.02 I COEFF. OF VAR. 25.33 22.25 20.75 19.05 (one sample) 3-DAY TOTAL 24175.00 23663.33 24200.00 23998.33 I (two sample) STANDARD DEV. (csq known) 4916.81 3971.85 3478.51 2828.33 I STANDARD DEV. (csq estimated) 5280.62 4166.80 3635.93 2982.63 I COEFF. OF VAR. 20.57 16.91 14.45 11.84 (csq known) COEFF. OF VAR. 21.97 17.70 15.08 12.45 I (csq estimated) ESTIMATE OF C-SQUARED
.18 .08 .05 .04 I The value given is the average over 100 simUlations. I I
I I I
I I 6.111 I Table 6.17 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour perlod. Mean density subjected to uniform perturbations. Mean Density of Organisms= 10.01 1000 cubic meters I Value of c-squared c Volume of Sample in cubic meters
.05 I TRUE 3-DAY 100 24250.09 150 24250.09 200 24250.09 300 24250.09 TOTAL I TRUE STD DEV 5057.14 4182.26 3667.37 3067.23 I TRUE COEFF.
OF VARIATION 20.88 17.26 15.14 12.66 3-DAY TOTAL 23540.00 24026.67 25245.00 23603.33 I (one sample) STANDARD DEV. 6043.18 ~421. 77 5305.89 4562.89 (one sample) I COEFF. OF VAR. (one sample) 25.86 22.58 20.91 19.27 I 3-DAY TOTAL (two sample) 23495.00 24126.67 24847.50 23956.67 STANDARD DEV. 4974.09 4169.2~ 3719.78 3044.33 I (csq known)
!3TANDARD DEV. 5177.35 4316.89 3827.37 3126.23 (c:sq estimated)
I COEFF. OF VAR. (csq known) 21.37 17.37 15.04 12.74 I C:OEFF. OF VAR. (c:sq estimated) 22.19 17.93 15.39 13.00 ESTIMATE OF .22 .12 .09 .07 I C:-SQUARED The value given is the average over 100 si mul ati c.ns. I I I
I 6.112 I Table 6.17 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I o Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .10 Volume of Sample in cubic meters I TRUE 3-DAY 100 24250.09 150 24250.09 200 24250.09 300 24250.09 I TOTAL TRUE STD DEV 5186.45 4337.72 3843.72 3276.05 I TRUE COEFF. OF VARIATION 21.41 17.90 15.86 13.52 I 3-DAY TOTAL 24350.00 24520.00 24395.00 24446.67 (one sample) STANDARD DEV. 6246.60 5628.10 5171.32 4828.62 I (one sample) COEFF. OF VAR. (one sample) 25.73 22.96 21.15 19066 I 3-DAY TOTAL (two sample) 24185.00 24726.67 24155.00 24505.00 I STANDARD DEV. 5178.00 4385.93 3834.41 3302.72 (csq known) STANDARD DEV. 5399.54 4544.10 3906.72 3306.48 I (csq estimated) COEFF. OF VAR. (csq known) 21.60 17.81 15.96 13.50 I COEFF. OF VAR. (csq estimated) 22.37 18.37 16.13 13.45 I ESTIMATE OF .24 .18 .14
- 11 C-SQUARED I
The value given is the average over 100 simulations. I iliii iiii Ui i~i'_"'_.i
I I 6.113 Table 6.17 (continued) I I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms= 10.0/ 1000 cubic meters Value of c-squared= .33 I Volume of Sample in cubic meter's I TRUE 3-DAY 100 150 200 300 24250.09 24250.09 24250.09 24250.09 TOTAL I TRUE STD DEV 5751. 56 4999.71 4577.71 4112.63 I TRUE COEFF. OF VARIATION 23.73 20.63 18.88 16.96 3-DAY TOTAL 24650.00 24220.00 23590.00 24340.00 I (one sample) STANDARD DEV. 6581. 79 5801.92 5316.01 5102.72 (one sample) I COEFF. OF VAR. (one sample) 26.61 23.91 22.49 20.70 I 3-DAY TOTAL (two sample) 25060.00 24293.33 23902.50 24256.67 STANDARD DEV. 5860.39 4996.25 4518.54 4103.55 I (csq known) STANDARD DEV. 5914.39 5024.38 4533.62 4164.13 (csq estimated) I COEFF. OF VAR. (csq kn()wn) 23.53 20.61 18.91 16.90 I COEFF. OF VAR. (csq estimated) 23.51 20.58 18.79 16.88 ESTIMATE OF .53 .41 .39 .38 I C-SGlUARED The value given is the average over 100 simulations. I I I
I 6.114 I Table 6.17 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared= .50 Volume of Sample in cubic meters I TRUE 3-DAY 100 24250.09 150 24250.09 200 24250.09
.300 24250.09 o
TOTAL TRUE STD DEV 6123.38 5423.32 5036.93 4618.33 I TRUE COEFF. OF VARIATION 25.26 22.37 20.77 19.04 I
- -DAY TOTAL 22860.00 23400.00 23845.00 24723.33 (one sample)
STANDARD DEV. 6349.02 5881.42 5561.92 5380.93 I (one sample) COEFF. OF VAR. (one sample) 27.86 24.84 23.16 21.52 I
- ~-DAY TOTAL (two sample) 23520.00 24383.33 24090.00 24370.00 I
STANDARD DEV. 5971.60 5485.54 5008.99 4634.99 (csq known)
~3TANDARD DEV. 5984.98 5483.11 4881.60 4716.17 I
(csq estimated) COEFF. OF VAR. (csq known) 25.56 22.46 20.76 18.94 I COEFF. OF VAR. (csq estimated) 25.22 22.25 20.07 18.92 I ESTIMATE OF .62 .59 .50 .57 C-SQUARED I The value given is the average over 100 simulations. I I I
I I 6.115 Table 6.17 (continued) I I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms= 10.01 1000 cubic meters Value of c-squared=1.00 I Volume of Sample in cubic meters I TRUE 3-DAY 100 24250.09 150 24250.09 200 24250.09 300 24250.09 TOTAL I TRUE STD DEV 7123.30 6531.31 6214.21 5880.02 I TRUE COEFF. OF VARIATION 29.38 26.93 25.62 24.23 3-DAY TOTAL 24270.00 24293.33 24750.00 23923.33 I (one sample) STANDARD DEV. 7107.04 6741.58 6296.63 5930.43 (one sample) I COEFF. OF VAR. (one sample) 29.13 26.72 25.05 24.07 I 3-DAY TOTAL (two sample) 24485.00 24296.67 24552.50 24005.01) STANDARD DEV. 7213.76 6619.56 6223.31 5880.81 I (csq known) STANDARD DEV. 6938.66 6511.02 6190.01 5880.05 (csq estimated) I COEFF. OF VAR. (csq known) 29.29 26.90 25.15 24.06 COEFF. OF VAR. I (csq estimated) 28.00 25.97 24.56 23.72 ESTIMATE OF 1.20 1.22 1. 22 1. 13 C-SQUARED I The value given is the average over 100 simulations. I I I
I 6.116 Table 6.18 I Simulation of Sampling for Three Days with Uniform Variation of Density About Level of 50 Organisms/lOOO m3 I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared=O.OO I Volume of Sample in cubic meters I 100 150 200 300 TRUE 3-DAY TOTAL 119787.21 119787.21 119787.21 119787.21 I TRUE STD DEV 10944.73 8936.34 7739.10 6318.95 I TRUE COEFF. 9.15 7.47 6.47 5.28 OF VARIATION 3-DAY TOTAL 119450.00 120186.67 119970.00 119326.67 I (one sample) STANDARD DEV. (one sample) 20995.00 20101.41 19564.76 19036.92 I COEFF. OF VAR. (one sample) 17.51 16.69 16.27 15.92 I 3-DAY TOTAL 119205.00 120150.00 119915.00 119408.33 (two sample) STANDARD DEV. 10918.10 8949.86 7743.22 6308.94 I (csq known) STANDARD DEV. (csq estimated) 11619.17 9344.16 8210.36 6691.99 I COEFF. OF VAR. (csq known) 9.19 7.47 6.47 5.29 I COEFF. OF VAR. 9.77 7.78 6.84 5.59 (csq estimated) ESTIMATE OF .03 .01 .01 ,01 I C-SQUARED The value given is the average over 100 simulations. I I I I
I I 6.117 I Table 6.18 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .05 Volume of Sample in cubic meters I 100 150 200 300 TRUE 3-DAY 119787.21 119787.21 119787.21 119787.21 I TOTAL TRUE STD DEV 12336.30 10595.06 9606.81 8504.48 I TRUE COEFF. OF VARIATION 10.31 8.85 8.02 7.10 I 3-DAY TOTAL (one sample) 119890.00 117880.00 118180.00 118030.00 STANDARD DEV. 21469.51 20168.29 19738.48 1925:5.85 I (one sample) COEFF. OF VAR. 17.86 17.04 16.63 16.23 (one sample) I 3-DAY TOTAL (two sample) 119905.00 118680.00 118345.00 119701. 67 I STANDARD DEV. (csq known) 12334.79 10525.98 9530.59 8508.33 STANDARD DEV. 12569.21 10688.52 9218.87 8450.74 I (ceq estimated) COEFF. OF VAR. 10.31 8.88 8.06 7.11 (csq known) I COEFF. OF VAR. (csq estimated) 10.43 8.94 7.71 6.98 I ESTIMATE OF C-SQUARED
.06 .06 .04 .05 I The value given is the average over 100 simUlations.
I I
I 6.118 I Table 6.18 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .10 Volume of Sample in cubic meters I TRUE 3-DAY 100 119787.21 150 119787.21 200 119787.21 300 119787.21. I TOTAL TRUE STD DEV 13586.08 12027.15 11166.37 10233.44 I I TRUE COEFF. 11.35 10.04 9.32 8.54 OF VARIATION
- -DAY TOTAL 118170.00 119606.67 1.19480.00 119036.67 (one sample)
S;TANDARD DEV. 21443.65 20704.70 20345.88 19803.28 I (one sample) COEFF. OF VAR. (one sample) 18.08 17.23 16.95 16.57 I 3-DAY TOTAL (two sample) 119500.00 119076.67 119340.00 120091.67 I STANDARD DEV. 13534.70 11938.84 11147.16 10257.38 (csq known) STANDARD DEV. 13236.69 12187.15 10993.86 10065.48 I (csq estimated) COEFF. OF VAR. (csq known) 11.34 10.03 9.33 8.53 I I COEFF. OF VAR. 11.02 10.14 9. 10 8.29 (csq estimated) ESTIMATE OF .09
- 11 . 10 .10 C-SQUARED ThE~ value given is the average over 100 simulations.
I I I I
I I 6.119 Table 6.18 (continued) I I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PE~IODS Channel volume=100,000 cubic meters per 3-hour peri*:)d. I Mean density subjected to uniform perturbations. Mean Density of Organisms= 50.01 1000 cubic meters Value of c-squared= .33 I Volume of Sample in cubic meters I TRUE 3-DAY 100 119787.21 150 119787.21 200 119787.21 300 119787.21 TOTAL I TRUE STD DEV 18323.92 17199.91 16609.41 15997.12 I TRUE COEFF. OF VARIATION 15.29 14.35 13.86 13.34
- 3-DAY TOTAL 122900.00 118073.33 118565.00 117036.67 I (one sample) lSTANDARD DEV. 23921.12 22126.71 21854.86 21196.86 (one sample)
I COEFF. OF VAR. (one sample) 19.17 18.51 18.26 17.86 I ;S-DAY TOTAL (two sample) 121670.00 118830.00 120130.00 118345.00 STANDARD DEV. 18445.58 17032.21 16611.88 15748.53 I (csq known) STANDARD DEV. 18353.47 16622.27 16612.65 15993.21 (csq estimated) I COEFF. OF VAR. (csq known) 15.11 14.26 13.76 13.26 I COEFF. OF VAR. (c:sq estimated) 14.73 13.73 13.55 13.28 ESTIMATE OF .34 .32 .34 .36 I C-SQUARED The value given is the average over 100 simulations. I I I
I 6.120 I Table 6.18 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms= 50.0/ 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters I 100 150 200 300 TRUE 3-DAY TOTAL 119787.21 119787.21 119787.21 119787.21 I TRUE STD DEV 21065.58 20095.51 19592.47 19076.18 I TRUE COEFF. 17.58 16.76 16.34 15.91 OF VARIATION
- -DAY TOTAL 115040.00 119726.67 117795.00 120586.67 I
(one sample) STANDARD DEV. (one sample) 23700.21 24003.43 23095.94 23002.99 I COEFF. OF VAR. (one sample) 20.13 19.73 19.37 18.71 I
- )-DAY TOTAL 117680.00 120940.00 118535.00 119206.67 (two sample)
STANDARD DEV. 20789.18 20366.60 19421.03 18965.08 I {csq known) STANDARD DEV. (c:sq estimated) 20716.42 19843.39 19412.05 18968.31 I COEFF. OF VAR. (csq known) 17.54 16.69 16.28 15.79 I COEFF. OF VAR. 17.04 16. 11 16.06 15.44 (c:sq estimated) ESTIMATE OF .51 .48 c::...,
.;;;)"'- .51 I
C-SQUARED The value given is the aver-age over 100 simulations. I I I I
I I 6.121 Table 6.18 (continued) I I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PI~RIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms:: 50.01 1000 cubic meters Value of c-squared=1.00 I Volume of Sample in cubic meters I TRUE 3-DAY 100 119787.21 150 119787.21 200 119787.21 300 119787.21 TOTAL I TRUE STD DEV 27707.94 26977.79 26605.20 26227.31 I TRUE COEFF. OF VARIATION 23.11 22.50 22.19 21.87 3-DAY TOTAL 119120.00 121433.33 119550.00 117140.00 I (one sample) STANDARD DEV. 27885.66 27356.98 26241. 00 26205.66 (one sample) I COEFF. OF VAR. (one sample) 22.76 21.60 21.34 21.58 I 3-DAY TOTAL (two sample) 120865.00 121396.67 120287.50 119046.67 STANDARD DEV. 28161.59 26974.71 26604.74 26281.36 I (csq known) STANDARD DEV. 27329.01 27149.18 26601.01 26446.02 (csq.estimated) I COEFF. OF VAR. (csq known) 22.90 21.88 21. 74 21.66 I COEFF. OF VAR. (csq estimated) 22.19 21.54 21.27 21.35 ESTIMATE OF 1. 06 1.14 1.08 1.15 I C-SQUARED The value given is the average over 100 simulations. I I I
6.122 Table 6.19 I Simulation of Sampling for Three Days with Uniform Variation of Density About Level of 100 Organisms/lOOO m3 SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared=O.OO I Volume of Sample in cubic meters I 100 150 200 300 TRUE 3-DAY TOTAL 237817.32 237817.32 237817.32 237817.32 I TRUE STD DEV 15421. 33 12591.46 10904.52 8903.51 I TRUE COEFF. 6.49 5 .. 30 4.59 3.75 OF VARIATION
- 3-DAY TOTAL 238700.00 238306.67 237505.00 239013.33 I
(one sample) STANDARD DEV. (one sample) 39144.73 37966.59 37301.71 36940.33 I COEFF. OF VAR. (one sample) 16.36 15.90 15.67 15.42 I 3-DAY TOTAL 238870.00 237283.33 238335.00 238801.67 (two sample) STANDARD DEV. 15455.42 12577.32 10916.39 8921.91 I (csq known) STANDARD DEV. (csq estimated) 16229.91 13276.13 11317.57 9484.70 I COEFF. OF VAR. (csq known) 6.48 5.31 4.59 3.74 I COEFF. OF VAR. 6.79 5.59 4.75 3.97 (csq estimated) ESTIMATE OF .01 .01 0.00 0.00 I C-SQUARED The value given is the average over 100 simulations. I I I I
I I 6.123 Table 6.19 (continued) I I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 c:ubic: meters per 3-hour period. I Mean density subjected to uniform perturbations. 11ean Density of Organisms=100.01 1000 cubic meters Value of c-squared= .~5 I Volume of Sample in cubic meteni 100 IS0 200 300 I "rRUE 3-DAY TOTAL 237817.32 237817.32 237817.32 237817.32 I TRUE STD DEV 19114.48 16914.22 15698.88 14381.19 TRUE COEFF. 8.04 7.11 6.60 6.05 I OF VARIATION 3-DAY TOTAL 238920.00 236306.67 236350.00 235423.33 (one sample) I STANDARD DEV. (one sample) 39910.40 38372.56 37914.01 37279.98 I COEFF. OF VAR. (one sample) 16.65 16.18 15.99 15.78 3-DAY TOTAL 239170.00 237363.33 237247.50 236383.33 I (two sample) STANDARD DEV. 19188.67 16891.86 15658.96 14323.12 (c:eq known) I STANDARD DEV. (csq estimated) 19098.95 1693:5.63 15660.98 14160.04 I COEFF. OF VAR. (csq known) 8.03 7.12 6.60 6.06 C:OEFF. OF VAR. 7.95 7.02 6.54 5.93 I (csq estimated) ESTIMATE OF .05 .05 .05 .05 C-SGlUARED I The value given is the average over 100 simulations. I I I
I 6.124 Table 6.19 (continued) I I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared= .10 I Volume of Sample in cubic meters I 100 150 200 300 TRUE 3-DAY TOTAL 237817.32 237817.32 237817.32 237817.32 I TRUE STD DEV 22201.57 20338.07 19339.10 18285.64 I TRUE COEFF. 9.34 8.55 8.13 7.68 OF VARIATION 3-DAY TOTAL 234690.00 235640.00 239190.00 237256.67 I (one sample) STANDARD DEV. (one sample) 40066.20 39099.47 39416.30 38221.75 I COEFF. OF VAR. (one sample) 16.98 16.53 16.39 16.05 I
- 3:-DAY TOTAL 238830.00 236750.00 237347.50 238390.00 (two sample)
STANDARD DEV. 22293.61 20237.30 19319.37 18334.57 I (csq known) STANDARD DEV. (csq estimated) 22000.68 19939.30 19597.03 17584.15 I COEFF. OF VAR. (csq known) 9.33 8.54 8.13 7.68 I C:OEFF. OF VAR. 9. 11 8.33 8.16 7.30 (c:sq estimated) ESTIMATE OF .10 .10 . 11 .09 I C;-SQUARED The value given is the average over 100 simulations. I I I I
I I 6.125 Table 6.19 (continued) I I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbationE. Mean Density of Organisms=100.01 1000 cubic meters Value of c-squared=.33 I Volume of S.mple in cubic meters I TRUE 3-DAY 100 237817.32 150 237817.32 200 237817.32 300 237817.32 TOTAL TRUE STD DEV 32986.60 31762.30 31132.10 30488.87 I TRUE COEFF. OF VARIATION 13.86 13.34 13.08- 12.81 3-DAY TOTAL 240170.00 238833.33 233550.00 236406.67 I (one sample) STANDARD DEV. 44616.59 42860.29 41740.09 42046.63 (one s.mple) I COEFF. OF VAR. (one sample) 18.33 17.78 17.65 17.61 I 3-DAY TOTAL (two sample) 239305.00 239840.00 236192.50 237298.33 STANDARD DEV. 33138.74 31828.10 30840.46 30363.18 I (csq known) STANDARD DEV. 32647.43 31129.05 30764.94 30931. 12 (csq estimated) I COEFF. OF VAR. (csq known) 13.78 13.22 12.97 12.73 I COEFF. OF VAR. (csq esti mated) 13.46 12.78 12.73 12.74 ESTIMATE OF .33 .32 .34 .36 I C-SQUARED The value given is the average over 100 simulations. I I I
I 6.126 Table 6.19 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=100.0/ 1000 cubic meters Val t.te of c-squar-ed= .50 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE 3-DAY TOTAL 237817.32 237817.32 237817.32 237817. :*2 I TRUE STD DEV 38900.88 37868.27 37341.26 36806.70 TRUE COEFF. 16.34 15.91 15.69 15.46 I OF VARIATION 3-DAY TOTAL (one sample) 236470.00 229253.33 236205.00 235706.67 I STANDARD DEV. (one sample) 45910.89 43980.65 44735.50 44102.42 I COEFF. OF VAR. 19.18 18.83 18.65 18.45 (one sample) 3-DAY TOTAL 236900.00 236076.67 239562.50 236533 .. 33 I (two sample) STANDARD DEV. (csq known) 38675.06 37850.24 37590.29 36487.64 I STANDARD DEV. (csq estimated) 38683.85 37549.64 37161.32 36885.23 I COEFF. OF VAR. 16.24 15.91 15.57 15.29 (csq known) COEFF. OF VAR. 15.98 15.50 15.22 15.22 I (csq estimated) ESTIMATE OF C-SQUARED
.52 .51 .50 .54 I The value given is the average over 100 simulations. I I
I I I
I I 6.127 Table 6.19 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=100.0/ 1000 cubic meters Value of c-squared=1.00 I Volume of Sample in cubic meter!; 100 150 200 300 I TRUE 3-DAY 237817.32 237817.32 237817.32 237817.32 TOTAL I TRUE STD DEV 52808.51 52052.54 51670.40 51285.42 TRUE COEFF. 22.18 21.86 21.70 21.54 I 01= VARIATION 3-DAY TOTAL 234510.00 239513.33 241425.00 240930.00 (one sample) I !3TANDARD DEV. (one sample) 51853.93 52402.16 52746.16 51828.46 I I:OEFF. OF VAR. (one sample) 21.53 21.29 20.96 20.88 3-DAY TOTAL 233075.00 241120.00 241807.50 238586.67 I (two sample) STANDARD DEV. 52047.69 52738.01 52132.18 51339.86 (csq known) I STANPARD DEV. (csq estimated) 50657.63 49353.33 52609.23 50460.35 I COEFF. OF VAR. (csq known) 21.94 21.54 21.27 21.16 COEFF. OF VAR. 21.18 20.06 21.07 20.51 I (ceq estimated) ESTIMATE OF 1.07 .93 1. 21 1.04 C-SClUARED I The value given is the average over 100 simulations. I I I I
I 6.128 Table 6.20 I Simulation of Sampling for Three Days with Uniform 3 Variation of Density About Level of 250 Organisms/lOOO m SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I Channel volume=lOO,OOO cubic meters per 3-hour period. MeBn density subjected to uniform perturbations. I Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared=O.OO Volume of Sample in cubic meters I TRUE 3-DAY 100 600996.93 150 600996.93 200 600996.93 300 600996.93 I TOTAL TRUE STD DEV 24515.24 20016.61 17334.89 14153.88 I TRUE COEFF. OF VARIATION 4.08 3.33 2089 2.,36 I
;::-DAY TOTAL 603570.00 599620.00 602670.00 601476.67 (one sample)
STANDARD DEV. 93792.43 92197.32 92144.82 91363.26 I (one sample) COEFFo OF VAR. (one sample) 15.52 15.35 15.27 15.17 I
- -DAY TOTAL (two sample) 601285.00 600583.33 601782.50 602045.00 I
STANDARD DEV. 24521.11 20009.72 17346.22 14166.22
<csq known)
STANDARD DEV. 26004.90 21119.50 18212.46 15050.05 I (c:sq estimated) COEFF. OF VAR. (csq known) 4.08 3034 2089 2.36 I COEFF. OF VAR. (c:sq estimated) 4.32 3.51 3002 2.50 I ESTIMATE OF 0.00 0.00 0.00 0.00 C-SQUARED I The value given is the average over 100 simulations. I I I I
I I 6.129 Table 6.20 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=250.0/ 1000 cubic meters Val ue o*f c-squared= .05 I Volume of Sample in cubic meters I TRUE 3-DAY 100 600996.93 150 600996.93 200 600996.93 300 600996.93 TOTAL I TRUE STD DEV 37625.45 34861.76 33394.26 31859.22 I TRUE COEFF. OF VARIATION 6.26 5.80 5.55 5.30 3-DAY TOTAL 597310.00 605913.33 603725.00 601290.00 I (one sample) STANDARD DEV. 95105.84 95375.52 94561.00 93433.02 (one sample) I COEFF. OF VAR. (one sample) 15.87 15.70 15.62 15.51 I 3-DAY TOTAL (two sample) 598950.00 602456.67 606362.50 600691.67 STANDARD DEV. 37558.13 34922.16 33669.61 31838.99 I (csq known) STANDARD DEV. 36725.06 34881.48 33295.78 30840.48 (csq estimated) I COEFF. OF VAR. (csq known) 6.27 5.79 5.55 5.29 I COEFF. OF VAR. (csq estimated) 6.05 5.74 5.40 5.07 ESTIMATE OF .05 .05 .05 .05 I C-SGlUARED The value given is the average over 100 simulations. I I I I
I 6.130 I Table 6.20 (continued) SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=250.01 1000 cubic: meters Value of c-squared= .10 Volume of Sample in cubic meters I 100 150 200 300 TRUE 3-DAY 600996.93 600996.93 600996.93 600996.93 I TOTAL TRUE STD DEV 47226.61 45055.75 43930.11 42774.85 I TRUE COEFF. 7.85 7.49 7.30 7.11 OF VARIATION 606290.00 600273.33 603620.00 607360.00 I 3-DAY TOTAL (one sample) STANDARD DEV. (one sample) 98905.21 96562.45 97057.14 96247.87 I COEFF. OF VAR. (one sample) 16.22 16.02 16.01 15.79 I
- 5-DAY TOTAL 601525.00 598083.33 598280.00 602650.00 (two sample)
I STANDARD DEV. 47273.61 44748.25 43815.10 42804.38 (csq known) STANDARD DEV. (c:sq estimated) 47575.36 44845.83 44770.05 42105.17 I COEFF. OF VAR. (c:sq known) 7.85 7.47 7.31 7.09 I COEFF. OF VAR. 7.79 7.39 7.35 6.89 (c:sq estimated) ESTIMATE OF .10 .10
- 11 .10 I
C-SQUARED Th,e value given is the average over 100 simulations. I I I I I
I I 6.131 Table 6.20 (continued) I SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I Mean density subjected to uniform perturbations. Mean Density of Organisms=250.01 1000 cubic meters Value of c-squared= .33 I Volume of Sample in cubic meter's 100 150 200 300 I TRUE 3-DAY TOTAL 600996.93 600996.93 600996.93 600996.93 I TRUE STD DEV 77666.78 76366.20 75707.53 75043.08 TRUE COEFF. 12.91 12.70 12.59 12.47 I OF VARIATION 3-DAY TOTAL 602650.00 592126.67 610690.00 611403.33 (one sample) I STANDARD DEV. (one sample) 107687.51 105019.57 107670.98 106413.98 I COEFF. OF VAR. (one sample) 17.68 17.49 17.45 17.22 3-DAY TOTAL 594755.00 598690.00 605035.00 608488.33 I (two sample) STANDARD DEV. 76831.68 76088.91 76202.96 75783.38 (csq known) I STANDARD DEV.
<csq estimated) 77406.30 75427.71 75930.69 75572.13 I COEFF. OF VAR.
(csq known) 12.85 12.64 12.52 12.39 COEFF. OF VAR. 12.77 12.37 12.33 12.10 I (csq estimated) ESTIMATE OF .35 .33 .34 .33 C-SQUARED I The value given is the average over 100 simulations. I I I I
I 6.132 I Table 6.20 (continued) SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. I M~an Mean density subjected to uniform perturbations. Density of Organisms=250.01 1000 cubic meters Value of c-squared= .50 I Volume of Sample in cubic meters I 100 150 200 300 TFCUE 3-DAY TOTAL 600996.93 600996.93 600996.93 600996.93 I TFtUE STD DEV 93529.55 92452.39 91909.07 91362.53 I TF~UE COEFF. 15.55 15.37 15.28 15.19 OF VARIATION 3--DAY TOTAL 601830.00 604866.67 598755.00 597956.67 I (one sample) STANDARD DEV. (one sample) 112318.70 112681.50 111790.33 110571. 18 I CDEFF. OF VAR. (one sample) 18.40 18.27 18.33 18.23 I 3--DAY TOTAL 603970.00 596753.33 599592.50 589971. 67 (two sample) STANDARD DEV. 93962.89 91711.32 91603.50 89230.00 I (csq known) STANDARD DEV. (csq estimated) 92454.64 90996.46 92236.67 91400.54 I CIJEFF. OF VAR. (csq known) 15.42 15.25 15.13 15.00 I CIJEFF. OF VAR. 14.92 14.86 15.04 15.05 (c';;q estimated) ESTIMATE OF .50 .51 m53 .54 I C-SQUARED ThE! value given is the average over 100 simulations. I I I I I
I I 6.133 Table 6.20 (continued) SIMULATION OF SAMPLING FOR 3 DAYS - TWENTY-FOUR 3-HOUR PERIODS I Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to uniform perturbations. I Mean Density of Organisms=250.0/ 1000 cubic meters Value of c-squared=1.0 I 100 Volume of Sample in cubic meter"s 1~0 200 300 I TRUE 3-DAV TOTAL 600996.93 600996.93 600996.93 600996.93 TRUE STD DEV 129979.06 129206.12 128817.92 128428.54 I TRUE COEFF. 21.61 21.48 21.41 21.35 OF VARIATION I 3-DAV TOTAL (one sample) 632900.00 603733.33 596575.00 602816.67 I STANDARD DEV. (one sample) 139546.26 128629.15 127829.57 128426.55 COEFF. OF VAR. 21.25 20.63 20.82 20.68 I (one sample) 3-DAY TOTAL 605405.00 592306.67 605167.50 601176.67 (two sample) I STANDARD DEV. (csq known) 133210.42 126399.54 129707.87 128029.47 I STANDARD DEV. (csq estimated) 132288.45 125631.05 130780.88 126248.32 COEFF. OF VAR. 21.56 20.96 21. 10 . 21.01 I (csq known) COEFF. OF VAR. 21.08 20.50 20.90 20.35 (csq estimated) I ESTIMATE OF C-SQUARED 1.07 1.02 1. 10 1.06 I The value given is the average over 100 simulations. I I I I
I 6.134 Table 6.21 I I Simulation of la-Day Entrainment Season SIMULATION OF SAMPLING FOR 18 DAYS - 144 3-HOUR PERIODS Channel volume=100,000 c:ubic meters per 3-hour period. Mean density subjec:ted to random perturbations. I Mean Densitylper 1000 c:ubic: meters)/No. of Days: Value of c:-squared=O.OO 10/8, 50/4, 100/4, 250/2 I Volume of Sample in c:ubic: meters I 100 150 200 300 TRUE SEASON TOTAL 944854.79 944854.79 944854.79 944854.79 I TRUE STO DEV 30738.49 25097.87 21735 .. 39 17746.88 TRUE COEFF. .,.
.. j . ...,C'" "::'...J 2.66 2.30 1.88 I
OF VARIATION SEASON TOTAL lone sample) 951750.00 945973.33 951.365.00 9433=3.33 I STANDARD DEV. 93390.15 91299.63 90769.90 890.32.07 (one sample) COEFF. OF VAR. 9.80 9.64 9053 9.42 I (one sample) SEASON TOTAL (two sample) 948960.00 945903.33 949760.00 943813.33 I STANDARD DEV. 30805.19 25111. 79 21791.74 17737.09 (c:sq known) STANDARD DEV. 31810.22 26221.75 22482.49 18325.67 I (csq estimated) COEFF. OF VAR. ( c:sq known)
.,. ..,c:- ":'O ..... ..J 2.66 2.30 1.88 I 94 I .,. ..... C'"
COEFF. OF VAR. .~ **..;...,J 2.77 2.37 1. ':c:sq estimated) ESTIMATE OF 0.00 0.00 0.00 0.00 C-SQUARED The value given is the average over 100 simulations. I I I I
I I 6.135 Table 6.21 continued I I I SIMULATION OF SAMPLING FOR 18 DAYS 144 3-HOUR PERIODS Chann.l velum.-100,000 cubic m.tars par 3-heur paried. I Mean Den5ity(par 1000 cubic matar5)/Ne * .of Days: 10/8, 50/4. 100/4, 250/2 I Velume .of Sample in cubic meters 100 lS0 200 I 300 TRUE SEASON 9448S4.79 9448:54.79 944854.79 944854.79 TOTAL I TRUE STD DEV 41351.3:5 37349.46 35178.21 32863.82 TRUE COEFF. 4.38 3.95 3.72 3.48 I OF VARIATION SEASON TOTAL 945750.00 942746.67 938410.00 94:5:;23.33 (.one sample) I STANDARD DEV. (.one sample) 94962.62 93142.42 91792.73 91675.79 COEFF. OF VAR. 10.01 9.85 9.76 9.66 I (.one sample) SEASON TOTAL (two sample) 941815.00 944053.33 942245.00 945190.00 I STANDARD DEV. (c:sq knewn) 41224.74 37305.71 35112.45 32893.47 STANDARD DEV. 40981.40 37220.66 34456.39 32927.34 I (csq estimated) COEFF. OF VAR. (csq knewn) 4.38 3.95 3.72 3.47 I COEFF. OF VAR. (c:sq astimated) 4.32 3.92 3.63 3.45 ESTIMATE OF .05 .05 .05 .05 I C-SQUARED The value given is the averaga over 100 simulatlons. I I I
I 6.1.36 Table 6.21 continued I I I SIMULATION OF SAMPLING FOR 18 DAYS - 144 3-HOUR PERIODS Channel Yolume=100,OOO cubic meters per 3-hour period. I r1ean densi ty subjected to random perturbati ons. Mean DensityCper 1000 cubic meters)/No. of Days: Value of c-squared= .10 10/8, 50/4, 100/4, :50/~ I Volume of Sample in cubic meters I 100 150 200 300 TRUE SEASON TOTAL 944854.79 944854.79 944854.79 944854.79 I TRUE STD DEV 49749.50 46476.46 44750.26 42954.74 TRUE COEFF. 5.26 4.92 4.,73 4.54 I OF VARIATION SEASON TOTAL (one sample) 941090.00 934826.67 94:385.00 945956.67 I STANDARD DEV. 97170.73 93558.99 94106.47 93988.62 (one sample) COEFF. OF VAR. 10.27 9.96 9.95 9.87 I (one sample) SEASON TOTAL (two samole) 942575.00 942776.67 940532.50 946643.33 I I STANDARD DEV. 49719.29 46414.12 44497.18 43155.30 (csq known) STANDARD DEV. 49303.60 45206.61 44616.21 42326.99 Ccsq estimated) COEFF. OF VAR. (csq known) 5.27 4.92 4.72 4.54 I COEFF. OF VAR. (csq estimated) ESTIMATE OF 5.19
.10 4.76 .09 4.69 .10 4.43 .10 I
C-SQUARED The value given is the average over 100 simulations. I I I
I 6.137 I Table 6.21 continued I I I SIMULATION OF SAMPLING FOR 18 DAVS - 144 3-HOUR PERIODS Channel volume-l00,OOO cubic meters per 3-hour period. Me.n density subjected to r.ndom perturb.tions. I Mean Density(per 1000 cubic meters)/No. of Days: 10/8, 50/4, 100/4, 250/2 V.lue of c-5quared- .33 I Volume of Sample in cubic meters 100 150 200 300 944854.79 TRUE SEASON 944854.79 944854.79 944854.79 TOTAL TRUE STD DEV 77751.73 75699.27 74651.88 73589.59 TRUE COEFF. 8.22 8.00 7.89 7.78 I OF VARIATION SEASON TOTAL 928690.00 938673.33 941915.00 931543.33 (one sample) I STANDARD DEV. (one sample) 103544.82 102558.36 103980.38 100218.82 COEFF. OF VAR. 11.01 10.81 10.92 10.67 I (one sample) SEASON TOTAL (two sample) 935840.00 941813.33 942077.50 941593.33 I STANDARD DEV. (csq known) 76929.06 75202.85 74257.39 73145.'::0 STANDARD DEV. 7:5447.60 75173.16 74379.58 71156.29 I (csq estimated) COEFF. OF VAR. (c:sq known) 8.18 7.94 7.84 7.73 I COEFF. OF VAR. (c:sq Itstimatad) 7.94 7.86 7.78 7.47 ESTIMATE OF .32 .34 .34 .32 I C-SQUARED The value given is the average over 100 si mul ati on5 .* I I I
I 6.138 Table 6.21 continued I I I SIMULATION OF SAMPLING FOR 18 DAYS - 144 3-HOUR PERIODS Channel volume=100,000 c:ubic: meters per 3-hour period. I"lean densi ty subjec:ted to random perturbati .ons. I Mean Density<pe'r 1000 c:ubic: meters)/No . .of Days: Value of c:-squared= .50 10/8,50/4,100/4,250/2 I 100 Volume of Sample in c:ub.ic: meters 150 200 300 I TRUE SEASON TOTAL 944854.79 944854.79 944854.79 944854.79 I TRUE STD DEV 92712.70 90998.32 90128.90 89251.01 TRUE COEFF. 9.80 9.62 9.53 9.43 I OF VARIATION SEASON TOTAL (one sample) 934450.00 94137:::.. 33 929490.00 936900.00 I STANDARD DEV. 109867.51 109574.10 108382.12 106943.07 (.one sample) COEFF. OF liAR. 11.57 11.43 11.44 11.24 I (.one sample) SEASON TOTAL (two samole) 934475.00 944113.33 937785.00 941418.33 I STANDARD DEV. 91139.72 90785.03 89754.52 88792.01 ( c:sq kno .... n) STANDARD DEV. 91972.20 89923.37 90028.54 88373.30 I (c:sq estimated) COEFF. OF VAR. ( c:sq kn.o .... n) 9.70 9.54 9.50 9.37 I COEFF. OF VAR. 9.70 9.38 9.45 9.22 ':csq estimated) ESTIMATE OF .,--
* ...J..;. .51 C"~ ......J . . " 51 I
C-SQUARED
'~e value given is the average over 100 simulations.
I I I I
I 6.139 I Table 6.21 continued I SIMULATION OF SAMPLING FOR 18 DAYS - 144 3-HOUR PERIODS I Channel volume-100,000 cubic meters per 3-hour period. Mean density subjectad to random perturbations. I Mean Oansity<per 1000 cubic meters)/No. of Oays: 10/8, 50/4, 100/4, 250/2 Value of c-squared-1.00 I Volume of Sample in cubic meters 100 1:;0 200 300 TRUE SEASON 9448:;4.79 944854.79 944854.79 944854.79 TOTAL TRUE STO OEV 127461.51 126219.98 125594.62 124966.13 I TRUE COEFF. 13.47 13.34 13.28 13.21 OF VARIATION I SEASON TOTAL (ona sample)
<<;':;8880.00 922193.33 918240.00 942323.33 I
STANDARD DEV. 133072.31 122166.15 120520.75 124311.24 (ona sample) COEFF. OF VAR. 13.48 12.92 12.79 12.94 (ona sample) I SEASON TOTAL (two sample) 948935.00 938376.67 929012.50 <<;'44265.00 I STANDARD DEV. (csq known) STANDARD DEV. 128896.08 1281<<;'0.27 12:;723.:;2 123248.86 121647. :)1 123324.43 124982.25 125472.73 (c!lq astimillted) I COEFF. OF VAR. (csq known) 13.40 13.20 12.97 13.07 I COEFF. OF VAR. (CSQ estimated) ESTIMATE OF 13.18 1.05 12.89 1.0:: 12.97
- 1. 12 1::.05
- 1. 08 C-StlUARED I The value given is tha average over 100 si mul ati or,s.
I I
I 6.140 Tab 1e 6 *. 22 I I Simulation of 34-Day Entrainment Season I SIMUILATION OF SAMPLING FOR 34 DAYS - 272 3-HOUR PERIODS Channel volume=100,OOO cubic meters per 3-hour period. I Mlean densi ty subjected to random perturbati on.s. Mean DensityCper 1000 cubic meters)/No. of Days: Value of c-squared=O.OO 10/10, 50/10, 100/8, 250/6 I Volume of Sample in cubic meters I 100 150 200 300 TRUE SEASON TOTAL 2320279.09 2320279.09 2320279.09 2320279.09 I TRUE STD DEV 48169.28 39330.05 34060.82 27810.54 TRUE COEFF. 2.08 1. 70 1.47 1.20 I OF VARIATION SEASON TOTAL Cone sample) 2327440.00 2321626.67 2321045.00 2321400.00 I STANDARD DEV. 153064.50 148552.18 147357.17 I 150103.22 (one sample) COEFF. OF VAR. 6.57 6.46 6.40 6.34 (one sample) SEASON TOTAL (two sample) 2321970.00 2320270.00 2321660.00 2319085.00 I STANDARD DEV. 48186.82 39329.97 34070.96 27803.39 (csq known) STANDARD DEV. 49089.71 40292.51 35034.27 28575.48 (csq estimated) COEFF. OF VAR. Ccsq known) 2.08 1. 70 1. 47 1. 20 I COEFF. OF VAR. (csq estimated) ESTIMATE OF 2.11 0.00
- 1. 74 0.00
- 1. 51 0.00
- 1. 23 0000 I
C-SQUARED The val~e given is the average over 100 simulations. I I I I
I I 6.141 Table 6.22 continued I I I SIMULATION OF SAMPLING FOR 34 DAYS - 272 3-HOUR PERIODS Channel volumaal00,000 cubic meters per 3-hour period. Mean density subjected to random perturbationa. I Mean Density(per 1000 cubic meter.)/No. of Days: 10/10, ~0/10, 100/B, 250/6 Value of c-squared= .05 I Volume of Sample in cubic meters 100 1~0 200 300 I TRUE SEASON TOTAL 2320279.09 2320279.09 2320279.09 2320279.09 I TRUE STD DEV TRUE COEFF. 66430.77 2.B6 60329.27 2.60 57034.26 2.46 53536.B5 2.31 OF VARIATION I SEASON TOTAL (one sample) 2328610.00 2317713.33 2321615.00 2321783.33 I STANDARD DEV. (one sample) 156616.70 152930.94 1~2081. 34 151042.60 COEFF. OF VAR. 6.72 6.59 6.54 6.50 I (one sample) SEASON TOTAL 2325270.00 2320403.33 2318430.00 2323265.00 (two sample) I STANDARD DEV. (csq known) 66568.43 60323.10 56971.62 53669.29 STANDARD DEV. 66503.38 60510.11 57110.bB 5::5b2.62 I (csq estimated) COEFF. OF VAR. (csq known) 2.86 2.60 2.46 2.31 I COEFF. OF VAR. (csq estimated) 2.85 2.60 ~.45 2.::9 ESTIMATE OF .05 .05 .05 .05 I C-SQUARED The value given is the average over 100 simulatlvr.s. I' I I
I 6.142 Table 6.22 continued I I SIMULATION OF SAMPLING FOR 34 DAYS - 272 3-HOUR PERIODS Channel volume=100,000 cubic meters per 3-hour period. Mean density subjected to random perturbations. I Mean Density(per 1000 cubic meters)/No. of Days: Value of c-squared= .10 10/10, 50/10, 100/8, 250/6 I 100 Volume of Sample in cubic meters 150 200 300 I TRUE SEASON TOTAL 2320279.09 2320279.09 2320279.09 2320279.09 I TRUE STO OEV 80658.63 75712.54 73114.12 70419.90 TRUE COEFF. OF VARIATION 3.48 3.26 3.15 3.03 I SEASON TOTAL (one sample) 2324740.00. 2323893.33 2324500.00 2321886.67 I STANDARD DEV. 159450.56 156837.44 155689.61 154270.06 (one sample) CCEFF. OF VAR. 6.85 6.74 6.68 6.63 I (one sample) SEASON TOTAL (two sampl e) 2324535.00 2325800.00 232~390.00 2324370.00 I STANDARD DEV. 80787.37 75900.29 73341.46 70539.91 (csq kno .... n) STANDARD DEV. 81447.74 76039.73 73781.47 71570.18 I (csq estimated) COEFF. OF VAR. (c:sq kno .... n) 3.47 3.26 3.15 3.03 I COEFF. OF VAR. 3.49 3.26 3.16 3.07 ( c:sq estimated) ESTIMATE OF .10 .10 .10 .. 10 I C-SQUARED The value given is the average over 100 simulations. I I I
I 6.143 I Table 6.22 continued I I SIMULATION OF SAMPLING FOR 34 DAYS - 272 3-HOUR PERIODS I Channel volum.-l00,OOO cubic meters per 3-hour period. Mean density subjected to random perturbations. I Mean Density(per 1000 cubic meters)/No. of Days: 10/10, 50/10, 101)/8, 250/6 Value of c-squared= .33 I 100 Volume of Sample in cubic m.ters 150 300 200 I TRUE SEASON TOTAL 2320279.09 23;;0279.09 2320279.09 :2320279.09 TRUE STO DEV 127:561.46 124492.97 122930.00 121346.91 I TRUE COEFF. 5.S0 5.36 5.30 5.23 OF VARIATION I SEASON TOTAL (one sample) 2325820.00 2328093.33 2325510.00 2328600.00 STANDARD DEV. 174018.33 172738.96 170295.90 171296.33 I (one sample) COEFF. OF VAR. 7.45 7.38 7.29 . 7 ....... (one sample) I SEASON TOTAL (two sample) 2329760.00 2323266.67 2329982.50 2327271.67 STANDARD DEV. 128288.78 124743.11 123358.58 122228.59 I (ceq known) STANDARD DEV. 130150.80 125344.49 124362.21 123913.60 (csq estimated) I COEFF. OF VAR. (csq known) 5.49 5.35 5.28 5.24 COEFF. OF VAR. 5.55 S.36 5.29 ... ...
~
I
.J.~,
(csq estimated) ESTIMATE OF .35 .34 .34 .35 C-SQUARED I The value given is the average over 100 simulations. I I I
I 6.144 Table 6.22 continued I I I SIMUL.ATION OF SAMPLING FOR 34 DAYS - 272 3-HOUR PERIODS Chanrlel volume-100,000 cubic meters per 3-hour period. Me,an density subjected to random perturbations. I Mean Density(per 1000 cubic meters)/No. of Days: Value of c-squared= .50 10/10. 50/10, 100/8. 250/6 I 100 Volume of Sample in cubic: meters 150 200 300 I TRUE SEASON TOTAL 2320279.09 2320279.09 2320279.09 2320279.09 I TRUE STD DEV 152472.81 149915.08 148619.71 147312.95 TRUE COEFF. 6.57 6.46 6.40 6.35 I OF VARIATION SEASON TOTAL (one sample) 2334760.00 2335673.33 2338165.00 2347490.00 I STANDARD DEV. 186216.57 183610.51 182332.15 182491.59 (one samole) COEFF. OF VAR. 7.92 7.82 7.74 7.72 I (one sample) SEASON TOTAL (two sample)
- !332955.00 2328290.00 2333447.50 2340266.67 I
STANDARD DEV. 153690.54 150349.59 149516.37 149075.54 (csq known) STANDARD DEV. 154240.45 149647.15 151143e75 149851.30 I (csq estimated) COEFF. OF VAR. (csq known) 6.56 6.44 6.39 6.35 I COEFF. OF VAR. 6.54 6.40 6.42 6.36 (CSQ estimated) ESTIMATE OF .51 .,~O co"">
* ..J .. ,,51 I
C-SQUARED The value given is the average over 100 simulations. I I I I
I I 6.14.5 Table 6.22 continued I I SIMULATION OF SAMPLING FOR 34 DAYS - 272 3-HOUR PERIODS I Channel volume-l00,000 cubic meters per 3-hour period. Mean density subjectad to random perturbationa. I Mean Density(per 1000 cubic matara)/No. oT Days: 10/10, 50/10, 100/8, 250/6 Valua of c-squared=1.00 I Volume of Sample in cubic maters 100 1:50 200 300 I TRUE SEASON TOTAL 2320279.09 2320279.09 2320279.09 2320279.09 I TRUE STD DEV TRUE COEFF. 210180.01 9.0:5 208331.97 8.97 207401. 77 8.93 206467.39 8.89 OF VARIATION I SEASON TOTAL (one sampla) 2342240.00 2357720.00 2318155.00 2318063.33 I STANDARD DEV. (one sample) COEFF. OF VAR. 212585.96 8.96 213809.20 8.93 205037.91 8.71 204160.27 8.71 (one sample) I SEASON TOTAL (two sample) 2344975.00 23:50770.00 2332317.30 2=40256.67 I STANDARD DEV. (csq known) STANDARD DEV. 213940.30 214003.98 212537.74 2143:5:5.56 208649.70 210062.12 208090.62 206664.11 (csq estimAted) I COEFF. OF VAR. (csq known) 9.07 8.98 8.89 8.84 I COEFF. OF VAR. (csq estimated) 9.05 9.05 8.91 8.74 ESTIMATE OF 1.05 1.07 1.06 1.01 I C-SQUARED The value given is tha average over 100 simulations. I I I
I 6.146 resulting estimate of c 2 can be used effectively in achieving our goal I of entrainment estimation. Note that although the 5~ 40, or 24 paired observations were viewed here as data from 7, 5, or 3 days, they could in practice be scattered over an entrainment sampling season of several months. I In Tables 6.9 to 6.12, 6.13 to 6.16, or 6.17 to 6.20, factors that affect the accuracy of estimation of c 2 include the mean organism density, the value of c 2 , and I the pattern of daily variation (diel or uniform). Low organism density, high values of c:~, and diel variation increase the variability of ~2. I Thus, although the estimation of c 2 is generally quite satisfactory, factor combinations that increase the inherent difficulty of this estima-tion may require special attention. I 6.5 STRUCTURE OF THE SIMULATION STUDY OF SEASONAL ENTRAINMENT I The simulation of entrainment sampling presented here is structured as a series of 24-hour days, each of which is individually described by the study of daily entrainment in Sections 6.1 to 6.3. The behavior of I I?ntrainment sampling and estimation is examined for two simulated seasons, il short season of 18 days and a long season of 34 days. Organism density
'is zero during the rest of the entrainment sampling period of about three I
months. The short entrainment season consists of 18 days, of which 8 have organism I density A = 10, 4 have A = 50, 4 have A = 100, and 2 have A = 250 (organism density A is expressed here in organisms per 1000 cubic meters, as in Sections 6.1 to 6.3). This seasonal set of AIS would be a good approximation to the density pattern of a species-life stage combination I appearing only in a brief spike during the entrainment sampling period. That is, this type of organism has density zero for part of the entrain-ment sampling period, then might have 4 days with A = 10, 2 days with I A = 50, 2 days with A = 100, 2 days with A = 250, then 2 days with A = 100, 2 days with A = 50, 4 days with A = 10, followed by density A = 0 for the remainder of the entrainment season. It is important to note, I however, that this plausible pattern is only one pattern out of many that generate the specified 18 AIS of our short entrainment season. This is because the timE! order of the 18 days with these AIS is immaterial, i.e., has no effect whatsoever on the analysiS to be presented. Scattering I the 18 specified AIS randomly, systematically, or arbitrarily throughout the overall entrainment period of three months or so and setting A equal to zero on eveD' remaining day would produce the same analysis as the I spiked entrainmE!nt pattern just described. The long entrainment season consists of 34 days, of which 10 have organism I density A = 10, 10 have A= 50, 8 have A = 100, and 6 have A = 250. This seasonal set of AIS would be a good approximation to the density pattern of a species-life stage combination appearing in a long spike I occurring at some point in the entrainment sampling period. For example, this would be the case for an organism whose density is zero for part of I I
I I 6.147 the entrainment sampling period of three months, then rises to A = 10 for 5 days, then to A = 50 for 5 days, A = 100 for 4 days, A = 250 for 6 I days, then falls to A = 100 for 4 days, A = 50 for 5 days, A = 10 for 5 days, and finally to zero for the rest of the sampling period. As -with the l8-day entrainment season, this single spike is only one of a multi-I plicity of possible seasonal patterns resulting in the specified 34 A'S; any distribution of the 34 days across the 3-month entrainment sampling period, with all remaining days having density zero, will pro-duce the same analysis as the single long spike entrainment pattern. This I class of distributions includes everything from double spike patterns (two spikes occurring at different times) to randomly located days with nonzero density. I As in Sections 6.1 to 6.3, we investigate the behavior of entrainment sampling and estimation over a range of sample volumes (v = 100, 150, 200, I and 300 cubic meters per 3-hour period), temporal patterns (uniform and diel), and values of c 2 (.0, .05, .10, .33, .50, and 1.00). For any combination of these values, our analysis of an entrainment season pro-I ceeds as follows. We begin by introducing some necessary notation, much of which coincides I with previously introduced notation. Let T = true total entrainment over the entire season, I T(j) = true daily total entrainment for day j, m = number of days in the entire season, I m_ A = number of days for which mean organism density is A. Then I T= m I T{j) = T{l) + T(2) + *** + T(m) (6.1) j=l I and
,. m ,.
T = L T(j), I j=l (6.2) I '" ,. where T and T(j) are estimates of T and T(j), respectively. From (6.2) T to (6.3) below, the analysis applies ~o l , T~, or T~ of Section 6.2, I any of which may be what is meant by T in (6.2) to (6.3). The expected value of T, E(T), and the simulation estimate of E(T) are I I I
I 6.148 I A m A E(T) = j~IE(T(j)) = m10 D10 + msoDso + m100D100 + m2S0D2S0' I I where A I DA = expected value of T(j) when mean organism density is A, I dA = simulation estimate of DA. A I The variance of T and its simulation estimate are I Var(T) = IVar(l-(j )) = m10U 10 + m50 U50 + mlOOUlOO + m250U250' I Var(T) = Ivar(l-(j )) = m10u10 + mSOu SO + ml00uI00 + m2S0u2S0' I where I UA = variance of T(j). when mean organism density is A, uA = simulation estimate of UAo I The coefficient of variation of T and its simulation estimate are I I (6.3) I An example of the calculations can be constructed using Tables 6.1 to 6.4, in which the variation in organism density within a day is uniform. I Take sample volume v = ISO and quadratic variance parameter c2 = .10, so all figures will come from the second column of the third page of Tables I 6.1 to 6.4. Take V = 100,000 and consider the short, la-day simulated season. Simulation estimates for the true entrainment are I I
I I 6.149 E(T) = 8(8,036.39) + 4(40,174.97) + 4(80,680.24) + 2(199,403.12) I = 946,518.2 I Var(T) = 8(2,495.99)2 + 4(6,971.71)2 + 4(11,906.71)2 + 2(25,966.30)2 I = 2,159,835,133 SO(T) = [Var(T)]1/2 = 46,474 I . eV(T) = 46,474/946,518 = 4.91% I . Simulation estimates for the one-sample estimator Tl are I E(l!) = 8(8,306.67)+4(40,046.67)+4(80,240.00)+2(198,546.67)=944,693.4 I I
= 8,974,862,755 I A A SD(T 1) = 94,736 I ,. ,.
CV(T 1) = 10.03% I Simulation estimates for the two-sample estimator with c 2 estimated, I . l~, are (with the superscript omit"ted for brevity) I ,. ,. E(T 2) = 8{8,170.00)+4(39,636.67)+4(80,520.00)+2(201,320.00)=948,626.7 I I = 2,882,621,829 I SD(T 2) = 53,690 I
I 6.150 I These numbers are not identical to the entries in the second column of the third page of Table 6.21 (page 6.136), which reports the simulation I results from the situation just described. However, the numbers are quite close to the table entries. The slight discrepancies are due to the fact that Table 6.21 was produced by a second simulation run, based I on different random numbers from the ones that led to Tables 6.1 to 6.4. The effects of erroneously assuming that the data are Poisson rather than I negative binomial are easily shown. The Poisson assumption gives A A A Var(T 1) = (V/v)T 1 = (666.667)(944,693.4) = 629,795,914.9 I A A Var(T 2) A
= (V/v)T 2/2 = (333.333)(948,626.7) = 316,208,583.8 I A A The first of these is much lower than Var(T 1) = 8,974,8~2,755 given I above, which is Cipp~oxi~ately equal to the variance of T1. The second is much lower than Var(T 2) = 2,882,621,~29 given above, which is I approximately equal to the variance of T2. The Poisson assumption thus gives estimated variances, standard deviations, and coefficients of variation that are substantially smaller than those obtained from the I
correct calculations. This produces confidence intervals whose true confidence levels are substantially lower than their nominal confidence levels, which demonstrates the severe shortcomings of using Poisson- I based methods in situations when the Poisson assumption fails to hold.
6.6 CONCLUSION
S FROM I THE SIMULATION STUDY I The simulation study is based on a reasonable approximation of the actual . entrainment conditions encountered in practice. The study does not reflect true entra,inment conditiorrs exactly: the entrainment seasons of I organisms of various species and life stages will not be exactly 3 days, 5 days, 7 days, 18 days, or 34 days in length; organism densities will not behave exactly as specified here; and discharge channel volume and sample volume will not be identical in all 3-hour periods or p.s.u.1s. I Thf~se approximations are inevitable in a simulation study of a phenomenon as complex as entrainment sampling. We can draw general conclusions from our simulations on the basis of the strong similarities observed between I unHorm and diel variation of organism density and among 3-day, 5-day, 7-day, 18-day, and 34-day entrainment seasons. I I I
I I '.151 I A major conclusion from this study is that the gain from drawing samples larger than 150 cubic meters per 3-hour period would be small relative to the cost involved. This can be seen both in the daily simulation I study (Tables 6.1 to 6.8) and the seasonal study (Tables 6.9 to 6.22). The present level of sampling effort is therefore adequate for the type of entrainment estimation considered in this project. I A second conclusion is that, as expected, estimation of entr'ainment is more difficult when the density of organisms is low. That is, the coefficient of variation of the estimate of seasonal entrainment increases I as organism density throughout the season decreases. A third conclusion is that under many entrainment conditions there are I substantial benefits from estimating c2 and using these estimates to calculate confidence intervals, coefficients of variation, and standard deviations. Estimating c2 gives shorter confidence intervals and smaller I coefficients of variation and standard deviations than the one-sample estimation method, which does not estimate c2 from paired observations. I It also avoids the necessity (discussed in Section 4.5) of using the empirical conservative upper bound, c2 < 1. I A final conclusion is that, if the entrainment process conforms to the assumptions (e.g., paired observations are replicates from a negative binomial process), the estimation techniques developed in earlier chapters I provide an effective means of obtaining entrainment estimates with acceptable margins of error, as measured by standard deviations and co-efficients of variation of these estimates. I I I I I
I I I
- 7. SUM-QUOTA SAMPLING IN ENTRAINMENT ESTIMATION I
7.0 INTRODUCTION
I It is possible that nearly all of the entrainment information contained in a sample could be obtained from an analysis of only part of the sample's contents. For example, it might be possible to extract 95% of a I sample's information from an analYSis of 50% of the sample volume, that is, sorting and identifying ichthyoplankton from only half of the full sample. This is quite a natural prospect: a sample collected during a I p.s.u. gives accurate information about the entrainment in the entire dis-charge channel flow during the p.s.u., which is a much larger volume of cooling water; similarly, a fraction of the sample's contents, if properly analyzed, may give accurate information about the entrainment present in I the entire sample. This idea could be exploited by dividing each sample into several equal smaller volumes, called subsamples or splits, and per-forming laboratory work on some but not all of these. For instance, a I sample of volume 160 cubic meters might be divided into eight splits, each of volume 20 cubic meters, and laboratory identification of ichthyo-plankton might be performed on four of these splits. The advantages of I such a procedure are clear: identification is performed for only part of each sample, so for a given cost we could process larger samples, more samples per p.s.u., or more p.s.u.'s in a noncontinuous sampling plan, giving more accurate entrainment estimates. Alternatively, we could I leave the level of accuracy unchanged, choosing instead to lower the cost of estimation. I This general approach of subdividing each sample for analysis leads to three-stage sampling designs, which were described in Section 3.2. As was noted there, the number of splits analyzed from each sample may be either fixed I or adaptive. With a fixed sampling plan, the same number of splits is selected at random from each sample and analyzed. With an adaptive sampl-ling plan, splits are selected at random from a given sample and analyzed until Gertain conditions are met, at which time any remaining splits from I the sample are discarded and laboratory analysis begins on the next sample. The observed subsample counts obtained during a sampling period are sub-I ject to two major sources of variation. The first is the variation in the discharge channel, which was shown in Chapter 4 to be adequately modeled by the negative binomial distribution. The second source is the varia-I tion among subsamples induced by the splitting of the samples. In order to utilize sample splitting in entrainment estimation, it is necessary to know how split or subsample ichthyoplankton counts are generated from sample counts. (This discussion of counts applies to a particular species-I life stage combination; overall entrainment estimates are obtained by sum-ming over these combinations.) In particular, it is important to know how much variation occurs among split counts within a given sample. A low I I
I 7.2 level of variation among splits leads to great improvements in the effi-I ciency of entrainment estimates. A high level of variation among splits has the opposite result. I To illustrate the key role of variation among splits in estimation proce-dures for three-stage sampling designs, two extreme situations will now be described. It can be virtually guaranteed that neither of these will be observed in practice, but the true situation could be close to one of I them. First, if among-splits variance is zero, which means that each split has exactly the same number of ichthyoplankton, then analyzing a single split gives full information about the entire sample, since the I exact sample count can be found by multiplying the split count by the number of splits in the sample. This would allow the cost of analysis to be cut greatly without any change in entrainment estimates or their estimated I variances. Second, if extreme clumping of organisms causes most splits from each sample to have organism counts of zero, so each sample's entrain-ment is concentrated in only a few of the splits created from it, then analyzing only some splits will give very uncertain information about the I whole sample. This is because the splits not processed could have either very high or very 1m*, counts, so the variability introduced by leaving some splits unprocessed would be quite large. I A model intermediate between these two extremes assumes thorough mixing of the sample before splitting. This means that any organism is allocated to I a split independently of all other organisms, with an equal likelihood of being placed in each split. Achieving this complete randomization of the sample is the goal of plankton splitting devices, such as the Pneumatic Mix-ing Rotational Depositing Plankton Splitter (EA, 1983a, p. 3-5). Conse-I quently, the treatment of three-stage sampling plans in this chapter will be based on this assumption of complete randomization of each sample, which produces split counts with uniform multinomial variation. This model for I variation due to splitting will be discussed in Section 7.1. A crucial question, therefore, is whether splitting devices achieve uniform I multinomial variation in split counts. More research on this question is needed, but available information leads to the conclusion that splits exhibiting uniform multinomial variation are very difficult, perhaps impos-sible, to obtain with currently used splitting devices. These devices pro-I duce splits with a greater level of variation among split counts than is cons"istent with the uniform multinomial model, suggesting that organism clumping may be present. This raises the possibility that variation among I splits may be great enough to offset the advantages of splitting. Until more is known about among-splits variation in ichthyoplankton entrainment counts, estimation based on sample splitting must be viewed with caution, I and the use of such methods as questionable. Several additional concerns can be raised about the use of sample split-ting in estimating entrainment. One is the potential for errors to occur I because of incorrect execution of the necessary laboratory procedures. The division of the sample into splits and the termination of split analy-sis at the correct time for each sample are steps at which mistakes can be I made. A second concern is the additional cost of the splitting of samples: the operation of splitting is labor intensive, additional record-keeping I I
I 7.3 I and quality control are needed, and so on. Finally, it is possible that other uses for the entrainment data, not addressed in this study, could I require data from whole samples rather than from the partial samples con-stituted by those splits that have been analyz~d. If the uncertainties concerning among-splits variation can be resolved, I then entrainment estimation techniques based on split counts are likely to provide greater efficiency than techniques based on whole-sample counts. If a splitting device is available that achieves uniform multinomial varia-* I tion among splits, sum-quota sampling plans will be much more efficient than whole-sample plans in cost and effort expended for a specified level of accuracy of estimation. Sum-quota sampling is an adaptive form of I split selection, in which splits from a given sample are selected in random order and analyzed until the total number of ichthyoplankton counted in the sample reaches a specified threshold. Any remaining splits from this sample are then discarded and processing of the next sample's splits is I begun. Sum-quota sampling plans have been used for entrainment data in some recent studies (EA, 1983a, p. 3-5). I The remainder of this chapter examines the use of sum-quota sampling in entrainment estimation. Ichthyoplankton counts from subsamples within a sample will be assumed to follow the uniform multinomial distribution, I which would be the consequence of a thorough uniform mixing of the sample before splitting. If it can be assured that such mixing is achieved, sum-quota sampling will be much more efficient than whole-sample analysis. We strongly recommend that when splitting devices are available that produce I uniform mixing of the sample, all sampling be done on a sum-quota basis. I 7.1 SAMPLE SPLITTING AND SUM-QUOTA SAMPLING: ESTIMATION FOR A SINGLE SAMPLE I We begin by considering the analysis of a single sample. This sample may be a single subunit drawn during a p.s.u. of the entrainment season, or it may be a composite sample formed by mixing several of these subunits together. The sample is split into L=8 subsamples of equal volume, called I subsamples or splits, for analysis. These are processed in random sequence until a minimum of Q=200 ichthyoplankton have been counted; that is, the processing sequence terminates with the tth subsample if the I total count in the t subsamples or splits is at least Q=200. The value Q=200 is used here because it was used in the subsampling done at Indian Point and Roseton. Of course, other values of the quota Q could be used. I Define I = the total count in the first! subsamples processed from the sample i, which may be either a subunit from a single p.s.u. or a composite of several ~u~h subunits. I I I
I 7.4 I In discussions of a single sample, l(£) will be written instead of I l.(£), for convenience and to avoid drawing attention away from £, the 1. number of subsamples processed from the sample. I It will be assumed that the mechanism used to split the sample provides a uniform mixing of the sample before splitting. This mixing results in a thorough randomization of the sample, allowing us to model the variation I in the subsamples with a uniform multinomial distribution. The rest of this paragraph is a formal statistical explanation of the appropriateness of this model for variation in the splits. It is included for complete-ness, and may be skipped by all but the reader interested in a mathematical I proof. For a sample divided into L splits, let Xl ,X 2"",X L denote the counts of organisms present in these splits (either total organisms or a I particular species-life stage combination). The randomization of the sample produced ~I the splitter, e.g., the Pneumatic Mixing Rotational Depositing Plankton Splitter (EA, 1983a, p. 3-51 should be independent of the pattern of organisms present in the sample. Thus Xl to XL are inde-I pendent non-negative integer-valued random variables, and the conditional distribution of X1 ,X 2""X L given the value of their sum Z(L)=EX; is a I multinomial with parameters EXi,l/L,l/L, .*. ,l/L. The total count Z(£) is composed of counts of many different species-life I stage combinations. These Z(£) organisms are sorted and counted by type, resulting in a vector of totals by type, (1(£;1),Z(£;2)Z(£;C))' whose I components sum to Z(£): I (7.1) I where Z(£;j) is the total count of the jth species-life stage combination observed in the £ processed subsamples. The objective of this subsampling I procedure is to es,timate the composite sample totals by type, namelYth (Z(L;1),Z(L;2)"",Z(L;C))' where Z(L;j) is the total count of the j species-life stage combination in the entire sample. If Z(L) < Q then I £=L and, aside from sorting and counting errors, this vector of total counts is then exactly measured. I I I I
I 7.5 I If ~L)>>Q then subsample (split) processing will terminate at some split I number 1<L, at a savings in laboratory cost as compared to the cost of processing all Z(L) organisms. This number n will not be known in advance since Z(L) is not known in advance; however, the amount of processing effort I is known in advance to be "Q plus overshoot." And this excess over Q has an expected value equal to approximately half of the average abundance per split, I
-(7.2)
I Since cost as measured by Z(t) is essentially predetermined under this I sampling scheme while sample size as measured by t is not fixed in advance, this sampling procedure has been referred to as "fixed cost sampl-I ing" to distinguish it from fixed-sample-size procedures (Pathak, 1976). Actual processing costs are not directly proportional to the total number
~t) of organisms processed, however, and the sampling procedure might be I better referred to as "sum-quota sampling" where Q is the predetermined quota imposed on the sample sum Z(t) of an o-bserved non-negative va-riable.
I Sum-quota sampling can be viewed as a generalization of conventional fixed-size sampling, where a sample of predetermined size is collected. In view of this fact it is, perhaps, not surprising that statistical esti-I mation formulas developed under the assumption of fixed-n sampling do carryover almost intact to the more general case of sum-quota or fixed-cost sampling. Pathak (1976) provided mathematical proof that conventional I mean and standard error formulas do, indeed, require no modification when applied to the pre-terminal data of a sum-quota sample (ignoring the data from the terminal sampling unit). He also demonstrated. however, that modifications are required if the data from the terminal sampling unit I are to be utilized in the estimation procedure. This need can be appre-ciated by noting that sum-quota sampling will tend to terminate with a sampling unit having a large value of the sum-quota variable. The data I from the terminal sampling unit are thus "size-biased" with respect to the sum-quota variable, and with respect to any other variables that are correlated with the sum-quota variable. As a case in point, the terminal I (ith) one-eighth subsample from a composite sample tends to have a larger-than-average total number of organisms (unless Z(8)<200) and hence I is also biased upward with respect to the dominant types in the composite. The modifications in the statistical analysis needed to ad~iust for the I effects of this size-bias are simple but do require use of the separate data from each of the t subsamples included in the sum-quota sample. There are :hr:e differe~t candidates for the estimate of Z(L)' which we will I denote by Z, Zpre' and Zadj' I
I 7.6 I Z is simply the usual estimate of the total count in the L subsamples (splits) based on observing the counts in only i subsamp1es: I
.:,L = IL x( total count in the i subsamp1es) (7.3) I I
~ Z pre is obtained by the same calculation except that the count in the h last (it) subsample analyzed is ignored: I zpre = _L_ x (total count in the first t-l subsamp1es) t-l {7. 4) I L
= -t-l Z( t- 1) .
I This estimator is an unbiased estimator of Z(L); the average value of the I estimator is exactly equal to the target value Z(L)' I
~
E(Zpre) = Z(L) . (7.5) I ~ ~ Zpre will, however, have a larger variance than Z, since it is based on I fewer observations (1-1 rather than t observations). Thus, in choosing A A between Z and Zpre' we must choose between an unbiased estimator with a I larger variance and a biased estimator with a smaller variance. A A A ZadJ" is an attempt to combine the good pOints of both Zpre and Z. The A I adjustment algorithm exploits the unbiasedness of Zpre by averaging over all p()ssib1e pre-terminal estimates that could have resulted from dif- I ferent orderings of the observed subsamp1e counts. More precisely, if we let Xj denote the total count in the jth subsample, the adjusted estimator is obtained by avera9ing all possible pre-terminal estimates [L/(i.-l)][Z ( )-X J ] I that can be formed using observed counts Xj that exceed Z(l)-Q, that is, 1 . all observed Xj that could possibly have been the terminal observation. I I I
I 7.7 I (To be a possible terminal observation, a count Xj must exceed Z(t)-Q. This is because if it does not, then the sum of the preceding 1-1 obser-I vations must equal or exceed the quota Q, which would result in the ces-sation of counting before the 1th subsample. For instance, with Q=200 and L=8, sampling would terminate after the sequence of split counts 20, I 52, 46, 39, 25, 49, saving the cost of processing two splits. Then 1=6 and Z(.g,)=231. Neither X1=20 nor XS=2S is a possible terminal count; for example, sampling could not end with an observation of 20 that results in I Z(6)=231, since this would imply Z(S)=211, i.e., the sum of the first five observations would exceed the quota Q=200.) I Letting Xt denote the average of all split counts that are greater than I Z(1)-Q, then the formula for the adjusted, unbiased estimator of Z(L) becomes: I .Z d" L(Z(.q,) - Xt ) a (7.6) a J t - 1 I . If t=l then no adjustment is made and ~adj=LZ(l); likewise t if 1>1 and I all t observations exceed Z(t)-Q then Xt=Z(t)/t and Zadj=LZ(t)/1. An outcome in which none of the t observations exceedAZ(t)-Q can occur only I with 1=L, and then Z(L) is not to be adjusted, so Zadj=Z(L)' A small numerical example will serve both to illustrate the size-bias of I the terminal observation and to demonstrate the adjustment needed for esti-mating the total count. In order to reduce the arithmetic we consider a 3-way split instead of an 8-way split. Suppose we have set a quota of I Q=8 and, unknown to us, the numbers of organisms in the L=3 subsamples are 1,3, and S, respectively, so the estimation target is Z(L)=9. I For this simple example, the possible outcomes of a sum-quota sampling strategy are:
. " A I Subsample Z
(usual Zpre (pre-terminal Zadj (adjusted Sequence estimate) estimate) estimate) I 1,3,5 9 6 7.5 3,1,5 9 6 7.5 I 1,5,3 5,1,3 3,5 9 9 12 9 9 9 12 7.5 7.5 5,3 12 15 12 I Average 10 9 9 I
I 7.8 I Each of the possible sample sequen:es is equally likely to be realized, and on average the usual estimate Z exceeds the true value of Z(L)=9. The I adjusted estimate Zadj' which averages to exactly the true value, may at first glance appear strange, since in four of the six possible outcomes this estimate of the total number of organisms is actually less than the I observed number! For example, in thE! first subsample sequence (1,3,5), either 3 or 5 could I have been the term'inal observation, but 1 could not have been. Thus Xt =(3+5)/2=4, and I (7.7) I When ~=L, the adjusted estimate will be less than the known total count I Z(L) unless all L observations exceed Z(L)-Q. A rational justification for deliberately committing such a known error of underestimation in a sample I is the concomitant knowledge that overestimation will by chance occur in other samples to compensate in the long run; indeed, if the adjustment algorithm is not strictly adhered to, the laboratory will, in the long run, overestimate abundance in the samples when using the sample splitting I strategy. Since annual entrainment totals are the ultimate estimation target and since these are compiled from the counts in numerous samples, there is compelling reason to apply mean-unbiased estimation methods to I the individual samples. In the above numerical example showing the deliberate adjustment from a A I known value of Z(L)=9 down to an estimate of Zadj=7.5, the concomitant knowledge of the chance for compensating overestimation is also plainly revealed in the remaining rows of the table. With all L subsample counts I (1,3,5) in hand, the remainder of the table could be constructed after the fact, showing the exact chance and the exact amount of overestimation that might have occurred and that, we must assume, will occur among some I of the other samples counted incompletely by the split sample, sum-quota method. I Therefore, in deciding among the three estimators, we must assess the amount of bias that may be present in our situation and must weigh this against the amount of variance reduction that is possible. Since these quantities depend on the unknown parameters of the problem, we must exa-I mine a range of likely values of these parameters. Using both theoretical calculations and Monte Carlo simulations, we can arrive at a reasonable choice. I I u
I 7.9 I 7.2 VARIANCE AND BIAS IN SUBSAMPLING ESTIMATION I From the three candidate estimators Z,' Zpre' and Zadj! we must.choose the one that is.. optimal for our purposes. We- know that Zpre and Zad'J are I unbiased, Z is biased, and I Variance(Z) < variance(Zpre) (7.8) I I .. .. however, the variance of Z and Zadj are not directly comparable. In mak-ing a practical decision as to which estimator is best, we must consider, I among other things, I i) The magnitude of the bias of Z ii) The magnitude of variance reduction possible iii) Ease of calculation. I Such comparisons could be carried out on a theoretical level, but the expressions involved are quite complicated, making direct c:omparisons I .. extremely difficult. For example, if we calculate Z based on i out of L possible subsamples, and the underlying distribution is Poisson(A), then I (see Appendix V) the expected- value of Zis given by E(Z) = A~ + L I Q1 e-(i-l)A[(i_l)AJ - ] (7.9) I l i=2 i(Q-l)! I where Q is the chosen quota. Variance calculations are even more involved, so rather than attempt theoretical comparisons, it was decided that a more fruitful attack would be through a simulation study. I 7.2.1 OUTLINE OF SUM-QUOTA I ESTIMATION SIMULATION STUDY 7.2.1.1 Poisson(A) distribution I Fix A and calculate the vectors f and E giving the Poisson pmf (probability mass function) and cdf (cumulative distribution function) respectively: I I
I 7.10 I f = [f(0),f(1),f(2), ... ,f(R)] I F = [F(0),F(1),F(2)"."F(R)] I where R is the first integer making P[Poisson(A) variable 5 R] greater than or equal to .99999: I F(R) ~ .99999 > F(R-1); I then round F to 5 decimal places. I 7.2.1.2. Poisson sample Generate L=8 uniform random numbers Ul'""UL'0 ~ Ui ~ .99999; using the I Poisson(A) cdf F, calculate the vector ~=(X1,***,XL) such that II F(X -1) < Uj ~ F(X j ); j I compute ZL=X 1+ .*. +XL" I 7.2.1.3. Finite population sampling I Calculate M=1000 random permutations of the vector X. For each permu-tation compute ~=(Z(1)'Z(2)"",Z(L)) and I compute 1. such that Z(1._1) < Q 5 Z(1.)' with 1.=L if Z(L) < Q I
~
compute Z = LZ( 1.)/1. I compute X1.' with x~ =0 if 1.=L and i L) < Q I A compute Zpre = L(Z(1.)-X1.)/(1-1) = LZ(1._1)/(1-1) I I I
I I 7.11 order the vector (X~, ... ,X ) and compute the number k of XJ,'S I exceeding
.I.
Z(~)-Q; R. I if k=O set 0=0 and Xk=O; otherwise set 0=1 and set Xk equal to the mean of these k Xj's that satisfy Xj > Z(R.)-Q I I I compute the unit vectors O=[O(O), *** ,O(L)] and E=[E(1), **. ,E(L)], where .. - I O( i)= 1 for i=k and E(i) = 1 for i d. o 1 for i;'k 1o for i ;1. I compute the unit vector W=[W(O), *** W(K)] where I W(i) :t 11 if i=X'1. o otherwise I For the M permutations of ~ calculate the mean and the covariance matrix I (dividing by M-l) of I I and the mean of I C=(O,E,W) I 7.2.1.4 Replication Repeat the sampling *of Sections 7.2.1.2 and 7.2.1.3 10*M times, and compute I the mean and covariance matrix of Y and the mean of ~. I I
I 7.2.2 RESULTS OF THE SIMULATION STUDY 7.12 I Simulation according to the preceding program consisted of generating pseudorancbn data X ,X , ... ,X 8 representing the number of ichthyoplankton I 1 2 in each one-eighth portion of a sample, and then accumulating the counts sequentially: I I I until the cumulative count first reaches or exceeds the sum-quota Q=200 (or until all 8 portions have been counted in cases where the grand total I Z(8)=X 1+X 2+X 3+... +X 8 is less than 200). The total count Z(8) is always known in this artificial situation, so the estimates of Z (8) obtained by I simulated sum-quota sampling can be compared to the actual Z(8) in order to assess bias and variance of the sampling error in the estimates. I This computer-based experiment was undertaken to provide empirical veri-fication of various theoretical properties of the sum-quota sampling pro-cedure and, in pal'"ticular, to determine the adequacy of certain theoretical I ~pproximations concerning bias and variance. The bias and sampling vari-I A - ance of the unadjusted estimator Z=8X are of special concern, first, to ascertain whether the bias is negligible and, 'second, to determine whether its sampling variance is less than that of the adjusted unbiased estima-tor. Both the record keeping and the calculations required for the bias adjustment are a nuisance to be avoided if possible. I As an illustration of the simulator output (Table 7.1) we consider a case in which Z(8) is Poisson distributed with a mean value of 400, so I X1 ,X 2 , *.. ,X 8 are independent Poisson variablesAea~h having expected value A=50. Total variance of an estimator such as Z=8X is then comprised of A I two components corresponding to the sampling error, Z-Z(8)' and the Poisson error, Z(8)-400, respectively: I Z - 400 = (Z Z(8)) + (Z(8) - 400) (7.10) I Because these two components of the estimation error are uncorrelated, the variance of the estimation error is the sum of the two variance com-I ponents: the sampling error variance of the sum-quota Z as an estimate of I I
I 7.13 I I Table 7.1 I ENTER Q 5IM Example of Output from Simulation Study 0: I ENTER 0: 200 L 8 I e:1'ITEj;: L.AMDA 0: 50 ENTER NUMBER OF PERMUTATIONS I 0: 100 ENTER NUMBER OF H VECTORS 0: I 100 ENTER NUMBER OF REPL.ICATES 0: 100 I THE FREQUENCY DISTRIBUTION OF 0 1 2 3 K IS 4 5 6 7 8 0.0000 0.0318 0.0398 0.0:563 0.588:5 0.2828 0.0008 0.0000 0.0000 I THE FREQUENCY DISTRIBUTION OF I.N IS I 1 0.0000 2 0.0000 3 0.0000 4 0.487:5 5 0.5096 6 0.0029 7 0.0000 8 0.0000 I Q: Q is the preassigned quota for the total count; counting of suc-cessive subsamples. each a one-eighth portion of the sample. con-tinues until the cumulative count first equals or exceeds the quota Q. I L: L-S is the number of subsamples into which the sample is split. LAMDA: A is the expected count per subsample. I # PERMUTATIONS: A sample of LaS counts is to be simulated by generating a vector
~=(X1'X2""'XS) of independent Poisson counts from a Poisson distribution with parameter A. The sum-quota sampling rule is to I # X VECTORS:
be applied to 100 independent permutations of this vector.
= NUMBER OF REPLICATES = number of simulated X-vectors.
K is the number of terminal candidates in a sum-quota sample. I LN is the number of observations in a (sum-quota) sample. XLN is the terminal observation in the sample. I I I
I 7.14 I Table 7.1 (continued) I T"'E MEAN ZL OF" T"'E ,. l.N VECTOl"o IS Z1 :.:l.I-I 51.4194 Z2 K ::E<A": Z3 I 399.2100 4.5154 401.4229 398.9238 4.0531 51.3766 399.0352 THE COVAfO:IANCE MATI"oI:{ OF 'rM ZL loN I1 IS
,,:l.N Z2 K :-:K E<AfO: Z3 I
425.0161 -6.2199 431.7611 54.5079 430.5687 -6.2727 ~5.1665 428.7786
-.0.2199 431. 7611 54.5079 430.5687 0.0982 -6.3536 -0.8044 -6.3284 -6.3536 443.8540 57.5881 439.3045 -0.8044 57.5881 8.3268 55.1320 -6.3284 439.3045 55.1320 438.9545 0.0991 -6.7051 -1.0060 -6.3642 -0.8088 57.9~58 8.1377 56.0778 -6.3167 438.3728 55.6357 436.3895 I -6.2727 0.0991 -6.7051 -1.0060 -6.3642 0.1629 -1.0314 -6.3156 55.1665 428.7786 0.0000 -0.8088 -6.3167 0.0000 57.9258 438.3728 0.0000 8.1377 55.6357 0.0000 56.0778 436.3895 0.0000 -1.0314 -6.3156 0.0000 8.2:!54 55.8\?!38 0.0000 55.8988 435.5521.
0.0000 I T ... E .... EAI*' COVAfO:IANCE OF l.N .li
-4.3882 .,. iIIITHP"
- l.N
!.: VECTO":S -eo Z2 1:"..,..,..,
IS K
-0.0752
- KE<A'"
0.0778 Z3
-5.6336 D
0.0000. I 0.1599 0.0526 ~ *
- J#~_
-4.3882 312.3722 37.2248 315.1348 -3.1208 35.8901 318.6008 0.0000 0.0526 -5.57:2 -0.0752 37.2248 315.1348 -3.1208 44.6872 -57.2185 -2.2721 -57 . 2185 545.6664 0.6180 -2.~721 0.6180 0.6299 12.2368 20.4329 -2.2979 2::!.157~
352.7891 0.6689 0.0000 0.0000 0.0000 I 0.0778 35.8901 12.2368 20.4329 -:.2979 12.2389 20.4874 0.0000
-5.6336 0.0000 318,6008 0.0000 22.1572 0.0000 352.7891 0.0000 0.6689 0.0000 20.4874 0.0000 357.0124 0.0000 0.0000 0.0000 I THE MEAI'IS OF Zl, "b. AND Z3 FO": loN APE .1 2 3 4 l.N 5 6 7 8 I Zt 0.0000 0.0000 0.0000 422.7635 381.4606 321.8391 O. \)000 0.000 Z2 0.0000 0.0000 0.0000 42::!.5739 376.7943 376.8528 311.9448 308. '23-15
- 0. *:*(*\)0 .). ')0('
0 .. I Z3 0.0000 0.0000 0.0000 422.7635 0. C'O,)*) ~)G',; I I The V-vector is a vector of summary statistics calculated for each permutation of each X-vector. ZL is the total of the L=8 observations in the X-vector. XBAR=XKBAR is the mean of the k terminal candidates. 0 may be ignored. For each X-vector the mean and covariance of the V-vector were computed first over I permutations within X-vectors and then among X-vectors. The latter is called the covariance matrix of YM, where YM is the mean for a fi xE~d X-vector. I Output variable: ZL LN Z1 XLN Z2 K XBAR Z3 Equivale!nt term I in text: Z k I I
I 7.15 I the actual total Z(8)' and the Poisson variance of Z(8) about its average I value 400: I Va r (Z - 400) = Va r (Z - Z(8>>) + Va r (Z (8) - 400) * (7.11) I Since Poisson variance is equal to the Poisson mean, the second component of variance is LA=8(SO)=400. An approximation to the sum-quota sampling error variance component is given by I Va r( Z - Z( 8) : L 2(A 1) (U.) 2 Q + AI 2 - r A = ~ - LA (7.12) I
= 200(400}2 _ 400 = 711 - 400 = 311 I + 50/2 A
which is very close to the mean of the empirical variances of Z, 312.3722, I ,. listed for ZI=Z in the mean covariance matrix. The empirical value of the Poisson variance of ZL=Z(8) is the leading entry in lithe covariance I matrix of YM,II namely, 425.0161 as compared to the expected value of 400. Such a discrepancy between an empirical variance and a theoretical vari-ance is well within the range of chance variation. I .. The expected value for Z is approximately I E(Z) : LA(1 + 0:1) = 400(1 + 1~9) = 402 (7.13) I ... as compared to the mean of all the 10,000 simulated values of Z1=Z listed in lithe mean of the Y vector as 401. 4229. At 0=200 the approximate bi as ll I is
= 1~0% = 0.5%
I percent bias (7.14) I and the empirical bias is even smaller. Interestingly, this is a situa-tion where the empirical bias is more relevant than the theoretical bias because the... latter is due primarily to extremely - large but extremely rare I values of Z that simply do not occur in 10,000 trials due to their extremely small probability of occurrence. I I
I 7.16 I A hint of this phenomenon is found in the last.lines of output of simula-tion in Table 7.1, where the 10,000 values of Z=Z1 were sorted by 2., the I number of one-eighth portions needed to reach
. the sum-quota of Q=200. The line labeled Zl shows the average value of Z within each sample size class that is represented among the 10,000 trials (with zero printed for I
empty sample size classes). Since
- 2 : ... + \)
I Z = Zl = LX .* LC1 + X (7.15) I
= L (!~ + exce~s over Q)
I
= 8 (-200 +2.excess)
I and since the amount of excess over 200 is only weakly correlated with l, then it is clear that as t increases the class mean must decrease or, equivalently, if 2. is small then the class mean must be large. Only the I chance combination of several unduly large counts can produce a small 2. and hence an unduly large X, and it is the very rare combinations (not even seen in the 10,000 trials) that mainly contribute the 0.5% positive I bias in Z. The small empirical bias in Z found in this simulation experiment and I numerous other simulation experiments at different parameter settings is of negligible magnitude and does not detract from the advantages of this
- A I
estimator over the exactly unbiased estimators Zpre =Z2 and Zadj=Z3. Moreover, the empirical evidence consistently shows that the sampling error variance of Z is smaller than that of the adjusted estimator Z3, as well I as that of 72. Thus, Z is seen to have the advantage of better perform-ance as well as greater simplicity. I The precision of this estimator may be measured by its coefficient of variation (CV) or relative standard error I CV = standard error mean I whlCh in the present context becomes I CV = j Var(Z) (7.16) u. I I
I 7.17 I If the numerator includes both components of variance, the sampling I error variance var(Z-Z(8)) and the Poisson 'variance component Var(Z(8)-Ll)=Ll, then the theoretical approximation of the CV becomes I CV = 1 1
= j 200 + 50/2 = .06667 (7.17)
J 0 + 1../2 I The empirical estimate derived from this simulation is I )312.3722 + 425.0161 = .06765 I 401.4229 7.3 VARIANCE OF THE ESTIMATE I OF ENTRAINMENT ABUNDANCE The results of the simulation study discussed in Section.. 7.2 show that the I variance-bias trade-off in subsamplin~ is favorable to Z = (L/t)Z(t)' To calculate the overall variance of T for the entrainment season, how-I ever, we must also take into account the negative binomial variation in the discharge channel. As in Chapter 5, T represents total entrainment over N sampling periods, T. represents entrainment over sampling period i, and I '" A 1 T and Ti represent estimates of these. A sampling period may be either a single p.s.u. or a composite of several p.s.u.ls, composition making the most practical sense for p.s.u. IS in which entrainment is very low and the corres-I ponding counts are small. Sum-quota (0) Poisson subsampling of a negative binomial(K,p) sample I achieves a coefficient of variation of approximately I CV: )1K 10
+ (7.18)
I in the estimation of I Ui = number of organisms per volume vi I '" by the estimator Z for sampling period i I ~i = Zi = Zi(L)' I
I 7.18 INhere Vi is the volume sampled in sampling period i. With subsampling I of the L=8 splits of the sample whose volume is vi' the number of splits sorted and identified is l i . There is a subsampling error component in I the estimate I
].Ii = Z., = Z,. + (Z. - Z.),
I where Z. denotes the (unknown) complete sample count of all L splits while I Z.
= (LI 1. ]I Z. ( . ) " R. I is based only on the observed sum of 1. counts. Random splitting induces , 2 2 Poisson variance among the subsamples (Si ~ Zi/l and G i ~ Pill), hence I
Var(Z.-Z.) ~ - - ' ].I. l-l. l
= -].I. - ].I,'.
I
" 1 1 *' 1*, ,
I When n. is based on sum-quota sampling, we have (l./L}p. - Q I 1 1 I and hence I Var(Z.-Z.) , , I Adding this subsampling variance component to the sampling variance component I I gives I I I
I I ,. Note that Var(Z~ = Var(Zi) when Q = ~i* I The estimated entrainment for sampling period; is Ti=Vi~i/vi' where I , V. is the total volume discharged in that sampling period. This estimator has variance I (7.19) I If sampling is carried out over N sampling periods, then the estimate of I total entrainment is obtained by summing over these N periods, which gives I .. T = N .. L T; (7.20)
. ; =1 I A Var(T) -
t 2 2 2(1 1) 2(1 1)
~ (Vi/vi)u; K+ Q = ~t Ti K+ Q (7.21)
I and I (7.22) I where I I [CV(T.)]21 1[E T~
= N . , _N(1NE.T.)2]/ , 1 , ~1 r N1 .T.]2 , (7.23)
I I ~ote that if there is no variation among the N sampling periods in discharge channel flow rate (V.=V for each ;) over this period then CV(T.)=CV(~./Vl*).
. , 1 1 I If, in addition, the sampling gear inta~e rate is the same for all sampling periods (vi=v for each i), this reduces to CV(Ti)=CV(~i). If the latter condi~ions are satisfied, so that T=(V/v)~~i and I
I
I 7.20 I U*1 I then I N2 I LU'
. 1 1
N )2 = 1.N (1 1) (1 K + Q + [CV (ui 2
)] ') (7.24)
I ( >u.1
~
1 I The estimated vari an,:e of T ; s gi ven by I I Var(T) (7025) I The estimated coefficient of variation of T is given by I I (7.26) I I where [CV(~i)J2 is obtained by substituting ~i for Ti throughout the equation (7.23). ThE!se formulas are closely related to those in Section 5.7, where subsampling is not performed. For example, when l/u.1 is replaced I by l/Q and c 2 by 11K in the approximate expression for CV(i) in (5.49), the result (after some routine algebra) is (7.22). I The effect of th~ sample volume v on CV(T) under sum-quota sampling is seen to be reflected mainly through Q, the sum-quota invoked in the sub-I sampling process. This means that if the sum-quota Q almost always causes laboratory analysis to terminate long before the entire sample has I I
I 7.21 I been processed, then increasing v will have little or no effect on the accuracy of the estimation of T; as long as Q remains fixed, a larger I v will result only in reaching Q after processing a smaller fraction of the sample. Thus it is appropriate to view the sum-quota Q as an implicit function of the sample volume v, with the choice of the sum-quota Q(v} based on the value of v. I 7.4 THE ESTIMATION OF ENTRAINMENT I REDUCTION UNDER SUM-QUOTA SAMPLING The analysis of Section 5.8, in which the estimation of entrainment I reduction under fixed-sample-size sampling was discussed, can be extended to the case of sum-quota sampling. If subsamp1ing A is performed, A theA treat-ment of Section 5.8 remains valid after the term 1/~ = l/~i = Vi/viTiA I in (S.49), (S.S1), (S.S2), (5.S4), and (5.55) is replaced by l/Q and c2 A is replaced by I/K, giving I I I 7.5 OPTIMIZATION OF SUM-QUOTA SAMPLING I In this section we describe some of the operating characteristics of sum-quota sampling, and compare them to those of fixed-samp1e-size sampling, that is, whole-sample plans. Furthermore, we describe some techniques to optimize the sum-quota sampling plan in which the sum-quota Q = 200 and I the number of splits per sample L = 8; these values were used in a 1982 study at Bowline Point (EA, 1983a, p. 3-5). I Sum-quota strategies that have been used impose a quota of Q=200 on the total number of ichthyoplankton counted, irrespective of taxon and life I stage. Such a strategy can be optimized, however, by imposing double-sampling quotas, with larger quotas on counting and sorting of life stages and smaller quotas on species identification. To illustrate the potential I savings, we consider here the operating characteristics of a sum-quota procedure for estimating the relative numbers in a mixture of two types of organisms. (An example might be the sorting of counted eggs into those I of species A and those not of species A.) I I I
-------~~~-~~~~~~~
I 7.22 I The procedure considered here is to continue sorting into two piles, called the "success" pile and the "failure" pile, until either the number in the success pi 1e reach!s the predetermi ned quota Os or the number in the I fa il ure pil e reaChE!S the predetermi ned quota OF' Sorti ng termi na tes as soon as one of these quotas is attained, so the total number examined under I this strategy can be at most 0S+OF' The total number available for sort-ing, say U, is presumed known and may represent the outcome U=O+excess of a previous stage of sum-quota sampling. The objective of the present I stage is to estimate the numbers TS=UPS and TF=U(l-PS) by estimating the fracti on Ps of typ1e S. I Sorting at this stage might proceed in sequential batches, where a "batch" might represent one portion of a split 'sample, or the number I observable in one field on a microscope slide. If such batch sorting is practical then sorting may terminate with an excess over the quota; i.e., at termination the sample counts ts and tF will satisfy either ts < OS' I tF ~ OF or ts ~ QS' tF < QF (unless the last batch happened to bring both counts over their respective quotas or unless U < Os + OF and TS < OS' I TF < OF' in which case the entire lot would be sorted giving ts < OS' tF < OF' ts + tF=U). I The graph in Figu)~e 7.1 illustrates the operating characteristics of such a procedure, exprl~ssed in terms of the standard error of the estimator A I Ps = tS/(t s + t F) and the expected value of the sample size ts + t F. Batch size of 1 was assumed in these calculations, and other nuisance com-plications were removed by assuming TS > Os and TF > OF so at termination I either tS=Q S and tF < OF or ts < Os and tF=QF' The probability distribu-tion of the outcome (tS,t F) is then a mixture of inverse hypergeometric I distributions with parameters (TS,T F) and (OS,OF) or (if U>>QS+QF) negative binomials with parameters pS and (OS,QF)' Operating characteristics I were based on the latter, I I I I
*t
I 7.23 I I Figure 7.1: Operating Characteristics of Sl.lltI-Qoota San;lling I in Sorting to Estiltlate the Pl:oportion(p) of Organislrls of a Given Typ: (Graph is synmatric al:lout p ::0 1/2 = so t:ercent) I 400 FIXED S>>1PLE SIZE
- 400
/
I /
/
I E X p Os* ~ ". 200 ---> 4/
/ /
I E C T E 300
/
0 /' \ I S A
/ / \~ \ %*
100, OF = 200 M / \ I P L E /'
/ \ \ \ /' \
I ./ S \
\
I ;I' \ z /' ,, E /' ,,
/'
.1 200 I I I 100 I ~------+-------+-------~------~-----+-- o 10 20 30 40 SO TRUE P (PERCENl') I I I
I 7.24 Figure 7.1 (continued) I I I I I I 5 004 ~ = % = 100 T A ~.
',."PI' ..".~~~ .... -- .......
I N ." u = 200 D A 003 /
/'
I R ,I' D Os = QF = 200 I E 002 u = 400 R R 0 I R 0
.01 I F
I P 0 10 20 30 40 50 I TRUE p (PERCENI') I I I I I
I 7.25 I I I (7.27) I F Q + ts - 1) ps ts (l- PS) QF if t I ( t S
< Q and t =Q S F F s
I Five different scenarios are depicted in Figure 7.1, including two fixed sample size procedures and three sum-quota procedures, two with QS=Q F I and one with QF=2Q S' The fixed sample sizes have been fixed at u=QS+Q F, the maximum possible sample size, tS+tE~QS+QF=u, of the slim-quota proce-I dure, so the standard error/ Ps(1-ps)/u of the fixed sample size esti-mator tS/u constitutes a lower bound on the standard error of the sum-I quota estimator while QS+Q F constitutes an upper bound on the expected sample size of the sum-quota procedure. I ... In the case QS=Q F the standard error of pS is largest when the two types are present in the ratio 1 to 2 (or 2 to 1), i.e., when either PS=1/3 or I PS=2/3 (because the graph is symmetric about PS=1/2, only the left half of the graph is shown in this figure). The standard error in this case is ,I for Ps < i (7.28 ) I which is seen to have a maximum at PS=1/3. Expected sample size increases I from QF at PS=O to QS+Q F at PS=1/2 (and then decreases to QS=Q F at PS=l). I Prior information (from previous samples) can be exploitee, in choosing Qs and QF' If prior indications are that PS<1/3 then the choice QF=2Q S I is appropriate. I I
I I I 8. ENTRAINMENT ESTIMATION UNDER NONCONTINUOUS SAMPLING PLANS I
8.0 INTRODUCTION
The decision not to sample the discharge flow on some days in the entrain-I ment season incurs a loss of accuracy in the estimates of entrainment abundance. This choice should not be made without careful consideration of the consequences, because the failure to collect data at certain times I is irrevokab1e. If noncontinuous sampling is judged to be preferable because of cost and other practical limitations, care should be taken to choose a level and pattern of daily sampling that will minimize the con-comitant information loss. I This chapter treats the analysis of noncontinuous sampling plans. Plans of this type present more difficulties than continuous sampling plans. I To obtain unbiased entrainment estimates from noncontinuous sampling, more assumptions are needed than for continuous sampling. I 8.1 PRELIMINARY CONSIDERATIONS AND AN EXAMPLE I Any departure from continuous sampling at a constant rate may be viewed as adding an amount of random error or "noise ll to the estimation process. This decreases the accuracy of estimation. Changes of any kind in the I sampling design may be compared to a standard consisting of a fixed level of continuous sampling effort. I Introducing intermittent rather than continuous sampling decreases the accuracy of estimation. It will be shown later that when sampling is inter-mittent, the estimated variance of T, the estimated seasonal entrainment, I increases by a factor of at least m/h, where m is the number of days in the entrainment season and h is the number of sampling days, over the estimated A variance of T under continuous sampling. The key issues are: I (1) to quantify the bias in estimates of entrainment abundance and to identify conditions and assumptions that result in unbiased estimates, I (2) to quantify the increase in the variance of estimated entrainment abundance, allowing the comparison of various patterns of missing p.s.u.ls, and I (3) to determine how to arrange the days on which sampling is omitted so as to minimize the problems of bias and increased variance for esti-I mates of entrainment abundance. To demonstrate the application of these concepts, we now consider a hypo-I thetical example. Its conditions represent an oversimplification of I
I B.2 actual entrainment conditions at the three power plants; nevertheless, I the example provides an informative view of the difficulties involved. Consider a sampling season of m days, on each of which we can estimate A I the entrainment T(j), j=l, ... , m, by an unbiased estimator T(j) with variance cr 2. Continuous sampling would result in an unbiased estimator of the total entrainment T = IT(j) given by I A T = I m A T(j),. (B. 1) I j=l which has variance of mcr 2. I Now assume that r out of the m days are omitted from the sampling schedule, and that these occur in groups of g consecutive days. For example, if r = 24 and g = 6, then four gaps of six consecutive days occur within the sampling season. To estimate T(j) for days on which no sampling occurs, linear interpolation is used. For instance, if days 1 and 5 are sampled I but days 2, 3, and 4 are not, then I T(2) = 3T(1)/4 + T(5)/4,
" A "
T(3) = 2T(1)/4 + 2T(5)/4, (B.2) I T(4) = T(I)/4 + 3T(5)/4. A " I We now make I Assumption B.1. During each gap in the sampling process, there is a linear trend in the daily entrainment T(j). In other words, if day k is I' sampled and is then followed by a gap of g days, making day k+g+l the next day in the season on which sampling occurs, then the change in daily entrainment from one day to the next is the same throughout this time I peri od: T(k+1) - T(k) = T(k+2) - T(k+l) = .... = T(k+g+1) - T(k+g), (B.3) I An equivalent way of expressing this relationship is I T(k+j) = T(k) + gi 1[T(k+g+1) - T(k)] for j=O,l, ... ,g,g+l. (8.4) I I I
I 8.3 I We also assume that there are at least two consecutive sampled days between any pair of gaps. (This is solely for convenience; if only one I sampled day were to separate a pair of sampling gaps, then an even greater loss of precision would result than the loss when gaps are separated by at least two sampled days. The computations are easily done, but would complicate the example now being presented.) I A A Let T(r,g) denote the estimator T when r days are omitted from the I sampling schedule in groups of g consecutive days. That is, T(r,g) is calculated from (8.1), where linear interpolation is employed to calculate T(j) for each unsampled day, using the closest sampled days I before and ,. after day j. When Assumption 8.1 holds, it is routine to show that T(r,g) is unbiased and that I Var[T(r,g)] = [m + r(g + 2)/2]0 2* (8.5) I 2 Thus each single day missed adds an extra 1.50 to the variance of T, each weekend or other two-day gap adds an extra 40 2, and each six-day I gap adds 240 2. Clearly several small gaps are preferable to a single large one. In fact, if the total number of missed days r is fixed in advance, it is an optimal policy to arrange that all gaps should be one I day in length. The effects of various combinations of rand g when m = 80 are shown in Table 8.1. When r is fixed, notice the heavy penalty for increasing the value of g. I ,. Without Assumption 8.1, the estimator T(r,g) is not unbiased. No unbiased estimator of T can be found without either (i) an assumption I about how daily entrainment T(j) behaves on days when no sampling is done, such as Assumption 8.1, or (ii) random selection of the days on which sampling occurs. If the sampling schedule is determined systematically, rather than randomly, and we do not know the relationship between T(j) I on unsampled days and T(j) on sampled days, then we have no basis on which to calculate estimates T(j) for unsampled days. In a sampling gap I of six days, or even two or one, there could be a sharp spike, an "inverted spike" (a drop to near zero followed by a return to higher levels), or some other pattern substantially different from daily entrainment T(j) immediately before and after the gap; neither linear I interpolation nor any other method can be expected to perform satis-factorily over the range of possibilities. I I I I
I 8.4 I Table 8.1 I A Variance of T(r,g) when m = 80 I m = number of days in sampling season g = length of gap = number of consecutive days in each gap I r = total number of days omitted from sampling schedule I 2 2 Variances are written as multiples of 0 : table entry 80 means 800 , etc. 9 I 1 2 3 5 6 0 80 80 80 4 80 80 80 I 12 98 104 110 116 122 128 I r 24 116 128 140 152 164 176 36 134 152 170 188 206 224 I 48 152 176 200 224 248 272 I I I I I I I I I
I 8.5 I 8.2 ENTRAINMENT ESTIMATION UNDER NONCONTINUOUS SAMPLING I The example of Section 8.1 can be generalized to the entrainment sampling situation. Consider an entrainment season of m days, of which hare sampling days. A sampling day is defined to be a day on which samples I are drawn from the discharge channel during 24 consecutive one-hour p.s.u.'s. No sampling takes place on the remaining m - h days of the entrainment season. These unsampled days may form gaps of several differ'ent lengths. I Let dj denote the day of the season on which the jth sampling day occurred, for example, d2 = 5 if the second day of sampling was on day 5 of the season. On I each of the h sampling days, the daily entrainment T(d j ), j=l, *** ,h, can be estimated by I = 24 24 L (V./v.)X. = L R.X. (8.6) i=l 1 1 1 i=l 1 1 I where i indexes the 24 one-hour p.s.u.'s in the jth sampling day and I " Vi' vi' Xi' and Ri are as defined in Chapter 5. The estimate T(d j ) is unbiased for the true daily entrainment T(d j ). ~sing the continuous sampling I methods of Chapter 5, the estimated variance of T(d j ) can be found tal be I = r 24 2 i=l "2 "2 R.X.(l + c X.)/(l + c ) 1 1 1 (8.7) I if an estimate of c2 can be obtained from replicate sampling. Otherwise take c 2 = 1, giving I (8.8) I For the m-h days on which sampling does not occur, linear interpolation I between the closest preceding day and the-closest succeeding day on which sampling is performed is used to estimate T(j). If day k is sampled and is followed by a gap of g days, making day k+g+1 the next sampling day, I this interpolation gives I T(k+j) = T(k) + g1 1[T(k+g+1) - T(k)] for j=l, ... ,g. (8.9) I We now give formulas for estimating the total seasonal entrainment T. Let j=l, *** ,h index the h sampling days, so j=3 represents the third day on which sampling took- place, regardless of which day in the season this I
I B.6 was. Define for j=I, ... ,h I dj = t he cay I af t e h season on WhlC* h t he J. th samp 11ng
. day occurre d , I I
Wj (d j +1 - dj _1 )/2 for j=2, ... ,h-l (B.I0) I I For example, if days 1, B, 15, 16, and IB of the season are the first I five sampling days, then d1=1, d2=B, d =15, d4=16, and d =IB. The 3 weights Wj are given by 5 I wI = (1 + B - 1)/2 = 4, I W = 2 (15 1)/2 = 7, (B.11) I (16 - B)/2 W3 = = 4, I W = (IB - 15)/2 = 1.5, 4 A I and so on. Then formula (B.l) for T can be rewritten in an equivalent form making explicit use of only the h sampling days, by using (B.9) to estimate the daily entrainment for each unsampled day: B A T = I h A w.T(d.). j=1 J J (B.12) I The mean, variance, and estimated variance of this estimator are I A E( T ) = ,1: h A W ' E( T ( d . ) ) h
= I w.T(d.), (B.13) I J:: 1 J J j=1 J J I
I I
I B.7 I A Var(T) = .L h 2 A wJ.Var(T(d J
.>>, (8.14)
J=l I Var(T) = .I WJ~var(T(dJ'>>' (8.15) J=l I A The estimator T of (B.1~ is biased except under special circumstances. It I is appropriate to condition all calculations on the dates when entrainment sampling occurred, since these are chosen systematically in order to sample when entrainment levels are expected to be high. Conditional on I A these dates, the estimator T of (&12) cannot be guaranteed to be unbiased. Unbiasedness would require I E~T) - T = E{T) - [T{l) + T(2) + **** , + T(m)] = 0, (8.16) I A . where E(T) is given by the right-hand side of (&13). This is because the linear interpolation that led to the weights Wj in (B.12) will unavoidably I miss entrainment spikes or other departures from linearity occurring during sampling gaps. A The estimator T of (8.1~, however, is unbiased (conditional on the sampling I dates) when Assumption 8.1 of Section 8.1 holds, that is, when there is a linear trend in daily entrainment during each sampling gap. The bias in A T will be small if Assumption 8.1 is approximately true, that is, if I actual daily entrainment in each sampling gap is nearly linear. This might be quantified as I Assumption 8.2. Whenever sampling on day k of the entrainment season is followed by a sampling gap of g unsampled days, the daily entrainment levels satisfy I IT(k+j) - {T(k) + 9i1[T(k+9+l) - T(k)]}1 ~ U for j=l, .... ,g. (B.17) I A Under this assumption, the magnitude of the bias in T of (a12) is bounded I by m A m
= I I E(T(k>> L T(k)/ ~ (m-h)U, (8.18)
I k=l k=l I where m-h is the number of days that are unsampled. I I
I 8,8 We now ~emonstrate that noncontinuous sampling results in a larger value I of Var(T) than continuous sampling with the same level of sampling effort, that is, with the same total volume sampled over the course of the entrain-ment season. To simplify the presentation, we treat the noncontinuous I sampling plan in ~/hich sampling occurs every second day throughout the sea-son. This plan's regularity minimizes the mathematical complexity of the reasoning, but thE! same reasoning with more elaborate sets of constants applies to any noncontinuous sampling plan, e.g., sampling every third day, I etc. Let the subscript "ij" denote p.s.u. i on day j, where i=I, ... ,24 and I j=I, ... ,m. (The choice of 24 p.s.u.ls is made for concreteness, but 8 or some other number of p.s.u. IS would be equally acceptable.) Assume that Rij and ~ij change slowly from day to day, that is, Rij is approximately I equal to R;,j+l for all i and j and the same holds for ~ij' This assump-tion of slow change makes conditions as favorable as possible to the use of noncontinuous sampling; if the day-to-day change is not slow, then I variability among days is greater (than for slow change), which makes non-continuous sampling more questionable because of its use of sampled days to estimate entrainment for adjacent unsampled days. I Let (prime) denote quantities associated with the noncontinuous, every-I second-day sampling plan, and the absence of a prime denote quantities I associated with the continuous sampling plan. We are comparing plans with the same total volume sampled over the season, which means that daily sample volume under noncontinuous sampling must be twice as large as under continuous sampling, because sampling occurs on only half as many days. I Therefore vi j = ;:~v i j for every i and j (where j denotes a day on whi ch the noncontinuous sampling plan operates), and consequently I R~. lJ
= v1J . ./v lJ'~. :: V**I (2v .. ) = R.. /2 1J 1J 1J I ~ ~. = AlJ** v!* :: A** ( 2v .. ) = 2~ lJ .. (8.19)
I 1J 1.J lJ lJ I In every-second-*day sampling, the days of sampling are d1 = 1, d2 = 3, d3 = 5, ... ,d h = 2h - 1 = m, (8.20) I and (8.10) gives I
= w_ = 2. (8.21)
I h 1 I I
I B.9 I A For continuous sampling, using (5.65) and Var(T(j}) = Var(T(j+1)), I A
- Var(T(J)) =
24 2 L 2 2 R.. (\.1.. + C u.. ) i=l lJ lJ lJ I A Var(T)
= Var(T(l))
A
+ Var{T(2)) + *** + Var{T{m))
A I A """ A ,..
- 2Var(T{1)) + 2Var{T(3)) + 2Var(T{5)) + *** + 2Var(T(m))
I = jI:I:R~ i lJ
.(2u ..
lJ
+ 2c2u~.)
lJ (B.22) I where i ranges from 1 to 24 and j ranges over the odd-numbered days 1, 3, A A 5, *** of the season. (The coefficients of Var{T(l)) and Var(T{m)) should I be made 1.5, rather than 2, for a better approximation; this could be done but will not, to simplify the presentation.) I For noncontinuous sampling, applying (5.65) to each day gives I I 24 2 2 2
= L (R lJ.. /2) (2u** + 4c u**)
I ;=1 lJ lJ I 24 2 2 2 .
= I RlJ.. ( iu** + C J.I ** ),- (B.23)
I i=1 lJ lJ I I Then (B.14) gives (with j used rather than dj to denote the jth sampling day, as in (B.23)). I I I
I 8.10 A h 2 A I Varl(T) = L WjVarl(T(j)) j=l I h L 4Var l (T(j)) I A j=1 I
= 1:ER .* (2~ .. + 4c2~~ .). (8.24) j1 lJ lJ lJ I
Comparing (8.22) and (8.24), we see that I A Var(T) < Varl(T) A A 2Var(T). (8.25) I An analogous argument involving X.. IS instead of lJ
~ .. IS shows that lJ I
Var(T) < A Varl(T) A ..... 2Var(T). (8.26) D If "ij<< C2"~j for some one-hour p.s.u. organism counts, these p.s.u.'s I will be the ones with the largest values of ~ .. , because the discrepancy
" li I between ~ .. and c2~~. (measured by the ratio c ~?/)1 .. =c2)1 .. ) increases as lJ lJ " lJ lJ lJ
~ij "increases. I~hen the summands in (8.22) and (8.24) make large contri-butions to the variances in these equations, which happens when )1ij is I large, these contributions come mostly from the term C2~fj and hardly at all from the term )1 ... Because c2~~. is multiplied by 4 in (8.24) and by 1J lJA A I 2 in (8.22), this implies that Varl(T) ::: 2Var(T); this approximation is excellent when the organism counts are large enough to make the ratio I C2~fj/)1ij much greater than 1 for a substantial number of p.s.u.ls. This will be the case unless the sample organism counts are almost uniformly very low. Such counts indicate low underlying means, hence (Cassie, I quoted in Section 4.2.1) a low value of c 2 and Poisson-like behavior. Even in this cas.e, the inequalities (8.25) and (8.26) hold. I I I
I 8.11 I A There is, in fact, another term contributing to the variance of T when sampling is noncontinuous. Consider the every-second-day sampling plan, I and imagine letting the sample volume v .. increase to its limit, V.. , so A lJ lJ Tj = Tj for each sampling day; that is, our sample consists of the entire I discharge flow on every second day, so entrainment is kno\~n exactly on these days. There would sti I I be error in our estimate of the seasonal total T, however. This systematic sampling of days amounts to partition-I ing the season into two equal.sized clusters of days and ."andomly select-ing one of the two clusters for complete (v ij = Vij ) enumeration. The estimator of T is then (essentially) 2 times the enumerated entrainment I in the selected half-season, and its standard error is essentially the absolute difference in entrainment between the two halves (the odd-days half and the even-days half). Thus, if TI and T2 are the odd-days total I entrainment and the even-~ays total entrainment, so T = Tl + T2 , then the sampling distribution of T would be I A P(T A
= 2T I ) = P(T = 2T 2) = .5. (8.27)
I Therefore the variance of the estimator of T is I I (8.28) I The standard error of the estimate is (essentially) the absolute difference I in entrainment between the two half-seasons, A I S.E.(T) = IT1-T21. (8.29) I A lower bound on the expected magnitude of this absolute difference can be calculated by assuming a Poisson model for daily entrainment. If each of Tl and T2 is a sum of Poisson entrainments, then the expected value of I their absolute difference is I (8.30) I which is derived by using the normal approximation to the binomial distri-bution of TI , I
I B.12 P( T IT) T
= ~ __ (1/2) 1(1/2) 2, T
(B.31) I 1 T IT* I I' 2" I with E(T1IT) = T/2 and Var(T1IT) = T/4. I A This contribution to the variance of T is more complicated to calculate for other noncontinuous sampling plans, which lack the regularity that made the mathematics (relatively) easy for the situation of I every-second-day sampling. It will suffice here to point out that the addition of the term treated in (B.2B) and (B.30) to the variance of T A I under noncontinuous sampling gives continuous sampling an even larger advan-tage over noncontinuous sampling. The formula (B.15), which does not include A this component of variance, will be used to estimate the variance of T under I noncontinuous sampling. Further analytic work is needed to extend the treat-ment of this additional component of variance from the every-second-day sampling plan to more general noncontinuous sampling plans. I A more detailed mathematical treatment shows that the variance inequality (B.25) generalizes to I I A A A Var(T) < Var'(T) < (m/h)Var(T) (B.32) for noncontinuous sampling plans with gaps of no more than one day. A A Var'(T) is close to (m/h)Var(T) unless the organism counts are almost uni-I formly low. For noncontinuous sampling plans with gaps longer than one daYl' both the upper bound and the typical behavior of Var' (T) are even larger than in (B.32). Thus, noting that m/h is the reciprocal of the I proportion of days sampled, we see that the number of experimental units (days) is much more influential than the size of each unit's sample; cut-ting the number of experimental units in half makes the variance almost I twice as great, even when each experimental unit doubles in size (volume sampled). I A comparison of two sampling plans, one continuous and the other noncontinu-ous, expending the same amount of sampling effort as measured by the total volume of all samples gathered over the whole season, reveals that the noncontinuous plan loses on both important criteria. First, the noncon-I tinuous plan gives an estimate of T that is biased, while the continuous plan provides an unbiased estimate of T. Second, the variance of the non-continuous plan's estimate of T is greater than the variance of the con-I tinuous plan's estimate of T. I I I
I I B.13 A natural interpretation of the inequality (B.32) is based on parti-I tioning the variability of entrainment into two components, a within-p.s.u.'s variance and an among-p.s.u.'s variance. Continuous sampling gives relatively precise information about among-p.s.u.'s variance, because no p.s.u. is unsampled, but spreading the sampling effort across I all p.s.u.'s lowers the amount of information about within-p.s.u.'s variance that is obtained. In contrast to this, a noncontinuous sampling plan with the same sampling effort concentrates the sampling on only some I of the p.s.u.'s, drawing a larger sample from each of these than under con-tinuous sampling. This reduces the amount of information provided by the sampling plan about variance among p.s.u.ls, because some p.s . u. IS are not I sampled, making their contents completely unknown. However, the volume of the sample drawn from each sampled p.s.u. is larger, giving more infor-mation about variance within p.s.u.'s. The superiority of continuous over noncontinuous sampling makes it clear that when a smaller number of I p.s.u.'s are sampled, the increase in knowledge about within-p.s.u.ls variance is heavily outweighed by the greater loss of information about among-p.s.u.'s variance. An intuitive interpretation of these findings I is that variance among p.s.u.'s is a much greater component of overall variability than variance within p.s.u.'s. I A The variance and estimated variance of T are given by (a14) and (B.15). Large weights w. will produce large terms in these equations, which will A J ,.. ,.. . 2 increase Var{!) and VariT)',..except when Wj is multiplied by a small I value of Var(T(d j )) or Var{T(d j )). Two points are relevant here: first, equation (B.1Q makes it clear that a long sampling gap immediately before I and/or after"the jth sampling day is what causes w. t.o.be large.. Second, small
'" "" J values of Var{T(d j )) and Var(T(d j )) occur in entrainment sampling when I entrainment itself is very.low, as the formulas for these quantities show.
The result of these considerations ,.,is that sampling gaps will affect the I variance and estimated variance of T only slightly, if at all, when these gaps occur during portions of the entrainment season known to have organism density of zero, 'or virtually zero. However, sampling gaps that occur I when organism density is greater than zero will increase Var(T) A ... and Var(T). Longer gaps will produce greater increases, all other things I being equal, as in th~ example of Section B.1. This discussion of entrainment estimation under noncontinuous sampling plans can be summarized as follows: I (1) In general, interpolation-based estimators of seasonal entrainment I will be biased; noncontinuous sampling data do not provide any information about the degree and direction of bias present. Under Assumption B.1, however, these estimators will be unbiased. If a I suitable approximation to this assumption, like Assumption B.2, seems more realistic, then bounds on the magnitude of bias can be obtained. I
I (2) 8.14 The variance and estimated variance of estimated entrainment abundance, I given by (8.1~ and (8.15), are determined by the weight and the variance of the daily entrainment estimate for each day on which sampling occurs. Large weights are attached to sampling days that I are adjacent to long sampling gaps. The variances of daily entrain-ment estimates increase with organism densities, which are closely connected to observed organism counts. Sampling gaps during the entrainment season will not increase the variance and estimated I I A variance of T if organism density is known to be zero throughout these gaps. On the other hand, the increase in variance can be very great if gaps occur in high-density periods. (3) r If gaps are necessary, they should be concentrated in portions of the entrainment season in which the density of organisms is zero, I or as close to it as possible. When density is zero, Assumption 8.1 A is satisfied, so T is unbiased; and daily entrainment estimates have I variance equal to zero, so the variance of T is not increased. (4) For much of the entrainment season, it is not possible to be certain I in advance (when sampling schedules are being decided upon) whether a given day's organism density will be high, moderate, or low. We may know, for example, only that an entrainment spike of 7 to 10 days I in duration will probably occur at some point between mid-June and mid-July. During such periods, it is best to have no sampling gaps. If they are necessary, sampling gaps should be as infrequent and as I short as possible. Many short gaps are preferable to a few long ones containing the same total number of days; gaps should be one day in length, two days if this is unavoidable, but no longer. Short gaps make Assumption 8.1 or a suitable approximation to it more tenable, I reducing or eliminating bias in entrainment estimates, and avoid the large weights that would inflate the variances of these estimates. I The simultaneous estimation of entrainment abundance for many species and life stages of ichthyoplankton calls for a conservative approach to the scheduling of gaps. Long sampling gaps during periods of nonzero organism I density introduce bias that cannot be estimated without restrictive assumptions, and also increase the variance of T much more than shorter gaps. The penalty for having long gaps when any important species-life I stage combination ;s present will therefore be great. I 8.3 NONCONTINUOUS SAMPLING AT ROSETON The methods of Section 8.2 will now be applied to 1983 data from the I Roseton Generating Station. Tables 8.2 and 8.3 present daily and seasonal entrainment estimates for striped bass yolk-sac larvae and juveniles. Sampling was performed on 35 days during the 1983 season: May 3, 10; daily I from May 17 to 27, May 31 to June 3, June 7 to 10, June 14 to 22; June 28, July 5,12,19, and 26. For each of these dates, the chronological day of the I I
.. I~
Table 8.2 Estimates of Oailx Entrainment of Stri~ed Bass Yolk-Sac Larvae and Juveniles, Roseton 1983 005 5rJ)ATE J 1) W rURl V1LARI V2LUI TJUVI V1JUYl V2JUVI 1 031UY83 1 1 11.0 0 0.00000£..00 0.00000£..00 0 0.00000£..00 O.ooOOOE.oO 2 1114A183 2 8 7.0 7.1138 1.99291£..01 1.21095£..01 0 0.00000£..00 0.00000£..00 3 171'lAI83 3 15 11.0 2,089 11.36301£..06 11.36301£..06 0 O. 00000£..00 0.00000£..00 II 1814 AY83 II 16 1.0 2,022 II. 09045£..06 11.09045£..06 0 0.00000£..00 0.00000£..00 5 lSHAl83 5 11 1.0 0 O.OOOOQE..oo 0.00000£..00 0 0.00000£..00 O.ooOOOE..oO 6 2OHAY83 6 18 1.0 0 0.00000£..00 0.00000£..00 0 0.00000£..00 0.00000£..00 1 21HA183 7 19 1.0 0 0.00000£..00 0.00000£..00 0 0.00000£..00 0.00000£..00 8 22HAY63 8 20 1.0 0 0.00000£..00 0.00000£..00 0 0.00000£..00 0.00000£..00 9 231'lAI63 9 21 1.0 1,1115 2.00210£..06 2.00270£..06 0 0.00000£..00 0.00000£..00 10 24KAY83 10 22 1.0 0 0.00000£..00 0.00000£..00 0 0.00000£..00 0.00000£..00 11 25HAY83 11 23 1.0 8,540 1.21589£..01 1.21589[..07 0 0.00000£..00 0.00000£..00 12 26HAY83 12 24 1.0 2.738 3.74838£..06 3.748]8£..06 0 0.00000£..00 0.00000£..00 27KAY83 13 25 2.5 11.193 5.862511£..06 5.862511£..06 0 0.00000£..00 0.00000£..00
-13 0 0.00000£..00 0.00000£..00 14 31MAY83 14 29 2.5 5,592 7.8168]£..06 1.81683£..06 01JUH83 15 30 1.0 2.719 3.69527£..06 3.69527£..06 0 0.00000£..00 0.00000£..00 15 02JUN83 16 31 1.0 5.506 9.112151£..06 7.19617£..06 0 0.00000£..00 0.00000£..00 16 11 03JUN83 11 32 2.5 16.621 3.65684£..01 2.46375£..07 0 0.00000£..00 0.00000£..00 18 19 07JUN83 OBJUN83 18 19 36 37 2.5 1.0 8.413 23.510 1.18018[..07 1.08673£..08 1.18018£..07 4.111969£..07 0
0 0.00000£..00 0.00000£..00 0.00000£..00 0.00000£..00 ........ CD 20 09JUH83 20 38 1.0 3,1183 4.85215£..06 4.85215£..06 0 0.00000£.00 0.00000£..00 <J'I lOJUN83 21 39 2.5 15,184 3.06656F,:..o7 2.21010£..07 0 0.00000£..00 0.00000£..00 21 14JUN83 22 113 2.5 29.1119 7.99128£..07 4.119228£..01 0 0.00000£..00 0.00000£.00 22 15JUH83 23 1111 1.0 11.171 11.50720£..01 1.90875£..01 0 0.00000£..00 0.00000£..00 23 211 45 1.0 711.5118 3.13457£..08 1.29522£..08 '0 0.00000£..00 0.00000£..00 24 16JUN83 0.00000£.00 l1JUN83 25 46 1.0 In,II85 1.54874£..08 6 .. 88722£..01 0 0.00000£..00 25 0.00000£..00 18JUN83 26 41 1.0 64,083 3.75192£..08 1.21]18£+08 0 0.00000£..00 26 0.00000£..00 27 48 1.0 63.927 8.61950£..08 1.79968£..08 0 0.00000£..00 27 19JUN83 0.00000£+00 28 49 '1.0 1)4.258 2.74965£..09 11.83091£..08 0 0.00000£..00 28 2OJUN83 50 1.0 150,108 1. 15906£..09 4.421176£..08 0 0.00000£..00 0.00000£.00 29 21JUN83 29
)0 22JUN83 30 ' , 51 3.5 1113,961 1.89695£..09 5.09346£..08 0 0.00000£..00 0.00000£..00 57 6.5 14.2911 4.55625£..07 3.54191£..07 0 0.00000£..00 0.00000£+00 31 28JUN83 31 64 7.0 0 0.00000£..00 0.00000£..00 2,315 5.35725£..06 5.35125£..06 32 05JUL83 32 12JUL83 71 7.0 0 0.00000£..00 0.00000£..00 9, 121 2.60741£..07 2.14225£+01 33 33 19JUL83 311 78 7.0 0 0.00000£..00 0.00000£..00 6,857 1.56727£+01 1.56727£..07 311 26JUL83 85 11.0 0 0.00000£..00 0.00000£..00 0 0.00000£..00 0.00000£..00 35 35 oJ, D, and Ware j, dj and Wj of Section 8.2
°TLARI and TJUVI are estimated daily entrainment of yolk-sac larvae and juveniles from (8.6) °VILAR1 and VIJUVI are estimated variance of TLARI and TJUVI from (8.8) A 2 °V2LARl and V2JUVl are estimated variance of TLARI and TJUV1 from (8.7) with c = .0626.
--- . . .' . _ ' " . _ _ . . . . . . p' I 8.16 I Table 8.3 I Estimates of Total Entrainment of Striped Bass I Yolk-Sac Larvae and Juveni1es, Roseton 1983: Variance and Standard Deviation for Selected Sampling Ratios I Yolk-Sac Larvae Sampling Estimated Variance Std dev Variance Std dev I Ratio Season from (8.4) from (8.4) from (8.3) from (8.3) Entrainment I Actual 1,460,628 3.319644E+10 182,199 1.076977E+10 103,778 1,000 2,000 4,000 1,460,628 1,46{J,628 1,460,628
- 3. 129446E+10
- 3. 322113E+ 10 3.707446E+10 176,902 182,267 192,547
- 7. 189919E+09 1.081624E+10 1.806875E+10 84,794 104,001 134,420 I
B,OOO 1,460,628 4.478113E+IO 211 ,616 3.257378E+10 180,482 I Juveniles Sampling Est'imated Variance Std dey Variance Std dey Ratio Se;~son from (8.4) from (8.4) from (8.3) from (8.3) Entrainment Actual 128,049 2.308096E+09 48,043 2.080168E+09 45,609 1,000 128,049 1. 731408E+09 41,610 9. 947608E+08 31,540 2,000 128,049 2. 179578E+09 46,686 1.838280E+09 42,875 4,000 128,049 3.075918E+09 55,461 3.525317E+09 59,374 8,000 128,049 4.868598E+09 69,775 6.899392E+09 83,063
I I 8.17 season that was the jth sampling day is listed in Table 8.2 as D, and the weight to be attached to this day in the estimate of seasonal entrainment I is listed as W. {D and Ware dj and Wj of Section 8.2}. Each sampling day consisted of 24 consecutive one-hour p.s.u.ls, which I began at 9:00 AM on the specified date and ended at 9:00 AM on the follow-ing date. For each sampling day, the 24 samples from these p.s.u. IS were A I used to compute an estimate of the day's en~rainment, T(d j ), from formula (8.6) and two estimates of the variance of T{d j ) from (8.7) and {8.8}. These appear-in Table 8.2 for striped bass yolk-sac larvae (TLARl, VILARI from I (8.8), V2LARl from (8.7)) and juveniles (TJUV1, V1JUV1 from (8.8), V2JUV1 from (8.7)). The value ~2 = .0626 = 1/15.97 was used, based on 1982 replicate I sampling data from Roseton (Table 4.8) for which K+ = 15.97 was obtained for striped bass yolk-sac larvae. This will illustrate the variance reduction that results from using (8.7) rather than (8.8). Replicate data did not I allow calculation of K+ and thus ~2 for juveniles (see Section 4.3), so
~2 = .0626 was again used for illustrative purpose;.
I For yolk-sac larvae, the daily estimated variance V2LARl is often sub-stantially lower than V1LARl, in. particular when the latter takes on its largest values on June 20, 21, and 22. For juveniles, the estimated daily entrainment and the estimates of its variance are positive only on the last three sampling dates, which all have weights of 7.0 because sampling at this stage of the season is on a weekly basis. For yolk-sac larvae, the daily sampling information in Table 8.2 is good up to the June 20-22 entrainment peak, but is scanty after this peak. The lack of information about the pattern following the observed peak makes it possible that substantial bias is present in estimated seasonal entrain-ment, unless Assumption 8.1 or a similar assumption can be supported by other means. (These data provide no evidence for or against such an assumption.) Bias in the seasonal estimate is even more serious for the juveniles. Sampling only once per week during their presence raises a strong possi-bility that the daily entrainment level peaked at a level markedly higher than was reached on any of the three days when juveniles were observed. This could have occurred during one of the six-day sampling gaps. Fur-thermore, the data tell us nothing about entrainment after the last sampled day, July 26. This is especially important because entrainment is high (relative to the maximum daily level) on July 19, so there may be some uncertainty concerning whether the drop to zero entrainment on July 26 Signals the end of nonzero entrainment levels for the rest of the season.
I 8.18 Seasonal entrainment estimates appear in Table 8.3. In the rows with I A sampling ratio listed as Actual, T and the two variance estimates for each life stage have been computed from Table B.2 using (&12) and (B.1S), I with VI corresponding to (B.B) and V2 to (8.7). To examine the influence of the sampling ratio R; = Vi/vi' the estimation of T has been performed for constant sampling ratio of R; = 1,000 for every p.s.u. i. This was I achieved by changing the sample volume vi to V/l,OOO for each p.s.u. and adjusting the organism counts proportionally. This procedure has been I repeated for sampling ratios of 2,000, 4,000, and B,OOO. Two aspects of thE: estimated variances in Table 8.3 should be mentioned. I First, variances and standard deviations obtained from (B.7) are smaller than those obtained from (B.B). For yolk-sac larvae, the differences are sizeable; for instance, with the actual sampling ratios, I Std dev (B.3)/Std dev (B.4) = 103,77B/1B2,199 = 57.0%. I Second, variances and standard deviations increase with the sampling ratio, I but not proportionally to the change in R1.. For example, for yolk-sac larvae, as R. increases from 1,000 to 2,000,a change of 100% (which 1 reduces the sample volume in each p.s.u. by 50%), the standard deviation from (B.7) increases only 22.7%, from 84,794 to 104,001. Computations of this kind can be used to evaluate the precision attained by adopting various sampling ratios. The sampling plan at Roseton would be improved by eliminating sampling gaps of six days in length, except perhaps in those portions of the entrainment season when it can be documented that entrainment levels are virtually zero. This might be done by shifting from continuous sampling at mid-season (e.g., June 14 to 22) to gaps of one day, then possibly to gaps of two days. Longer gaps cannot be recommended except as a byproduct of sporadic sampling in portions of the entrainment season in which the density of organisms is zero or nearly zero (as noted near the end of Section 8.2). The cost in bias and variance of encountering entrainment above the near-zero level on a sampling day adjoining a long gap is very great. A further possibility for the Roseton Generating Station, noted in Section 1.3, would bE~ to supplement the sampling at Roseton by the concurrent and additional sampling that was done at Danskammer Point. Sampling data from Danskammer could be used in estimating the entrainment at Roseton during periods when sampling was not performed at Roseton. This approach has not been considered in this report.
I I I 9. SAMPLING DESIGN RECOMMENDATIONS FOR FUTURE ENTRAINMENT ABUNDANCE ESTIMATION I
9.0 INTRODUCTION
I This chapter draws on the material developed and presented in earlier chapters, using it to recommend sampling design features for future I entrainment abundance estimation. General conclusions are presented. Specific features of .recommended sampling designs are then treated. I
9.1 CONCLUSION
S ON SAMPLING DESIGNS It is useful to discuss briefly the characteristics of the sampling I designs used at Bowline Point, Indian Point, and Roseton in recent years. In the first design, a p.s.u. corresponds to a three-hour period and sampling is continuous, i.e., sampling occurs over 8 p.s.u. IS per day, I 7 days per week. In the second design, a p.s.u. corresponds to a one-hour period and a full day of p.s.u.ls is sampled on systematically chosen days during the entrainment season. I A basic issue concerning these designs is that the variability of entrainment estimates has not been properly addressed. More specifically, these designs provide pOint estimators of entrainment abundance and I entrainment reduction, but the estimated variances of these estimators are needed. These estimated variances have until now been calculated under the assumption of Poisson variation in the observed data from the I discharge channel, which gives unrealistically short confidence intervals when the data are not Poisson. In some cases, even this has not been done, and the subject of estimated variances of entrainment estimators has gone unmentioned. This is a key point, because a comparison of the I designs recommended here to other designs used previously can be judged in either of two ways: I (i) in terms of the estimated variances of estimators obta"ined in connection with previous designs, or I (ii) after incorporating methodology from this study to aid in the esti-mation of variances from data collected under previous designs. The first of these comparisons is easy: in entrainment studies prior to I this report, estimated variances and standard errors of estimates have been calculated either not at all or under the often unrealistic Poisson assumption. By this standard, the improvement is unbounded. I The second comparison requires more work. Calculating for entrainment estimators from earlier studies the estimated variances developed here I I
I 9.2 makes those studies useful in a way they had not been before, because I the estimated variance or standard error of any estimator provides essential information about the precision of that estimator. The estima-tors of entrainment we have dealt with, both the standard fixed-sample-I volumE~ estimator and the adaptive, more efficient sum-quota sampling estimator, are either exactly unbiased or very nearly so. Thus the criterion of low bias is satisfied by these estimators, making low I variance and low coefficient of variation the only remaining issues. Our conclusions cover six specific topics bearing on the sampling design: continuity of sampling, equality of the sampling fractions v/V i , I the sample volume's relationship to confidence interval width, replicate sampling, sample duration, and sum-quota versus fixed-volume sampling. I These will now be discussed.
- 1. Continuous sampling, or as nearly continuous sampling as possible, is an important property of a good sampling design. The reasons for this I
were discussed in Chapter 8. Electing not to sample on some days can lea~, except under assumptions that may be considered unrealistic, to bias in T, the estimator of total entrainment abundance. It also introduces a I major source of additional variability to entrainment estimation. Scheduling a gap of 9 days in the sampling plan increases the estimated variance of T by a little more than g2/2; thus, if there must be gaps in I the sampling, many small gaps (one day apiece, perhaps two days if this is unavoidable) are much better than a few large gaps. There is one exception to this guideline, which has to do with the fact that some I species-life stage combinations are more important than others for us to estimate accurately. It may happen that all of these important species-life stage combinations are known to have densities that are zero (or I virtually zero) in a part of the entrainment season. For instance, we may know in advance from long biological experience that all of the combinations we are strongly concerned about will register zero entrain- I ment during the first four weeks of the entrainment season. In this case, only sporadic sampling would be necessary during this period. Naturally, if our "knowledge" turns out to be incorrect and a sizeable level of entrainment takes place during one or more of our sampling gaps, then we I will fail to observE~ the evidence of this entrainment peak. This danger is present whenever gaps in the sampling schedule exist; it can be avoided only by ensuring that we have full coverage of the entrainment I season except for any time interval during which we are absolutely certain no entrainment of any consequence will occur. I
- 2. Equality or neal' equality of the sampling fractions v;lV; across all p.s.u.'s prov*ides important advantages, which were discussed in Section 5.9. Inequality of these sampling fractions can lead to inaccuracy in the computation of I
confidence intervals, that is, to confidence intervals whose nominal and true confidence levels differ substantially. This is a critical issue, as the quality of the confidence intervals obtained from entrainment I abundance sampling is a primary consideration; a design that produces confidence intervals we must be cautious in believing cannot be highly regarded. I I _=__.w __. - . .
I 9.3 I Substantial inequality among the sampling fractions Vi/V; is necessary I to erode seriously the accuracy of entrainment confidence intervals. Small to moderate inequalities, e.g., sampling ratios differing among p.s.u.1s by a factor of two, will cause only a minor loss of accuracy. I As the imbalance in vi/Vi increases, though, the accuracy of entrainment confidence intervals decreases; for example, sampling ratios differing I among p.s.u.*s by a factor of six can cause major degradation of confi-dence interval accuracy. Consequently, if equality of the sampling frac-tions cannot be accomplished exactly, it should be arranged to the great-est extent possible. I The equality of sampling fractions is for all p.s.u. IS at a given plant, not necessarily from one plant to another. To make the sampling fractions I in all p.s.u.ls equal, which means making the sample volume proportional to the p.s.u. volume (plant discharge flow volume during the p.s.u.), the sampling gear intake rate could be regulated by a servomechanism with feed-I back linked to discharge flow rate. The complexity of entrainment sampling prevents any attempt to improve the accuracy of estimation by using sampling fractions that differ from one p.s.u. to another. This complexity I encompasses the need to obtain entrainment estimates for many' different species and life stages; the relative costs of collecting, sorting, and identifying samples of various sizes and compositions; the year-to-year variation in entrainment patterns; and other similar factors. A related I consideration is that vi/Vi must be the same for any collecti~n of samples that we wish to combine into a single composite sample. I 3. The total volume of water sampled, LVi' must be adequate to yield acceptable confidence intervals for the entrainment-related quantities of I interest. When this volume is divided among the p.s.u.'s so that vi is proportional
~
to the V., 1 the approximate length of the confidence interval I for T can be found by applying these volumes or approximations of them in the formulas in Chapter 5 for obtaining confidence intervals. Historical or projected data for the Ails, the mean density of ichthyoplankton in I the sampling periods, may be used in these formulas. One may proceed in the reverse direction, instead: starting with a decision that the total volume of water sampled is determined (perhaps by cost considera-I .tions), it is possible to obtain the approximate length of the resulting confidence interval in this manner. I 4. Replicate sampling offers great potential for decreaSing the length of the confidence intervals. The quadratic variance, negative binomial mod~l for entrainment data is supported by a variety of empirical and theoretical considerations. By using replicate observations to estimate c2, the quadratic variance parameter of (5.B), and then using formula (5.14) instead of (5.13), we can often obtain dramatic reductions in S.
I 9.4 I Recall that c2 differs among species-life stage combinations, so it will be most beneficial to schedule replicate sampling ~t times when the species-life stagl~ combinations of primary importance are being entrained. I It is not possible to know at present what the optimal pattern of repli-cate sampling sholJld be. This is because our current knowledge of its properties comes from a relatively small number of sampling experiments. More information must be collected in order to gain a better understand-I ing of c 2 , the negative binomial model, and related aspects of the entrain-ment process. I A reasonable way to gather further information would be to schedule a limited amount of replicate sampling on a weekly basis, e.g., two conse- I cutive days per week, Tuesdays and Wednesdays say. Such a schedule would allow us to investigate whether c2 is constant over hours in the day, days within the week, and weeks within the season. Data analyzed I in this study have been consistent with the hypothesis that c 2 is constant for each species and life stage of ichthyoplankton, i.e., that c2 has the I same value for every p.s.u. in the sampling season. Additional replicate sampling data ma)' further support this hypothesis. If instead it appears that c 2 varies over the sampling season, these changes may occur in ways I we can take advantage of. Replicate sampling should receive special attention until the behavior of c 2 and related aspects of entrainment are more fully understood. I
- 5. Sample duration, or the time interval corresponding to a p.s.u.,
should be short; as shown in Section 4.3, shorter p.s.u.ls give better I performance. The difference between p.s.u.ls of one hour and three hours, for example, is a complex function of the entrainment pattern; this dif-ference does not appear to be great. Thus, while a sample duration of one I hour is preferable, durations of two or three hours also perform well. Sample durations of less than one hour are not recommended for three I reasons. First, short sample durations lead to great variability in the data, producing observed counts that are relatively high in noise and low in signal. Second, and more important, the justification of the negative binomial model of Section 4.2 has been largely empirical, based on I analysis of one-, two-, and three-hour samples. For sample durations shorter than one hour, it cannot be taken for granted that the negative binomial model will perform as well as it does with one- to three-hour samples. Third., short durations will increase the number of p.s.u. IS and samples in the season, which in turn will increase administrative and record-keeping costs.
I I 9.5 I 6. If a splitting device is able to divide a sample into subsamples whose organism cou~ts are a random division of the sample's ichthyoplank-ton contents, in the sense of Sections 7.0 and 7.1, then the three-stage adaptive sampling method of sum-quota sampling will outperform fixed-I volume sampling plans. Under these conditions, sum-quota sampling is a more efficient source of information per dollar spent on processing the samples than fixed-volume sampling, in which the entire sample of volume I vi or a fixed fraction of the sample is always analyzed. At present, splitting devices that successfully produce a random division of the ichthyoplankton in a sample do not appear to be available, so the use of I sum-quota sampling methods is not appropriate. When the problem of organism clumping in splits has been overcome, however, sum-quota sampling should improve greatly on fixed-volume methods. More remains to be done I in order to obtain recommendations of appropriate values of the para-meters (the quota Q, the number of splits, and so on) for optimizing the performance of sum-quota sampling. I 9.2 RECOMMENDATIONS FOR ENTRAINMENT ABUNDANCE SAMPLING DESIGNS I The specific features recommended for an entrainment abundance sampling design will now be listed. These were discussed in greater detail in I Section 9.1. An effective sampling design should have I 1. Continuous sampling, or as nearly continuous sampling as possible; I 2. Equality of the sampling fractions, or as close to equality as possible, in all p.s.u.1s; I 3. Adequate total volume sampled from the discharge flow to yield acceptable confidence intervals for the entrainment quantities of interest; I 4. Sufficient replicate sampling to allow precise estimation of the quadratic variance parameter c2, leading to shorter confidence I intervals;
- 5. P.s.u.1s of duration one to three hours.
I The techniques developed in Chapter 5 can be used to obtain estimates of I entrainment abundance and entrainment reduction from a design of this kind. If improved splitting devices become available,
- 6. Laboratory analysis of samples by sum-quota sampling should be I employed. The techniques developed in Chapter 7 will then be needed.
I
-p-"'.------
I 9.6 I 9.3 RECOMMENDATIONS FOR ADDITIONAL STUDIES RELATED TO ENTRAINMENT ESTIMATION I SevE!ral additional studies would provide information with far-reaching implications for entrainment sampling. These studies would examine the I assumptions currently adopted and the proper use of historical estimates of entrainment obtained from designs different from those recommended here. I 9.3.1 A STUDY OF REPLICATE SAMPLING The replicate sampling done to date has been analyzed under Assumption 3.1'. I This treats two simultaneously drawn samples from different locations in the discharge channel as a random sample of two pump or net volumes chosen from the much larger collection of pump or net volumes constituting I the discharge flow during the p.s.u. Two simultaneous samples can be thought of as two independent draws from a common underlying distribution of organism counts. It is important to assess whether this assumption is true. If it is, the position of the sampling gear in the discharge channel I cross-section should not influence the sample organism counts. This allows us to think of each sample count as having been randomly chosen from the set of all counts that could have been obtained from different loca-I tions of the sampling gear in the discharge channel cross-section; viewing sample counts as random samples of one or two observations from this set, we can make inferences about the set and hence about total entrainment o during the p.s.u. being considered. On the other hand, systematic spatial nonrandomness may be present, that I is~, the position of the sampler in the cross-section may influence the sample organism counts. For example, counts may exhibit a consistent tendency to be higher at some depths than at others. The presence of such nonrandomness would create difficulties, perhaps great ones, for entrain-I ment estimation. There are other reasons to suspect that systematic spatial nonrandomness may occur. In a closed discharge pipe, like those at Bowline Point and Roseton, the velocity of flow at a cross-sectional location is strongly influenced by the distance of that location from the pipe's wall. Velo-I city of flow at a point in the cross-section increases as the point moves away from the wall of the pipe toward the center of the pipe. This means that a greater volume of cooling water will pass by the location of a I sampling orifice as the orifice moves closer to the center of the pipe's Cil~cular cross-section. The effects of this situation on entrainment sampling are not clear, though, because ichthyoplankton density could be re"lated to flow velocity at a cross-sectional location, to the location I itself, or to both. These complex circumstances are potentially present in the open discharge channel of Indian Point as well, although perhaps to a lesser degree. I I I
I I 9.7 I Sampling gear presently in use draw all samples from one or two fixed locations in the discharge channel cross-section. This does not permit an investigation of the effect, if any, of this location. Sampling gear I that allow the intake to occur at any of a large number of cross-sectional locations would let us address the question of whether location has an effect on sample organism counts. Such a gear could consist of a fixed I horizontal beam with a vertical crossbeam mounted to it, capable of sliding from left to right and also of being raised and lowered. Mounting the sampler intake pipe to the bottom end of the crossbeam would give it access to a wide range of positions in the discharge channel. Repeating I this arrangement by having two vertical crossbeams with sampler intake pipes, to be used simultaneously, would make it possible to take replicate observations. A carefully designed and analyzed series of paired observa-I tions could reveal whether each pair is a true set of replicates, i.e., a random sample of size two from a common distribution, or a simultaneous pair with systematic spatial nonrandomness present. If the latter is the I case, it may be possible to determine the pattern of the nonrandomness and to use this information to improve entrainment estimates. I 9.3.2 A STUDY OF ENTRAINMENT' ABUNDANCE AND PLANT OPERATING FACTORS I The difficulties in relating flow rate to entrainment were discussed in Section 4.7. The simplest model, given by (4.17), is based on the hypo-thesis that entrainment during a time interval is directly proportional I to the flow rate~ or equivalently that ichthyoplankton density in the dis-charge channel is independent of the flow rate there. Thus, doubling the flow rate in a time interval would double the number of organisms entrained. It is, however, possible that a more complicated relationship, such as I (4.19), holds rather than (4.17). To determine the links connecting entrainment abundance, species abundance in the river, and flow rate, observe that the first of these is a function of the other two. To I examine the relationship between entrainment abundance and flow rate, it is necessary to hold species abundance constant. We cannot measure entrainment simultaneously at two or more different flow rates at a given I plant, which is ideally what we would like. Instead, we can sample plant entrainment at flow rate f1 for a brief I period of length t; then change the plant operating conditions to attain flow rate f2 and wait for any residual effects of the earl ieY' flow rate f1 and the changeover to pass; then sample plant entrainment at flow rate I f2 for a time interval of length t. The time interval between sampling . I I I
I 9.8 at flow rates f1 and f2 must be as short as possible, in order to keep I river conditions (species abundance, tide, etc.) as similar as possible in the two sampling periods. Likewise, t must be small enough to make I the sampling periods similar, thereby reducing the bias resulting from differences between the sampling periods to an acceptable level. How-ever, t must also be large enough to allow collection of sufficient sample volumes to provide acceptably accurate, that is, low-variance entrainment I estimates. If practical considerations prevent us from finding a value of t that is satisfactory on both of these criteria, the first is more important: we can keep t small to avoid bias, taking several replications I with this t rather than one with a larger t. This experiment should be repeated for a variety of river conditions, I plant operating conditions, and choices of f1 and f2" Several independent runs at each combination of conditions should be performed, with as much prespecification as possible of the times and conditions at which the I runs are to be made. If feasible, a more elaborate changeover design of the form f1' then f2' then f1' then f2 could be more informative. I A critical requirement is to make adjacent time intervals comparable by making both t and the changeover period between flow rates small, so bias is low. An analysis would proceed by testing the hypothesis that the I average density X is independent of the flow rate f. Suitable variables among the run conditions may be used as covariates. I 9.3.3 USE OF HISTORICAL ENTRAINMENT ESTIMATES I Many past entrainment studies lack the replicate sampling necessary for the estimation of the quadratic variance parameter c2. The classification I model of Section 4.5.1 leads to the bound c2 < 1. Unless one adopts this 2 - model, however, no bound can be put on c , and consequently no confidence I intervals can be derived for the entrainment abundance and entrainment reduction from these past data sets. While c2 is known to vary from one species and life stage of ichythoplankton I to another, it may well be the case that for a given species and life stage, c 2 is constant across p.s.u.*s within a particular entrainment season and also from one season to the next. Replicate sampling data I analyzed in this project are consistent with the hypothesis that c 2 remains unchanged throughout a season. Replicate sampling data from I another year or two will be necessary in order to assess whether c 2 for a given species and life stage remains the same over several years. If it does, the estimated value ~2 obtained from replicate sampling data spanning I I I
I I 9.9 I several years can logically be applied to studies from previous years without replicate sampling. This approach would result in confidence .1 intervals for entrainment estimates from these previous years. 9.3.4 A STUDY OF PLANT FLOW VOLUMES AND SAMPLE VOLUMES I Throughout this report, it has been assumed that the discharge flow volume I Vi and the sample volume vi in each p.s.u.,i=l, *** ,N, are known. As noted at the end of Section 5.9, if they are not known exactly and must be estimated, there will be a random amount of difference between I our data value for Vi (for example) and the underlying, unknown true value of Vi' The same is true for vi. The presence of such random errors could have serious statistical consequences for entrainment sampl-I ing methods. A study of the relationship between the values of Vi and vi determined by standard measurement methods and the true values of Vi and I vi would provide information on how much or how little error is included in the observed values recorded for Vi and vi. The magnitude of the I effect of these errors on entrainment estimation could then be investi-gated. I I I I I I I I I
I I REFERENCES I Anscombe, F. J. 1949. The statistical analysis of insect counts based on the negative binomial distribution. Biometrics 5, 165-173. I Bliss, C. I. 1953. Fitting the negative binomial distribution to biologi-cal data. Biometrics 9, 176-196. I Cassie, R. M. 1963. Microdistribution of Plankton. Oceanogr. Mar. Biol. 1, 223-252. I Central Hudson Gas and Electric Corporation (CHG&E). Roseton and Danskammer Point Generating Stations 316(a) Demonstration. Central Hudson Gas and Electric Corporation, Poughkeepsie, New York. I Clopper, C. J. and Pearson, E. S. 1934. The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26, I 404-413. Cochran, W. G. 1977. Sampling Technigues, 3rd edition. John Wiley & Sons, I Inc., New York. Collings, B. J. and Margolin, B. H. 1985. Testing goodness of fit for the Poisson assumption when observations are not identically distributed. I Journal of the American Statistical Association 80, 411-418. Consolidated Edison Company of New York, Inc. (Con Edison). 1977. I Indian Point Unit No.2, Indian Point Unit No.3. Near Field Effects of Once-Through Cooling System Operation on Hudson River Biota. Con-solidated Edison Company of New York, Inc., Power Authority of the I State of New York. July 1977. Ecological Analysts, Inc.' (EA). 1981. Indian Point Generating Station I Entrainment and Near Field River Studies: 1979 Annual Report. Pre-pared for Consolidated Edison Company of New York, Inc. Ecological Analysts. Inc .* Middletown. New York. . I EA. 1~83a. Bowline Point Generating Station Entrainment Abundance and Unit Outage Evaluation: 1982 Annual Report. Prepared for Orange I and Rockland Utilities, Inc. Ecological Analysts. Inc., Middle-town, New York. EA. 1983b. Roseton and Danskammer Point Generating Stations Entrainment I Abundance Studies: 1982 Annual Report. Prepared for Central Hudson Gas and Electric Corporation. Ecological Analysts, Inc., Middletown, New York. I EA. 1984a. Bowline Point Generating Station Entrainment Abundance and Outage Evaluation: 1983 Annual Report. Draft version. Prepared I for Orange and Rockland Utilities, Inc. Ecological Analysts, Inc., Middletown, New York. I
I REFERENCES/2 I EA. 1984b. Indian Point Generating Station Entrainment Abundance and Outage Evaluation: 1983 Annual Report. Prepared for Consolidated I Edison Company of New York, Inc. Ecological Analysts, Inc., Middletown, New York. I EA. 1984c. Roseton and Danskammer Point Generating Stations Entrain-ment Abundance Studies: 1983 Report. Prepared for Central Hudson Gas and Electric Corporation. Ecological Analysts, Inc., Middle- I town, New York. Edmondson, W. T. and Winberg, G. G., editors. 1971. A Manual on Methods I for the Assessment of Secondar Productivit in Fresh Waters. International Biological Programme I.B.P. Handbook No. 17. Black-well, Oxford, England. I Efron, B. 1982. The Jackknife, the Bootstrap, and Other Resampling Plans. Society for Industrial and Applied Mathematics, Philadelphia, I Pennsylvania. Evans, D. A. 1953. Experimental evidence concerning contagious distri- I butions. Biometrika 40, 186-21l. Fisher, R. A. 1953. Note on the efficient fitting of the negative binomial. Biometrics 9, 197-200. I Gart, J. J. 1964. in virology. The analysiS of Poisson regression with an application Biometrika 51~ 517-521. I Greenwood, M. and Yule, G. U. 1920. An enquiry into the nature of fre-quency distributions of multiple happenings, with particular reference I to the occurrence of multiple attacks of disease or repeated acci-dents. Journal of the Royal Statistical Society, Series A 83, 255-279~ I Johnson, N. L. and Kotz, S. 1969.
&Sons, Inc., New York.
Discrete Distributions. John Wiley I Madenjian, C. P. and Jude, D. J. 1983. Error bounds for estimates of entrained ichthyoplankton. Can. J. Fish. Aquat. Sci. 40, 10-16. I Martin Marietta Corporation, Environmental tenter and Virginia Institute of Marine Scil~nce. (Martin Marietta). 1981. Investigation of I Hydrodynamic and Behavioral Factors that Affect Zooplankton Sampling by Pumps. Prl=pared for Maryland Department of Natu*ral Resources, Power Plant Siting Program, by Environmental Center, Martin Marietta Corporation, Baltimore, Maryland. I I I
I I REFERENCES/3 I Mood, A. M., Graybill, F. A., and Boes, D. C. 1974. Introduction to the Theory of Statistics, 3rd edition. McGraw-Hill Book Company, New York. I Morris, C. N. 1982. Natural exponential families with quadratic variance functions. Annals of Statistics 10, 65-80. I Normandeau Associates, Inc. and Consolidated Edison Company of New York, Inc. (Normandeau and Con Edison). 1982.' Gear Comparability Study for Entrainment Sampling of Juvenile Fish at the Indian Point Station, I 1981. Consolidated Edison Company of New York, Inc. Pathak, P. K. 1976. Unbiased estimation in fixed cost sequential sampling I schemes. Annals of Statistics 4, 1012-1017. Pielou, E. C. 1977. Mathematical Ecology. John Wiley &Sons, Inc., New York. I Quenouille, M. H. 1949. A relation between the logarithmic, Poisson, and negative binomial series. Biometrics 5, 162-164. I Rubinstein. R. Y. 1981. Simulation and the Monte Carlo Method. John Wile,l &
. -Sons, Inc., New York.
I Snedecor, G. W. and Cochran, W. G. 1980. Statistical Methods, 7th edition. Iowa State University Press, Ames, Iowa. I Student. 1907. On the error of counting with a haemacytometer. metrika 5, 351-360. Bio-I Vaughan, D. S. and Kumar, K. D. 1981. Detectability and Precision of Estimates of Entrainment Mortality of Ichthyoplankton. Oak Ridge National Laboratories, NUREG/CR-1984 ORNL/NUREG/TM-435. I I I I I I I
I APPENDIX I I DERIVATION OF A THEORETICAL MODEL FOR POISSON I VARIATION IN THE DISCHARGE CHANNEL Define I A = cross-sectional area of water in the channel a = cross-sectional area of water in the sampler I W= velocity of water in the channel w = velocity of water in the sampler e(u) = mean cross-sectional density of ichthyoplankton in both the I channel and the sampler at time u. At any instant, du, the mean cross-sectional ichthyoplankton count in the I channel is Ae(u)du, and thus, the mean count during a time interval (uO'u 1) is I I I Similarly, in the sampler, the mean count is I I By definition, AW=F=flowrate in the channel, and aw=f=flowrate F sampler. Thus T{uO'u ) = 1t(uO'u1). 1 I Consider a cross-section, at any instant du, divided I ~ ~
/ /
I I \ area of subre~ion = 4 area of cross-section = a I \, ~ J I
~
I into m subregions, each with area o. Let Yi,i=l, .*. ,m denote the number of ichthyoplankton in the ith subregion. Make the following assumptions: I I
I 1.2 I
- 1. ,
P(Y.>l) = 0(0) The notation 0(0) denotes a function that satisfies I lim 0(0)/0=0. Thus, we are assuming that there will be no more o~ than one observation in any sufficiently small subregion. I
- 2. p(Y i=l) = 9 00+o( 0), for some constant 9 0 I This assumption can be interpreted as saying that, for small sub-regions, the probability that the region has an observation is I,
proportional to the area of the region. I
- 3. Y1"."Ym are independent for every m.
m I Let T = L Y.. Then P(one or more Yi >l) = 0(0) by assumption 1, so m i=l ' I m m
= p( I 1=1 Yi=t) = p{ I Y.=tIY 1 i=1 ' ~ l'Y 2 <
1, ... ,Y < m-
- 1) I I
I I where the last equality follows .from assumptions 1 and 3. To evaluate the conditional p"obability, note that if Yi ~ 1, then Yi is either I o or 1. Using assumptions 2 and 3, we can treat these conditional Y.IS , as binomial variables, and use the binomial probability formula to obtain I I Now let m-+<D,o-+O, and mo-+a. That is, take smaller and smaller subregions, but always cover the entire cross-section. Then, using Stirling's formula, I I I
I I.3 I I I I I I This is the Poisson distribution. I Therefore, the distribution in a cross-section of width du at time Uo is Poisson(0 0a). To get the distribution of the counts for a sample taken from time Uo to u1' we must integrate over all cross-sections. I Since the value 00a is a function of time, write 00a=a0(u)du. The above argument shows that the number of ichthyoplankton, T(u), in a cross-section I of width du is Poisson(a0(u)du). If sampling is done duri~g (uO,u 1), and the velocity of water is w, the total number of ichthytJplankton is a I Poisson variable with parameter I I The same argument applies in the channel, the only difference being that I the cross-sectional area is A and the velocity is W. Thus, the varia-tion in the channel is Poisson, with parameter I I I if assumptions 1 to 3 hold. I I
I I.4 The Poisson distribution is closely related to the concept of complete I spatial randomness. Consider an individual organism located somewhere in the cross-section of the discharge channel at which the sampling gear has been placed. It is plausible to hypothesize that the organism is I equally likely to appear at any position in the cross-section of the sampler. In other words, no point in this cross-section is more likely to be the organism's position than any other point. A slightly different I way to express this situation (which is also more correct in formal mathe-matical terms) is: the probability of the organism's appearing at some point in any specified subregion of the cross-section is directly propor-tional to the area of the subregion. Any two subregions of the same cross-I sectional area are equally likely to contain the organism no matter what their shapes and locations in the cross-section of the sampler are. I A second plausible hypothesis is that each organism's position in this cross-section is determined independently of all other organisms. In other words, information about the location of any organism(s) in the I cross-section at which the sampling gear is situated is not of any value or help in predicting where any other organism(s) will appear. If both of these assumptions are met, we can say that the distribution of I organisms is spatially random, or that this distribution is characterized by complete spatial randomness. Assume further that a fixed number of individual organisms is present in the discharge channel in a cross-I section of incremental width dt, from which the sampler draws a very small fraction of the cross-section's volume. Then a derivation similar to the one given earlier in this appendix establishes that the count of organisms I sampled has a Poisson distribution. This result can be extended from the incremental cross-section of width dt to a time interval (to,t 1) without difficulty. I I I I I I I I I
I APPENDIX II I DERIVATION OF A THEORETICAL MODEL FOR NEGATIVE BINOMIAL MODEL VARIATION IN THE DISCHARGE CHANNEL I If a particular ichthyoplankton count is observed with Poisson variation, I i.e., X - Poisson(Av) where v = volume of water sampled and A = mean density, and if in addition A itself is subject to variation according to the density function I I for A>O, I I which is a Gamma(K,S) distribution, we then have I P(Xsxl - Jp(X=XIAlf(AldA I SSCDe-AV(V),,)X SK AK-1e-S)"d). o xl r(K) I = vXSK (CD)"X+K-le-<S+v) Ad). X! r (K) Jo I I
- VXSK r(x+K) x 1 r (K) ( S+v) x+K I Using the fact that r(n)=(n-l)l, this last expression can be rearranged to give I
I for x=O,1,2, ... (ILl) I which is the negative binomial distribution. For convenience, let P=S/(S+v) and q=l-p=v/(a+v). It is well known that the negative binomial distribution I (11.1) has mean E(X)=~=Kq/p and variance Var{X)=Kq/p2. (See, for example, Mood, Graybill, and Boes, 1974, Appendix B). Routine algebra shows that I
I 11.2 Var{X) = Kq/p + (Kq/p2 Kq/p) I
= II + ll(l-p)/p - II + II (Kq/p) (I/K) (11.2) I = p + (l/K)1l2 I The density of the negative binomial distribution with parameters K and p is obtained from (11.1) by replacing S/(S+v) with p and replacing I v/(S+v) with q = I-p, giving P(X=X) = (K+X-l)pK qX for x=0,1,2, ... (11.3)
I
, x It is sometimes convenient to use the parameters c 2 of (4.30) and II = Kq/p I
instead of K and p. Comparing (4.30) and (11.2) shows that I c2 = I/K (11.4) K = l/c 2 ; I comparing the formulas for E(X) and Var(X) given above (11.2) and using (4.30), we obtain I Var{ X) = ll/p = II + c 2 2 II , (II.S) I from which it follows immediately that I p = 1/(C 211 + 1). (11.6) I A standard interpretation of the negative binomial (K,p) distribution is that it describes the number of failures occurring before the Kth success I in a series of independent Bernoulli (success-failure) trials in which the probability of success is p at every trial. Although this interpretation makes sense only when K is an integer, and some calculations for the nega-tive binomial distribution are easier when K is an integer, none of the I results in this study require K to be an integer. There are numerous methods for simulating a negative binomial random vari-I able, several of which appear in Rubinstein (1981, Sec. 3.7 and 3.7.4). The most routine of these is the general algorithm for generating random I I
I B.3 I values from any discrete density known as the inverse transform algorithm (Rubinstein's Algorithm IT-2). It begins by calculating Po = P(X=O), I PI = P(X=I), P2 = P(X=2), *** using (11.3). It then generates a uniform random number, U, from the unit interval (0,1), and transforms U to a nega-I tive binomial X by returning the value x = 0 if U < Po I x = 1 if Po 2 U < PI I I I etc. This algorithm was used in the simulation study of Chapter 6 to generate I negative binomial random values. The code for this simulation study appears in Appendix VIII. I I I I I I I I I I
I APPENDIX III I DERIVATION OF THE BETA-BINOMIAL DISTRIBUTION Given that we observe a pair of independent random variables (X,Y), where I both X and Y follow the negative binomial distribution given in Appendix II, the conditional distribution of X, conditioned on the fact that X+Y=t, is given by I I p(x=xlx+Y=t) = P(x=x)P(Y=t-x) P(X+Y=t) I S )K ( v ) x K+t-x-l (_S_)K (~) t-x
= ( S+v S+v t-x S+v S+v I
( 2K+t-l)(--L)2K(~ t S+v S+v
)t I
for x= 0 , 1 , 2 , . . 0 I I' I Since both X and Yare negative binomial with parameters K and B/(B+V), it follows that X+Y is also negative binomial with parameters 2K and I B/(6+v). Thus, I p(x=xlx+Y=t) = P(x=x,x+Y=t) P(X+Y=t)
= P(X=x,Y=t-x)
I P(X+Y=t)
= P(X=x)P(Y=t-x)
I P(X+Y-t) I which is the beta-binomial distribution. I I I
I APPENDIX IV I THE LIMIT OF NEGATIVE BINOMIAL SUMS I Let Xi be a negative binomial random variable with probability mass function I I I or, for short, write Xi ~ NB(Ki,p). I If X1, ... ,X n are independent random variables, and X. ~ NB{K.,p), then I it is true that T n
=.r Xi ~ NB(
n r 1 1 K. ,p). The probabil ity generating function of T is g;3e~ by ;=1 1 I n I ~(s) = E I i-l K.
~
(l-(l-p)s) - I n n I Now let n~ so that L K.~,
;=1 1 p~l, and (l-p) L K.~A. ;=1 1 I Then I
I I and passing to logarithms, it follows that I I I
I IV.2 I the probability generating function of the Poisson distribution. Thus, in this precise sense, the Poisson is the limit of negative bino-I n mials, or, for large n, L X.
. 1 1 1=
is approximately Poisson. I Translating this back to the case of sampling in the discharge channel will give some indication why it is not expected that the Poisson approxi-mation will work. For discharge sampling, p=S/(S+v) (see Appendix II), I where v = volume sampled and S is a parameter describing the variation in mean density. In order for the above limit argument to apply, S/(S+v) must be constant throughout the entire sampling period. While this may I be approximately true for v, it is almost certainly false for S. The other two limit conditions, that p+l and (l-p)tK i + constant are also suspect. Hence, although it is possible to model sums of negative binomials I as Poissons, this theory does not seem to apply here. I I I I I I I I I I I I
I APPENDIX V I CALCULATION OF BIAS IN SUBSAMPLING I A sample ;s randomly divided into L=8 subsamples, and these are processed in random order until Q=200 organisms have been counted, i.e., the pro-I cessing sequence terminates with the tth subsample if the total count, A Tt , in the t subsamples is at least Q=20~o The usual estimate, T, of the I total count in the L subsamples is then T = (L/t)Tto Let fi(x) denote the probability that the total in i subsamples, Ti , I equals x, i.e., I I Let fx(x) denote the probability that a single subsample c:ount equals x, i.e., I I and let EL( ) denote expectation over L subsamples. Then we have I I I I I I I I I
V.2 I I I where we use DL(Q-l) to denote the second two terms above. We now have II a recursion relation, which allows us to write L
= A+ I ° (Q-l) I £=2 £ I
Note that we have used the convolution formulas, I I I I' where F denotes the cumulative distribution function of f. of the Poisson distribution we have For the case II
-]..l x I fX(x) = ~¥- x. =p(x;]..l),
I* I
- 1 These equations together with the convolution equation imply I
I I I
- *
- i 1 _ i l... _ _ ,.ww _.
I V.3 I So in the Poisson case I I I N.B. If Xi = jLX lJ
.. is a sum of independent Poisson variables with means I E(X;j) = Pj~' ~Pj = 1, then Tt = ~Ttj and E(Ttj/t) = PjE(Tt/t).
J J I I I I I I I I I I I I I
I APPENDIX VI I TESTING FOR A QUADRATIC RELATIONSHIP BETWEEN VARIANCE AND MEAN USING PAIRED OBSERVATIONS Suppose that n paired observations (X 1,Y 1), (X 2,Y 2), ... ,(X n ,Y n) are col-I lected. These may be two replicate observations X.1 and Y.1 taken at each of n times indexed by i=l, 2, ... ,n. We wish to test the null hypothesis I that 0~ = ~i + c2~~ for each i. Assume that for every i, Xi and Yi are independent, identically distributed random variables with mean ~. and I 2 2 1 variance ~. + c ~ .. Let I z., = (X.1 ,
+ Y.) /2 I
s? = (X. - Vi) 2/2.
, , I Then I
EZ. = 1 , , (~. +~.)/2= ,
~.
I (1/2)[E(X?) + E(Y?) - 2E(X.)E(Y~)]
= E(X?) - E(X.)E(Y.)
1 " I
= Var(X.) + [E(X.)]2 - [E(X.)]2 = ~., + c2~?,
1 , , I Note that I*
= (1/4)[E(X?) + E(Y?) + 2E(X.)E(Y.)]
I
= (1/2)[E(X?) , + E(X.)E(Y.)] , 1 = Var(X.)/2 + [E(X.)]2 I , 1 = v./2 + (1 + c 2/2)~ 2.. , , I I
I I
I
,-VI.2 I
It follows routinely that I 2 I ES 21. = jl. 1
+ 2 2 = E [Z.(l C).1.
1 1 + C 1 + c 2/2 Z.)] 1, I so under appropriate assumptions, I I I If c2 is constant, or relatively constant, from sample to sample, we can sum over the n independent samples to obtain I 2/2) n S~(l + c
- 2 I L 2 Xn i=l Z.O 1
+ c z.)
1 I and I, (VI.l ) I I where ~2 is an estimate of c2* Each sample variance sf based on a paired sample (Xi,Y i ) is an unbiased estimator of the variance Jli+ c2Jl;, 2 so I I I I = E[X~1 + Y~1 - 2X.Y.] 1 1 (VI.2) I
I VI.3 I It is also true that (using the independence of X.1 and Y.) 1 I E[X. + Y. + 2X.Y.c 2] = E(X.) + E(Y.) + 2c 2E(X.)E(Y.) 1 1 11 1 1 1 1 I 2 2
= 2(1l.+ c ll.)
1 1 (VI.3) I It follows, with the help of these results, that I E[(X.+V.)(X.+Y.-1)] = E[X~+Y~+2X.Y.-(X.+V.)] 1111 111111 I 2 2 .-2X.V.] - E[X.+V.+2X.Y.c 2]
= E[X.+V 1111 1111 I + E[4X.Y.
1 1
+ 2X.V.c ]
1 1 2 I
= 2(lli+ c211~) - 2(lli+ c211~) + 2(c 2+2)E[X i Vi ] (VI.4)I Thus, if a number of independent pairs (X.,V.) 1 1 2
having a common c , although I not necessarily a common ll, are available,
.1 then a consistent estimator of c 2 is given by n
I (x.+v.)(X.+V.-1)
+ I "2 i=1 1 1 1 1 C =
2 n I x.v.
- 2 (VI.5)
I
. 111 1 =.
where [ ]+ is the positive part function: [w]+ = max(w,O). I (Consistency is an important property for an estimator to possess. Informally, it means that the estimator is certain to approach the true, unknown parameter as the number of observations increases. For a more I formal definition, see Mood, Graybill, and Boes (1974, p. 295).) Substituting the ~2 of formula (VI.5) into (VL1) gives a relationship 2 2 2 I that can be used to test Ho : cr*1 = ll*1 + C ll*1 for each i. I I
I APPENDIX VII I A TIME SERIES APPROACH TO ESTIMATING I ENTRAINMENT ABUNDANCE A recent paper (Madenjian and Jude, 1983) has proposed tha': time series I techniques can be incorporated directly into the estimation of entrain-ment at power plants. In assessing whether this approach is appropriate, it is important to distinguish between the actual entrainment during the year and the theoretical expected entrainment of the power plant. I The theoretical expected entrainment can be viewed as the average entrain-ment that would be observed in many replications of the saMpling year under identical plant and river conditions. These replications would I differ in observed entrainment because of random variation in the under-lying process. The actual entrainment during the year is a single replication from this conceptual system. I Madenjian and Jude have confused sampling error and process error. (Sampl-ing error is the random variability inherent in sampling from the discharge channel at a given time; process error is the random variability inherent I in the stochastic process of entrainment over the year.) Their replicate samples at a given instant provide a measure of the sampling error vari-I ance of N. - N., where N. is the actual number entrained during period j
... J J J and Nj is their estimate of this number. (In the paper they fail to make this notational distinction, referring to Nj as the estimated number I entrained.) The entrainment process (N j ) over the year is indeed a stochastic process that generates a time series when time is blocked into I periods; if the correct model for this process could be specified, then sampling error Nj - Nj and process error Nj - ~j might be separately I estimated, where ~j is the theoretical expected entrainment during period
- j. Since sampling errors on different sampling occasions are uncorrelated (Cov(N i - Ni' Nj - Nj ) = O}, an estimated serial correlation is I
Cov(N.-~.,
, , N"-~")
J J I I The authors calculate rl"l-J"I using Box and Jenkins techniques, and then I incorrectly multiply it by sampling standard errors to lIestimate the ll numerator, which in their notation is Cov(N i , Nj ), as I = r I"l-J* 'II Var(N"-N")Vard~.-N.) 1 1 J J I I
I VI 1. 2 Madenjian and Jude seem unaware of the statistical subtleties in estima-I tion inference when several components of estimation error are involved. If the estimation target is the actual entrainment during the specific year in question then process error is not an issue, since process error I refers to what might have been entrained rather than what was entrained (process error is the difference between actual entrainment and the theoretically expected entrainment). Actual entrainment is estimated by I a sampling process that yields internal estimates of sampling error variance and permits estimation of the sampling standard error of our estimate of the actual entrainment. I If the objective were, instead, to estimate the theoretical entrainment, then the estimate of the actual entrainment for the year would serve that purpose, but would include a second component of error I I
~ A Actual - Theoretical = (Actual Actual) I + (Actual - Theoretical).
I Estimation of the second component is model-dependent, and several years of data would be required to validate a model. Statistical separation of sampling error variance and process error variance would require more I sophisticated statistical methods than those used by the authors. I I I I I I I I I
I APPENDIX VI II I COMPUTER CODE (IN BASIC) FOR SIMULATION STUDY OF CHAPTER 6: I ONE DAY (8 P.S.U.'S), UNIFORM VARIATION OF DENSITY I I I I I I I I 1 I I I I I I .1
I VIII/2 I 10 ~SIMULATE5: SIMULATES VALUES FOR 1 DAY:S 3-HOUR PERIODS I 20 ~AND PRINTS OUT THE RESULTS. 30 40
!LAMBDA IS UNIFORM.
OPTION BASE 1 I 50 DIM A(16,4) ,C(16) 60 70 SO
!A CONTAINS THE RELEVANT PARAMETERS AND STATISTICS: !THE THE AVERAGE (OVER ALL SIMULATIONS) OF THE DAILY VALUES !C CONTAINS THE DAILY VALUES (TO UPDATE A)
I 90 !A1:TRUE MEAN A2:TRUE VAR A3:TRUE CV 100 110
!A4-AS:1-SAMPLE STATISTICS !A4:MEAN A5:VAR(CSQ=1) A6:CV I
120 !A7-A12:2-SAMPLE STATISTICS 130 !A7:MEAN AS:VAR(KNOWN CSQ) A9:VAR(EST CSQ) 140 Al0:CV(KNOWN CSQ) Al1:CV(EST CSQ) 150 A12:EST OF eSQ 160 170 lS0 DIM Vsample(4),Lambda(4),Csq(6) DIM X(S),Y(S),Meanden(S,100) DIM Labl$(16)[15J,Lab2$(16)[15J LABELS FOR NICE OUTPUT I 190 Lab1$(1)=" TRUE DAILY " 200 210 Lab2$ (1) =" TOTAL II Lab1$(2)=" TRUE STD DEV" I 220 Labl$(3)=" TRUE eOEFF. II 23(> 240 Lab2$(3)="OF VARIATION " Labl$(4)=" DAILY TOTAL" I 250 Lab2$(4)=" (one sample)" 260 270 2S0 Labl$(5)=" STANDARD DEV." Lab2$(5)=" (one sample)" Labl$(6)=" COEFF. OF VAR." I 290 Lab2$(6)=" (one sample)" 300 310 Labl$(7)=" DAILY TOTAL" Lab2$(7)=" (two sample)" I 32() Lab1$(S)=" STANDARD DEV." 330 340 Lab2$(S)=" (csq known)" Labl$(9)=" STANDARD DEV." I 35() Lab2$(9)="(csq estimated)" 360 370 3S0 Labl$(10)=" COEFF. OF VAR." Lab2$ (10) =" (csq known)" Lab1$(11)=" COEFF. OF VAR." I 390 Lab2$(11)=" (csq estimated) I II 400 Labl$(12)=" ESTIMATE OF" 410 Lab2$(12)=" C-SQUARED" 420 Nsim=100!********NUMBER OF SIMULATED DAYS************ 430 440 Nvar=12! NUMBER OF VARIABLES Nperiod=S !NUMBER OF SAMPLING PERIODS PER DAY I 450 READ Vsample(*) 460 470 4S0 DATA 100,150,200,300 READ Lambd~(*) DATA .01,.05,.1,.25 I 490 READ Csq(*) 500 DATA 0,.05,.1,.33333,.5,1 I I
I I VIII/3 I 510 520 FOR 11=1 TO 4!LOOP ON LAMBDA
! GENERATE PERTURBED LAMBDA VALUES WITH UNIFORM VARIATION I 530 540 550 FOR 1=1 TO Nperiod FOR J=l TO Nsim MeandenCI,J)=LambdaCIl)+LambdaCll)*(RND-.5)
S60 NEXT J I 570 580 NEXT I FOR Ic::::l TO 6 !LOOP ON VALUES OF CSQ S90 !INITIALIZE I 600 610 620 FOR 1::::1 TO Nvar FOR Iv=l TO 4 ACI,Iv)=O 630 NEXT Iv I 640 650 NEXT I FOR Iv=1 TO 4! LOOP ON VALUES OF SAMPLE VOLUME 660 !*****************START SIMULATION LOOP******************** I 670 680 FOR N=1 TO Nsim
! INITIALIZE 690 FOR 1=1 TO Nvar I 700 710 720 C (I) =0 NEXT I FOR 1=1 TO Nperiod ! 8 3-HR SAMPLING PERIODS C1 DAY)
I 730 740 750 Mu=Vsample(Iv)*Meanden(I,N) C C1 ) =C C1 ) +Mu C(2)=CC2)+Mu+Csq(Ic)*Mu A 2 760 IF Csq(Ic)=O THEN I 7"70 780 CALL PoisvarCMu,XCI>> CALL PoisvarCMu,Y(I>> 790 ELSE I 800 810 820 K=INTCCI/CsqClc>>+.5) P=l/CCsq(Ic)*Mu+l) CALL Negativbi CK,P,XCI>> 830 CALL Negativbi (K,P,YCI>> I 840 850 END IF NEXT I 860 C(3)=1 I 870 880 IF C(1>>0 THEN C(3)=SQRCC(2>>/C(1) FOR 1=1 TO Nperiod ONE SAMPLE STATISTICS 890 C(4)=C(4)+X(I) I 900 910 920 C(S)=C(S)+X(I)*C1+XCI>>/2 NEXT I C(6)=1 9:::;;0 IF C(4>>0 THEN I 940 950 C(6)=SQRCCCS>>/CC4) END IF 960 !************* CALCULATE ESTIMATE OF CSQ ************** I 970 980 TopeO Bot=O 990 FOR 1=1 TO Nperiod I 1000 Z=X(I)+Y(I) I
I VIII/4 I 1010 1020 IF Z>O THEN Top=Top+Z* (Z-1) I Bot=Bot+X(I)*Y(I) 1030 1040 1050 END IF NEXT I I 1060 IF Bot}O THEN 1070 1080 Csqhat=.5*Top/Bot-2 IF Csqhat(O THEN Csqhat=O I 1090 ELSE 1100 1110 1120 Csqhat=O END IF I
!*************TWO-SAMPLE STATISTICS*********************
1130 1140 1150 FOR 1=1 TO Nperiod Z=(X (I)+Y(!>> 12 C(7)=C(7)+Z I 1160 C(8)=C(8)+2*Z*(1+Csq(Ic)*Z)/(2+Csq(Ic>> 1170 1180 C(9)=C(9)+2*Z*(1+Csqhat*Z)/(2+Csqhat} NEXT I I 1190 C(10)=1 1200 1210 1220 C01}=1 IF C(7})0 THEN C(10}=SQR(C(8>>/C(7) I 1230 C(11}=SQR(CC9}}/C(7) 1240 1250 END IF C (12) =Csqhat I 1260 !***********UPDATE VARIABLES*************** 1270 1280 FOR 1=1 TO Nvar A(I,Iv)=<<N-l)*A(I,Iv)+C(I>>/N I 1290 NEXT I 1300 1310 1320 NEXT N
!****************END SIMULATION LoOP************************
NEXT Iv!END OF LOOP ON SAMPLE VOLUMES I 1330 1340 1350 FOR 1=1 TO 11 FOR Iv=l TO 4 IF 1=3 OR 1=6 OR 1>9 THEN I 1360 A(I,Iv}=A(I,Iv)*100 ! PUT CV ON PERCENT SCALE 1370 1380 END IF IF 1=2 OR 1=5 OR 1=8 OR 1=9 THEN I 1390 A(I,Iv)=100000*SQR(A(I,Iv>>/Vsample(lv} 1400 1410 1420 END IF IF 1=1 OR 1=4 OR 1=7 THEN ACI,Iv)=100000*A(I,Iv}/Vsample(Iv) o 1430 1440 1450 END IF NEXT Iv NEXT I I 1460 !****************PRINT OUT RESULTS*~************************ 1470 1480 PRINTER IS 701 ! TURN ON PRINTER PRINT USING 1490 I 1490 IMAGE 1,6X,"SIMULATIoN OF SAMPLING FOR ONE DAY- EIGHT 3-HOUR PERI ODS",I 1500 PRINT USING 1510 I I
I VII 1/5 I I 1510 IMAGE lOX," Channel volume=100,000 cubic meters per 3-hour perio
- d. II ,/
1520 PRINT USING 1530 1530 IMAGE 15X,IIMean density subjected to uniform perturbations. ",I I 1540 PRINT USING 1550;1000*LambdaCIl),Csq(Ic) 1550 IMAGE lX,"Mean Density of Organisms=II,3D.D,1I1 1000 cubic meters", 4X,"Value I of c-squared=",D.2D,11 1560 PRINT USING 1570 1570 IMAGE 32X,"Volume of Sample in cubic meters" 1580 PRINT USING 1590 I 1590 _ _ _ _ I. 1~)00 IMAGE 18X,"------------------------------------------------------ PRINT USING 1610;Vsample(1),VsampleC2),Vsample(3),Vsample(4) I 1610 1620 1630 IMAGE 15X,4CSX,5D,6X) PRINT USING 1590 FOR 1=1 TO Nvar 1640 PRINT USING 1650;Labl$(I),ACI,1),A(I,2),A(I,3),A(I,4) I 1650 1660 H,70 IMAGE 15A,4(10D.DD,3X) PRINT USING 1670;Lab2$(I) IMAGE 15A,1 I 1680 1690 1700 NEXT I PRINT USING 1700;Nsim IMAGE 1,15X,"The value given is the average over",4D," simulation S.1f I 1710 PRINT CHR$(12) ! FORM FEED 1720 PRINTER IS 1 ! TURN OFF PRINTER 1730 NEXT Ic!END OF LOOP ON CSQ VALUES I 1740 NEXT Il!END OF LOOP ON LAMBDA 1750 END 1760 SUB NegativbiCK,P,X) 1770 ! GENERATES A RANDOM VARIABLE X FROM A NEGATIVE BINOMIAL I 1780 ! DISTRIBUTION WITH PARAMETERS K AND P. 1790 RANDOMIZE 1800 U=RND 1810 x=o I 1820 Sum=O 1830 Prob=P ..... K 1840 Sum=Sum+Prob I 1850 IF Sum>U THEN GOTO Endit 1860 IF Sum(=U THEN 1870 Prob=(C1-P)*(K+X)/(X+l>>*Prob 1880 X=X+l I 1890 1900 END IF GOTO 1840 1910 Endit: SUBEND I 1920 SUB Poisvar(L,X) 1930 ! GENERATES A RANDOM VARIABLE X FROM A POISSON 1940 ! DISTRIBUTION WITH PARAMETER L. 1950 RANDOMIZE I 1960 U=RND 1970 X=O 1980 Sum=O I 1990 Prob=EXP(-L) 2000 Sum=Sum+Prob I
I I APPENDIX IX INSTRUCTIONS FOR I IMPLEMENTING :EROCEDURES GIVEN IN EVALUATION OF ENTRAINMENT ABUNDANCE I .SAMPLING DESIGNS I The analysis described here must be performed separately on each life stage c of each species of interest. Thus 2 should, in principle, be indexed or subscripted by the life stage and species being analyzed, but this nota-I tion will be omitted for convenience. The same is true for all of the computations detailed in these instructions.
- 1. The first step is to estimate c 2 by the estimator ~2:
I n +
~2 = i~1(Xil+Xi2)(Xil~X12-1)
I n 2i~lXilXi2 2 (1) I where [ J+ is the positive part function: [zJ+ = max(z,O). If the expression in square brackets in (1) is negative, then the estimator c2 becomes O. In formula (1), n is the number of p.s.u. 's in which I replicate sampling has been performed, and XiI and Xi2 are the adjusted counts of the life stage and species under consideration in the two replicate samples taken in p.s.u. i, i=l, *.. ,n. These adjusted counts I are obtained for each replicate sample as the product of the observed density (in organisms per cubic meter) in that sample and t~e average volume of the two replicate samples in the pair: I XiI = (XiIlvil)(vil+vi2)/2 (2) I X12 = (Xi~vi2)(vil+vi2)/2 where XiI and Xi2 are the actual organism counts in the two replicate I samples and 1 v' l and v. 2 are the corresponding 1 . volumes of thE! two samples in p.s.u. i. It is assumed that the two replicate samples in any p.s.u. I have volumes that differ by no more than 10%r in other wordS, the ratio of the larger sample volume to the smaller is at most 1.10. Sample vol-umes may vary without restriction from one p.s.u. to another. I The number of p.s.u. 's in which X +Xi2 (or equivalently X11 +Xi2) exceeds il o must be at least 24 in order for this procedure to be used without I further concern about its accuracy. It is important to note that the c entire estimation process described here depends on 2
- Replicate obser-vations in which XiI = Xi2 = 0 contain no information bearing on the I estimation of c2 for the life stage and species of ichthyopll9.nkton being I
IX. 2 I considered; there must be sufficiently many replicate observations (at I least 24 pairs) in which Xil +Xi2 exceeds 0, which means that Xil' Xi2 ' or both are greater than O. Unless this condition is met, the procedures presented here may estimate c 2 with insufficient accuracy. For example, I in the worst possible case, a particular life stage and species might have X = X = 0 for every p.s.u. in which replicate sampling occurs; il i2 there is no way to use these methods in such a case. If there are in-I sufficient replicate observations for the accurate estimation of c 2 , the life stage and species will require further consideration in order c to arrive at a value of 2 . Methods for doing this are being developed. I Methods for using triplicate data, in which three samples are taken in a given p.s.u., in the formula for 82 are also being developed. I
- 2. The next step is to treat the seasonal total number of organisms (of a specified life stage and species) entrained, T. Three cases must be considered: continuous sampling, continuous sampling with a small number of missing p.s.u. 's, and noncontinuous sampling.
I A. Continuous sampling I The estimate oi' T is
'" N I
T = LN T.l. L i=l (V./v.)X.
- l. l. l.
where V.l. = i=l voJ.ume of water in discharge channel in p. s. u. i (that is, I the volume of water flowing past the sampling location in p.s.u. i) I
- v. = volume of water sampled in p. s. u. i X.
1 1.
= number of organisms collected in sample in p. s. u. i I
N = number of p. s. u. 's in the entrainment season
'"T. = (V./v. )Xo = estimated number of organisms entrained in I
- 1. :L 1. l.
- p. S.u. i If replicate si~pling has been performed in p.s.u. i, then vi and Xi I
are totals over the replicates, that is, vi = vii + v i2 = volume Ofl.osample 1 + volume of sample 2 in I
- p. S.u.
Xi = Xil + X i2
= organism count of sample 1 sample 2 in p.s.u. i + organism count of I N will be greater than n (of step 1, formula (1)) unless replicate samp-ling is performed in every p.s.u.
I I I
..._,1____... 1_
I IX. 3 I A The estimated variance of T is I (4 ) where I (5) I (6) I The estimated coefficient of variation of T is eV(r) = [Var(r) ]1/ 2/T I Confidence intervals are of the form T.:!: b[Var(T)]1/2 (8) I where the coefficient b that gives an approximate 90% confidence inter-val is given by I N b 6 3.0 I 8 16 or more 2.5 2.0 I The value of b that gives 95~ confidence intervals is being calculated. B. Continuous sampling with a small number of missing p.s.u. 's I If the p.s.u. 's are ordered chronologically, assume that there is a gap in sampling between p.s.u. 's m and m'. For example, if a single p.s.u. is missing, then sampling occurs at p.s.u. 's m and m' = m+2 and there is I no sampling at p.s.u. m+l. If two consecutive p.s.u. 's are missing, then m' = m+3 and no sampling occurs at m+l and m+2. To adjust for the missing p.s.u. 's, the terms from i~+l to i=m'-l in the sum in formula I A (3) for T must be replaced. The replacement terms will be linear combina-tions of the counts X. from nearby p.s.u. 's in which sampling does occur.
~
I Three methods o~replacing the terms for p.s.u. 's in which there is no sampling will now be described. The first of these methods is the simplest to calculate, but this is because it does not utilize knowledge I of V. in missing p.s.u. 's; it is based on the procedure of finding the
~ "
estimate T. for each missing p.s.u. by linear interpolation between T A A ~ m and T ,. The second method improves on this by interpolating linearly I m between V.~mm(X Iv ) = (v.lv)x l:mm and Vi(Xm,Iv,) m = (v.lv
~m ,)Xm, to find T.~
for each missing p.s.u. The third method is much more general, allowing I I
IX. 4 I the estimate '"T. for each missing p.s.u. to be any linear combination of I l the observed X.'s in the p.s.u. 's during which sampling occurs. l practical reasons, the second of these three methods seems the best For I choice. Method 1. In the sum of formula (3) that gives T, leave the term T.l I unchanged unless p.s.u. i is adjacent to one or more missing p.s.u. 's (that is, p.s.u.'s in which sampling does not occur); when this happens at p. s. u. i, whic:h is sampled, replace '"T. in (3) by I l (a) when p.s.u. i is followed by missing p.s.u. 's up to (but not including) p.s.u. m', at which samp- I ling occurs, and p.s.u. i is not preceded direct-ly by missing p.s.u. 's; I (b) 2'1 ( i + 1 - m")'"Ti when p.s.u. i is preceded by missing p.s.u. 's back to (but not including) p. s. u. m", at which samp-ling occurs, and p.s.u. i is not followed direct-ly by missing p.s.u. 's; I (c) when p.s.u. i is preceded by missing p.s.u. 's back to (but not including) p.s.u. m" and is followed I by missing p.s.u. 's up to (but not including) p.s.u. m', at both of which sampling occurs; I remove '"T. from (3) for all p. s. u. 's i in which sampling does not occur. l F'or convenience of notation, let L:: denote l summation over all p. s. u. 's I in which sampling occurs. We can express '"T as
'"T = L."'<;"'+. a.,.
l l i'r' l I where 1 when p.s.u. 's i-l, i, and i+l are sampled I
- a. =
l
.!(m'+l-i) 2 when p.s.u. 's i-l, i, and m' are sampled but i+l, i+2, .... , m'-l are not I ~(:i+l-m") when p.s.u. 's mIt, i, and i+l are sampled but m"+l, m"+2, .... , i-l are not I .!(m' _mIt) when p.s.u. 's mIt, i, and m' are sampled but 2
m"+l, .... , i-l and i+l, . " ' , m'-l are not I 'I'he formula for Var(T) is, using the constants a. just defined,
... + .~.
2"" A
- a. Var (T. )
l (10) I l l l I I
I I Method 2. In the sum of formula (3) that gives T, leave the term T.
~
unchanged if p.s.u. i and the adjacent p.s.u. 's i-l and i+l are all I sampled. Omit the term '"T. from the sum if sampling does not occur in p.s.~ i.
~ .
If p.s.u. i is adjacent to one or more missing p.s.u. IS, replace "T. in (3) by a.T., A giving I ~ ~ J. (11) I where 1 when p. s. u. 's i-I, i, anc~ i+l are sampled I :i:l(::=~)(V~Vi) when p.s.u. 's i-l, i, and m' are sampled but 1+1, ... , m'-l are not I n a 1 = j=!"+l(t:;;)(VIVi ) when p.s.u. 's m , i, and i+l are sampled but m"+l, **. , i-I are :not I ~" j=m +1 (ij-m;;)(V!V.) + m~-l(m:-~)(v./v.)
-m 3 1. j=i+l m -1. J ~
I whenp.s.u.'sm", 1, andm' are sampled but m"+l, *.. ,1-1 and 1+1, ... , m'-l are not I A ..... The formula for Var(T) is (10), but using the constants a. defined for Method 2 just below (11). ~ I Method 3. Let!:; denote summation over all p.s.u. 's in which sampling occurs and zj summation over all p.s.u. 's in which sampling does not I occur. For any p.s.u. j in which sampling does not occur, we can estimate the number of organisms entrained during the p.s.u. by a linear combina-tion of the estimated entrainment T. in sampled p.s.u. IS: I 1. (12) I The constants b ij may be chosen to give the analysis of Method 1 or 2 above, or they may be chosen to give some other analysis deemed suitable. I Then A N A T':!: T.=Z.T. +I:.Tj
+" _"
i=l 1. 1. 1. J I
= Z.+J. a.T.
I A
]. ~
(13) where (14 ) I I
JX.6 I
'" is (10), but with the constants a. defined in
'J'he formula for "Var (T) J.. I (14) rather than those defined for Method 1. vlith the appropriate formulas-for '"T and '" '" for whichever of the three Var(T) I methods is used, the estimated coefficient of variation and confidence intervals are computed from formulas (7) and (8). I G. Noncontinuous sampling (of a systematic nature)
- ~t L be the number of days in the entrainment season and let h be the I number of these on which sampling is performed. A sampling day is de-fined to be a 24-hour period in which samples are drawn from the discharge channel; this period is sampl.ed as a sequence of N' p. s. u. 's (N' = 8 for 3-hour p. s. u. 's, N' = 24 for I-hour p. s. u. 's, etc.). The I
- remaining L - h unsampled days may form gaps of several different
- lengths. I
- Let d. be the da.y of the season on which the j th sampling day occurred.
J For example, if sampling occurred on days 1, 8, 12, ... then dl=l, d 2 =8, I (i =12, and so on. For each day d., the estimated entrainment on that 3 J day is given by formula (3) with N replaced by N', the number of p.s.u. 's in the day, where the volumes V. and v. and the counts X. are from the J..]. ]. I J? s. u. 's in that day. The estimated variance of this daily entrainment J
'" A estimate T., which is denoted by Var(T.), is given by formula (4) with J
I N again replaced. by N', using the volumes and counts from the p. s. u. 's in day d .. J I
'I'he season has now been divi£ed intQ a ~equence of days. For each day on which sampling occurred, T. and Var (T .) are known. All remaining days J J are days on which sampling did not occur. The analysis of the I
case of continuous sampling with missing p. s. u. 's is directly applicable to noncontinuolli: sampling by making one adjustment: days 1,2, ... , L must be used instead of p.s.u. 's 1,2, ... ,N of the entrainment season. With I this done and with '"T. and '"Var (T.) A known for each sampled day, we can use J J interpolation o:f the sampled days to estimate entrainment on the days when sampling dj.d not occur; this is exactly what was done for p. s. u. 's I in the case of eontinuous sampling with missing p.s.u. 's. Methods 1, 2, and 3 for continuous sampling with missing p.s.u. 's there-I fore carry over to the situation of noncontinuous sampling, except that the analysis here will be based on 24-hour sampling days rather than on shorter p.s.u. 's. The formulas already given for these three methods I are directly apJ)licable to noncontinuous sampling. An alternative ifaY to express Method 1, which is equivalent to the approach just described for noncontinuous sampling, is the following: I I I
I IX.7* I Define the weights I for j=l (d j +l - d j _l )/2 for j=2,3, ... ,h-l (15) I (1 + ~ - ~_1)/2 for j=h I The estimate of seasonal entrainment is then A h A T = .1: WjT * (16) I J=l and its estimated variance is J I A A Var(T) = 1: ~Var(Tj) h j:::l J A A (17) I A This method. is based on the use of linear inter:p~lation of Tj'S for days on which sampling occurred in order to obtain a T. for each day on which J I there was no sampling. It can therefore be improved in ways analogous to Methods 2 and 3 discussed for continuous sampling with a small number of missing p.s.u. 's. I 3. The next step is to treat the entrainment reduction due to outage and the potential entrainment under full flow conditions. For any p. s. u. i, let Vi denote the realized discharge volume and let Vi + Vi denote the I potential discharge volume under full flow conditions. Similarly, let Ti denote the realized entrainment in p. s. u. i and Ti +Ti the potential I entrainment when the discharge volume is V.1 + V!1 . The seasonal total reduction in entrainment, T', is estimated by I A T' = N 1:: 1=1 i A T' (18) where I A A T! 1
= (Vi/Vi)Ti (19)
For any p.s.u. in which sampling occurs, the estimated variance of T' I is obtained from the formula i (20) I A ~ A A Note that if X. = 0, then T. = T!1 = 0 and Var(T!) = O. I 1 1 1 Break the sum of formula (18) into sampled and unsamp1ed p.s.u. 's, if there are any of the latter, along the lines of formula (13): I AT'
= i=l N A, t T. = ~+ a,AT , (21) 1 ""1 1 1 I
IX.S I I
" A
~'he estimated variance of T' is then given by A A + 2" A (22) var(T') = r,.1 a! 1 var(T!) 1 When there are no missing p.s.u. 's, this reduces to I A A N A A var(T') = .~ Var(T!) (23) J.=l 1 'Nhen missing p. s. u. 's are present, formula (21) can be evaluated by I substituting specific expressions into the sum over missing p.s.u. 's
~j "'I'j. To do this, first find "Tj for each p. s. u. j contained in a sam-pling gap, that is, for each p.s.u. j in which no sampling data are ob-I tained. Let m and m' denote the (sampled) boundaries of the sampling gap containing p.S.l1. j, so sampling occurs in p.s.u. m and then in p.s.u. m' but not in any of" the p.s.u. 's between these, including j. For Method 1 I (with continuou:3 sampling and a small number of miss ing p. s. u. 's), "T j = (m' -j')T m' -m m + (- j-m m'-m- )'"Tm' (24 ) I and for Method :2 T.J = (m: -j )(V'/V)T + (j,-m )(v.lv ,)T , (25)
I m -m J m m m -m j' m m Then derive Tj "by using (19): 10'. = (V'./V.)T .. (19) and (25) this way gives J J J J For Method 2, combining I T'.J = (m;-j)(V/V)T m -In. J m m_
+ ( j,-m )CV'./V ,)10 ,
m -m J m m (25') I Substituting these expressions for each missing p.s.u. j into (21) yields explici.t terms a! in the right-hand side of this formula. The 1 I estimated variance of the resulting "T' is then found from formula (22). The estimated coefficient of variation of "T' is I (26 ) I The potential total entrainment over the entire season, T + T', is estimated by using T from formula (3) and T' from equation (lS): A I T + T' :: "T + "T' = i=l N"
~ (T.
1 A
+ T~)
1
= i=l N ~ -
T.-+T! 1 1 I For each sampled p. s. u. i, A ----............. Var(T.1+T!) 1
= Var(T.)
A 1
"I"\. + Var(T!) 1 + 2(V!/V.
11
)Var(T.)
1 (2S) I These estimated variances for individual p.s.u. 's can be combined to give the estimated variance of potential total entrainment over the entire season . When there are no missing p.s.u. 's, summing (2S) over I i=l, ... ,N gives
" -- ......... A ;'\, A.
Var(T + T') = var(T) + Var(T') + 2.~ (V:/V.)Var(T.) A N I"\. A I 1=1 1 1 1 (29) I w,z _ _ a_ilW i~i._""'.i
I IX. 9 I When missing p.s.u. 's are presen~, write
........-...... NA NA = i=l I T +T' 1: T. + 1: ~ i=l T! ~
I ~xpress A Tj for each missing p.s.u. j as a linear combination of values I Ti for sampled p.s.u. 's i; for example, formulas (24) and (25) do this A for Methods 1 and 2. Similarly, express T~ for each miSSing p.s.u. j J by applying formula (19) as described in the discussion of evaluating I formula (21) when there are missing p.s.u. 'so Adding the terms for actual and potential entrainment over both the sampled and the missing p.s.u. 's in formula (30) produces I + ,A,
+ 1:.~ a.T ~ i (31)
I A A It is important that only the T. and Tj terms from the right-hand side of (30) must be used in constr~ting the sum 1:~ a.Ti in (31), and that A ~ ~ I only the T! and Tj terms from the right-hand side of (30) must be used in constr~ting the sum 1:~ a!Ti in (30). This segregation of terms is
~ A A A A I necessary because of the difference between var(T i ) and var(Ti).
variance contribution corresponding to p.s.u. i is estimated by The A A I + ai2var (Ti) + A A 2a a ' (V!/V. )Var(T.) (32) i i ~ ~ ~ I Summing these over all p.s.u. 's i in which sampling occurs, we have the estimated variance over the season I (33) I The coefficient of variation of T +T' is estimated by CV(T:"?) = [var(~ )]1/2/ c;:?) (34 ) I 4. If mortality rather than entrainment is to be estimated, a mortality factor Fc must be included in the calculations. This mortality factor I is affected by plant operating conditions, such as the number of pumps operating, water temperature, and so on. Consequently, F changes from c one p.s.u. to another, and F during a given p.s.u. is affected by I c whether the plant is operating at full flow or at reduced flow. Accord-ingly, the mortality factor Fc must be indexed by the p.s.u. i and also I by the flow condition or volume. Let F c,~
. denote mortality in p.s.~ i I
IX. 10 I at actual flow conditions and F + . denote mortality in p.s.u. i at full c,~ flow conditions. These will be equal when the plant is operating at full flow during p. s. u. i. Assume that F . and F+ . are known constants for I c,~ c,~ each p. s. u. i. ~'he formulas here for entrainment become estimates of mortality when F . and F+ . are inserted into the estimation process in c,~ c,~ I a'[?propriate fashion. It would be more realistic to treat F . and F+ . as unknown parameters, which must be estimated, and to factor the variance and bias of this estimation process into the mortality analysis (Vaughan c,~ c,~ I and Kumar, 1981). Because of the complexity of this task, it will not be addressed here; F . and F+ . will be assumed to be known constants. c, ~ c, ~ I ~'o illustrate this, let M denote the realized mortality during the a.ctual entrainme:n.t season, and let M +M' denote the potential mortality d.uring the season when the discharge volume is V. + V~ (rather than the I
~ 1 realized discharge volume V.) in p.s.u. i, i=l, ... ,N. ~
difference between realized mortality and the potential mortality dur-Then M' is the I ing the season at discharge volume V. +V! in p. s. u. i. For continuous samp l i ng (*th e W1 .. no m..J.ss1ng p. s. u. , s ) , 1 ]. I A N A M::: ~ F .T. (35 ) i=l C,1 1 I M+M' N
= i=l~ F + A . (T. + T!)
c,].]. A
].
(36 ) I A N + A + A N A M' = i=l~ (F .T. + C,1]' F .T!) C,]. ].
~ Fc,]..T.].
i=l I (37 ) I All three of these estimates have estimated variances that fOllow directly from t:ne methods developed here, us ing (32) , (20) , and (5). A A N A A I Var(M) =.~ F 2 .Var(T.) (38) 1=1 c,]. ]. var(M:'M~) ~ F+ 0/[1 + 2VVV. ]Var (T.) + Var (T!)) I
= (39) i=l c,i~ ].]. ]. 1 A A Var(M' ) = ¥~F+ . - FC,1. )2VarCT.)]. + F+ ~var(T!) +
I* i=Jt C,1 C,l 1 2CF+ . -F C,~ C,~
.)F+ .(V! / V.)var(T.)
c,~ ~ 1 A A 1 J (40) I The adjustment~: for continuous sampling with a few missing p. s. u. 's or for noncontinuous sampling are straightforward. For example, I A + A A M= L
~
F c,~
.T. ~ + ~~ F J c,J J .T.
_ + (41) I
~ + + "- _ + A + + A A
M+M' =~. F .T. + ~. F .T. + ~. F .T! + ~. F .T'.* (42)
~ c,~ ~ J c,J J ~ C,11 J c, J J I
I IX. II I I multiplied by the AT., " 1. Equations (41) for M, (42) for M+ M', and the corresponding equation for "M' differ from equation (30) for T + T' only in having constants T.,"T!, and "Tj' terms. Carrying these constants J 1. through the analysis of equations (30) to (34) results in estimated I ,,--- variances and coefficients of variation of the mortality seasonal totals M, M+M', and '"M' when there are missing p.s.u. 's or sampling is noncontinuous. The constants used in these seasonal mortality estima-I " "At '" tors to multiply the Ti' Tj' Ti, and Tj terms are different for each of the three mortality quantities (M, M+ M', and M'). All three of these sets of constants differ from the constants in (30) to (33), I since they must now reflect the presence of the mortality f~ctors F . and F'!' . throughout. C,l. C,l. I 5. We now extend the work just done in Sections 3 and 4 of this appendix. Our goal is to obtain an estimate of each of the six quantities in the table I Actual Reduction Potential Entrainment Mortality T M I ~: I T+T' M+M' I The estimated variances of the estimates of these quantities are also of major interest. "Actual" refers to actual flow co~ditions, "Reduction" refers to the reduction in entrainment or mortality due to the plant's I operation at reduced flow rather than full flow, and "Potential" refers to the level that entrainment or mortality would have assumed if the plant had operated at full flow throughout the entrainment season. I Several complicating factors will be dealt with in the analysis of this section. These stem from the fact that each plant consists of two units I that operate independently of each other. At Bowline Point, each unit has a separate cooling water system. At Indian Point, each unit has a separate intake structure, but there is a common discharge channel. The two units at Roseton share a common intake and discharge pipe. (Units I 2 and 3 at Indian Point are considered here, but Unit 1, which has not operated for commercial production since OCtober 1974, is not. This simplifies the presentation, allowing us to restrict attention to the I two-unit case; however, handling three or more units in a plant would not be a problem, as we now note.) The treatment here of two units can be generalized in straightforward fashion to apply to a plano; consisting I of three or more units. The complications related to the two-unit structure of each plant are: I 1. Mortality rates may differ between the two units. Let F .. and Cl.J denote the mortality rate Fc in p.s.u. i at unit j under actual flow
~'j Cl.
I conditions and full flow conditions, respectively. It is possible that Fci1 F Fci2' F;il F F;i2' or both in any p.s.u. i. I I
IX. 12 I
- 2. The volumes of cooling water flowing through the two units during a I
specified p. s.u. may differ. Let V.. denote the volume of cooling lJ water that . a.etually flows through unit j in p.s.u. i, and let V. 1 I denote the volume of cooling water passing through the whole plant in p. s. u. i, so (44 ) I Then it is possible that Vii and V are unequal. In particular, if i2 I unit j is not in operation in p. s. u. i, then V.. = O. In this case, lJ the entire plant's cooling water flow for p.s.u. i passes through the other unit j' (j' 1= j), that is, V. = V.. , . I 1 lJ
- 3. Estimates both for the whole plant and for each unit within the plant are desired. I To address these issues within a unified framework, we consider the nature of the cooling 1Nater flow through a plant's two units and the discharge channel. The tw*o units may have either a cormnon intake structure or two I
separate intake structures. Similarly, they may have either a common discharge channel or two separate discharge channels (at the location of the entrainment sampling gear, which is the position of importance I in any discussion of sampling). All water flowing through the plant during a specified p.s.u. can be classified as belonging to either the portion of the plant flow from which the entrainment sample is drawn or I a portion of plant flow excluded from the volume from which this sample is drawn. Any volume of water that might have flowed through the plant but did not, because of reduced flow rather than full flow operation in I p.s.u. i, is excluded from the volume out of which the entrainment sample is drawn. For example, recall that Bowline Point has two separate cool-ing systems; if a sample is drawn in p.s.u. i from the unit 1 discharge channel, the srunple comes from the actual unit 1 discharge volume Vii' I while the actual unit 2 discharge volume V and unrealized discharge volumes Vii and Vi2 are unsampled. i2 (The unrealized discharge volume at I unit j in p. s. u . i, denoted by V!., is the difference between the full lJ flow volume that unit j could have in p. s. u. i and the actual flow V.. ' ) lJ I Roseton, on the other hand, has a common intake and discharge pipe, so both actual discharge volumes Vii and Vi2 are sampled; of course, the unrealized discharge volumes Yil and Vi2 are unsampled. I The key point of this discussion is that there is a crucial distinction, for entrainment and mortality estimation purposes, between the (actual) I discharge flow from which the entrainment sample is drawn in p.s.u. i and the discharge flow, either realized or unrealized, that is excluded from the sampling process in p. s. u. i. The sampled flow is analyzed I o A A A A like the T. terms in T and Var(T) in equations (3) to (6); the unsampled 1 A A....... /'\. flow is analyzed like the T~ terms in T' and Var(T') in equations (18) 1 I
I IX. 13 I to (20) and (23); combining these to get T +T' follows the pattern of equations (27) to (29). Starting from this set of principles, we can incorporate the necessary adjustments for mortality estimation and I for missing p.s.u. 'so We now introduce some needed notation. I For each p.s.u. i, define
= volume of discharge flow in p.s.u. i from which entrainment I sample is drawn U
ij
= volume of discharge flow in p.s.u. i at unit j from which I u'i entrainment sample is drawn = volume of discharge flow in p.s.~ i excluded from entrain-ment sampling process (both realized and unrealized flOW)
I U 1j = volume of discharge flow in p.s.u. i at unit j excluded from entrainment sampling process (both realized and unrealized flOW) I Vi = volume of actual discharge flow in p. s. u. i Vij = volume of actual_~isch~rge flow in p. s. u. i at unit j I V~
~ = volume of unrealized discharge flow in p. s. u. i I V1j = volume of unrealized discharge flow in p. s.u. i at unit j It follows immediately from these definitions that I Ui = Uil + Ui2 U'i = U11 + U12 I V.~ = ViI + Vi2 (45)
I V!1 = ViI + Vi2 Uij + Uij = Vij + V1j I Ui + U'i = V.1 + V' i I We further define T~ = (U~vi)Xi = estimated entrainment in the sampled portion of the actual discharge flow in p.s.u. i I AU Ti = ([V.-Ui ]/ Vi)X. = estimated entrainment in the unsampled portion 1 1 of the actual discharge flow in p.s.u. i I I A To;
= T* + Ti = (V. v.)X. = estimated entrainment in the actual dis- "8 1
AU J.:~1 ch arge fow l 'ln p.s.u. 1. AS Tij = (u ../ v.)X. = estimated entrainment in the sampled portion of I 1J 1 1 the actual discharge flow in p.s.u. i at unit j I
IX.l4 I T~. = ([V .. -u .. ]/v. )X. = estimated entrainment in the unsampled 'Oor-1,J 1.:' 1J 1 1 tion of the actual discharge flow in ~ I T .. = "s 1J' T .. ;. "u 1J T .. = (V 1J
./v. )x.
1 1 p.s.u. i at unit j 1
= estimated entrainment in the actual l 'ln p.s.u. 1. a t linl*t j d'1SC h arge fow I
The sample volw~ v. and the organism count X. are totals over the I replicates in p.s.u~ i, as specified below eqUation (3). To distinguish between entrainment at the whole plant and entrainment at an individual illlit over the entire season, we use the notation I
"T = estimated seasonal entrainment at whole plant I "T(unit j) = estimated seasonal entrainment at unit j "T' = estimated seasonal reduction in entrainment at whole I plant "T' (unit j) = estimated seasonal reduction in entrainment at unit j I
~:;ontinuous sampling: a.nd similar ly for T + T' and the corresponding mortality terms. entrainment estimation, whole plant I For the entire plant, including both units, I
" NAN "s AU T = Z T. = Z (T. + T.) (46 )
i=l 1 To evaluate this, observe that i=l 1 1 I A AS . Var(T.) = (U. v.) Var(X.) 1 1
/
1 21"0. 1 I I (48) I where "Var(X.) is given by equation (6). Both of the estimated variances 1 in (47) and (48) are 0 if X. = O. 1 Then the estimated variance of "T is I
= ]'if A L: var(T. )
i=:l A 1 o I (V./v.)2Var 11111 (X.) + ([V.-U.J/v.)(V';U.)X. 1 111 (50) I I
I IX. 15 I The reduction in entrainment for the whole plant, T', is estimated by
" N" = i=l I T' L T' i
where (51) and the variance of the estimate T!1 is estimated by I "A NA Var (T') = .Llvar (T! ) A (52) 1= 1 I A A where var(Ti) is given by equation (20) with Ti replaced by T *
" A S i
I Finally, we consider potential entrainment, T + T': A A N A A N
= T + T' = i=l L (T + T!)
i ~ J.=l [(U1:'/vi)X.1: + (U~/vl)XJ.']
= .L I = ~ [T S u + (T i + T !)]
(53) i=l i 1 I Using the fact that (U!/Vi)([Ui+U! J. J/U.1:1:
)X. + (U!/v. 1:J. )2 var (x:t ) (54 )
I J. A which follows from equation (20) with T. and Ti replaced by Ti and A AS A A J. I T~ + Ti' respectively, we see from (28) that A ---........ A A A A AS var(T.J. +T!) = var(Ti + T!) = [1 + 2(Ui'/u.)Jvar(T.) J. + I 1 1: J. A (AU A Var T.+T!) (55) 1: J. I and thus
'" ~ N",~-
Var l T + T') = L Var l Ti.. + T! '1 ) *(56) i=l 1: I An alternative form of equation (56) is given by I A ---..... Var(T+T') = iLl Nt A
= ([u.+U!J/v.)2var 1: J. 1:
(X i ) + (u!/v.)([U.+ui'J/u.)x. 1: J. J. ~ J.'t (57) Continuous sampling: entrainment estimation, one unit I We now present the formulas providing the results analogous to those developed in equations (46) to (57) but for only one of the plant's I two units. N A T(unit j) = L T.. (58) i=l I J.J (59) I I
IX. 16 I A AU Var(To 0) lJ I (60 ) I Var(TloJo) :;:: (Vo .IV. )2Var(x.) + lJ 1. 1. (VirUij~(Vij+Ui-Uij~x. V. U. 1 (61) I
- 1. 1.
A " N" Var(T(un:Lt j)) :;:: .!: Var(T .. ) 1.=1 A lJ (62) I A N A
.IV.1. )T.1.
A (v~ T' (unit ,j) = 1.=
.I:
l T~ 1.J 0 where T~ 1.J
. = =
1.J (V~ ./Vo )X. (63 ) I 1.J 1. 1 Var CT' (unit
" A j>> =
N" i=l A
!: Var (T~ .)
1.J (64 ) I A A A " where Var (T~ 0) is given by (20) with T~ replaced by T~ . and T. by T .. 1.J 1. 1.J A 1. AS 1 I ___ A A NA A T + T' ( unit j) = T ( unit j) + T' ( unit j) N
= I: ( T, . + T ~ .)
i=l 1.J 1.J I
= !: [(U, ./v.)X. + (U~ ./v.)X.] =
i=l N "S lJ 1 1 AU L: [T,. + (T .. + Tl~J')] A lJ 1 1 I i=l lJ 1.J A From (20) with T~ replaced throughout by T .. +T~ . = (U~ . v. )X., 1. AU A 1.J lJ
/
lJ 1. 1 I A Var(TAU A
.. +T! .) = (U! ./ v.) 2AVar(X.) + (U~ 0/ v. )([V.+U! .J / V, )X.
1.J lJ 1.J 1. 1. 1.J 1. " 1 1.J 1. 1. (66 ) I This leads to A _ __ Var(T+'r'(unit j)) = i=l N "{ ([u .. +U ' I: . .J/v.) 2'"Var(X.) + I lJ lJ 1. 1 (U~ .Iv. )([V.+U! .J/v. )x.} 1.J 1. 1. 1.J 1. 1. I Continuous sampling: mortality estimation, whole plant A N A I M = I: M. (68) i=l I 1 where
= ([FCl'lU*l+F 1 C1.'2U'2J/v.)X.
- 1. 1. 1. + ([FC1.'l(V'l-U, 1. 1. l ) +FC1.02(v.1 2 -U.1 2 )J/v.)x, 1. 1.
I (69) I with I
I IX. 17 I G~ = [Fci1Ui1 + Fci2Ui2J/Ui (70) I G~ = [Fci1(Vi1-Ui1) + FCi2(Vi2-Ui2)]/[Vi -uiJ (71)
'"Var(M.;) '" = G.s 2'"Var(T.) "'S + G.~'"Var(T.) "'U S U '" "'s ) (72)
I ""
* ~
N" '" 1 1 1
+ 2G.G. ~ 1
([V.-U. J/u.)var(T 111 i Var(M) = ~ Var(M.) (73) I i=l l. For potential mortality: I --" M+M' N
= i=l ~
M.+M! 1 1 (74 ) I I = + ([Fci1Uil +
+ Fci2Ui2 J/)
vi Xi + ([F;il (Vil+Vil- Uil ) + F;i2(V i2+Vi2~Ui2) J/vi)X i I (75) I with S Hi = (76) I (77) I var~) A ___ I Var(M +M') (79) I For reduction of mortality: A M'
= M+M' - M=
A N
~
SAS r:r T. U AU A
+ L(T. +T~)] (80) i=1 i l. ~ 1 1 I where I~ = HSi _ GS (81) ~ i I I~ = H~ - G~(Vi - Ui )/~i (82) ~"
I "" N s" "s . . . u ..... s u Var(M') = i~1[Ii2Var(Ti) + Ii Var(Ti+Ti) + 2I i I i (u.i/u i )var(Ti )](83)
..... "s In the above formulas, the squares of the coefficients G~, H~, I~, etc.
I are written without parentheses or brackets:
. 1 1 these squares appear as 1
G~e, H{2, I~2, etc. rather than as* (G~)2, (H~)2, (I~)2, etc. I
IX. 18 I Continuous sampling: mortality estimation, one unit I In the analysis of continuous sampling mortality estimation for the whole plant, formulas (68) to (83), omit every term in which the unit not of interest appears explicitly. For example, if unit 2 is of I interest, then terms involving unit 1 should be omitted, which causes (69), (70), and (71) to become
'" "S "u I
M.1 = FCl'2(T' 12 + T.1 2 ) (FCl~ 1 1 + (FCl'2[V'2-u'2J/V,)X,
")U.lZIv.)X. 1 1 1 1 I
G~T~ 1 1
+ G~~
1 1 (84 ) I with G~1 = FCleo'f")'J, IU, lZ 1 and G~1 = F '2(V'2-U'2)/(V, -u,) Cl 1 1 1 1 (85) I When these values of the coefficients (the G's, H's, and I's) are used, we obtain the mortality estimates for the single unit of interest. I Noncontinuous sampling: entrainment estimation Here we treat both sampling cases of concern: continuous sampling with I a small number of missing p.s.u. 's, and noncontinuous sampling of a systematic nature. The two situations of whole plant estimation and single unit estimation are, similarly, variations on the theme that will I now be discussed. When there are some p.s.u. 's in which sampling does not occur, estimates of entrainment in the missing p.s.u. 's must be constructed from the I data pertaining to the sampled p.s.u. 'so For example, equation (3) can no longer be used to determine T; it must be replaced by (9), (11),* or I (13). Extending this idea to the analyses of entrainment and mortality for the whole plant and for a single unit, analogous changes must be made in order to handle missing p.s.u. 'so For instance, equation (46) for continuous sampling, whole plant entrainment estimation must be I replaced by an equation of the form "T = ~.
+ a,T.
111
'" =~. + [a.T.
111 s"'s + a.T.]
~u 1 1 (86 )
I Correspondingl.y, equations (51) and (53) for '"T' and T +T' must be replaced by eq.uations of the form I
+ ",.
A T' =~. a.. T~ 111 I
---T+ T' = ~l',+ [a1,T u "'U + ")J sAs, + a,(T, T~ (88) 1 1 1 1.
The method of formulas (31) to (33) can then be used to obtain the I variance of any of these estimators. No conceptual problems arise with I I
IX. 19 this approach: we select a subset of the N p.s.u. 's consisting of those p.s.u. 's during which entrainment sampling occurred, then decide on weights to combine with these data in order to obtain estimates of entrainment over N p.s.u. 's from the smaller number of
- p. s. u. 's that were sampled, and finally compute variances a.long the lines of formulas (32) and (33).
Noncontinuous sampling: mortality estimation As with entrainment estimation under noncontinuous sampling, we treat simultaneously the case of a few random missing p.s.u. 's and systematic noncontinuous sampling, and also the cases of whole plant and single unit estimation. The general method is very similar to theLt just presented for entrainment estimation; the only difference j.s that the mortality factors Fc~01' F+.c~ 2
, and so on must be included in the calcu-lations, which affects the values taken on by the weights EL.. For example, with missing p.s.u. 's (68) becomes ~
In the same way,
---- N + AS AU A M+M' = i~l[FCil(Til+ Til + Til) + SAS U AU ..... = L.[a.T. ~ ~ ~ + ai(T. +T!)J ~ ~
The mortality factors are reflected in the values of the weights a.s and
~
u a . Once these weights have been determined, we proceed exactly as in i the estimation of entrainment.
I I I I I I I I I I I I I I I I I I I
I I ADDEBDUK I to EVALUATXOH OP EHTRAXNHEHT ABUHDAHCB SAKPLXHa DESXGNS I I I Prepared Under Contract with I Consolidated Edison Company of New York, Inc. 4 Irving Place New York, New York 10003 I Jointly FUnded By I Central Hudson Gas and Electric corporation Consolidated Edison Company of New York, Inc. I New York Power Authority Niagara Mohawk Power Corporation Orange and Rockland utilities, Inc. I I I Prepared by I steven J. Schwager, Ph.D. George Casella, Ph.D. Douglas S. Robson, Ph.D. I William D. Youngs, Ph.D. I I Ithaca, New York 14850 February 1990 I I
I I 1 I. INTRODUCTION This Addendum reviews selected aspects of entrainment abundance sampling performed I at Indian Point, Roseton, and Bowline Point Generating Stations since 1983. The emphasis is on the information obtained from these sampling programs about the pattern of variation I illl density of entrained ichthyoplankton in the discharge channel. For convenience, the term
"'discharge channel" will often be used to mean either a discharge channel or 21 discharge pipe, I whichever is present at the generating station. The sampling programs in specified years at these generating stations are discussed, with emphasis on the results as they pertain to I sampling issues. The concluding section considers what has been learned, wha.t information is still needed, and what directions would be fruitful for future work.
I Recent progress on the problem of estimating confidence intervals for entrainment abundance sampling programs is examined here. In the report to which this Addendum is I attached, the complexity of the statistical estimation problem is described in detail. presence of greater variability in ichthyoplankton counts than is consistent with a Poisson The I distribution is shown, and methodology for handling the variation is presented. 2 additional variation leads to the introduction of the parameter c in the qUII.dratic variance This I model, which is given by the variance formula (1) I The value of the parameter c 2 reflects the level of extra-Poisson variability present. This I variance formula appeared in Sections 4.2.2 and 4.5 of the report preceding this Addendum. Except for this Addendum, the report has been available in draft form since July, 1986; it has I been cited by several of the reports discussed in this Addendum as Casella et ai, 1986. I Estimating the level of variability in organism counts requires the performance of replicate sampling, in which two (or more) sampling intakes draw simultaneous samples from I each cooling water discharge under identical conditions from the discharge channel. from these replicate samples can be used to compute the estimate c 2 Data of th,e parameter c 2
- The phrase "identical conditions" implies that each set of replicate samples must be drawn at I locations within the discharge flow where exactly the same process is occurring during the sampling period. That is, the distribution of possible organism densities is identical at all I
I
I 2 I sampling locations. If the depths within the discharge flow, the distances to the channel wall, or other differences between the sampling locations result in nonidentical conditions, the I simultaneous samples will not be true replicates. The estimator (:2 is a method of moments estimator based on the variance formula (1), I and is therefore appliicable to any distribution satisfying this relationship. The negative binomial distribution is the most tractable of these distributions, and a rationale for using it I as the alternative model to the Poisson is detailed in Section 4.2.1. It is shown there that a gamma mixture of Poissons has the negative binomial distribution. On the basis of this I argument, the alternative to the Poisson is often referred to in this Addendum as the negative binomial model. However, there may also be other distributions suitable for I modeling the extra-Poisson variability in ichthyoplankton counts present when (1) holds with c2 > O. For example, one possibility is a lognormal mixture rather than a gamma mixture of Poissons. I The extra-Poisson variation results in wider confidence intervals than the standard I Poisson model. In oirder to determine how wide these confidence intervals must be, the degree of extra-Poisson variation, quantified by the parameter c 2 , must be estimated from the I data for each taxon/life stage of interest. This increases the difficulty of the sampling problem, because only nonzero replicate samples contribute information to the estimation of I c2
- Consequently, sampling must be performed in a way that .results in enough nonzero replicates to provide (!LCCurate estimation of c 2 for each taxon/life stage of interest. Unless I sufficiently many nom:ero replicate samples are available for these taxa/life stages, another approach applicable to these undersampled taxa/life stages must be taken. I The pattern of variability in the discharge channel is a major focus of this work. It is important to note that there are two distinct types of departure from the Poisson model of I
completely random di8tribution of organisms within the discharge flow are possible. are:. These I I. Stratification within the discharge channel. There may be relatively homogeneous zones I within the discharge channel, that is, zones in which simultaneously drawn observations are likely to be more similar than simultaneously drawn observations from the entire channel. I I I
I I 3 II. Clumping of organisms. Within each stratum, or within the entire discharge channel if I there is no stratification, organisms may form clumps or clusters rather than exhibiting a random Poisson distribution. I These two types of possible departure from the null hypothesis of Poisson (i.e., randomly) I distributed organisms lead to four combinations: I 1) No stratification and no clumping. These conditions satisfy the null hypothesis of Poisson distributed counts, so the parameter c 2 equals O. Any two simultaneous samples in the discharge channel will be true replicates, with a common mean Pl=P2' I
- 2) Clumping but no stratification. The distribution of clumps of organisms across the I discharge channel is random, so simultaneous samples will have a common mean Pl=P2; however, clumping produces greater variability than the Poisson distribution, 80 c2 > O.
I 3) Stratification but no clumping. Because the discharge channel cont.a.ins strata, or I relatively homogeneous zones that differ from one another, two sampling situations must be distinguished. I - If simultaneous sampling takes place within a single stratum, as occurs when two. (or more) samples are drawn from locations that are very close to one another in the discharge channel, the situation appears to reduce to 1) above, no st ratification and I no clumping. Drawing two (or more) samples from a single stratum makes it* impossible to detect the presence of other strata. I - If simultaneous samples are drawn from different strata, they can no longer be assumed to have a common mean, so Pl :FP2; this difference in means across strata I produces greater-than-Poisson variability between observed counts from different strata, so c 2 > O. I 4) Both stratification and clumping. As in 3), two sampling situations must be distinguished. I - If simultaneous sampling takes place within a single stratum, the situation appears. to reduce to 2) above, clumping but no stratification. As was just noted in 3), drawing the two (or more) samples from a single stratum makes it impossible to I detect the presence of other strata.
- If simultaneous samples are drawn from different strata, they do not have a I
I
I 4 I common mean, so J.'l =i=J.'2; both the clumping within strata and the difference in means across strata produce greater-than-Poisson variability between observed counts I from different strata, so c 2 > O. With the proper experimental set-up, the patterns outlined above for J."s and c 2 can be I detected through careful statistical analysis. among sets of randomly located replicate samples yields For example, if the magnitude of variability c2 ~ 0, then we might conclude that I c 2 ~:::: 0 and that there is no stratification. From the above categorizations, this would support the fact of the organisms being randomly distributed in the discharge channel. If, I instead, the statistical evidence supports c 2 > 0, then different situations may exist. For instance, stratification is a serious type of departure from randomness because it can lead to I biased entrainment estimates; if a carefully chosen experiment is performed under the right conditions and is analyzed under the right assumptions, stratification can be detected from I differences among mea.n densities from fixed sampling locations. Furthermore, situations other than those outlililed above can occur, affecting <;2 and tests of J.'l = J.'2' For example, an inflated value of c 2 could result from inherent errors in the measurement process (such as I extrusion, mutilation, non-detection, or misidentification). I In summary, the presence or absence of stratification, the presence or absence of clumping, and the sampling pattern of replication within or across strata (when strata are I present) must be considered in assessing the variability in the discharge channel. If statistical analyses support the <Claim that c 2 =0, then stronger conclusions can be drawn than if the I claim c 2 > 0 is supported. In this latter case determination of the exact cause of the "extra-Poisson" variability is quite important. Such determinations can only be made with carefully designed experiments. I The observations collected in entrainment abundance studies are dependent on the site I and conditions. Observations taken at different locations along the discharge channel or under different flow conditions can vary greatly. The same is true for the organism counts of I different taxa/life stages. Similarly, data collected at different generating stations can lead to very different conclusions. I Entrainment abundance sampling involves several major difficulties inherent in the I process of field collecti.on and laboratory analysis. One is that the locations of the sampling I I
I I 5 intakes in the discharge channel must be fixed; engineering constraints limit the possible I positions of the sampling intakes, and prevent moving the sampling intakes in order to randomize their locations during the entrainment season. The high velocity of the discharge I flow is another physical problem, made more complicated by its variability over time, location in the channel, and conditions. Bias in the test of randomness may be unavoidable. I The classification model in Section 4.5 of this report raises this issue. The dilJcussion later in this Addendum of Appendix F of the 1984 Bowline Point report and Young (1988) also bear I on the problem of bias. Sorting, identification, and extrusion of the ichthyoplankton in the samples collected in the field can introduce errors whose ramifications are substantial. A final sampling issue is the possibility of splitting some or all samples into subsamples, and counting I ichthyoplankton in only a subset of these according to a carefully specified rule. Splitting is possible, and may under some conditions be an effective technique for improving the accuracy I of estimation attainable for a given total cost; the reduced cost per sample achieved through splitting should allow increased replication. For instance, if dividing a sample into halves and I examining one of them provides almost as much information as the whole sample, then counts from two half-samples taken from two separate samples may provide much more I information than one count of a whole sample at little or no extra expense. 'Furthermore, it may also be true that counting and sorting precision in splits is subject to relatively smaller I measurement error than for full samples. This aspect of splitting is analogous to the improved quality of data seen by the Census Bureau in sample surveys as compared to the Census. More work must be done to establish the conditions that are favorable for sample I splitting. I This introductory section closes with discussions of three important methodological issues that arise in the study of entrainment abundance. I I A Justifiation (or the Logarithmic Tran(ormation o( ANOVA Data The rationale for the use of the logarithmic transformation for ANOVA purposes is I in.structive, so it will now be presented briefly. A fundamental assumption of ANOYA is that all observations have equal variances; performing an ANOYA on data th.at violate this I assumption may well lead to erroneous conclusions (for a thorough discussion, see Scheffe, 1959, Sec. 10.2 and 10.4). It has therefore become common to transform data in which the I I
I 6 I variances of observations are known to be unequal, in an attempt to equalize them. I The variance stabilizing transformation of an observed random variable (in this case, an observed count x) is defined as the transformation z = f(x) whose standard deviation is equal, at least approximately, to a specified constant (Tz. As pointed out by Scheffe (1959, I Sec:. 10.7), this transformation is obtained by solving the equation I f </J1~) f(x) = (Tz (2) I where the function <P expresses the standard deviation (Tx of an observed count x as a function of the mean J'J. For example, this transformation becomes f(x) = {X for data from I a Poisson distribution; 10 the case where per cent error is constant, it becomes the logarithmic transformation, f(x) = In(x), which results from substituting (Tx = </J(J-J) = CJ-J into (2). In the more complex situation of interest here, equation (1) gives I (3) I so setting the arbitrary constant (Tz equal to 1 yields I (4) I I Looking up this integral leads immediately to (5) I If J-J is sufficiently large, then x is also large with probability close to 1, so that ~ x+c 2X2 ::::: I cx, and therefore I f(x) ::::: ! In{2c(x + ~)} = ! In(2c) + ! In (x + ~). (6) 4c 4c I This shows that the transformation (5) is roughly equivalent to the logarithm of (count + constant) when J-J is large. This derivation applies to both natural logarithms (usually I denoted by In, as above, or loge) and logarithms taken with base 10 (usually denoted by I I
I I 7 logI0)' since the identity I In x = log1Ox . In 10 = 2.30259 log10x (7) I bolds for any positive number x. I This demonstration is a justification for using the logarithmic transformation with I ANOVA data. It is applicable when I' is large and the quadratic variance formula (1), or equivalently (3), holds. Under different conditions, such as a variance formula other than (1), the logarithmic transformation would not be a good approximation to the variance I stabilizing transformation. This makes it clear that using the logI0(x+l) tra.nsformation for ANOVA data requires attention to the conditions under which it is applied. It is not an I automatic device that always solves the problem of variance heterogeneity. I A final note of caution about variance stabilizing transformations is in order. Scheffe (1959, Sec. 10.7) mentioned the belief "that if a transformation equalizes the variances it is I also likely to reduce non normality and nonadditivity." He forcefully demonstrated that this belief can be incorrect by presenting an example, involving observations drawn from the I Poisson distribution, in which additivity and equality of variance are irreconcilable, that is, equality of variance can be obtained only by sacrificing additivity. I Sampling Gear and Abundance Estimation I The method of sample collection, i.e., net or pump, may have an effect on the I estimation of entrainment abundance in a flowing media. This is because the volume of discharge water that passes through the cross-section of the sampling device depends on I many factors. One of these is the velocity of the discharge flow, which differs from one location to another in the discharge channel, being slower near the walls than near the center. Another is the ratio of the sample volume to the discharge volume, which is represented as I v/V or v(t)/V(t), where t denotes the time at which a primary sampling unit occurs. This ratio may also depend on the sampling gear: net sampling results will be identical to pump I sampling results only if the pump mechanism is able to withdraw water from the sampling pnpe by matching the discharge flow exactly in local velocity, never drawing either faster or I I
I 8 I slower than the flow immediately surrounding the pipe. I As discussed earlier in this report (pp. 9.2-9.3), the sample volumes should be regulated to make the ratio v(t)/V(t) as nearly constant across time as possible for each plant. When replicate sampling ta.kes place, the same restriction applies equally to each individual I sampling unit. At ealch sampling station, the velocity of water through the sampling orifice should be equal to th,e local velocity of the adjacent water. The consequences of departing I from this pattern of eClual v IV ratios are of unknown magnitude. They could be investigated by simulation modeling, based on different assumptions about the nature of the discharge I flow and the behavior of sampling gear of various kinds. I This set of issuE:s needs further consideration; its resolution is beyond the scope of this Addendum. Current results on estimation of entrainment abundance could be modified to I reflect the results of such an investigation. I Random Variability, Uniform Variability, and Replicate Sampling I The need for replicate sampling is based on a simple but far-reaching observation: without replicate sampling, there is no way to model the variability in the discharge channel. I Without replication, the assumption of Poisson variability or some similar highly structured situation is unavoidable. Such an assumption mayor may not be justified, but one has no I way of knowing how consistent (or inconsistent) the assumption is with the data, unless some replicate sampling has been performed, allowing the analyst to test the fit of the data to the assumption. I It is a commonly stated misconception that Poisson variability in replicate sample I counts inevitably results from random variability in the channel. It is important to note that Poisson variability results from a very specific type of random variability, uniform variability, I which occurs when every location in the cross-section of the channel has the same (i.e., uniform) probability of containing an organism. When this is true, ichthyoplankton are said I to be randomly disper8ed within the channel cross section. Other types of random variability can lead to non-Poisson distributions, such as the negative binomial or lognormal, which may I occur when ichthyoplankton are clumped. Without replicate sampling, the exact type of I I
I I 9 variability cannot be determined. For example, it may be possible to have different levels of I uniform variability in different strata. This would lead to Poisson distributions within a stratum, but the discharge channel as a whole would not exhibit Poisson variability. This is I a possibility that can only be detected with replicate sampling. An adequate description of the variability in the channel is needed in order to place accurate confidence limits on the I entrainment estimates. I REVIEW OF SELECTED POST-l983 STUDIES I Several recent studies of entrainment abundance sampling will now be discussed. These are limited to those studies selected by Consolidated Edison becausf~ they involved I replicate sampling. Some of these studies are taken from the annual reports of Indian Point, Bowline Point, and Roseton Generating Stations. Others are articles relating to replicate I sa.mpling. I I"dian Point Generating Station Entrainment Abundance Program 1985 Annual Report. I Appendices G and I Appendix G is a small but well conceived study of the reliability of the sample splitting I procedure. It was conducted by counting all eight splits of each sample instead of only one split, and then examining the variability of ichthyoplankton and amphipod counts among the I eight splits. The study was repeated here on two sets of three samples, but under somewhat different protocols; the nonstandard precautions taken in the second set may make these data I unrepresentative of standard splits. Under both protocols, tests of splitting replicabiIity revealed higher variability among splits than expected from random chance. The observed I degree of variability among splits was described as comparable to the subsampling variability found in several earlier studies considered by Van Guelpen et al (1982). This source of error I in estimates of organism density appears to be relatively unimportant, because (1) high va,riability among splits seems most likely to occur with high counts, whi<t:h lead to low percent errors, and (2) the sampling error resulting from the high variability in the densities I is much greater. This study merits repetition and elabora.tion. What is missing in the design is duplicate counts of the same sample, by different people, to provide a measure of this I I
I 10 I source of variance. It is important to note that splitting is a form of subsampling, not replicate sampling: the analysis of two (or more) splits obtained from a single sample is quite I different from the analysis of two (or more) replicate samples. Appendix I trea.ts the estimation of the quadratic variance parameter c 2 , which is I needed for the calculation of entrainment confidence limits. These limits are required for 16 taxa/life stages, of which only four occurred in at least 40 pairs of nonzero replicate samples, I the number of pairs recommended for accurate estimation of c 2
- The Appendix considers the esti.mation of generali:led c 2 values, applicable even when there are fewer than 40 pairs of I
replicate samples for a given taxon/life stage. It treats this shortage of replicate observations by proposing that different taxa may have a common value of c 2 for each life stage. I Generalized values of c 2 , pooled over taxa for each life stage, are developed for eggs, post yolk-sac larvae, and juveniles; these values are consistent with data for individual taxa with I both large C~ 40) and small << 40) numbers of nonzero replicate pairs. Results of the study of split replicability indicated that at higher densities the splits I exhibit greater than equi-multinomial (Poisson) variation, even under the precautions taken with the second set of samples. For a test of given size 0, where the size is defined as the I I probability P(reject lIo: Poisson distribution lIo is true), the power of the test is an increasing function of the density of organisms. The magnitude of the excess in variance over I the Poisson model can be measured byc 2
- This requires use of the generalized formula for c 2
, for arbitrary sample size n (in the case here, n = 8) rather than simply n=2, given by I (8) I This formula is a mor4~ general version of the formula for c2 given in Chapter 4 of the report to which this Addendum is attached. In terms of Fisher's D2 statistic (see, for example, Haas I
and Heller, 1986) I n D2 = I: (x. - x)2 / x , (9) i=1 1 I the formula for c 2 in (8) can be written as I c 2 = [D2 - (n - 1)l/[(n - l)x - D 2/n] . (10) I I
I I 11 Under the beta-m'lllti'nomial model, the sampling distribution of this estimator is closely I approximated by assuming that I (11) I that is, the expression in (11) has a chi-squared distribution with n-l degrees of freedom, where c 2 is now a parameter of the distribution. The validity of this appro.ximation in the I beta-multinomial case has been illustrated by enumeration of the exact sampling distribution cf the expression in (11). As a consequence, we may replace n-l in (10) by the 5% and 95% percentiles of the chi-squared distribution to obtain 90% confidence limits for c2. I Note also that c2 is unbounded when there is only one nonzero count among the n I counts, and c2 is negative, and should be truncated to zero, when s2 < x (an occurrence that indicates low sample variability). I I I"dian Point Genel'llting Station Entrainment Abundance Program 1986 Annual Report. pages 91-110 I This report addresses many important issues raised in an earlier draft of Casella et al (1986). A reasonable method of gathering data is described, and the appropriate analyses are I performed to estimate c 2
- The estimate c2 is then used to construct confidence limits on the number of a given species entrained. (Using the assumption that the variance of the number I of a species observed in a sample satisfies the relationship (1), we see that the quantity c 2 is a variance parameter.)
I For a detailed description of sampling procedures at Indian Point, see Sections 1.2.2 I and 1.3 and Figure 1.3 in Chapter 1. There were eight taxa/life stages with enough replicate data to yield good c2 estimates, I based on guidelines set down by Casella et al (1986). In all, there were 688 replicate samples in the channel, which is a huge increase over the amount of replicate sampllng done in the I past. I I
I 12 I For eight key species, defined in the report (Table 4.4-1, page 96), the estimates of the parameter c 2 were very consistent. It was argued convincingly, based on the small variability I among these estimates, that considering c 2 to be constant among these species was very reasonable. For the other, non-key, species, c2 was more variable, so c 2 could not be considered constant. This higher variability could also be a function of a lower sample size, I as was noted in the re][>Ort. I Furthermore, for most of the key species, the estimated value of c 2 was quite close to zero. Since a c 2 value of zero indicates that the Poisson model is appropriate, the report I discusses the viability of the Poisson model, which was rejected by Casella et al (1986) in favor of the more variable negative binomial model. Although the 1986 Indian Point report I concludes that the Poisson model may be viable, it notes at least two other explanations. First, the sample SiZE: is much larger than before, so variability in the estimates of c 2 is reduced. Taking this smaller variance of c2 into account, smaller values of c2 would tend to I favor the Poisson model. Second, the two sampling intakes, which provide the replication, are close to each oth,er (one meter apart at the same depth). This may not provide an I adequate representation of the variability in the entire channel cross-section, making replicate samples appear less va.riable. The key issue in this discussion is that two different kinds of I variation are involved: variation in the whole discharge channel vs. the "local" variation in a sma.ll section (which could be a stratum) of the channel. The 1986 Indian Point report I correctly concludes that this study alone is insufficient evidence to prefer the Poisson model over the negative binomial model, and recommends continuing with the more conservative I negative binomial model. As noted above, this report is a careful and thorough treatment of the issues raised by I Casella et al (1986). The estimates of entrainment, including confidence limits based on the quadratic variance model, are presented (Tables 4.4-2 to 4.4-4), showing the estimated I number killed with and without mitigation. The quadratic variance model is retained, as it should be. There is sufficient evidence to support it, and the quadratic variance model I provides a more conservative picture than the more optimistic Poisson model. Although the estimates of c 2 are small, there is still enough variation to warrant continued estimation and I replicate sampling. I The number of replicate samples, 688, seems to be adequate, although there were a few I I
I I 13 taxa/life stages that had insufficient sample size to estimate the c 2 paramete~. The feasibility I of increasing the sample size to provide better estimates for these taxa/life stages should be considered. Keeping the sample size fixed, but changing the times at which r,eplicate samples I a.re taken, is not feasible: this cannot be done effectively for the many taJca/life stages of interest, especially when (as is the case here) these times must be selected fa.r in advance of I the actual sampling. However, the sample of 688 replicate observations did provide very good information on a large number of species. I The other major point of consideration is the proximity of the two sample intakes. As I stated in the report, even though the sample size was large, the proximity of the intakes makes it difficult to evaluate the variability in the channel. Previously, 1the sample size chosen was insufficient to provide a complete picture, although some observations are I possible. For example, for those taxa/life stages that had c2 ~ 0, this is evidence that there was no clumping within strata, and that sorting and identification errors are not much of a I problem. Then further studies could be aimed at testing for the presence of stratification and at calculating c2 from more widely separated samples. What is now needed are data of a I sufficient number (as in this 1986 Indian Point report) taken at replicate intakes that are far enough apart to provide some assurance that the variability in the channel is being I adequately described. The question of whether c 2 is greater than 0 in the quadratic variance model (1) is an I important one, as it can have a great effect on the size of the confidence limits (with those based on the Poisson model being smaller). Thus, from both an economic and an I environmental point of view, the decision about the model is crucial. For further discussion of the importance of an adequate description of channel variability, see the treatment of the I 1987 Bowline Point sampling program later in this Addendum. I Indian Point Genef'llting Station Entrainment Abundance Progf'llm 1987 Annual Report, pages 71-I 102 This report is very similar to the 1986 report, and therefore will not be described in I detail. The intakes are in the same position as 1986, one meter apart at the same depth. A few differences will be noted, however. I I
I 14 I Unit 3 was not operational during the entire sampling period, so data apply to conditions present when only Unit 2 is in operation. I The analyses described in the 1986 report were done again 10 this report, along with some additional ones. In particular, t-tests were performed to see if the mean density of ten taxa/life stages of ichthyoplankters differed between the two sample intakes. These tests were done on a logarithmic transformation of the data, and essentially showed no difference I in the intake densitie8. (There was one significant difference, but this can be attributed to chance because of the large number of t-tests performed. This is noted in the report.) These I t-tests provide an opportunity to confirm the null hypothesis of equal mean densities at the two intakes, if it is true. If the null hypothesis were rejected, the two intakes clearly would I not provide true replicates, that is, pairs of observations of exactly the same random process. In other words, rejection of the null hypothesis tells us that the two intakes are not close I enough together to constitute a true replicate measuring process. These t-tests, like the ANOVAs to be discussed shortly, are based on the assumption that all log-transformed observations have equa.l variance. I A common rationale for log-transforming the data has to do with additivity of effects. I That is, if the experimenters are concerned about understanding a treatment effect when underlying counts vary over 2-3 orders of magnitude, the situation may be remedied with a I logarithmic transforma.tion. The experimenters often feel that, in cases such as these, the log transform is reasonahle, resulting in an analysis that is easier to interpret. As discussed I earlier in this Addendum (see the subsection A Justification for the Logarithmic Transformation of ANOVA Data), though, concern over additivity alone is not sufficient justification for the logarithmic transforma.tion. The form of the variance function is a crucial component in the I decision to transform the data, and information on the form is attainable through the estimation of c 2
- Although the estimation of c 2 has not been a major focus of the I
entrainment sampling program, many studies (Indian Point, for example) have obtained data suitable for this estimation. Moreover, this estimation can be performed for different years, I and the consistency of c2 , and hence c 2 , can be evaluated. Armed with this additional information, a more fully informed decision about logarithmic transformations can be made. I The ANOVA o:f loglo(x+l)-transformed data is based on the implicit assumption that I the quadratic term in (1) dominates the linear term. This is true when either the density J.' is I I
I I 15 high or c 2 is large, but not when p is low and c 2 = 0, e.g., not for the Poisson distribution I with small mean p. When the logarithmic transformation is applied to counts made during relative peak abundance, with virtually no zero counts present, the largEl p criterion is I satisfied, and this reasoning seems relevant u a potential justification of analyzing log-transformed counts. This method is then likely to be justified on additivity grounds as well, I sance effects are more likely to result in percentage changes in counts rather than additive changes. I I Indian Point Generating Station Entrainment Survival Study 1988 Annual Report, pages 4-21 to 4-*33 I The purposes of the 1988 study at the Indian Point Generating Station were (1) to estimate the survival of ichthyoplankton entrained in the cooling water flow, (2) to determine I whether live and dead ichthyoplankton are distributed at random within the discharge canal at sampling station D2, and (3) to assess whether the thermal and mechanical components of I entrainment stress appear to be independent. The second objective addresses the issue of potential bias of put survival and abundance estimates, which were based on samples drawn I frllm mid-depth in the discharge channel. If live organisms are able to move vertically in the WItter column, then fixed location sampling might be biased by the movement of live organisms into or out of the sampled stratum. If directed vertical movement of live I organisms were to occur, then samples taken near the surface might have a different density and different proportion of live ichthyoplankton than samples taken near the bottom. Tests I of the null hypotheses of equal density and equal fractions alive at surfacl~ and bottom, conditions that would occur under random distribution within the channel, were included in I the program objectives. Due to the nature of survival sampling, it was not anticipated that sufficient paired surface and bottom samples would be available to examine the magnitude of I c2 , I Numbers of live and dead ichthyoplankton for surface and bottom samples were compared by chi-squared analysis of contingency tables. Results of these analyses showed significant differences in survival between surface and bottom samples for two of five I species/life stage combinations, post yolk-sac larvae for striped bass and white perch. For striped bass, survival was slightly better from bottom samples than from surface samples, I I
I 16 I 70.5% vs 66.1%. For white perch, survival was only 6.2% for bottom samples, but was 26.2% for surface samples. The other species/life stage combinations did not show a I tendency for live or dead organisms to be concentrated in bottom or surface samples. ANOVA was performed on the densities of these samples after application of the loglO(x+1) transformation; it indicated significant differences between bottom and surface for striped I bass yolk-sac larvae and white perch post yolk-sac larvae. I To obtain fUJrther information on stratification, ichthyoplankton at three depths, surface, middepth, and bottom. nets were used to sample The resulting densities I were transformed by IloglO(x+1), in order to compensate for their heterogeneity of variance, and ANOVA was then used to test the transformed densities for differences between means. No difference between mean densities on the basis of depth was found. I The objectives of this study, which were reported at the beginning of this section, were quite different from those of the 1986 and 1987 Indian Point studies of entrainment abundance. A large number of replicate samples were present in the 1986 and 1987 data; I that is not the case in 1988, so the information in the 1988 report bears on only a limited number of the issues: addressed m this Addendum. One such issue is whether there is I stratification of ichthyoplankton in the discharge flow. Because the number of replicate samples in 1988 was not large, the statistical analyses performed in the 1988 report do not I include the entrainment estimates and confidence limits based on the quadratic variance model that were present in the reports of the previous two years. The 1988 Indian Point I report thus does not utilize the statistical estimation procedures detailed in Casella et al (1986). I The major lssue addressed in the sections reviewed here is the vertical distribution of ichthyoplankton, i.e., whether stratification is present in the channel, not the estimation of I entrainment abundance. The ANOVAs in these sections, like the t-tests in 1987, provide an opportunity to disprove the null hypothesis of equal mean densities at the two intakes, if it is I not true. If the null hypothesis is rejected, the two intakes clearly do not provide true replicates, that is, paiirs of observations of exactly the same random process. In other words, I rejection of the null hypothesis tells us that the two intakes are not close enough together to constitute a true replicate measuring process, and that stratification is present. The one-way I and two-way ANOVAs were intended primarily to examine differences in means across I I
I I 17 different locations in the discharge, but do not provide any information about the variability I in the channel, e.g., the possibility of clumping. ANOVAs were performed for six species/life stage combinations; significant differences in mean density across different locations were I found only for striped bass yolk-sac larvae and white perch post yolk-sac larvae. The finding of higher survival of white perch post yolk-sac larvae at the surface raises the possibility of I stratification for this taxon. A suggestion is made (p. 4-32) that vertical distribution is dependent on flow in the discharge channel. This is presented as the reason for discrepancies I seen in vertical distribution for 1973, when only Unit 1 was operating, and subsequent years (1974-present), when Unit 1 was not operating but Unit 2 and/or Unit 3 was. I Since these ANOVAs were analyses on means, no conclusions could be reached about variances. Moreover, this shows that a conclusion in the report is incorrect, going beyond the I scope of the data analysis: it is stated (pg. 4-32) that "These results, togethflr with analyses conducted on data collected since Unit 2 began operation (1974), provide a fairly consistent I pattern supporting the contention that fish eggs and larvae tend to be randomly distributed within the discharge channel at Indian Point. The only exceptions to this pattern across the I 22 species/life stage evaluations made across the years were for striped bass post yolk-sac la.rvae in 1975 and for striped bass yolk-sac larvae and white perch post yolk-sac larvae in I flume samples from the current study." Although the majority of species and life stages examined showed no evidence of stratification, none of the statistical analyses done in this report 'can be used to justify a claim of random distribution in the discharge channel, as the I conclusions were not based on an appropriate analysis of replicate sampling data. I The report describes the sampling intakes as being at the top and the bottom of the channel. Since these sampling intakes are much further apart than the one meter separation I in the 1986 and 1987 studies, valuable information could be gained from these data. If an analysis similar to those done in 1986 and 1987 were performed for those taxon/life stage I combinations where a sufficient number of replicates exist, and values of c2 were estimated, then we could have a clearer picture of the variability in the channel. Furthermore, we could assess the validity of the confidence limits based on the quadratic variance model, which I re'flects extra-Poisson variation, and the Poisson model. I Using the 1988 data from the separated intakes, c2 could be calculated for various taxa/life stages. If c2 turned out to be much larger than 0, we would be led to reject the null I I
I 18 I hypothesis HO: c 2 = o. This hypothesis is satisfied by the Poisson distribution, which also supposes no clumping and no stratification. The departure from the null hypothesis could be due to either clumpin~: or stratification, or a combination of the two; however, with only two replicates, there is no way to tell from the data which of these is the underlying situation. (Details of this test CLre given in Casella et al (1986), Appendix VI.) A significant t-test I would indicate that ~ltratification is present, but would not indicate what portion of the departure from the null hypothesis is due to this stratification; the rest of the departure I would be due to the pl:esence of some degree of clumping. I Ro...eton and Danskammer Point Generating Stations Entrainment Abundance Studies 1986 I Annual Report, pages 5-77 to 5-103, 4-1 to 4-8 The specific objective of the 1986 entrainment abundance program at the Roseton Generating Station w;a.g to evaluate the effectiveness of outages in reducing entrainment at this station. Continuous sampling was done on 62 predetermined dates from May through I July, with samples retained as one-hour collections. Replicate samples were collected during two randomly determined days per week, resulting in 48 replicated one-hour sampling periods I per week, throughout the 13-week sampling period. I Primarysamplilllg at Roseton was at a fixed location midpoint in the discharge pipe, 1.8 meters from the bottom. (See Figure 1.4 in Chapter 1.) This has been the sampling I location for collections since 1980. Replicate samples were collected from a point next to the seal well wall, approximately 2.8 meters from the bottom. The replicate sampler was I approximately 4.3 m in front of the primary sampler. Sampling was done with a pump-net-barrel system. Sample volumes were standardized and replicate sampling was performed, with flow and volume used as criteria for acceptance or rejection of the samples for the 24-I hour collection period.. (For further details on sampling procedures at Roseton, see Sections 1.2.3 and 1.3 in Chapter 1.) I Sample volume!1 for primary and replicate collections were very similar, as were the I percent composition of key taxa/life stages. Comparison of hourly samples for the two collection sites throu~~hout the season were presented by a variety of graphs, charts, and I tables; this information showed similar patterns with little difference, in general, between I I
I I 19 primary and replicate samples. The report states that "[data] for certain taxa and life stages, I suggest a small but consistent difference ..." Differences in ichthyoplankton density between primary and replicate samples were treated statistically by a t-test using the difference of the I natural logarithms of density as the response variable. The null hypothesis implicit in this procedure is Ho: In D = 0, where D is defined by I D _ primary sample organism density + 1 (12)
- replicate sample organism density + 1" I Using this procedure, 4 out of 20 possible sets of density differences were found to be = 0.05 level.
I statistically significant at the a The location of the replicate sampling intake against the seal well wall would seem to I be a "worst case" location. The report acknowledges this possibility, noting that the selected slLmpling locations "provided the most severe test of homogeneous mixing ... " In spite of I this, the resulting counts and density estimates of ichthyoplankton were very similar to those collected at the primary sampling site. The observation of similar sample statistics, however, I does not necessarily imply equality of the distribution of organisms in the wa.ter from which the samples were drawn. The earlier subsection in this Addendum, entitled Sampling Gear I and Abundance Estimation, is relevant to this situation. The transformation of the data described by equation (12) above makes the validity of I the assumptions underlying the t-test questionable. Little is known about the error structure of the data, but a multiplicative error structure is implicitly assumed by the model used in I th.e report. The experimenters examined the transformed data and felt that the multiplicative error structure is justified, but we feel that statistical treatment of the data I might he supplemented by the use of distribution-free (non parametric) test statistics, rather than by attempting to normalize the data through transformation. I The paired samples from this study could he used to estimate the parameter c2 as I proposed in Casella. et al (1986). Although this was not included in the scope of work for the R()seton 1986 Annual Report, it would be straightforward to perform this analysis from the da.ta compiled for tha.t report. I I I
I 20 I Bowline Point Generating Station Entrainment Abundance and Unit Outage Evaluation 1984 Annual Report, Appendix F I This appendix ifI a discussion of sampling variation, emphasizing the Poisson model and the nature of alternative models of variability. Variation in the number of organisms I recorded as present ixll a sample results from variation in the number of organisms actually present per cubic meter of water, the main quantity of interest. It also results from variation I in many incidental factors, including the sampling duration, the pumping rate of the sampler, the fraction of organisms detected during sorting, and the fraction of organisms correctly I identified. Each of these (and other) incidental factors is a random variable with its own distribution; perhaps several of these variables are dependent, requiring the use of a joint I distribution. B The situation ill which the number of organisms present per cubic meter is Poisson distributed is consider,ed. The Appendix contains the claim that the recorded catch, C, then has a Poisson distribution if and only if the fractions of organisms detected during sorting I (f3) and correctly identified (f4) and two other similar fractions "are all constant and equal to unity." Actually, the distribution of C can be shown to be Poisson under the less restrictive I con.dition that the number of organisms present per cubic meter is Poisson distributed and all of the fractions fl to f4 are constants between 0 and 1. This result follows from the I distributional theorem: I Theorem: If the conditional distribution of Y given X=x IS binomial(x, p), and X has a Poisson(A) distribution, then the (marginal) distribution of Y is Poisson(Ap). Stated I equivalently, but less formally, if YIX "" binomial(X,p) and X '" Poisson( A), then Y '" Poisson( Ap), I Applying this result repeatedly to the equation C "" B(B(B(B(P(t.q.d),f1 ),f2 ),f3 ),f4) Appendix gives the distribution of C as Poisson(t.q.d *f1*f2 *f3 .f4). III the I In practice, the fractions f1 to f4 will be random variables, and f4 may, through I misclassification, be gJreater than 1, making the distribution of the recorded organism count more complex than the Poisson. (Section 4.5 of this report discusses the connection between I misclassification error~1 and the quadratic variance model.) Because of these extra sources of I
I I 21 variability, the variation in observed counts exceed!' the variation in the number of organisms I entering the sampler. I Replicate sampling data from 1983 and 1984 were analyzed to test the null hypothesis of a Poisson distribution. The test was based on the variance:mean ratios of replicate pairs; I under the null hypothesis, the variance and the mean should be equal. DatiL from 1984 did not depart as much as 1983 data from the null hypothesis, perhaps because of additional I controls imposed on the sampling process in 1984. The test used is the test recommended in Casella et al (1986, equation (4.3>>. I This analysis highlights the fact that the variability in observed organism counts equals the variability in the number of organisms entering the sampler plus the additional variability I due to experimental error, composed of factors like the ones mentioned above. It is therefore important to make the variation due to these additional sources of error SiS close to 0 as I possible, in order to avoid or limit the bias toward rejecting the null hypothesis of the Poisson distribution. This important point is not addressed further in this Bowline Point report. I However, it deserves more attention. I Bowline Point Generating Station Entrainment Abundance and Unit Outage Evaluation 1986 Annual Report, psges 3-9 to 3-11, 4-16 to 4-22 I The results of the 1986 entrainment sampling at Bowline Point Generating Station are I presented in terms of estimated daily and season total numbers of key taxa/life stages, with 90% confidence limits, a~d estimated percent reductions attributable to outages (without I confidence limits). The outages reduced cooling water flow by 34%, whilfl the estimated percent reduction in numbers ranged from 4.5% to 48.9% for the key taxa/life stages as a I result of the timing of the outages relative to the development of life stages. I This report utilized the estimation approach developed in Casella et al (1986), involving the calculation of c2 and its use to estimate the variances of the entrainment estimates and to obtain confidence limits. Details appear on pages 3-9 to 3-11 and in I Appendix B-1. I I
I 22 I Bowline Point Generalting Station Entrainment Abundance and Unit Outage Evaluation 1987 I Annual Report, pages ~J-11 to 4-28 The 1987 Bowline Point report describes two major categories of bias affecting I entrainment estimate!!. The first is differences between the true organism density in a sampled unit of water and the expected value of the organism density observed in a sample I dra.wn from that unit, where each unit corresponds to a given time interval and a given portion of the discharge channel cross-section. Some components of this category involve I nonrandom sampling <<~rror, e.g., gear avoidance. Others involve nonsampling error, which is defined as error not related to the selection of a sample, that would be present even in a I census, e.g., organism misidentification. The second category of bias is location effects, a type of nonrandom sampling error, resulting from systematic patterns in organism density I within the discharge channel. The report cites a recent critique (Versar, 1987) that concluded the distribution of organisms within the Bowline Point discharge pipe could and should be evaluated, using simultaneous sampling at multiple points within the discharge I pipe. The 1987 studien at Bowline Point investigated the existence of location effects. I The importance of an adequate description of channel variability is highlighted by the 1987 Bowline Point program, which included a study comparing four different sampling I stations or locations. Simultaneous replicate sampling was conducted in an extensive study of distribution patterns in the Unit 2 discharge pipe during June 2 through August 7, 1987. I Four locations within a cross-section of the discharge pipe were simultaneously sampled in eight 3-hour collection periods per day, using two automated systems (AUTOSAM) and two manually operated systems (MANOSAM). (Differences in device type were not expected to I influence the results of entrainment sampling.) These collection systems were switched among the different sa.mpling locations on a daily basis in a four-day-per-week cycle. Three I of the stations were near mid-depth of the discharge pipe, located in center, north, and south positions, while the fourth was located at the top of the discharge pipe at the entrance to the I standpipe. (See FigUJre 1.2 in Chapter 1. For further details on sampling procedures at Bowline Point, see Sections 1.2.1 and 1.3 in Chapter 1.) I Entrainment abundance of only striped bass and bay anchovy post yolk-sac larvae were I considered adequate to test for station and gear effects. Data from only their respective I I
I I 23 periods of high abundance (four weeks in June for striped bass and six weeks in late June to I early August for bay anchovy) were included in the analysis, in order to avoid the effect of a hLrge number of zero catches. Log transformed data, coded by the eight three-hour daily I collection periods and by week as well as by location and gear, were analyzed by SAS PROC GLM using a linear model including all four main effects and their first-order and second-I order interactions, except for week x location x gear. Sample counts were first converted to densities (per 1000 m 3 ) and then incremented by 1 before the loglo transformation was I performed to "minimize the effects of variance heterogeneity". This comment was the only r<<~ference in the report concerning a relationship between the variance ILnd the mean. I Implications of the transformation with respect to multiplicativity of factor effects on density were not discussed, and no test of the ANOVA's implied assumptions was repo*rted. I The ANOV A can provide information about the pattern of stratilication in the channel, but not about whether clumping is present. The considerations cited above in the I subsection on the logarithmic transformation of ANOVA data were not discllssed; however, they are quite relevant. The ANOV A of the log-transformed counts is based on the implicit I lUlsumption that the quadratic term in (1) dominates the linear term. This is true when either the density is high or c 2 is large, but not when the density is low and c 2 = 0, e.g., not I fo*r the Poisson distribution with small mean. Restriction of the periods used for the analysis to those of high abundance increases the probability that this assumption is satisfied. I The untransformed data are potentially useful for exploring the relationship of variance to mean. The two MANOSAM devices, in particular, provide numerous paired counts over a I range of densities for both species. I Interpretation of the analysis of the log-transformed data was complicated by the presence of a significant location x gear interaction for both species. The possibility was I mentioned that confounding due to some missing combinations in the four-factor factorial design might produce anomalous results; however, only 6 to 7% of more than 500 cells were I missing in each case, so this is not a very promising source of anomalies. Three-dimensional graphical displays of the 4 x 4 location x gear means failed to reveal any consi~tent pattern fOlr the two species; the significant location x gear interaction does not appea.r to be safely I interpretable. A comparison of bay anchovy length frequency distributions indicated a deficiency of smaller post yolk-sac larvae in AUTOSAM 1, at the top of the discharge pipe I I
24 location. There was an indication that excluding the data from this location and this sampling system would eliminate the difficulties, resulting in consistent homogeneity (i.e., I random distribution) a.t mid-depth for both species. I Two Related Articles o The following two articles contain information that bears upon the question of the randomness of spatial distribution in the discharge channel. Consequently, we were asked to o review them. Although a comprehensive search for material on this subject was not undertaken, the litera.ture that was examined did not reveal any other work on this topic. I I Young, J.R. 1988. "'Testing for Unbiased Sampling of Ichthyoplankton In a Power Plant Discharge Channel". Unpublished Manuscript. I This study considered the issues of bias in entrainment sampling and the randomness of dispersion of ichthyoplankton in the discharge flow. Empirical data from the Indian Point I 1986 sampling progra:m and a computer simulation of the entrainment sampling process were used to examine the possibility of bias in tests of randomness (i.e., no stratification and no I clumping of organism:,) due to processing errors. Observing that "the tedious and repetitive nature of sample sorting makers] it a likely source of additional variation in sample counts," I the paper notes that a sample judged to be acceptable, or not defective, can contain a substantial amount oJ detection and classification error. A sample is judged to be defective if, based on a second independent examination of its contents, either (i) less than 90% of the I organisms were foundl or (ii) any taxon count had an error rate of 10% or more; thus, a non defective sample eould have observed "counts" that are rather different from its true, I unobserved organism ,counts. I Indian Point replicate data from 1986 on each abundant taxon/life stage were used to test the hypotheses: underlying mean densities are equal across locations in the discharge I cross-section, and repllicate counts are consistent with a Poisson distribution. A paired t-test was performed on the differences between log-transformed counts, loge(count + 1), at the two sampling locatiollls. These locations were within 1 meter of each other, as the sampling I
I I 25 I program's original intention had been to measure sampling variation at a single location. Analysis of the data supported the view that there was no difference in mean density between the locations sampled. I In addition, Fisher's 0 2 was computed for each sample pair, given by 02 = I (n-I )s2 Ii, where i and s2 are the mean and variance of the replicate pair and n=2. Under the null hypothesis that each pair consisted of two independent Poisson distributed I observations with a common mean, the 0 2 for each pair has a chi-squared distribution with 1 dj~gree of freedom. Using the average value of 0 2 over all nonzero replicates as a test I statistic, the Poisson null hypothesis was decisively rejected. (This test statistic is equivalent to the statistic Sp defined in Section 4.1 of this report.) It is then demonstrated that the samples' true organism counts could be Poisson distributed, while errors arising during I sorting and classification could explain the departure of the observed data from the null hypothesis. This is exactly the point advanced previously in Section 4.5 of this report. I The computer simulation and an analytical description of the sampling process I examined the effect of sample processing errors on the expected value of O:l. Pairs of iid (independent, identically distributed) Poisson sample counts X were generated. Each count I X was reduced to a smaller detected count Y, reflecting sorting error, the failure to detect all organisms in the sample; Y followed a binomial distribution with parameters X and R, a I random detection probability having a beta distribution. Each detected count Y was then transformed further to an identified count Z, reflecting counting and identification error; Z followed a binomial distribution with parameters Y /q and q. The identified count Z could be I either higher or lower than Y, but the expected counting and identification error was O. The simulation and analytical results showed that sorting and identification errors can seriously I bias 0 2. This bias becomes greater as the mean density of organisms increases. I This article supports the position proposed in Section 4.5 of this report, that the true (but unknown) organism counts in the samples could follow a Poisson distribution, while I errors arising during sorting and classification could result in observed sample counts that do not follow a Poisson distribution. From a statistical viewpoint, the analysis of the observed data must follow the quadratic variance model of equation (1), which is detail.ed in Chapters I 4 and 5 of this report; regardless of whether the true (unobserved) organism counts are Poisson distributed, the observed data exhibit extra-Poisson variability and must be analyzed I I
I 26 I acc:ordingly. I Jude, Mansfield, DeBoe, and Tesar. 1986. "Spatial Distribution of Entrained Fish Larvae In a I Power Plant Discharge Canal". Can.J.Fish.Aquat.Sci. 43:1070-1074. An attempt to address the randomness and representativeness of larval fish I entrainment sampling was undertaken in a brief study at the Monroe Power Plant on western Lake Erie. The expressed motives and objectives were the same as those of the Hudson River I studies, to test for horizontal and vertical stratification of fish larvae in the discharge canal and to test the Poisson assumption in order to validate methodology for estimating I entrainment loss. Simultaneous samples were taken at three equally spaced locations across the discharge canal Illsing three 0.5-m-diameter plankton nets with attached flowmeters. I Each station was alternately sampled for IO-minute periods at three depths of 1, 3, or 5 m in the 6 m deep canal, ordered among the stations in a 3x3 Latin square design. Sampling was conducted over a two-*hour period around midnight on 7-8 June, 1982, when maximum catch I was expected, and again on 12-13 July, when juvenile fish would be present in larger numbers. On each date, the Latin square was repeated four times during the two-hour I period to provide replication in time. I Access to these stations was provided by a bridge spanning a bend in the discharge canal just before it widens into a channel into Lake Erie. The bend, unfortunately, created a I twofold difference in water velocity across the canal, resulting in slow current near the inside wall, potentially below detection of the net's flowmeter at this station. Estimated density (catch/measured volume) was twice as large at this station as at the middle and outer I stations, and the authors speculated that the propellor of this flowmeter was often not turning in the slow current so the sampled volume was grossly underestimated. Among all 72 I samples, the estimated volumes filtered per sample ranged from 2.4 to 80 cubic meters, even though all 12 circulating water pumps were pumping continuously at the same rate I throughout the two s;ampling periods. Subsequent measurements of water velocity at these stations and depths revealed large differences also between the middle and outer stations, as I well as station x depth interaction in velocity. Reliability of the estimated volume filtered is thus a critical and qlllestionable issue in the interpretation of the results of this sampling U study. I U
I I 27 The Latin square design carries with it implicit assumptions concerning lack of I interactions among the three controlled factors (location, depth, and time) and hence does not lend itself to testing the Poisson model. Such a test, though attempted here (in an I unspecified manner), is uninterpretable due to these other potential C2.UseS of model misspecification. (The Poisson goodness-of-fit test was said to be performed on density data; I if so, this would be analogous to performing a chi-squared test on percentage data rather than on count data.) Data analysis proceeded on the basis of the Poisson model, employing I ANOVA on the inappropriately square-root-transformed density data (rather than count data); with grossly varying sample volumes, the only appropriate Poisson-based analysis would employ likelihood ratio testing methods like those provided by the statistical package I GLIM. In view of the defects in design, measurement, and data analysis, the :reported results oj(' this study are to be considered of dubious value. The merit of the published report lies in I pointing out some of these defects so that others may avoid them in their sampling studies. I CONCLUSIONS I This concluding section has two distinct parts. The first is a summaory of what has I been observed from these recent entrainment abundance sampling studies. The second is a discussion of what further knowledge is needed for a fuller understanding of t he relationship bEltween observed ichthyoplankton counts and entrainment abundance estimates. I Observations I The quadratic variance formula (1) provides a better description of empirical I entrainment data than the Poisson model. As noted earlier in this Addendum~ c 2 reflects the quadratic term in equation (1); it provides a measure of the extra variation present in the I entrainment abundance data above the level consistent with the Poisson model. This extra v8.riation is potentially due to some combination of stratification, clumping, and inherent I sampling process error (such as extrusion, mutilation, non-detection, misidentification, and counting error). However, there are other possibilities: for instance, the entrainment sampling may have the structure of a gamma mixture over time of Poisson random I observations, which also leads to the extra-Poisson variation of equation (1). I I
I 28 I An important point concerns the relationship between the parameter c 2 , its estimator c2 , equation (1), and the negative binomial distribution. The estimator c2 is a method of I moments estimator of c 2 in the quadratic variance formula (1). It is therefore not tied to the negative binomial distribution, but only to the quadratic variance formula (1), which is satisfied by the negative binomial but also by other distributions. The negative binomial distributional model serves as a convenient, flexible, and mathematically tractable device for simulating distributions having the quadratic variance function of (1). It also has intuitive I appeal as a gamma mixture of Poisson distributions (see Appendix II). Our attention to the negative binomial model appears to have led to an emphasis on this distribution, rather than I on the more general equation (1), in the recent generating station entrainment reports described above. The estimator c2 may be tied to some restricted class of Poisson mixture I models, in which c 2 ii, the squared coefficient of variation of the mixing distribution of the Poisson parameter A: B Var(X) = E{Var(XIA)} + Var{E(XIA)}
= Jl A + (0"1/ Jl1)Jl1 = JlX = E(A) + Var(A) + (0"1/ Jl1)Jl5c = Jl A + 0"1 (13)
I where the conditional distribution of X given A is Poisson(A), and A is a random variable with I mean Jl A and variance O"~, so E(XIA) = Var(XIA) = A and JlX = E(X) = E(E(XIA) = Jl A' Therefore equation (1) holds, with c 2 = 0"1/ Jl1. I For example, in the case of a lognormal mixture of Poissons, I c2 = -1 + exp{Var(ln An (14) I From recent replicate sampling, we have gained some knowledge of the range of values assumed by the para,meter c 2 , based on the realized values of the estimator c2
- Useful I
information has come from the Indian Point 1986 and 1987 sampling programs, during which simultaneous samples were taken at intakes one meter apart at the same depth, and I potentially from the Bowline Point 1987 sampling program, during which pairs of MANOSAM samples were taken simultaneously at two mid-depth stations. From the data I collected during these sampling programs, we have at least a preliminary idea what range of values c 2 can assume. This is likely to change as further replicate sampling reveals more about which values are possible, and which values are likely, for c 2
- I I
I
I I 29 For taxa/life stag~ with small c2 , stratification a.nd clumping within strata are not I apparent. For taxa/life stages with larger c2 , more work is necessary to develop methods for determining the relative contributions of stratification, clumping, inherent sampling process I error, and other features of the entrainment process. Experiments can be designed to estimate these different phenomena, any or all of which could be responsible for making c 2 I greater than zero. Note that inherent sampling process error leads to biatl in testing the validity of the Poisson model via the null hypothesis that c2 equals O. I We have gained some experience with the estimator c2 , which has 'been calculated I using empirical data as mentioned earlier, in sections of this Addendum discussing specific pla.nt sampling programs. We also know a good deal about the behavior of (:2 under a wide rnnge of (hypothetical) conditions from the extensive computer simulations in Chapter 6 of I this report. Our knowledge about c2 translates immediately into corresponding knowledge about the entrainment abundance confidence intervals based on c2 , which provide more I realistic entrainment estimates than the intervals based on the overly optimistic model of Poisson variation in the discharge channel. I We have some indications of the role of stratification in the distribution of I ichthyoplankton in the discharge channel. The 1987 Bowline Point report concluded that its data analyses "provide no evidence of a non-random distribution of entrained organisms in the areas of the discharge pipe normally used for entrainment abundance" (p. 4-26). The I only anomalous sampling location, the top of the discharge pipe at the entrance to the standpipe, produced significantly higher results than the other locations for stJriped bass post I yolk-sac larvae but significantly lower results for bay anchovy post yolk-sac larvae. This location is not normally used in entrainment abundance studies. (Unexpectedly, there was I clear evidence of a sampling gear effect, due largely, perhaps primarily, to AUTOSAM 1.) The Indian Point 1988 report described an analysis of seven species/life stage combinations I sampled at three depth stations, surface, mid-depth, and bottom. There were no significant differences in density among the depth stations for any of the seven combinations. Based on I the results of 19 out of 22 species/life stage evaluations over the years, the report concluded that there is a fairly consistent pattern supporting the contention that fish eggB and larvae in the discharge channel at Indian Point show a fairly consistent pattern of no stratification (p. I 4-32). (This conclusion was worded in terms of fish eggs and larvae being randomly distributed; for a critique of this, see the I!!ubsection here on the Indian Point 1988 report.) In I I
I 30 I the remaining 3 evaluations, surface densities exceeded bottom densities. I The study of sample splitting described in Appendix G of the 1985 Indian Point report is a promising initial effort on assessing the reliability of splitting. If the difficulties in I achieving thorough uniform mixing of the sample before splitting can be resolved, the potential gains from slLmple splitting procedures are great, as suggested in Chapter 7 of the present report. I In summary, the statistical issues involved in the modeling process necessary for I accurate estimation of entrainment abundance are much more complex and difficult to address through data collection and analysis than had originally been thought. Hypotheses I about the distribution of organisms within the discharge channel are statements about the organisms at the point where they enter the sampling device. However, to test these I hypotheses, we observe those organisms that have entered the device, passed through the sampling pump (if applicable), and been retained in the net, detected in the sample, and identified (correctly or not) as being a particular taxon/life stage. The estimated c 2 values I reflect not only the va.riation in the distribution of organisms, but also the variability added by the remainder of the sampling process. I I Recommendations I We now consider what remains to be learned about the number of ichthyoplankton in the discharge channel and the estimation of this quantity. The main issues to be treated are: I (i) our understanding of the variability in the discharge channel and the roles of its potential sources, clumping and stratification; (ii) our understanding of the variability in the observed entrainment count data and the roles of its potential sources, which include inherent sampling I process error as well as the variability of the true (unknown) organism counts; (iii) our careful consideration of flow velocity; (iv) our understanding of the behavior and properties of I the estimator c2 , especially the effects of clumping, stratification, and inherent sampling process error on c2 ; (v) our knowledge of c 2 for various taxa/life stages; and (vi) the I prospects for sampling designs that address key questions connected with the pattern of entrainment in the discharge and the corresponding estimation techniques. I I I
I I 31 I (i) The first major need is to gain a better picture of ihe variability in entrainment abundance across the discharge channel. The two components of the "state of nature" in the discharge are the presence or absence of stratification in the discharge flow, and the presence I or absence of clustering of organisms (within each stratum if stratification is present). The pattern of variability reflects both of these; if neither stratification nor clumping is present, I the distribution of organism counts will be Poisson. I Stratification represents "large scale" variation across the channel, in which moving from one sector of the channel cross-section to another (e.g., moving 2 meters up and 1 meter I to the left) results in a different density of organisms, while replicate samples within a single 84~ctor (e.g., starting at the channel's geometric midpoint and moving 0.3 meters up and 0.2 meters to the right) constitute independent Poisson observations with a common density of I organisms. Clustering of organisms is the opposite, representing "small scale" or "local" variation in the channel, in which even very close replicate samples in the channel display I extra-Poisson variation; clustering does not involve any "large scale" variation or systematic changes across the channel. In many sampling studies, the "replicate" intakes have been I located relatively far apart in the discharge channel. If extra-Poisson variability is observed under these conditions, it could be due to either stratification or clumping. I If stratification is present, it is likely to lead to bias in entrainment estimates. I Unfortunately, the physical constraints on the sampling locations make it impossible to eliminate the bias by randomizing the location of the sampling station during the entrainment season. It therefore becomes critical to determine whether stratification occurs in the I discharge channel, and, if so, in what pattern and degree and under what conditions of flow, organism density, etc. Different taxa/life stages must be investigated .separately, although I the possibility of combining data for similar taxa should receive attention. This process must b<<~ undertaken separately for each generating station; it cannot be safely assumed that they I will display similar patterns. If stratification is present in the discharge flow, the ability to adjust suitably for its effects is very important in avoiding biased entrainment estimates. I More must be learned about the patterns of stratification present in the generating station discharge channels, in order to develop adjustment tech.niques. I If clumping or clustering is present in the distribution of organisms, bias will not be introduced, but the variance of estimation (including confidence limits) will be greater than I I
32 I under random Poisson distribution of organisms. Replicate sampling provides information about whether nonrandomness or clumping is present, and, if so, to what degree. Thus, if enough nonzero repli<:ate samples are available for a given taxon/life stage, an accurate confidence interval fof' entrainment can be found. replicate sampling data are insufficient. Problems in estimation arise only when Testing for the presence of clumping requires I eliminating the effects of possible stratification; this can be done by taking replicate samples very close together, to ensure that no stratification-based differences affect the observed I organism counts. Testing for the presence of clumping is discussed in Section 4.5 of the present report, in the article by Young, and in the Bowline Point 1984 Appendix F. I It is vitally important to be able to separate the phenomena of stratification and I clumping. To test the null hypothesis of a Poisson distribution of organisms (no stratification and no clumping) aga.inst the alternative that there is some departure from the Poisson (either some stratification or some clumping or both), data consisting of sets of two simultaneous observatiions are sufficient. To test for stratification and clumping separately, however, sets of thrlee (or more) simultaneous observations would be required. The I additional observation is needed because of the need to measure both the variation over the whole channel, i.e., stratification, and the variation at the "local" scale, i.e., clumping. To I accomplish both objectives at once, a sampling pattern combining the approaches taken at Indian Point in 1987 (close intakes) and 1988 (separated intakes) would be effective. With I three simultaneous intakes, two should be close enough together to be true replicates and the third farther away. It is important to know how close together two intakes must be to I produce true replicatei~;at Indian Point, based on the 1987 replicate study in which c2 ::::: 0 for most taxon/life sta.ge combinations (see p. 89, Table 4.3-2), it appears that a 1m distance may well be close enough to provide true replicates at a single location. For Bowline Point, it is possible that data @;athered in the 1987 studies may be similarly informative, but further analysis of these data is needed. The two close intakes would allow the estimation of the I degree of clumping, while the intake farther away would lead to an estimate of the degree of stratification. The Layout and analysis for this sampling experiment, which should be I implemented at each of the power stations, remain to be developed. I A set of simultaneous observations drawn from the discharge channel under identical conditions will appear to be true replicates only if the sampling gears at all sampling locations I perform equally well. In practice, equality of sampling gears has been difficult to achieve. I I
I I 33 Methods to evaluate this equality of sampling gears are also needed. Such methods should be I able to determine whether two or more apparently identical gears are actually performing equally well. I Exploratory analysis of the relationship between variance and mean should be I undertaken on existing data sets. For example, the mean square for sampling station within days at fixed segments of the diel period, which is the estimated error variance, may be I plotted against the corresponding mean densities to look for the anticipated quadratic relationship (1). Since density levels change over the season, such interactions should be I partitioned into short periods of relatively stable density. The counts should not be transformed in such explorations, since (1) refers to the original counts (for constant sample volume). I (ii) In addition to the variability in the true number of organisms in the discharge I channel, we must investigate the variability in the observed entrainment data. The true number of organisms is known rarely, if ever; the observed organism counts reflect both the I true (unknown) counts and the level of inherent sampling process error. ThiB error includes extrusion, mutilation, non-detection, misidentification, and counting error. The overall rate I of such problems may be small enough to be negiigible, or large enough to be quite serious, or somewhere between these extremes. Knowing the consequences of different levels of the various kinds of sampling process error is necessary in order to judge the seriousness of their I effects on c2 and the entrainment estimates. I (iii) Another need is a greater understanding of the role of the velocity profile in the discharge channel under various flow conditions. All work on estimation of entrainment I abundance at generating stations, in this report and elsewhere, where pumping is used to collect samples, is predicated on the assumption of a constant zone of withdrawal for the I sampling pipe. If the validity of this assumption can be documented, entrainment abundance estimation becomes a more manageable task. Conversely, failure to satisfy t:llis assumption I would greatly complicate further analysis. None of the work under review has addressed this area of concern. A study to evaluate velocities in the sampler orifice and u.djacent water under various flow conditions should be undertaken. I (iv) Much remains to be learned about estimating the quadratic variance parameter c2 I I
I 34 I and using the estimate in the estimation of entrainment abundance. The behavior and properties of the estimator c2 should be investigated further. In particular, the effects of clumping, stratification, and inherent sampling process error on the behavior of c2 and on the estimation of entrainment abundance are important to determine. I A related issue worth investigating is our understanding of the estimation of c 2 and the possible development 'c>f improvements in the estimation process. The sampling behavior of I the estimator c2 should be explored more fully, to determine whether current procedures should be revised; for example, perhaps the requirement of 40 nonzero replicate samples should be made less stringent. It was never claimed that c2 is the best possible estimator of c 2 , and attempts to find a better estimator of c 2 through different approaches should be I considered. (v) More detaHed knowledge of the values assumed by c 2 for the taxa/life stages of I interest would be useful in applying the model of extra-Poisson variation to the task of obtaining confidence intervals for entrainment abundance. It would be especially helpful to I know whether inform,ation about c 2 can be pooled across taxa and across years. Pooling across generating stations might also be possible, and pooling across different life stages is a I remote possibility. ][t would be very advantageous to establish that information from different taxa and years could be used to estimate a common value of c 2
- If it turns out that I one or more of these types of pooling is valid, this would decrease the amount of future sampling effort needed to obtain confidence intervals of a givep degree of accuracy. I The variability im observed organism counts has two components, the variability in the number of organisms entering the sampler and the additional variability due to experimental I
error (also referred to above as inherent sampling process error). The latter was discussed in the Addendum sections on Appendix F of the 1984 Bowline Point report and on Young I (1988). Its connection to the quadratic variance model was detailed in Section 4.5 of this report. More complete knowledge of the effect of the variability due to experimental error is necessary in order to judge whether current quality control practices for entrainment data are adequate. The cost of more stringent quality control should be compared to the savings I realized from the more: precise estimation possible with more accurate entrainment data. I (vi) A final recommendation IS for a renewed interest in choosing entrainment I I
I I 35 abundance sampling designs that make it possible to answer the critical questions about the I variation of organism counts in the discharge flow and about the entrainment abundance estimation process. As usual, the term "sampling design" is used here to i.nclude the field I collection, laboratory analysis, and statistical analysis of entrainment sampling data. I The reports from the last few years reveal great year-to-year and plant-to-plant differences in the sampling plans adopted at the generating stations and in the subsequent I statistical analyses of the resulting entrainment data. Some of these sampling designs have been extremely informative about variation in the discharge channel, while others have contained little or no information bearing on this issue. Moreover, the estimation of entrainment abundance, a major goal of this entire effort, has received variable emphasis through the years. In some years it is a primary aim, while in other years the emphasis has I been on addressing other issues, with very little being learned about channel variation and abundance estimation. From the perspective of these issues, it often seems that the three I plants operate separately, with little coordination of sampling designs. I The annual reports from the generating stations must address a variety of entrainment-related issues, of which the topic of this Addendum is only one among many. I Although there has been an effort to perform effective and informative entrainment abundance sampling over the years, this effort has not been uniform throughout plants or years. It would have benefited greatly from the presence of two features, both of which I should be incorporated without delay:
- More thorough data exploration should be included to supplement the standard analyses.
I Innovative data analyses suggested directly by the entrainment process and by the data themselves can be extremely informative. The quadratic variance model of equation (1) and I the c2 approach to entrainment abundance estimation are one such analysis; other analyses in the same spirit would improve our knowledge further. This is especially im~.ortant because I the standard analyses often depend on assumptions that are known not to be met, which makes them only approximately valid.
- Integration of the work done in previous years at the three plants is vital. The changes in I the three sampling programs over the years must reflect the past experience and findings of the programs, but there is no explicit evidence of this. The links between PILSt and present I s8.mpling studies are not reported. This leaves the impression that the plants' annual reports do not use and build further upon the good work done in previous years. To counter this I
I
I 36 I impression, there is an immediate need for an overall review of the sampling programs at the thn.'e plants. It should explain exactly what was done at each plant in each year and (to the I degree possible) why this choice was made, with special emphasis on the influence of results from studies done in previous years. Of course, as future annual reports are issued, they should include this information, which allows the reader to place each year's work properly in I the perspective of the overall sampling program. I We find that the entire body of work - the reports for all plants and all years - leaves several basic issues largely or wholely unresolved. There have been approximately ten I years for collection of data, and the total knowledge concerning variability in the discharge channel that has been ;1I.Ccumulated over this time is small compared to what could have been I learned with a more comprehensive approach. (For example, with a more coordinated sampling effort in th4~ discharge channel, we would know, by now, how to model the variability.) An overall statistical plan, encompassing sampling and analysis, building a body I of knowledge through the next few years, and coordinating the work at all three plants, is essential. I I Summary of Recommendations I To highlight the: points we consider important to concentrate on in the future, the following list reviews our recommendations. All points are described in detail in the above I conclusions. Although we are not familiar with the entire range of questions relating to the impact of power plants on Hudson River ichthyoplankton, the following needs are most important based on what we have seen: I o Better description of discharge channel variability, including quantification of I
- stratification -clumping I -replicability of sampling gear -comparisons between different sample collection methods, e.g., net vs pump I o Better description of observed count data variability, including -discharge channel variability factors, especially stratification and clumping I -measurement process errors (extrusion, mutilation, misidentification)
I I
I I 37 I
- Separation of the effects of stratification and clumping
-their effects lead to different conclusions -also tied to determination of placement of sampling gear I
- Further exploration of the relationship between mean and variance
-data analytic explorations needed I
- Better description of velocity profile
- Better knowledge of estimator c' I -behavior and properties of c'
-effects of stratification, clumping, measurement process errors on c' c' in estimation of c 2 I -possible improvements over
- Better knowledge of c' values for taxa/life stages of interest
-examine consistency across species I -examine consistency across years
- More exploration of effect of components of variance I -effect of variability of number of organisms in sampler
-effect of various components of experimental or measurement error I
- Execution of sampling designs providing answers to key questions involving
-variation of organism counts in discharge flow I -entrainment abundance estimation process I REFERENCES Casella, G., D.S. Robson, S.J. Schwager, and W.D. Youngs. July, 1986. Evaluation of I Entrainment Abundance Sampling Designs, Draft. Prepared for Consolidated Edison Company of New York, Inc.
I Haas, C.N. and B. Heller. 1986. "Statistics of Enumerating Total Coliforms in Water Su.mples by Membrane Filter Procedures." Wat.Res.20:525-530. I Sc:heffe, H. 1959. The Analysis of Variance. John Wiley & Sons, New York. VanGuelpen, L., D.F. Markle, and D.l. Duggan. 1982. "An Evaluation of Accuracy, Precision, and Speed of Several Zooplankton Subsampling Techniques." J.Cons.lnt.Explor.Mer. I 40:226-236. Versar, Inc., ESM Operations. 1987. Evaluating the Effectiveness of Outages: Phase //I Report. I Prepared for Consolidated Edison Company of New York, Inc. I I
I I I I I I I I I I I I I I I I I I I
I I I I I I I I I I I I I I I I I I I
.,j "'- i", -'I i
f.
'- ~ . \ I - :.~
(;-
;r *1 J "~I ~ .' .* I' .- .i~ ~.. " ~
- i 1..;
'!r l "\'. "'I ';',;",1 **I l ! .r* *.* 'i " .,,),
1--J'. _-::>t,
- ~: .L
;"' .* " -l'.' '-l" ; i: ., -.- ~=, ~. £ tV' ><-;1,"
_,1"
- j. t.
1.-- 1.'
.-~ ~ ,
i, L ._
..... l' * " ;.....
- ~ t -
I' f** j>:-i~-~.
". j ~ ;.1.' .J ~.;: _ ~ * ~-;~. "
- i. ..:;:?- ~~:. "
* 'f L .' '. ~ ,- I: .
_.' . 1 ' >,1. 1 :' .
-r.* '.' ~ j I ": ,I~, '. :-., ~"I' "'t
- ':;I*i,
.?-' .f: ,}}