after the third crash, i figured it wasn't a fluke, and tried again with gdb. Just before section 8.8.7, there are a pair of multiline equations. Go into the first, use the arrow keys to get out and in from of the first (starts with der_f2), and enter it & move. *splat* I've included the entire fille (207k), as i can't get it to happen in a partial file (and had trouble repeating one more time in the full file). (gdb) run Starting program: /usr/src/lyx-1_0_x/src/lyx.mm In MapColor [flcolor.c 816] ColormapFull. Using subsititutions LyX: Couldn't get color linen Program received signal SIGSEGV, Segmentation fault. 0x8112a44 in MathedXIter::ipop (this=0x822db44) at math_iter.C:803 803 crow = crow->next; (gsb) bt #0 0x8112a44 in MathedXIter::ipop (this=0x822db44) at math_iter.C:803 #1 0x8122952 in MathedCursor::SetPos (this=0x8255da8, x=-4, y=116) at math_cursor.C:313 #2 0x80e786b in InsetFormula::Edit (this=0x834f0e8, x=0, y=0) at formula.C:483 #3 0x80b5c1a in LyXFunc::Dispatch (this=0x8255218, ac=39, do_not_use_this_arg=0x0) at lyxfunc.C:2268 #4 0x80ac19c in LyXFunc::processKeyEvent (this=0x8255218, ev=0x4007f2ec) at lyxfunc.C:325 #5 0x8154eb5 in LyXView::KeyPressMask_raw_callback (fl=0x824d2a0, xev=0x4007f2ec) at LyXView.C:337 #6 0x400383bf in fl_register_raw_callback () #7 0x40037758 in fl_last_event () #8 0x40037ee2 in fl_treat_interaction_events () #9 0x40037f1c in fl_check_forms () #10 0x8057a57 in LyXGUI::runTime (this=0x82365e8) at lyx_gui.C:629 #11 0x804dfe5 in LyX::LyX (this=0xbffffbfc, argc=0xbffffca0, argv=0xbffffcb4) at ../src/lyx_main.C:128 #12 0x804d88c in main (argc=1, argv=0xbffffcb4) at ../src/main.C:51 (gdb) q The program is running. Exit anyway? (y or n) n Not confirmed. (gdb) c Continuing. lyx: SIGSEGV signal caught Sorry, you have found a bug in LyX. If possible, please read 'Known bugs' under the Help menu and then send us a full bug report. Thanks! lyx: Attempting to save document /home/hawk/Dissertation/dissertation.lyx as... 1) /home/hawk/Dissertation/dissertation.lyx.emergency Save seems successful. Phew. Bye. Program received signal SIGABRT, Aborted. 0x401d1601 in kill () (gdb) c Continuing. Program terminated with signal SIGABRT, Aborted. The program no longer exists. (gdb)
#This file was created by <hawk> Thu Apr 1 16:44:20 1999 #LyX 1.0 (C) 1995-1999 Matthias Ettrich and the LyX Team \lyxformat 2.15 \textclass report \begin_preamble \usepackage{isuthesis} \usepackage{verbatim} \usepackage[normalem]{ulem} \newcommand{\rh}{\uwave} \newcommand{\so}{\sout} \newcommand{\jd}{\uline} \newcommand{\wk}{\uuline}% \newcommand{\jim}{\textsf} %\renewcommand{\jim}{} %\renewcommand{\rh}{} %\renewcommand{\jd}{} %\renewcommand{\wk}{} %\renewcommand{\so}{} \end_preamble \language default \inputencoding default \fontscheme default \graphics default \paperfontsize default \spacing single \papersize letterpaper \paperpackage a4 \use_geometry 1 \use_amsmath 0 \paperorientation portrait \leftmargin 1in \topmargin 1in \rightmargin 2.25in \bottommargin 0.1in \secnumdepth 2 \tocdepth 2 \paragraph_separation indent \defskip medskip \quotes_language english \quotes_times 2 \papercolumns 1 \papersides 1 \paperpagestyle default \layout Title Numerical optimization of recursive systems of equations \layout Author Richard E. Hawkins \layout Abstract Investigation of numerical methods for the optimization of recursive systems, as found in breeding with partial genetic information. \layout Abstract \latex latex \backslash jim{Changes from Jim's suggestions look like this} \layout Abstract \latex latex \backslash wk{Changes from Wolfgang's suggestions look like this} \layout Abstract \latex latex \backslash jd{Changes from Jack's suggestions look like this} \layout Abstract \latex latex \backslash so{Removed text looks like this} \layout Abstract \latex latex \backslash rh{Rick's changes look like this} \layout Abstract Investigation of numerical methods for the optimization of recursive systems, as found in breeding with partial genetic information. \layout Standard \begin_inset LatexCommand \tableofcontents{} \end_inset \layout Chapter Introduction \layout Standard \latex latex \backslash pagenumbering{arabic} \layout Standard Recent years have seen rapid progress in computational technology, genetics, and the animal breeding industry, among others. While computer speed and storgage have increased, \latex latex \backslash jd{ \latex default advances in molecular genetics have made it \latex latex } \latex default possible to test individual animals for the presence of specific genes, and \latex latex \backslash so{ \latex default breeders have \latex latex } \backslash jd{ \latex default The breeding industry has \latex latex } \latex default become more concentrated. These advances can be combined to find more efficient methods of improving genetic progress within breeding herds. \begin_float margin \layout Standard jack: I don't know. where? \end_float Particularly, refinements of the long-established method of dynamic programming to search disjoint subspaces allow the use of genetic testing for individual genes to maximize genetic progress, even when such maximization is analytically impossible. \layout Standard Within living memory,swine were born, raised, lived, and slaughtered primarily on the family farm. They were mostly bred with the farmer's own herd, or with an impressive hog from another nearby farmer. Slaughter would take place on the farm, where the assorted parts of the carcass would be preserved for personal consumption. \bar under \latex latex \layout Standard At the turn of the century, the majority of hogs were sent by rail to a handful of regional slaughter and packing houses to feed the growing population of the cities \begin_inset LatexCommand \cite[ p. 4-5]{Vertical Coordination} \end_inset . Rather than feeding the family, raising hogs for market was a way to begin or expand a farm with relatively small amounts of capital. Still, though, a herd could be maintained and improved by avoiding obvious inbreeding and breeding the sows to the \latex latex \backslash sout{ \latex default easily recognizable superior \latex latex } \latex default sires \latex latex \backslash jd{ \latex default with superior phenotypic characteristics \latex latex } \latex default . \begin_float margin \layout Standard add: -- \begin_inset Quotes eld \end_inset the outwordly observable traits. \begin_inset Quotes erd \end_inset ? \end_float \layout Standard The world has changed since then, and hog production with it. Increasingly, additional traits of the hog were seen to have a genetic basis. Some were to be avoided by careful breeding, while others were to be sought out. Research has found better ways to breed the hog, and the predominant family operation of farrow to finish is being replaced by a system with separate operations for farrowing, nurseries, and finishing \begin_inset LatexCommand \cite[ p. 4]{Vertical Coordination} \end_inset . For some, it became profitable to buy sucklings to raise for market. \layout Standard While the raising of swine remained primarily a family operation, the production of breeding stock rapidly became a concentrated industry. \latex latex \backslash so{ \latex default Brands arose for these commercial breeders, and t \latex latex } \latex default Today the market is dominated by breeding companies, \latex latex \backslash jd{ \latex default each aggressively marketing their own brand of suckling \latex latex } \latex default . \bar under \bar default These companies are constantly positioning for market share, and can either command a better price, or take a larger share, or both, by a relatively modest improvement.While a tenth of a per cent of additional meat per hog may not have been noticeable to a farmer early in the century, commercial farmers see an effect similar to Rockefeller's reducing the number of tacks used in barrels: a fraction of a penny per hog can add up. Rockefeller saved $60,000 per year by reducing the number of tacks per barrel by one, and a commercial operation reaps a measurable amount by the slightest improvement--and losses for the slightest defect. \layout Standard The market has changed further since then. While artificial insemination of swine was not practical even ten years ago, today **% of hogs are conceived this way \begin_float margin \layout Standard Jawn Lawrence survey \end_float \begin_inset LatexCommand \cite{Christian} \end_inset . While sales in the past were based merely on animal weight, today price is adjusted for carcass quality. Statistical sampling and proxy measures such as the thickness of fat on the back of a hog create price premiums, whereas a excess fat will bring a penalty. \begin_float margin \layout Standard cite? \end_float In a multi-stage operation it is now necessary for each manager to be able to assess the quality of both inputs and outputs, creating further competitive pressure for the industries. \layout Standard While genetics have been indirectly recognized as affecting \latex latex \backslash jd{ \latex default the performance of \latex latex } \latex default animals \latex latex \backslash so{ \latex default since \latex latex } \latex default long before Mendel's age, resulting in selective breeding, the development of modern genetics and molecular biology has meant that an increasing number of genes that affect quality in various manners have been \latex latex \backslash so{ \latex default found. \latex latex } \latex default \latex latex \backslash jd{ \latex default identified. \latex latex } \latex default The estrogen receptor gene \begin_float margin \layout Standard xref to example? \end_float is known to increase litter size \begin_inset LatexCommand \cite{Rothschild} \end_inset , while a stress gene causing fainting \latex latex \backslash jd{ \latex default and reduced meat quality \latex latex } \latex default has also been identified.Not only are these genes recognized, but an individual animal may be tested \emph on prior \emph default to breeding to avoid less desirable progeny. \layout Standard While the famous example of Mendel's Peas concerned a \emph on qualitative trait, \emph default \begin_inset LatexCommand \index{qualitative trait} \end_inset or one that is either present or not present, most genes of interest \latex latex \backslash jd{ \latex default in livestock genetics \latex latex } \latex default concern \emph on quantitative traits \emph default \begin_inset LatexCommand \index{qualitative trait} \end_inset , or those which take a range of values, such as animal size. Rather than fully governing outcome, a gene may be one of thousands which \latex latex \backslash jd{ \latex default , along with the environment, \latex latex } \latex default contributes to the size of an animal, some with relatively minor individual effect, and others with strong effect. If these more important genes can be recognized, the possibility exists of identifying a better rule for selecting animals to breed, yielding an increase in quality without an accompanying increase in cost. \layout Standard Better selection decisions have a direct economic impact. An improvement in the current generation also improves all subsequent generatio ns--an extra 1% profit from improved genetics in this year's production raises all future years. which in turn increases the value of the farm by 1%. That is, the change is permanent--as \latex latex \backslash so{ \latex default well as \latex latex } \backslash jd{ \latex default is the increase in \latex latex } \latex default the value of the farm \latex latex \backslash so{ \latex default , by 1% \latex latex } \latex default . More importantly, with the improved genetics, the wealth of the operation is improved by a quantifiable amount. \layout Standard Current breeding methods are based on \emph on phenotypic \emph default \begin_inset LatexCommand \index{phenotype} \end_inset information, the observable traits of an animal. From this information, a \emph on breeding value \emph default \begin_inset LatexCommand \index{breeding value} \end_inset is estimated, or \latex latex \backslash jd{ \latex default the \latex latex \latex default statistical \latex latex } \latex default expected value of the \latex latex \backslash jd{ \latex default collective \latex latex } \latex default effect of the genes passed by the animal to its progeny. The goal is \latex latex \backslash jd{ \latex default to maximize rates of \latex latex } \latex default genetic improvement, an approach which is closely related methodologically to that of optimizing profits; in fact, it is a subset. In the simplest case, revenue is strictly a multiple of the quantity produced-- pounds of milk, for example--and the cost of testing is very low. In these cases, revenue is a simple multiple of the breeding value, production costs are taken as constant, and the optima for genetic improvement are also the economic optima. \layout Standard More complicated cases can be handled as well. Quality as well as quantity can be handled: leaner pork may fetch a higher price per hundredweight. Sales volume may depend upon quality: there is more demand, as well as a higher price, for superior semen for artificial insemination. Finally, there may be a \begin_inset Quotes eld \end_inset brand \begin_inset Quotes erd \end_inset premium in having fixed a gene as present in the breeder's animals that exceeds the direct value of the gene. As with revenues, costs may not be constant. \begin_float margin \layout Standard Jim: not just in this example \end_float It will frequently be the case that testing for the presence of genes in individual animals imposes a significant cost, and that the number of generatio ns to test becomes a variable to optimize. The quality of the animal may increase or lower costs, as well: a sow with larger litters may have increased veterinary costs, partially offsetting the gains, while optimizing for disease resistance could be expected to reduce the costs of raising the breeding herd and commercial animals. \layout Section The Problem \layout Standard While Mendel created a discipline \begin_float margin \layout Standard Jim questions \begin_inset Quotes eld \end_inset discipline \begin_inset Quotes erd \end_inset \end_float with wrinkled and unwrinkled peas, today's geneticist faces \latex latex \backslash so{ \latex default harder \latex latex } \latex default \latex latex \backslash jim{ \latex default more \latex latex complex} \latex default problems. Creatures have many traits, most of which are influenced by large numbers of genes--enough that they may frequently be treated as having infinite count. In recent years an increasing number of these genes have been identified. \latex latex \backslash so{ \latex default but usually \latex latex } \backslash jim{ \latex default However, they still \latex latex } \latex default work in concert with a large number of as yet undiscovered genes \begin_inset LatexCommand \cite{need cite} \end_inset . \layout Standard Given that a gene contributing significantly to a quantitative trait of interest can be detected, it seems likely that this information can be used to improve the herd. Particularly, the best possible improvement \emph on using \emph default the information will be \emph on at least \emph default as good as without the information. The question is then \emph on how \emph default to use the genetic information. The question as to \emph on how \emph default to use the information is not easy; the first proposed rules, using \begin_inset Quotes eld \end_inset genotypic selection, \begin_float margin \layout Standard More explanation? \end_float \begin_inset Quotes erd \end_inset have found \emph on lower \emph default long-term performance using the genetic information \begin_inset LatexCommand \cite{Gibson} \end_inset . \layout Standard If a gene is designated as \begin_inset Formula \( B \) \end_inset \latex latex \backslash jim{ \latex default when \latex latex } \latex default present, and \begin_inset Formula \( b \) \end_inset \latex latex \backslash jim{ \latex default when \latex latex } \latex default not present, there are three \latex latex \backslash jim{ \latex default \begin_inset Quotes eld \end_inset genotypes, \begin_inset Quotes erd \end_inset or \latex latex } \begin_float margin \layout Standard quotes or \emph on italics \emph default for terms? \end_float types of animals: \begin_inset Formula \( bb \) \end_inset , \begin_inset Formula \( bB \) \end_inset , and \begin_inset Formula \( BB \) \end_inset . A naive approach would be to breed only the \begin_inset Formula \( BB \) \end_inset 's, assuming that the gene is desirable. However, this is far from optimal. Suppose that each \begin_inset Formula \( B \) \end_inset is worth \begin_inset Formula \( 1 \) \end_inset , and that the unknown genes yield a standard-normal distribution for the trait. Approximately 5% of the \begin_inset Formula \( bb \) \end_inset 's will draw a value greater than \begin_inset Formula \( 2 \) \end_inset from the distribution, while half of the \begin_inset Formula \( BB \) \end_inset 's will draw a negative number. That 5% is clearly more desirable than the lower half; it is desirable to keep some of each. \layout Standard The question remains, however, as to the optimal combination. By selectively breeding, and with a litter size of 10, a gene can be brought from a frequency of 5% to 99% in about 5 generations \begin_inset LatexCommand \ref{Results} \end_inset . This means that if the program were to last for ten generations, with only the last generation of concern, the first five could be spent merely looking for high values from the normal distribution, with the last five spent increasing the frequency of the \begin_inset Formula \( BB \) \end_inset gene. This is not the optimal pattern, but is offered to show that the animals without the favored gene remain of value in maximizing the trait in question. Further, a choice must be made as to which \begin_inset Formula \( BB \) \end_inset 's to breed. \layout Standard While many classes of breeding rules exist, those to be considered here will select animals by \emph on truncation selection \begin_inset LatexCommand \index{truncation selection} \end_inset \emph default within each of the \emph on genotypes \begin_inset LatexCommand \index{genotype} \end_inset \emph default , such as \begin_inset Formula \( bB \) \end_inset : All creatures above the threshold quality, or \emph on estimated breeding value \emph default \begin_inset LatexCommand \index{breeding value} \end_inset \begin_inset LatexCommand \index{estimated breeding value} \end_inset , within that genotype breed, and are mated randomly amongst all breeding creatures. \begin_float footnote \layout Standard Faster progress could be made by selecting mates. However, this would introduce concerns about inbreeding, making the problem far more complicated. It is prudent to first solve the simple problem, and then approach the more complicated problem with the knowledge gained. \end_float Further, it will usually be assumed that the herd is arbitrarily large. As such, the mean breeding value \begin_inset LatexCommand \index{breeding value} \end_inset in the following generation, as well as all other variables of concern, are degenerate random variables. \layout Standard Preliminary work has already been done in this area. Dekkers, and van Arendonk, consider the case of a single \latex latex \backslash jim{ \emph on \latex default quantitative trait locus, \emph default or \latex latex } \latex default QTL, \latex latex \backslash jim{ \latex default at which a detectable gene is located, \latex latex } \latex default and use optimal control to find the maximum improvement in the final generation of an infinite herd \begin_inset LatexCommand \cite{Dekkers} \end_inset . \layout Standard Two basic scenarios, a small finite number of periods, and an infinite horizon using consider the case of a single QTL, and use optimal control to find the maximum improvement in the final generation of an infinite herd \begin_inset LatexCommand \cite{Dekkers} \end_inset . \layout Standard Two basic scenarios, a small finite number of periods, and an infinite horizon using present (discounted) value \begin_inset LatexCommand \index{present value} \end_inset , will be used, and each will be examined both with optimal control and numeric methods. In the simplest model, only the genetic improvement over time is considered. This problem is computationally harder, due to stiff Hessian matrices, and is of value in testing the robustness of the algorithms. Furthermore, an analytic solution is known from the work of Dekkers and van Arendonk \begin_inset LatexCommand \cite{Dekkers} \end_inset . \layout Subsection The Breeding Industry \layout Standard \latex latex \backslash jim{ \latex default add something about this \latex latex } \layout Subsection Economic Model \layout Standard \latex latex \backslash jim{ \latex default The breeding model translates directly to an \latex latex } \latex default economic model by \latex latex \backslash so{ \latex default can be made by \latex latex } \latex default using the present value for a finite number of generations. Revenues are the present discounted value of some function, possibly linear, of the trait in question plus any brand premium, \begin_inset Formula \begin{equation} \sum ^{\infty }_{t=0}\rho ^{t}\left[ R(\bar{A}_{t})+B\left( \bar{A}_{t},p_{t}\right) \right] \end{equation} \end_inset \begin_float margin \layout Standard make \begin_inset Formula \( \bar{A} \) \end_inset a function? \end_float , where the revenue \begin_inset Formula \( R \) \end_inset reflects the value of animals of the grade indicated by the breeding value, \begin_inset Formula \( \bar{A}_{t} \) \end_inset \latex latex \backslash jim{ \latex default in generation \begin_inset Formula \( t \) \end_inset which depends upon gene frequency \begin_inset Formula \( p_{t} \) \end_inset and other cators \latex latex } \latex default , and the brand premium \begin_inset Formula \( B \) \end_inset is an amount, possibly zero, paid beyond the revenue at that same level when the gene is fixed, i.e., gene frequency \begin_inset Formula \( p_{t}=1 \) \end_inset . \layout Standard Costs may be divided into three significant pieces: fixed costs, costs based on the quality of the herd, and testing costs. As it is assumed that the size of the herd is already chosen, and that a full herd will be kept, fixed costs may be ignored in all cases \latex latex \backslash jim{ \latex default , as they are the same \latex latex } \latex default . Cost is discounted in the same manner as revenue. \layout Standard \latex latex \backslash so{ \latex default Quality cost is a marginal cost, measuring from the fixed cost,that may be either positive or negative. If the objective of breeding is to increase litter size, this cost will likely be greater than zero, due to increased veterinary costs for the extra offspring. However, if the object is disease resistance, veterinary costs should drop as the breed becomes stronger. \latex latex } \layout Standard \latex latex \backslash jim{ \latex default Quality cost is a marginal cost, measuring from the fixed cost,that may be either positive or negative. Disease resistance, for example will reduce vetrinary costs, yielding a negative quantity cost. On the other hand, increasing litter size will increase costs, as there are more animcals to feed and house. This increased cost, however, should be more than offset by the increased revenue from the extra animals. Similarly, breeding for increased milk yield or animal size may cause the animals to eat more, raising costs while producing aditional salable product. \latex latex } \layout Standard The final type of cost is that of testing. \latex latex \backslash so{ \latex default In some cases, testing may be continued forever, and treated as a fixed cost. However, in many real-world cases, the cost is significant. \latex latex \latex default It is likely that testing may be valuable for several generations, after which the gene is to be fixed in the population. This should happen once future gains from testing exceed future costs. \latex latex } \latex default \latex latex \backslash jim{ \latex default If testing were to continue forever, it could be treated as a fixed cost of the enterprise. However, this is unlikely: unless testing can be done by a casual visual inspection, such as a differnt color of animal, it will have at least \emph on some \emph default cost. Once the gains from use of this genetic information, as compared to the program which would be used without the information, exceed the cost of testing, testing will cease. \latex latex } \layout Standard Combining these \latex latex \backslash jim{ \latex default revenue streams and costs \latex latex } \latex default yields the value of the enterprise, \begin_inset Formula \begin{equation} \sum ^{\infty }_{t=0}\rho ^{t}\left[ R(\bar{A}_{t})+B\left( \bar{A}_{t},p_{t}\right) -F_{t}-Q\left( \bar{A}_{t},p_{t}\right) -T\left( t\right) \right] \end{equation} \end_inset which is to be maximized \latex latex \backslash jim{ \latex default , where \begin_inset Formula \( \rho \) \end_inset is the discount factor, \begin_inset Formula \( F_{t} \) \end_inset is the fixed cost in that time period, \begin_inset Formula \( Q \) \end_inset the quality cost, and \begin_inset Formula \( T \) \end_inset the cost of testing. Only models in which \begin_inset Formula \( T \) \end_inset does not change until testing is halted, after which it becomes \begin_inset Formula \( 0 \) \end_inset , will be considered; strategies such as testing alternate generations will be ignored. \latex latex } \layout Subsection Analytic Solutions \layout Standard \begin_float margin \layout Standard Jim: i don't know. Delete section? \end_float In the simplest case, with a finite horizon, Dekkers and van Arendonk \begin_inset LatexCommand \cite{Dekkers} \end_inset have used optimal control theory to show an analytic solution, solving recursively from the final period. Chapter [ \begin_inset LatexCommand \ref{sec:analyticSolutions} \end_inset ] finds that an analytic solution does not exist for even the simplest case with an infinite horizon--while the solution exists, it has one too many variables to solve. \layout Subsection Computational Solutions \layout Standard In the case of a finite number of generations, simple second order methods are inadequate to solve the problem ( \latex latex \backslash S \latex default \begin_inset LatexCommand \ref{NR/SFG} \end_inset , \emph on infra) \emph default . \begin_float margin \layout Standard Jim: ?? \end_float Choices made in the early generations have very little weight in the final generation, and the Hessian matrix becomes stiff. \begin_inset LatexCommand \index{matrices, stiff} \end_inset This can be solved by augmented Newton methods \begin_inset LatexCommand \index{Newton Methods, Augmented} \end_inset , a genetic algorithm \begin_inset LatexCommand \index{genetic algorithm} \end_inset , or both. \layout Standard The infinite horizon presents a simpler problem in some ways. The discounting of the future reduces the stiffness found in the simple model--but will make the \begin_inset Quotes eld \end_inset far off \begin_inset Quotes erd \end_inset generations nearly irrelevant, rather than early generations. The key is in determining how many generations are necessary to be a \begin_inset Quotes eld \end_inset large number. \begin_inset Quotes erd \end_inset That is, is it fifteen, a hundred, or five hundred generations that must be considered to approximate infinity? \layout Standard Two broad categories of computational solutions are sought: conclusions to partial analytic solutions, and \begin_inset Quotes eld \end_inset brute force \begin_inset Quotes erd \end_inset methods which can handle methods not at all amenable to analytic solution. The first category is preferable when possible, but will probably require analytic work for each variation of the problem proposed. However, these methods will use known and established methods to complete problems, and will be certain of achieving optimal solutions. \latex latex \backslash so{ \latex default However the \latex latex } \latex default \latex latex \backslash jim{ \latex default Unfortuneately \latex latex } \latex default , such partial solutions generally do not exist for problems of economic interest. Accordingly, finding methods for the second class are also desirable. In some cases, it may be possible to provide a proof of convergence for a class of problems, and in others it may not. However, the methods likely have value even when proof is impossible: while it may not be possible to guarantee optimality, it remains simple to compare the proposed solution to the best \emph on existing \emph default solution--for a \latex latex \backslash so{ \latex default real \latex latex } \latex default breeder, an improvement upon current output is valuable, even if there is an unknowable better improvement. \layout Section Objectives \layout Standard \latex latex \backslash so{ \latex default A series of results are \latex latex } \latex default \latex latex \backslash jim{ \latex default The results \latex latex } \latex default expected \latex latex \latex default from this line of research \latex latex \backslash jim{ \latex default are \latex latex } \latex default : \layout Enumerate An evaluation of the economic value of genetic information \layout Enumerate The devlopment of an approach to quantiatively evaluate gentic improvements in a herd. \layout Enumerate Application of the program to one or more specific cases of identified pork genes. \layout Standard To achieve these objectives, the following sub-objectives must be accomplished: \layout Enumerate A concise formulation of the recursive optimization problem. \layout Enumerate An algorithm or methodology which can find the optimal values for the control variables. \layout Enumerate An implementation of this algorithm for the specific case of maximizing the present discounted value of a herd in which one or more QTL's have been identified, considering price premiums for quality and the cost of testing. The program code for this algorithm should be as modular as possible to allow re-use for other problems. \layout Enumerate \latex latex \backslash so{ \latex default Application of the program to one or more specific cases of identified pork genes. \latex latex } \layout Chapter \latex latex \backslash so{ \latex default Literature Review \latex latex } \layout Section \latex latex \backslash so{ \latex default Genetics \latex latex } \layout Section \latex latex \backslash so{ \latex default Pork Industry \latex latex } \layout Section \latex latex \backslash so{ \latex default Analytic Method \latex latex } \layout Standard \begin_float margin \layout Standard should this even be here? \end_float \layout Section \latex latex \backslash so{ \latex default Computational Methods \latex latex } \layout Standard \latex latex \backslash so{ \latex default The literature is nearly empty of applicable models, essentially stopping with linear-quadratic problems. \latex latex } \layout Subsubsection \latex latex \backslash so{ \latex default Linear-Quadratic \begin_inset LatexCommand \index{linear-quadratic} \end_inset \latex latex } \layout Standard \latex latex \backslash so{ \latex default refholding spot \latex latex } \layout Subsubsection \latex latex \backslash so{ \latex default Quasi-Newton \begin_inset LatexCommand \index{quasi-Newton} \end_inset \latex latex } \layout Subsubsection \latex latex \backslash so{ \latex default Genetic Algorithms \begin_inset LatexCommand \index{Genetic Algorithm} \end_inset \latex latex } \layout Chapter Genetics \layout Standard Some standard assumptions are made while working with theoretical genetics. These assumptions will generally apply to very large groups as well. Fundamentally, they are large sample results from the Central Limit Theorem, at such a sample size that variance of the population mean.has dropped to zero. \layout Section Structure of the Model \layout Standard Given an initial population in period \begin_inset Formula \( 0 \) \end_inset , it is sought to maximize the average phenotypic value \begin_inset Formula \( B \) \end_inset of the animals \begin_inset Formula \( T \) \end_inset generations later, where the phenotypic value is the total effect from genotype \begin_inset Formula \( g \) \end_inset , polygenic value \begin_inset Formula \( A \) \end_inset , and environment \begin_inset Formula \( E \) \end_inset : \begin_inset Formula \begin{equation} B=g+A+E \end{equation} \end_inset \begin_inset Formula \( E \) \end_inset will be assumed to be \begin_inset Formula \( 0 \) \end_inset in all cases. \layout Section The Breeding Value and Total Genetic Value \layout Standard There are two contributors to the breeding value: the major genes, and the polygenes. As the population is arbitrarily large, each range of values is present in its statistically expected value. It is assumed that it is the mean of the phenotypic value for the entire population that is of interest, rather than the traits of individuals. In a large population, the average breeding value and phenotypic value are the same. For example, it is the milk production of a \latex latex \backslash jim{ \latex default dairy \latex latex } \latex default herd that is most relevant, rather than how much a particular cow produces. \begin_float footnote \layout Standard Taken largely from \begin_inset LatexCommand \cite{Dekkers} \end_inset , pp **. \end_float The discussion that follows considers only a single locus with two alleles, for the sake of simplicity. Except as noted, the concepts carry over directly to the case of multiple loci. \layout Standard These genotypes are tagged by the variable \begin_inset Formula \( m \) \end_inset , taking the values 0, 1, and 2, referring to the number of times the favorable allele is present. \begin_inset Formula \( g_{m} \) \end_inset refers to the genotypic value of the gene, and takes the values \begin_inset Formula \( \left\{ -a,0,a\right\} \) \end_inset for \begin_inset Formula \( m=\{0,1,2\} \) \end_inset \latex latex \backslash jim{ \latex default in the case of additive major genes, which will be the primary case considered. An asymmetric responce, such as \begin_inset Formula \( \left\{ -b,0,a\right\} \) \end_inset is also possible. \latex latex } \begin_float margin \layout Standard switch order of b,a? \end_float \layout Standard \latex latex \backslash jim{ \latex default Allowing \begin_inset Formula \( i \) \end_inset to index individual animals, \latex latex } \latex default the polygenic breeding values of \latex latex \backslash jim{ \latex default an \latex latex } \latex default animal can be estimated by \begin_inset Formula \( \hat{A}_{imt} \) \end_inset , the heritability of the trait not accounting for the major gene. In the absence of a stochastic contribution, this estimate is the actual value. \begin_inset Formula \( \hat{A}_{imt} \) \end_inset is represented as a deviation from \begin_inset Formula \( \bar{A}_{t} \) \end_inset . \begin_inset Formula \[ \hat{A}_{imt}=h^{2}\left( P-g\right) \] \end_inset \latex latex \backslash jim{ \latex default where \begin_inset Formula \( P \) \end_inset is the observed phenotypic value of the animal, and \begin_inset Formula \( h^{2} \) \end_inset is the \emph on heritability \emph default \begin_inset LatexCommand \index{heritability} \end_inset , or the portion of polygenic value which is passed to the next generation \latex latex .} \emph on \latex default \emph default Letting \begin_inset Formula \( b_{mt} \) \end_inset be a weight for genotype \begin_inset Formula \( m \) \end_inset in generation \begin_inset Formula \( t \) \end_inset , the animal has a selection value \begin_inset Formula \begin{equation} \label{simpleselectionvalue} I_{imt}=b_{mt}g_{m}+\hat{A}_{imt} \end{equation} \end_inset For genotypic selection, \begin_inset Formula \( b_{mt}=1 \) \end_inset . In the absence of gametic phase disequilibrium \begin_inset LatexCommand \index{disequilibrium, gametic phase} \end_inset , or correllation between polygenic value and the major gene, the variance of the total genetic value is the same for each of the three groups, while the mean is different, resulting in distributions as seen in Figure \begin_inset LatexCommand \ref{simpleselectionvalue} \end_inset \layout Standard \begin_float fig \layout Standard \align center \begin_inset Figure size 208 174 file graphics/threedists.ps width 4 70 angle -90 flags 9 \end_inset \layout Caption Three Distributions With Same Variance \begin_inset LatexCommand \label{threedists} \end_inset \end_float \layout Standard In each generation, the frequency of the major gene is expressed by \begin_inset Formula \( p_{t} \) \end_inset . A value of 0 would mean that the entire population were homozygotes without the favorable allele, and a value of 1 would mean that the entire population had it twice. In equilibrium, there will be \begin_inset Formula \( p_{t}^{2} \) \end_inset homozygotes with the favorable allele twice \begin_inset Formula \( (AA) \) \end_inset , \begin_inset Formula \( 2p_{t}(1-p_{t}) \) \end_inset heterozygotes \begin_inset Formula \( (Aa) \) \end_inset , and \begin_inset Formula \( (1-p_{t})^{2} \) \end_inset homozygotes without it \begin_inset Formula \( (aa) \) \end_inset . Expressing the average polygenic breeding value of the population as \begin_inset Formula \( \bar{A}_{t} \) \end_inset , the average breeding value can be expressed as \begin_inset Formula \begin{equation} \bar{G}_{t}=a(2p_{t}-1)+\bar{A}_{t} \end{equation} \end_inset Also, \begin_inset Formula \( \bar{A}_{mt} \) \end_inset refers to the average polygenic value of animals with major genotype \begin_inset Formula \( m \) \end_inset in generation \begin_inset Formula \( t \) \end_inset . \layout Standard Combining, the state of the system at time \begin_inset Formula \( t \) \end_inset is fully described by the values \begin_inset Formula \( \{\bar{A}_{t},p_{t},\sigma ^{2}_{pt},\mu _{pt}\} \) \end_inset where \begin_inset Formula \( \mu _{pt} \) \end_inset and \begin_inset Formula \( \sigma ^{2}_{pt} \) \end_inset are the mean and variance of the polygenic distribution at time \begin_inset Formula \( t \) \end_inset . If disequilibrium is considered, then this set must be expanded to account for the fact that the polygenic distribution is different for each of the three genotypes, and the set becomes \begin_inset Formula \( \{\overline{A}_{t},p_{t},\sigma _{mpt},\mu _{mpt}\} \) \end_inset . Finally, if optimization is being done with a finite time horizon, the number of remaining generations becomes important, and \begin_inset Formula \( T-t \) \end_inset is needed as well. \layout Standard The problem is then to choose values for \begin_inset Formula \( b_{mt} \) \end_inset to maximize \begin_inset Formula \( A_{T} \) \end_inset , which is homomorphic with choosing \begin_inset Formula \( x_{mt} \) \end_inset or \begin_inset Formula \( f_{mt} \) \end_inset . \layout Section Standard Types of Breeding \layout Subsubsection Truncation \layout Standard Selection is by truncation of the breeding intensity \begin_inset Formula \( I \) \end_inset . For each \begin_inset Formula \( m \) \end_inset , a cutoff point \begin_inset Formula \( x_{mt} \) \end_inset is chosen. Animals with \begin_inset Formula \( I_{imt}>x_{mt} \) \end_inset are selected to breed for the next generation, while the rest do not breed. The breeding animals then randomly choose mates. \begin_float footnote \layout Standard Choosing mates for them increases problems with inbreeding of other genes. \end_float This cutoff point can also be expressed as a fraction, \begin_inset Formula \( f_{mt} \) \end_inset , which describes the portion of animals of that type bred. Then \begin_inset Formula \begin{equation} f_{mt}=1-F_{mt}(x_{mt}) \end{equation} \end_inset where \begin_inset Formula \( F_{mt} \) \end_inset is the cumulative distribution function for \begin_inset Formula \( I_{fmt} \) \end_inset . \layout Subsubsection Mass Selection \layout Standard In mass selection, \begin_inset Formula \( b_{mt}=h^{2} \) \end_inset for all \begin_inset Formula \( m \) \end_inset . Thus \begin_inset Formula \begin{equation} I=h^{2}g_{m}+h^{2}\left( P-g_{m}\right) =h^{2}P \end{equation} \end_inset The genotypic information is simply ignored, and the same truncation point is used for each genotype.. This is the simplest selection method. \layout Subsubsection Genotypic Selection \layout Standard For genotypic selection, \begin_inset Formula \( b_{mt}=1 \) \end_inset for all \begin_inset Formula \( m \) \end_inset . This is an initial attempt to take the genotypes into account. Thus \begin_inset Formula \begin{equation} I=1g_{m}+h^{2}\left( P-g_{m}\right) =\left( 1-h^{2}\right) g_{m}+h^{2}P \end{equation} \end_inset \layout Subsection disequilibrium \layout Standard After a selection with different cutoff points for the different genotypes, a negative correlation between the polygenic values and the major genotypes of parents will exist. This disequilibrium is not currently addressed, and needs to be added. \layout Subsection Variance \layout Standard Simple models assume that changes in the mean breeding value are small, and that therefore changes in the variance of the breeding orders are of second order smallness, and need not be considered. However, the models considered here attempt to maximize the change that can be made, and this would appear to no longer be a reasonable assumption. The changes in the mean are a function of the variance. If the variance drops significantly due to selection, gains from selection will be overstated \latex latex \backslash jim{ \latex default , as polygenic improvement is proportonal to polygenic variance. \latex latex } \layout Standard There are two sources of change to the variance of the polygenic distribution. The first is the Bulmer effect \begin_inset LatexCommand \index{Bulmer effect} \end_inset \begin_inset LatexCommand \cite[pp. 126-131]{Bulmer} \end_inset , which is a reduction in variance due to gametic phase disequilibrium. Breaking this down, \begin_float margin \layout Standard I expect we strike or totally rewrite this. Only discuss changeif we do 5-d model. \end_float \begin_inset Formula \begin{equation} \sigma _{gt}^{2}=p_{t}\left( 1-p_{t}\right) \end{equation} \end_inset \begin_inset Formula \begin{eqnarray} \sigma ^{2}_{pt} & = & E\left[ P^{2}_{t}-E\left[ P_{t}\right] ^{2}\right] \\ E\left[ P_{t}\right] & = & p_{t}^{2}E\left[ P_{t}^{2}|m=2\right] +2p_{t}\left( 1-p_{t}\right) E\left[ P_{t}^{2}|m=1\right] +\left( 1-p_{t}\right) ^{2}E\left[ P_{t}^{2}|m=0\right] \end{eqnarray} \end_inset where \begin_inset Formula \( E\left[ \right] \) \end_inset is the expectation operator. The total variance can be expressed as \begin_inset Formula \begin{equation} \sigma ^{2}_{At}=\sigma ^{2}_{gt}+\sigma ^{2}_{pt}+2\rho _{t}\sigma _{gt}\sigma _{pt} \end{equation} \end_inset where the subscripts \begin_inset Formula \( A \) \end_inset , \begin_inset Formula \( g \) \end_inset , and \begin_inset Formula \( p \) \end_inset indicated the breeding, genotypic, and polygenic distributions. While this can be taken further, it suffices for the present to observe that the variance is changing due to selection, and that the simple model does not account for this. \layout Chapter The Pork Industry \layout Chapter Dynamic Programming \layout Standard \begin_float margin \layout Standard absorb portions of Methods into this chapter \end_float Computational Methods \layout Standard The literature is nearly empty of applicable models, essentially stopping with linear-quadratic problems. \layout Subsubsection Linear-Quadratic \begin_inset LatexCommand \index{linear-quadratic} \end_inset \layout Standard refholding spot \begin_inset LatexCommand \label{scantlit} \end_inset \layout Subsubsection Quasi-Newton \begin_inset LatexCommand \index{quasi-Newton} \end_inset \layout Subsubsection Genetic Algorithms \begin_inset LatexCommand \index{Genetic Algorithm} \end_inset \layout Chapter \latex latex \backslash jim{ \latex default The Model \latex latex } \layout Section \latex latex \backslash jim{ \latex default The Analytic Problem \latex latex } \layout Standard Mathematically, these problems \latex latex \backslash so{ \latex default of interest \latex latex } \latex default present uncommon problems for both analytic and numerical optimization. Particularly, the recursive \begin_inset LatexCommand \index{recursive} \end_inset nature of the choice space twists the meanings and relevance of the commonly used jacobian and Hessian matrices, as well as making the choice space more difficult to define. In fact, there is almost no literature for numerical methods unless the entire system can be well approximated as linear-quadratic (see \begin_inset LatexCommand \ref{scantlit} \end_inset ). \begin_float margin \layout Standard fix this ref--to dp literature \end_float \layout Section General Description of \latex latex \backslash so{ \latex default Covered \latex latex } \latex default Models \layout Standard A single class of models will be considered, although the definition of this class is large enough to cover topics of interest in several fields. Within this class of model, application will be made to genetic selection rules. \layout Enumerate A known initial state, \begin_inset Formula \( S_{0} \) \end_inset , fully describing the system at the initial time, \begin_inset Formula \( t_{0} \) \end_inset . \layout Enumerate A choice set, \begin_inset Formula \( \mathcal{A}_{t} \) \end_inset , for all time periods, which can be expressed as a function of \begin_inset Formula \( S_{t} \) \end_inset . \layout Enumerate Determinism. Given the prior state and choice made, it must be possible to calculate the current state. \begin_float footnote \layout Standard There is no reason that the state cannot include informations from prior state \end_float \layout Enumerate A single-valued fitness function for the entire model. This function should be at least piecewise continuous. \begin_float margin \layout Standard where should discussion of this go? \end_float \layout Subsection* The State \layout Standard The system must have a fully defined state for each time period. This state includes all of the variables for that time period, and if relevant, those from prior time periods. Among these variables are those used to determine the state of the following generation, and one or more which are used for that period's contribution to the total fitness. Note that for a model with a finite number of generations, the state will generally include the number of generations remaining, and thus such models always lack the Markov property \begin_inset LatexCommand \index{Markov property} \end_inset . \layout Subsection* The Choice Space \layout Standard In each model, there is a choice set in each time period, which itself is determined by choices made in prior periods. That is, if \begin_inset Formula \( \mathcal{C}_{t} \) \end_inset is the set of choices, or action set \begin_inset LatexCommand \index{action set} \end_inset , available at time \begin_inset Formula \( t \) \end_inset , and \begin_inset Formula \( c_{t} \) \end_inset is the choice actually made, \begin_inset Formula \( \mathcal{C}_{t} \) \end_inset itself is determined by \begin_inset Formula \( \left\{ c_{s}:s<t\right\} \) \end_inset , \latex latex \backslash jim{ \latex default as in Figure [ \begin_inset LatexCommand \ref{choicespacefig1} \end_inset ] \latex latex } \latex default . That is, \layout Standard \begin_inset Formula \begin{equation} \mathcal{C}_{t}=\mathcal{C}_{t}\left( c_{t-1}|\mathcal{C}_{t-1}\left( c_{t-2}\ldots c_{0}|\mathcal{C}_{0}\right) \right) \end{equation} \end_inset \begin_float fig \layout Standard \begin_inset Figure size 129 132 file choices.eps flags 1 \end_inset \layout Caption \begin_inset LatexCommand \label{choicespacefig1} \end_inset The choice taken determines the next choice space \end_float \layout Standard This \latex latex \backslash so{ \latex default gives the difficulty \latex latex that} \latex default \latex latex \backslash jim{ \latex default complicates matters, as \latex latex } \latex default a change at time \begin_inset Formula \( s \) \end_inset means that there is not only an interaction between \begin_inset Formula \( c_{t} \) \end_inset and \begin_inset Formula \( c_{s} \) \end_inset , which can be handled by considering cross partial derivatives, but that \begin_inset Formula \( c_{s} \) \end_inset actually \emph on determines \emph default \begin_inset Formula \( \mathcal{C}_{t} \) \end_inset , and thus to speak coherently about effects from changing \begin_inset Formula \( c_{s} \) \end_inset require calculating the changes in \begin_inset Formula \( \left\{ c_{t},\mathcal{C}_{t}:t>s\right\} \) \end_inset that result. \layout Standard \begin_float margin \layout Standard move these 2 paragraphs? \end_float The \emph on state \emph default of the system can be fully described by the number of periods remaining in the program (if finite), the current frequency of the gene identified, or \emph on major gene \emph default \begin_inset LatexCommand \index{major gene} \end_inset , the mean of the unknown genes, or \emph on polygenes \emph default \begin_inset LatexCommand \index{polygene} \end_inset , the polygenic variance, and the correlation between the genotype and polygenic distribution. \layout Standard Each period, a choice is made as to the proportion of each group to be bred must be made. An additional choice of whether or not to test can be made, as well: it may well be the case that at some level of improvement, the benefits of the information no longer outweigh the cost. \layout Subsection* Determinism \layout Standard While a stochastic model would certainly have interest, it is first necessary to find appropriate tools for a deterministic system. \layout Subsection* Objective Function \layout Standard It is necessary to have a single objective function for the entire model to optimize. This function could be a value for a single period, or it can be some weighted combination. While continuity is desirable, it is not strictly necessary. However, it must be finite valued at all locations. Genetic algorithms generally have, within limits, some ability to move across discontinuities and to escape from local optima. \begin_float margin \layout Standard need ref \end_float \layout Section Formulation \layout Standard At this time, a simplified version of the genetic model will be used, including some strong assumptions. Particularly, an infinitely large herd with an infinite number of genes is assumed, yielding a normal distribution of the breeding values of the animals. It is further assumed that genetic progress does not cause deviations from normality. Initially, the variance of the distribution will be assumed to be fixed, though this restriction will eventually be relaxed. \begin_float margin \layout Standard will it? \end_float \layout Standard For the case of optimization of genetic improvement with a QTL, the general model can be further specified. The time periods are the generations of breeding, and the state variables are the frequency of the major genes (including covariance), mean and variance of the polygenic distribution, the covariance of the polygenic and major genes (if included), and the breeding value (which is actually a function of the others). The major genes and polygenic effect will be assumed to interact only in a linear manner. \begin_float margin \layout Standard include a discussion of this? \end_float \layout Subsection* Simple Case: One Locus \layout Standard In each generation, there will be the same number of \begin_inset Quotes eld \end_inset kinds \begin_inset Quotes erd \end_inset of creature, as determined by the combinations of major genes. With single locus with only two possible values, there are three types of creatures, namely those with 0, 1, or 2 of the gene. Generally, the number of types is the product of the number of permutations for each locus. \layout Standard The choice set for each generation is the fraction of each kind of creature to be bred to produce the following generation. The sum of the products of these fractions with the frequency of that type of creature must equal \latex latex \backslash jim{ \latex default \begin_inset Formula \( Q \) \end_inset \latex latex } \latex default the fraction of the entire population needed to produce the next generation: \begin_inset Formula \begin{equation} \label{genericqcons} \sum _{m}p_{mt}f_{mt}=Q \end{equation} \end_inset Note that these choices are not fully independent; if there are \begin_inset Formula \( n \) \end_inset types, the first \begin_inset Formula \( n-1 \) \end_inset choices also determine the final choice. Additionally, while the fractions are necessarily in the \begin_inset Formula \( \left[ 0,1\right] \) \end_inset range, not all choices are necessarily possible. For example, if \begin_inset Formula \( .2 \) \end_inset of the population is needed to breed the following generation, and type \begin_inset Formula \( m \) \end_inset has a frequency of \begin_inset Formula \( .4 \) \end_inset , \begin_inset Formula \( f_{mt} \) \end_inset must be chosen from \begin_inset Formula \( \left[ 0,.5\right] \) \end_inset . \layout Standard Given the multivariate distribution of the major and polygenes, \begin_float margin \layout Standard \begin_inset Quotes eld \end_inset major and polygenes \begin_inset Quotes erd \end_inset or \begin_inset Quotes eld \end_inset major and poly- genes \begin_inset Quotes erd \end_inset or ?? \end_float and the frequencies of the kinds, the average polygenic breeding value may be calculated. \begin_float margin \layout Standard include formula here? \end_float Alternatively, and more easily, it can be calculated from it's prior value. For example, with a single major gene, and ignoring gametic phase disequilibriu m \begin_inset LatexCommand \index{disequilibrium, gametic phase} \end_inset , the result is \begin_inset Formula \begin{equation} \bar{A}_{t+1}=\bar{A}_{t}+\frac{\sigma _{t}}{Q}\sum q_{mt}z_{mt} \end{equation} \end_inset \latex latex \backslash jim{ \latex default where \begin_inset Formula \( z_{mt} \) \end_inset is the height of the standard normal distribution at the truncation point. \latex latex } \begin_float margin \layout Standard is \begin_inset Formula \( \sigma _{t} \) \end_inset correct? \begin_inset Formula \( \sigma _{mt} \) \end_inset ? \begin_inset Formula \( h^{2} \) \end_inset \begin_inset Formula \( \sigma \) \end_inset \end_float Similarly, the total breeding value \begin_inset LatexCommand \index{breeding value, total} \end_inset , \begin_inset Formula \( \bar{G}_{t} \) \end_inset can be calculated from \begin_inset Formula \( \bar{A}_{t} \) \end_inset and the gene frequencies at time \begin_inset Formula \( t \) \end_inset . \layout Standard Finally, the objective function will typically be one of two forms: a function, perhaps equality, of \begin_inset Formula \( \bar{G}_{T} \) \end_inset , the value after the final breeding, or a sum of a discounted profit function of the values of the state variables in all generations. The first is appropriate when measuring the maximum genetic progress in a given number of generations. The latter calculates the economic value of a breeding program. Note that it is not necessarily the breeding value alone which is used, but more likely a function of the breeding value. One such case could be a price premium for animals which exceed a certain quality. In this case, the breeding value might represent the amount of meat produced, which could be sold at different prices depending upon how lean it is. Another example would be a fixed price premium for the fixation of a gene in the population, which would introduce a discontinuity into the objective function. \layout Subsection* Simplified Finite Number of Generations \layout Standard In this simplified version, only the mean breeding value of the herd after the final breeding generation is considered. This is a test of the maximum rate of progress over a finite period, but is not economically reasonable: it neglects both the sale of the animals during most of the program, and the residual value of the operation. Further, the present value of the early generations should count for \emph on more \emph default than the final generation, rather than nothing. \layout Standard Nonetheless, this model is useful in developing the numerical methods, and ferrets out potential problems in the methods. It also shows the maximum rate at which genetic progress \emph on could \emph default be made in a fixed number of generations, which is almost surely not the same solution as the economic problem. Additionally, this is the problem for which analytical solution has been demonstrated, and is therefore useful as a check on the accuracy of the methods. \begin_inset LatexCommand \cite{Dekkers} \end_inset \begin_float margin \layout Standard should the solution be included? \end_float \layout Subsection* Discounted Finite Generations \layout Standard The objective function is then the net present value of all future profits, discounted for all periods considered in the model. \layout Standard It should be noted that the discounted finite generations problem is not in itself of interest; it is developed as a tool for the infinite horizon problem. \begin_float margin \layout Standard cross reference this? \end_float \layout Standard The simplest form is to simply discount the revenues of each generation, assuming that costs are fixed and that revenues are a linear function of revenue, e.g., that the breeding value represents milk produced. In this case, the problem is to maximize total profit \begin_inset Formula \begin{equation} \pi =\sum _{t=0}^{T}\left( 1-r\right) ^{t}\bar{G}_{t} \end{equation} \end_inset \latex latex \backslash jim{ \latex default where \begin_inset Formula \( r \) \end_inset \latex latex \latex default is the discount rate, and is equal to \begin_inset Formula \( 1-\rho \) \end_inset \latex latex } \latex default Note that the first generation can be left out of the summation, as profits during that generation are predetermined by the initial state. However, as more complicated cases may have variable costs from choices made, this generation will be left in for the sake of consistency. \layout Standard This formulation, however, still only accounts for a very simple case of a single major gene affecting a single trait. Further, it does not take into account economic factor such as premiums for fixed gene lines, profits, or the ability to terminate testing as a choice variable. To account for such abilities, profitability should be considered. For example, consider a simple case in which a hog has both meat yield \begin_inset Formula \( Y \) \end_inset and leanness \begin_inset Formula \( Z \) \end_inset . At a fixed leanness, or quality, \begin_inset Formula \( Z \) \end_inset , revenues are presumably linear in \begin_inset Formula \( Y \) \end_inset . However, there is no \emph on a priori \emph default reason to believe that revenue is linear in leanness, though it would presumabl y be increasing within some range of interest, and then likely decreasing. Assuming that the desirability of leanness is unimodal, or that there is a single most desirable value, with desirability decreasing above and below this value, weak quasiconcavity should apply to revenues as a function of \begin_inset Formula \( Z \) \end_inset . Finally, it is possible that for a given set of genes for yield, that total meat produced may be different for different levels of leanness. Revenue then becomes a function of leanness, and the problem expands to be that of maximizing \begin_inset Formula \begin{equation} \pi =\sum ^{T}_{t=0}\left( 1-t\right) ^{t}\pi _{t}\left( Y_{t},Z_{t}\right) \end{equation} \end_inset Note that this formulation includes the possibility that costs for change with the size or leanness of the animal. Letting \begin_inset Formula \( \gamma \) \end_inset denote the vector of total genetic and breeding values, this becomes \begin_inset Formula \begin{equation} \pi =\sum ^{T}_{t=0}\left( 1-t\right) ^{t}\pi \left( \gamma _{t}\right) \end{equation} \end_inset \layout Standard Finally, some models will include a price premium or penalty based on the presence of a gene. To remain general, let \begin_inset Formula \( \theta \) \end_inset denote the vector of all gene frequencies and their covariances, and the problem becomes \begin_inset Formula \begin{equation} \label{finsumstate} \pi =\sum ^{T}_{t=0}\left( 1-t\right) ^{t}\pi \left( \gamma _{t},\theta _{t}\right) \end{equation} \end_inset \begin_inset LatexCommand \label{finprofofstat} \end_inset \layout Standard While these optimization problems express the profit function \begin_inset Formula \( \pi \) \end_inset as a function of the state variables, the determinism of the model means that the state variables themselves are functions of the choice variables, \begin_inset Formula \( f_{mt} \) \end_inset . Letting \begin_inset Formula \( \phi \) \end_inset be the vector of fractions selected, equation \begin_inset LatexCommand \ref{finsumstate} \end_inset becomes, \begin_inset Formula \begin{equation} \label{finsumphi} \pi =\sum ^{T}_{t=0}\left( 1-t\right) ^{t}\pi \left( \phi _{t}\right) \end{equation} \end_inset \layout Standard The final modification is to note that the cessation of testing for one or more genes may be included in the model. That is, it is entirely possible that the a point may be reached at which the value of the information from continued testing is less than the cost. Consideration will be limited to cases in which testing occurs in every generation until terminated. As such, the testing choice variable for a gene takes a whole number value. Letting \begin_inset Formula \( \tau _{m} \) \end_inset indicate the the first generation without testing for gene \begin_float margin \layout Standard don't use m. It indicates type. What variable to index loci with? \end_float \begin_inset Formula \( m \) \end_inset , and \begin_inset Formula \( \tau \) \end_inset itself be the entire vector, \begin_inset LatexCommand \ref{finsumphi} \end_inset becomes \begin_inset Formula \begin{equation} \label{finsumtest} \pi =\sum ^{T}_{t=0}\left( 1-t\right) ^{t}\pi \left( \phi _{t},\tau \right) \end{equation} \end_inset It should be noted that selecting \begin_inset Formula \( f_{mt} \) \end_inset after testing stops is nonsensical; there is only one type after this point, from which all must be chosen. \layout Subsection* Infinite Horizon \layout Standard This is the actual economic problem of interest. As an economic model, the herd should be assumed to continue forever--even though the farmer will eventually retire, the discounted value of the remaining infinite horizon reflects the value for which he can sell the herd. This problem is actually easier to solve analytically than the finite horizon-- in the cases in which it \emph on is \emph default soluble. However, unless the problem can be analytically reduced to a single equation or function, it is not possible to solve for an infinite number of generations; a rule must be found for approximation of this horizon. \layout Standard The problem is not as futile as it sounds. With the introduction of discounting, far off \begin_float margin \layout Standard \begin_inset Quotes eld \end_inset far off \begin_inset Quotes erd \end_inset ? colloquial \end_float generations have an increasingly diminished impact. Thus it can be expected that a convergence \begin_inset LatexCommand \index{convergence} \end_inset theorem can be written to the effect that for any desired \begin_inset Formula \( \epsilon \) \end_inset , and for any generation \begin_inset Formula \( T \) \end_inset , that \begin_inset Formula \( N_{\epsilon ,T} \) \end_inset can be chosen sufficiently large that \begin_inset Formula \begin{equation} \left| f_{mt}^{N_{\epsilon ,T}}-f_{mt}\right| <\epsilon \forall t\leq T \end{equation} \end_inset \layout Standard \begin_float footnote \layout Standard \latex latex \backslash so{ \latex default old paragraph: Even in the non-discounted finite case, solutions for adjacent numbers of generations strongly resemble each other. The early generations for ten generations are quite similar to those for fifteen \begin_inset LatexCommand \ref{Results} \end_inset ,which will already be quite close to infinite. \latex latex } \end_float Furthermore, the introduction of a generation in which to cease testing as a choice variable simplifies, rather than complicates, the problem. Once testing ends, selection is by mass selection, in which the animals to breed are selected solely on the observable value of the trait. The genetic progress under mass selection is well known, and thus once the gene is fixed (or even not fixed, but testing ended), there is nothing new to the problem. A \begin_inset Quotes eld \end_inset canned \begin_inset Quotes erd \end_inset function can be written for the value of the entire future after the cutoff generation, as discussed at [] \begin_float margin \layout Standard ref \end_float . This changes the problem from \begin_inset Quotes eld \end_inset find choices for all generations forever, \begin_inset Quotes erd \end_inset with an infinite number of choice variables, to \begin_inset Quotes eld \end_inset choose a finite number of generations, and choices for those generations. \begin_inset Quotes erd \end_inset \layout Standard Even without such a cutoff, the problem remains tractable. Gene frequencies That is, after a very small number of generations, the gene frequency becomes very close to one, and remains so permanently, which closely approximates mass selection, given that essentially all of the herd have become homozygotes with the gene. Accordingly, an infinite horizon may be simulated by considering \begin_inset Quotes eld \end_inset enough \begin_inset Quotes erd \end_inset generations \layout Section Genetic Models \layout Standard Very little consideration will be given to purely genetic models without economic consequence. In fact, there is only one possible genetic question that can be answered, namely \begin_inset Quotes eld \end_inset What is the greatest genetic progress possible in \begin_inset Formula \( N \) \end_inset generations? \begin_inset Quotes erd \end_inset where \begin_inset Formula \( N \) \end_inset is a fixed number. Any other question must involve a state from at least two generations after the initial state, which means that these must be weighted. Barring Divine Revelation, the question of how to weight generations is inherently a question of relative economic value. \layout Section Analytic Methods for Soluble Cases \layout Standard \begin_inset LatexCommand \label{sec:analyticSolutions} \end_inset There are categories of cases for which it is possible to partially solve analytically, such as the case considered in \begin_inset LatexCommand \cite{Dekkers} \end_inset . However, even in this case, analytic methods fail to yield a complete solution, but instead take the problem to a point at which iterative methods can solve for the truncation points. This is as far as such a method can get. \layout Standard However, Dekkers et al solved for a finite case. The actual economic problem is for the infinite horizon, in which the business continues indefinitely. The problem can easily be reformulated to include discounting and an infinite horizon. Consider again the Dekkers model. The genetic value \begin_inset Formula \( G_{t} \) \end_inset is defined as the sum of the effects of the major gene and the polygenic value \begin_inset Formula \begin{equation} \label{siminfobjfn} \bar{G}_{t}=a\left( 2p_{t}-1\right) +\bar{A}_{t} \end{equation} \end_inset where \begin_inset Formula \( p_{t} \) \end_inset is the frequency of the major gene \begin_inset LatexCommand \index{gene frequency} \end_inset , or the portion of loci in the population that actually have this gene, and \begin_inset Formula \( a \) \end_inset is the of \emph on each copy \emph default of the gene. \begin_inset Formula \( \bar{A}_{t} \) \end_inset \emph on \emph default is the mean of the polygenic value, which is assumed to be normally distributed with standard deviation \begin_inset Formula \( \sigma \) \end_inset \emph on . \emph default The available choice variables are \begin_inset Formula \( \left\{ f_{mt}:m\in \{0,1,t\},t\in \{0,1,...T-1\right\} \) \end_inset , the fraction of type \begin_inset Formula \( m \) \end_inset that will be bred in generation \begin_inset Formula \( t \) \end_inset to produce the next generation. These are isomorphic with the truncation points \begin_inset Formula \( x_{mt} \) \end_inset and density \begin_inset Formula \( z_{mt} \) \end_inset of the standard normal distribution. Breeding is by \begin_inset Quotes eld \end_inset truncation \begin_inset Quotes erd \end_inset : all animals better than the truncation point breed, and are randomly assigned to another breeder. \begin_inset Formula \( Q \) \end_inset is the fraction of the entire population that must be bred to produce another population of the same size. The initial values of \begin_inset Formula \( p_{0} \) \end_inset and \begin_inset Formula \( \bar{A}_{0} \) \end_inset are known. \layout Standard Dekkers problem was to maximize the total genetic progress by the final generation \begin_inset Formula \( T \) \end_inset : \begin_inset Formula \begin{equation} \label{finobjfn} Max_{f_{mt}}\left\{ L|\bar{A}_{0},p_{0},Q\right\} \end{equation} \end_inset where \begin_inset Formula \begin{eqnarray} L & = & \sum ^{T-1}_{t=0}\left\{ H_{t}-\lambda _{t}p_{t}-\gamma _{t}\bar{A}_{t}\right\} -\lambda _{T}p_{T}+\lambda _{0}p_{0}-\gamma _{T}\bar{A}_{0}+a\left( 2p_{T}-1\right) +\bar{A}_{T}\\ H_{t} & = & \frac{\lambda _{t+1}}{Q}\left\{ f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) \right\} \nonumber \\ & & +\gamma _{t+1}\left\{ \bar{A}_{t}+\frac{\sigma }{Q}\left[ p_{t}z_{1t}+2p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}\right] \right\} \nonumber \\ & & +\epsilon _{t}\left\{ Q-f_{1t}p_{t}^{2}-2f_{2t}p_{t}\left( 1-p_{t}\right) -f_{3t}\left( 1-p_{t}\right) ^{2}\right\} \end{eqnarray} \end_inset \begin_inset LatexCommand \cite[eq 6-9]{dekkers} \end_inset . \layout Standard The extension to an infinite horizon with discounting is straightforward. Using a constant discount value \begin_inset Formula \( r \) \end_inset , note that the total genetic value in generation \begin_inset Formula \( t \) \end_inset is with present value at time \begin_inset Formula \( t=0 \) \end_inset is \begin_inset Formula \begin{equation} \left( 1-r\right) ^{t}\bar{G}_{t}=\left( 1-r\right) ^{t}a\left( 2p_{t}-1\right) +\bar{A}_{t} \end{equation} \end_inset and the objective function to maximize becomes \begin_inset Formula \begin{equation} \label{simpleinfhoriz} G=\sum ^{\infty }_{t=0}\left( 1-r\right) ^{t}\left[ a\left( 2p_{t}-1\right) +\bar{A}_{t}\right] \end{equation} \end_inset subject to \begin_float margin \layout Standard the folowing equations were broken \end_float \begin_inset Formula \begin{eqnarray} Q & = & f_{1t}p_{t}^{2}+f_{2t}2p_{t}\left( 1-p_{t}\right) +f_{3t}\left( 1-p_{t}^{2}\right) \label{siminfqcns} \\ p_{t+1} & = & \frac{1}{Q}\left\{ f_{1t}p^{2}_{t}+f_{2t}p_{t}\left( 1-p_{t}\right) \right\} \label{siminfprule} \\ \bar{A}_{t+1} & = & \bar{A}_{t}+\frac{\sigma }{Q}\left\{ p^{2}_{t}z_{1t}+2p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}\right\} \label{siminfarule} \end{eqnarray} \end_inset where ( \begin_inset LatexCommand \ref{siminfqcns} \end_inset ) is the constraint keeping population size constant, and ( \begin_inset LatexCommand \ref{siminfprule} \end_inset ) and ( \begin_inset LatexCommand \ref{siminfarule} \end_inset ) describe the progression of \begin_inset Formula \( p \) \end_inset and \begin_inset Formula \( \bar{A} \) \end_inset given the choices made and the current state. This formulation has the same constraints as the Dekkers formulation, save only that the Lagrange multipliers are required for an infinite number of time periods. \layout Standard However, it is useful to rewrite this exclusively in terms of \begin_inset Formula \( x_{mt} \) \end_inset . Letting \begin_inset Formula \( \phi \) \end_inset and \begin_inset Formula \( \Phi \) \end_inset represent the pdf and cdf, respectively, of the standard normal distribution, \begin_inset Formula \begin{eqnarray} f_{mt} & = & 1-\Phi \left( x_{mt}\right) \\ z_{mt} & = & \phi \left( x_{mt}\right) \end{eqnarray} \end_inset Equations ( \begin_inset LatexCommand \ref{siminfqcns} \end_inset - \begin_inset LatexCommand \ref{siminfarule} \end_inset ) become \begin_inset Formula \begin{eqnarray} Q & = & \left( 1-\Phi \left( x_{1t}\right) \right) p_{t}^{2}+\left( 1-\Phi \left( x_{2t}\right) \right) 2p_{t}\left( 1-p_{t}\right) +\left( 1-\Phi \left( x_{3t}\right) \right) \left( 1-p_{t}^{2}\right) \label{siminfxqcons} \\ p_{t+1} & = & \frac{1}{Q}\left\{ \left( 1-\Phi \left( x_{1t}\right) \right) p_{t}^{2}+\left( 1-\Phi \left( x_{2t}\right) \right) _{2t}p_{t}\left( 1-p_{t}\right) \right\} \\ \bar{A}_{t+1} & = & \bar{A}_{t}+\frac{\sigma }{Q}\left\{ p_{t}^{2}\phi \left( x_{1t}\right) +2p_{t}\left( 1-p_{t}\right) \phi \left( x_{2t}\right) +\left( 1-p_{t}\right) ^{2}\phi \left( x_{3t}\right) \right\} \\ & \forall & 0\leq t\leq \infty \end{eqnarray} \end_inset The Hamiltonian remains \begin_inset Formula \begin{eqnarray} H_{t} & = & \frac{\lambda _{t+1}}{Q}\left\{ \left( 1-\Phi \left( x_{1t}\right) \right) p_{t}^{2}+\left( 1-\Phi \left( x_{2t}\right) \right) p_{t}\left( 1-p_{t}\right) \right\} \nonumber \\ & & +\gamma _{t+1}\left\{ \bar{A}_{t}+\frac{\sigma }{Q}\left[ p_{t}^{2}\phi \left( x_{1t}\right) +2p_{t}\left( 1-p_{t}\right) \phi \left( x_{2t}\right) +\left( 1-p_{t}\right) ^{2}\phi \left( x_{3t}\right) \right] \right\} \nonumber \\ & & +\epsilon _{t}\left\{ Q-\left( 1-\Phi \left( x_{1t}\right) \right) p_{t}^{2}-2\left( 1-\Phi \left( x_{2t}\right) \right) p_{t}\left( 1-p_{t}\right) -\left( 1-\Phi \left( x_{3t}\right) \right) \left( 1-p_{t}\right) ^{2}\right\} \end{eqnarray} \end_inset and \begin_inset Formula \begin{equation} L=G+\sum _{t=0}^{\infty }\left\{ H_{t}-\lambda _{t}p_{t}-\gamma _{t}\bar{A}_{t}\right\} +\lambda _{0}p_{0}+\gamma _{0}\bar{A}_{0} \end{equation} \end_inset \begin_float margin \layout Standard Up to here, I think I've made no changes. \end_float which differs from Dekkers' in the lack of a final period, and inclusion of the discounted values of all generations in G. \layout Standard The partial derivatives of \begin_inset Formula \( L \) \end_inset are taken with respect to the choice variables \begin_inset Formula \( x_{mt} \) \end_inset , the state variables \begin_inset Formula \( p_{t} \) \end_inset and \begin_inset Formula \( \bar{A}_{t} \) \end_inset , and the Lagrangian multipliers, all of which derivatives must be equal to zero. \layout Standard Taking the first partials of \begin_inset Formula \( L \) \end_inset yields \begin_inset Formula \[ \nabla _{x_{t}}L=\nabla _{x_{t}}H\] \end_inset \begin_inset Formula \begin{eqnarray} \nabla _{x_{t}}L & = & \nabla _{x_{t}}H\nonumber \\ & = & 0\nonumber \\ & = & \frac{\lambda _{t+1}}{Q}\left[ \begin{array}{c} -p_{t}^{2}\phi \left( x_{1t}\right) \\ -p_{t}\left( 1-p_{t}\right) \phi \left( x_{2t}\right) \\ 0 \end{array}\right] \nonumber \\ & & -\frac{\gamma _{t+1}\sigma }{Q}\left[ \begin{array}{c} p_{t}^{2}x_{1t}\phi \left( x_{1t}\right) \\ 2p_{t}\left( 1-p_{t}\right) x_{2t}\phi \left( x_{2t}\right) \\ \left( 1-p_{t}^{2}\right) x_{3t}\phi \left( x_{3t}\right) \end{array}\right] +\epsilon _{t}\left[ \begin{array}{c} p_{t}^{2}\phi \left( x_{1t}\right) \\ 2p_{t}\left( 1-p_{t}\right) \phi \left( x_{2t}\right) \\ \left( 1-p_{t}^{2}\right) \phi \left( x_{3t}\right) \end{array}\right] \end{eqnarray} \end_inset \layout Standard For \begin_inset Formula \( p_{t} \) \end_inset , \begin_inset Formula \begin{eqnarray} \frac{\partial L}{\partial p_{t}} & = & 2\left( 1-r\right) ^{t}a-\lambda _{t}\nonumber \\ & & +\frac{\lambda _{t+1}}{Q}\left\{ 2\left( 1-\Phi \left( x_{1t}\right) \right) p_{t}+\left( 1-\Phi \left( x_{2t}\right) \right) \left( 1-2p_{t}\right) \right\} \nonumber \\ & & +2\frac{\gamma _{t+1}\sigma }{Q}\left\{ p_{t}\phi \left( x_{1t}\right) +\left( 1-2p_{t}\right) \phi \left( x_{2t}\right) -\left( 1-p_{t}\right) \phi \left( x_{3t}\right) \right\} \label{siminfpt} \\ & & +2\epsilon _{t}\left\{ -\left( 1-\Phi \left( x_{1t}\right) \right) p_{t}-\left( 1-\Phi \left( x_{2t}\right) \right) \left( 1-2p_{t}\right) +\left( 1-\Phi \left( x_{3t}\right) \right) \left( 1-p_{t}\right) \right\} \nonumber \end{eqnarray} \end_inset The partial with respect to \begin_inset Formula \( \bar{A}_{t} \) \end_inset yields information about \begin_inset Formula \( \gamma _{t} \) \end_inset , its shadow value \begin_inset Formula \begin{equation} \label{simgameq} \frac{\partial L}{\partial \bar{A}_{t}}=\left( 1-r\right) ^{t}+\gamma _{t+1}-\gamma _{t} \end{equation} \end_inset while \begin_inset Formula \( \lambda \) \end_inset and \begin_inset Formula \( \epsilon \) \end_inset yield only the dynamic behavior of \begin_inset Formula \( p_{t} \) \end_inset and the breeding constraint: \begin_inset Formula \begin{eqnarray} L_{\lambda _{t+1}} & = & \frac{1}{Q}\left\{ f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) \right\} -p_{t+1}\label{siminflam} \\ L_{\epsilon _{t}} & = & Q-f_{1t}p_{t}^{2}-f_{2t}2p_{t}\left( 1-p_{t}\right) -f_{3t}\left( 1-p_{t}\right) ^{2}\label{siminfes} \\ L_{\gamma _{t+1}} & = & \bar{A}_{t}+\frac{\sigma }{Q}\left[ p_{t}^{2}\phi \left( x_{1t}\right) +2p_{t}\left( 1-p_{t}\right) \phi \left( x_{2t}\right) +\left( 1-p_{t}\right) ^{2}\phi \left( x_{3t}\right) \right] -\bar{A}_{t+1} \end{eqnarray} \end_inset Unlike the Dekkers formulation, there is no final period, and thus no derivativ es are calculated for that special case. \layout Standard As all of these equations are also equal to 0,[ \begin_inset LatexCommand \ref{simgameq} \end_inset ] means that \begin_inset Formula \begin{equation} \label{siminfgamnext} \gamma _{t+1}=\gamma _{t}-\left( 1-r\right) ^{t} \end{equation} \end_inset This result is drastically different than Dekkers', which found that \begin_inset Formula \( \gamma _{t}=1\forall t \) \end_inset . As \begin_inset Formula \( \gamma _{t} \) \end_inset is the shadow value for \begin_inset Formula \( \bar{A}_{t} \) \end_inset ,and the initial value \begin_inset Formula \( \bar{A}_{0} \) \end_inset is known, it follows that \begin_inset Formula \( \gamma _{t} \) \end_inset is a known value that can be directly calculated. As the growth is additive, \begin_inset Formula \( \bar{A}_{t} \) \end_inset contributes its own value in the current and each subsequent generation. This value can be discounted to time \begin_inset Formula \( 0 \) \end_inset , yielding \begin_inset Formula \begin{equation} \frac{\partial G}{\partial \bar{A}_{t}}=\frac{\left( 1-r\right) ^{t}}{r} \end{equation} \end_inset or that \begin_inset Formula \begin{equation} \label{siminfgamval} \gamma _{t}=\frac{\left( 1-r\right) ^{t}}{r} \end{equation} \end_inset This relation can also be found by starting with the limiting value at infinity of zero, and writing \begin_inset Formula \( \gamma _{0} \) \end_inset as an infinite sum of the later values. \layout Standard From [ \begin_inset LatexCommand \ref{siminff1} \end_inset ] and [ \begin_inset LatexCommand \pageref{siminfgamval} \end_inset ] come the equations \begin_inset Formula \begin{equation} \label{siminfepssln} -\frac{\lambda _{t+1}}{Q}\left[ \begin{array}{c} 1\\ .5\\ 0 \end{array}\right] +\frac{\left( 1-r\right) ^{t}\sigma }{rQ}\left[ \begin{array}{c} x_{1t}\\ x_{2t}\\ x_{3t} \end{array}\right] =\epsilon _{t}\left[ \begin{array}{c} 1\\ 1\\ 1 \end{array}\right] \end{equation} \end_inset which may be solved \begin_inset Formula \begin{eqnarray} x_{3t}-x_{2t} & = & \frac{r}{\left( 1-r\right) ^{t}}\frac{\lambda _{t+1}}{2\sigma }\label{siminfx3x2} \\ x_{2t}-x_{1t} & = & \frac{r}{\left( 1-r\right) ^{t}}\frac{\lambda _{t+1}}{2\sigma }\label{siminfx2x1} \end{eqnarray} \end_inset Which gives the result that \begin_inset Formula \begin{equation} x_{3t}-x_{2t}=x_{2t}-x_{1t} \end{equation} \end_inset or that the truncation points are equidistant, as Dekkers found in the finite model. Considering the limiting case of steep discounting, this yield for the first generation the same result as for Dekkers' final generation, as expected. \layout Standard The relations of [ \begin_inset LatexCommand \ref{siminfepssln} \end_inset ] can be used to remove the \begin_inset Formula \( \epsilon _{t} \) \end_inset from [ \begin_inset LatexCommand \ref{siminfpt} \end_inset ], which becomes \begin_inset Formula \begin{eqnarray} \lambda _{t} & = & 2\left( 1-r\right) ^{t}a\nonumber \\ & & +\frac{\lambda _{t+1}}{Q}\left\{ 2\left( 1-\Phi \left( x_{1t}\right) \right) p_{t}+\left( 1-\Phi \left( x_{2t}\right) \right) \left( 1-2p_{t}\right) \right\} \nonumber \\ & & +2\frac{\left( 1-r\right) ^{t}\sigma }{rQ}\left\{ p_{t}\phi \left( x_{1t}\right) +\left( 1-2p_{t}\right) \phi \left( x_{2t}\right) -\left( 1-p_{t}\right) \phi \left( x_{3t}\right) \right\} \nonumber \\ & & -\frac{\lambda _{t+1}}{Q}\left\{ 2\left( 1-\Phi \left( x_{1t}\right) \right) p_{t}+\left( 1-\Phi \left( x_{2t}\right) \right) \left( 1-2p_{t}\right) \right\} \nonumber \\ & & -2\frac{\left( 1-r\right) ^{t}\sigma }{rQ}\nonumber \\ & & \, \, \, \, \left\{ \left( 1-\Phi \left( x_{1t}\right) \right) x_{1t}p_{t}+\left( 1-\Phi \left( x_{2t}\right) \right) x_{2t}\left( 1-2p_{t}\right) -\left( 1-\Phi \left( x_{3t}\right) \right) x_{3t}\left( 1-p_{t}\right) \right\} \nonumber \\ & = & 2\left( 1-r\right) ^{t}a+2\frac{\left( 1-r\right) ^{t}\sigma }{rQ}\left\{ \left[ \phi \left( x_{1t}\right) -\left( 1-\Phi \left( x_{1t}\right) \right) x_{1t}\right] p_{t}\right. \nonumber \\ & & \, \, \, \, \, \left. +\left[ \phi \left( x_{2t}\right) -\left( 1-\Phi \left( x_{2t}\right) \right) x_{2t}\right] \left( 1-2p_{t}\right) -\left[ \phi \left( x_{3t}\right) -\left( 1-\Phi \left( x_{3t}\right) \right) x_{3t}\right] \left( 1-p_{t}\right) \right\} \label{siminfliso} \end{eqnarray} \end_inset Which is an expression, if not pretty, for \begin_inset Formula \( x_{mt} \) \end_inset and \begin_inset Formula \( \lambda _{t} \) \end_inset . \layout Standard Unfortunately, this expression is not amenable to analytic or numeric solution. For any given time period, equations [ \begin_inset LatexCommand \ref{siminfqcns} \end_inset ], [ \begin_inset LatexCommand \ref{siminfx3x2} \end_inset ], [ \begin_inset LatexCommand \ref{siminfx2x1} \end_inset ], and [ \begin_inset LatexCommand \ref{siminfliso} \end_inset ] produce five equations in only four unknowns for any given time period: the three \begin_inset Formula \( x_{mt} \) \end_inset , and the multipliers \begin_inset Formula \( \lambda _{t} \) \end_inset and \begin_inset Formula \( \lambda _{t+1} \) \end_inset . If \begin_inset Formula \( \lambda _{t} \) \end_inset were known for any finite period, the problem would be soluble for all periods. While the limiting value for large time is \begin_inset Formula \( 0 \) \end_inset , namely the result that there is very little value of something that far away, it is not possible to do an infinite summation as was done to produce ( \begin_inset LatexCommand \ref{siminfgamval} \end_inset ), as the remaining terms for each period are combinations of the CDF and PDF of the standard normal distribution. \layout Standard The only remaining possibility would be to find a value for \begin_inset Formula \( \lambda _{0} \) \end_inset . While the expression \begin_inset Formula \begin{equation} \lambda _{0}=\frac{d}{dp_{0}}G \end{equation} \end_inset this is of little value, as calculating this value requires knowledge of the solution. \layout Standard As such, there appears to be no solution available for the pure infinite horizon problem. \layout Section Infinite Horizon With Testing Costs \layout Standard While the simple problem of the infinite horizon cannot be treated analytically, due to the inability to find an initial Lagrangian multiplier for some finite time period, the introduction of a cost for genetic testing makes the problem tractable. \layout Standard Animals do not come with labels on their foreheads indicating their genetic makeup; if so, the polygenic distribution will be known as well. Instead, there is some finite costs for testing an animal, and testing should not occur if the gains are outweighed by the cost. \layout Standard Conveniently, it is possible to put a ceiling on the value of testing. The greatest possible gain from the major gene is for it to become fully present in the population. That is, for \begin_inset Formula \( p_{t} \) \end_inset to change from its present value to \begin_inset Formula \( 1 \) \end_inset . Consider first the extreme case of taking this entire gain in a single generation. If no selection at all were to occur, the gene would keep its current frequency. Thus the change for the next generation is \begin_inset Formula \( 1-p_{t} \) \end_inset , which has a value of \begin_inset Formula \( 2a\left( 1-p_{t}\right) \) \end_inset in the subsequent generation. Further, compared to no selection, it has that value in all future generations. Thus the present value of the change is \begin_inset Formula \begin{eqnarray} PV & = & \sum _{i=1}^{\infty }\left( 1-r\right) ^{i}2a\left( 1-p_{t}\right) \nonumber \\ & = & 2a\left( 1-p_{t}\right) \frac{1-r}{r} \end{eqnarray} \end_inset Thus, in no case is it worth testing the animals if the cost of testing is greater than this value. \layout Standard However, even if selection by major gene is not profitable, mass selection would still presumably be used, allowing a tighter limit to be drawn. Mass selection will continue to cause the frequency of the major gene to change; animals with this gene will be selected at lower polygenic values, and thus become relatively more common in the population. Without testing, the observed or phenotypic truncation point will be the same for all groups, with the result that the polygenic truncation points are separated by exactly \begin_inset Formula \( a \) \end_inset , or that \begin_inset Formula \begin{equation} x_{1t}+a=x_{2t}=x_{3t}-a \end{equation} \end_inset The frequency in the next generation as a function of the current frequency can then be calculated. First, the overall truncation point is \begin_inset Formula \begin{equation} x_{t}=\Phi ^{-1}\left( 1-Q\right) \end{equation} \end_inset That is, all creatures in the upper fraction \begin_inset Formula \( Q \) \end_inset of the population are kept. For creatures that don't have the gene at all, the phenotypic and polygenic value are the same, and thus \begin_inset Formula \begin{equation} x_{3t}=\Phi ^{-1}\left( 1-Q\right) \end{equation} \end_inset \begin_inset Formula \[ p_{t+1}(p_{t})=\frac{1}{Q}\left\{ \left( 1-\Phi \left( x_{2t}\right) \right) \right\} \] \end_inset \layout Standard break \layout Standard Given that the program of breeding for mass selection is well known, it is possible to write a value function for the current value of the future gains from mass selection, \begin_inset Formula \[ PV_{m}\left( p_{t}\right) =\sum _{i=1}^{\infty }\left( 1-r\right) ^{i}\left[ 2a\left( p_{t+i}^{m}-p_{t}\right) +\left( \bar{A}^{m}_{t+i}-\bar{A}_{t}\right) \right] \] \end_inset \layout Standard The amount of work to be invested in finding a better bound will depend upon how much computation the bound saves; a loose bound does no harm, but merely requires additional computation. \layout Subsection* Purpose of the Bound \layout Standard The bound has only one real purpose: if the maximu \begin_float margin \layout Standard as of 2/20/99, bound has not been used \end_float m gain from all future testing exceeds this generation's cost of testing, there is no reason to test, and mass selection will be used forevermore. \layout Chapter Genetic Algorithms \layout Standard Add brief discussion \layout Chapter Dynamic Programming \layout Standard Dynamic programming can only be applied to discrete problems. However, in many cases, it is possible to discretize a continuous problem, and find a reasonable approximation. \layout Section History of Dynamic Programming \layout Standard Dynamic programming, in and of itself, is nothing new. It has been used as a computational method since the 1950's, but its ability to solve problems is highly dependent upon available processing power and fast storage (cache or core memory). \layout Standard Dynamic programming is used in programs that have a fixed number of available states, and in which the \begin_inset Quotes eld \end_inset history \begin_inset Quotes erd \end_inset of the problem, or the path by which the state was reached, is irrelevant. That is, the states all must have the Markov property \begin_inset LatexCommand \index{Markov property} \end_inset . \latex latex \backslash jim{ \latex default A state is "Markov" if the state contains all relevant information, regardless of how it was reached. If the state has the Markov property, it does not matter in which generation the state was reache, nor does the path by which it was reached matter. \latex latex } \begin_float margin \layout Standard more on Markov? \end_float \layout Standard Dynamic programming can thus be used to solve policy questions, such as the order of applying treatment to fields, in which both \emph on which \emph default steps to take as well as which \emph on order \emph default to take them in are considered. The value of each state is known, and therefore once both steps of the tentative solution \begin_inset Quotes eld \end_inset apply forty tons of phosphor, then plant corn \begin_inset Quotes erd \end_inset is calculated, considering the solution, \begin_inset Quotes eld \end_inset apply twenty tons of herbicide, then forty tons of phosphor, then plant corn \begin_inset Quotes erd \end_inset requires only calculating the effect of the first step, and then using the already calculated value for corn. \layout Standard While the problems soluble by dynamic programming are inherently discrete, there is a long history of its use in solving continuous problems. This is done by the expedient of discretizing the problem, allowing only certain states, a procedure that dates to **. \begin_float margin \layout Standard get this \end_float In some cases, this approximate solution is sufficient. For example, if the problem involves a \begin_inset Formula \( [0,1] \) \end_inset choice variable, and knowledge of the solution to within \begin_inset Formula \( .01 \) \end_inset is sufficient, then only \begin_inset Formula \( 100 \) \end_inset states need be considered. If such an answer is not precise enough, the first solution can be used as a center for a search over a smaller area with a finer grid. \layout Standard However, published numeric solutions almost universally consider compact search spaces. The few examples of non-compact spaces \begin_float margin \layout Standard ref \end_float have all used contiguous spaces, which are not appropriate for the breeding problem. \layout Section The Breeding Problem and Dynamic Programming \layout Standard The naive approach to the breeding problem would be to look at the solution spaces as \begin_inset Formula \( \left\{ f_{mt}\right\} \) \end_inset . However, this is far too complicated a space to use as a state space for dynamic programming: at even a resolution of \begin_inset Formula \( .01 \) \end_inset , there are \begin_inset Formula \( 100 \) \end_inset possible values for each choice, or \begin_inset Formula \( 10^{4} \) \end_inset per generation with only one choice gene. Then for \begin_inset Formula \( T \) \end_inset generations, there are \begin_inset Formula \( 10^{4T} \) \end_inset possible states to consider. \layout Standard However, for a finite horizon problem, or an infinite horizon problem with a known \begin_inset Formula \( \hat{p} \) \end_inset bound such as [ \begin_float margin \layout Standard ref \end_float ], \begin_inset Formula \( p_{t} \) \end_inset alone may be used as the state. \begin_inset Formula \( p \) \end_inset may be divided into as many states as desired, and and the optimal choice for each value of \begin_inset Formula \( p \) \end_inset can be calculated. \layout Standard Consider again the nature of the general problem: once a change has been made to \begin_inset Formula \( \bar{A}_{t} \) \end_inset , the change is permanent. The only choice to be made depends upon the present value of \begin_inset Formula \( p_{t} \) \end_inset . \begin_float margin \layout Standard Will non-constant variance destroy this? \end_float The optimal behavior for \begin_inset Formula \( p_{t}>\hat{p} \) \end_inset is known, namely to switch to mass selection. This is entered into the action space as a beginning. Letting \begin_inset Formula \begin{equation} \Delta =\frac{1}{S} \end{equation} \end_inset where \begin_inset Formula \( \Delta \) \end_inset is the spacing between potential values of \begin_inset Formula \( p_{t} \) \end_inset for a number of states \begin_inset Formula \( S \) \end_inset , the first action considered is for \begin_inset Formula \( p_{t}=\hat{p}-\Delta \) \end_inset . There are only two choice variables to consider, \begin_inset Formula \( f_{1t} \) \end_inset and \begin_inset Formula \( f_{2t} \) \end_inset . As \begin_inset Formula \( p_{t+1}\geq p_{t} \) \end_inset , for each value of of the fractions considered, the future path is known. That is, for each trial value of \begin_inset Formula \( f_{t} \) \end_inset considered, \begin_inset Formula \( p_{t+1}\left( f_{t}\right) \) \end_inset is calculated, and the future value of this choice is selected from the table of known results. If this value is less than testing costs, the decision not to test is stored for this variable, a is the present value of mass selection at this frequency. Additionally, a tighter bound has been found, and \begin_inset Formula \( \hat{p} \) \end_inset takes on this new value. If testing was worthwhile, this fact is stored, as testing will be worthwhile at all smaller frequencies, \begin_float margin \layout Standard prove this? \end_float as are the optimal fractions and the net present value of this frequency. \layout Standard This step is repeated until a frequency is found which finds value in testing, at which point the comparison to testing cost is skipped for all smaller values, and \begin_inset Formula \( \hat{p} \) \end_inset is reset to \begin_inset Formula \( \Delta \) \end_inset greater than this frequency. Actions for \begin_inset Formula \( p_{t} \) \end_inset are then calculated for all smaller values. \layout Standard Eventually, values of \begin_inset Formula \( p_{t} \) \end_inset will be considered which find \begin_inset Formula \( p_{t+1}<\hat{p} \) \end_inset for the optimal values of \begin_inset Formula \( f_{t} \) \end_inset . This is easily handled; for each trial value of \begin_inset Formula \( f_{t} \) \end_inset , the value of the resultant \begin_inset Formula \( p_{t+1} \) \end_inset has already been saved, and need not be recalculated. \layout Standard For a single state variable \begin_inset Formula \( p_{t} \) \end_inset , this appears to be an efficient search: rather than checking all possible values of \begin_inset Formula \( p_{t} \) \end_inset , only a portion are checked. While this is convenient for the single state variable, it will be critical as the number of state variables increases. \layout Section An Initial Algorithm \layout Standard Consider first the simple problem of one gene with fixed variance. For any given \begin_inset Formula \( p_{t} \) \end_inset and \begin_inset Formula \( \bar{A}_{t} \) \end_inset , there is an associated maximal present value \begin_inset Formula \( G\left( p_{t}\right) \) \end_inset . First note that the starting value \begin_inset Formula \( \bar{A}_{t} \) \end_inset affects the present value by only a known offset \begin_inset Formula \begin{equation} G\left( p_{t},\bar{A}_{t}\right) =G\left( p_{t},0\right) +\frac{\bar{A}_{t}}{r} \end{equation} \end_inset Then break \begin_inset Formula \( G \) \end_inset into three pieces: the initial fixed effects, which choices in \begin_inset Formula \( t \) \end_inset cannot affect, the initial effects in the subsequent generation, and all time after that: \begin_inset Formula \begin{eqnarray*} G\left( p_{t}\right) & = & \bar{A}_{t}+a\left( 2p_{t}-1\right) \\ & & +\left( 1-r\right) \left[ \frac{\sigma }{Qr}\left\{ 2p_{t+1}^{2}z_{1t+1}+2p_{t+1}\left( 1-p_{t+1}\right) z_{2t+2}+\left( 1-p_{t+1}\right) ^{2}z_{3t+1}\right\} +\bar{A}_{t}\right] \\ & & +\left( 1-r\right) a\frac{1}{Q}\left\{ 2f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-2p_{t}\right) \right\} \\ & & +\sum _{s=t+2}^{\infty }\left( 1-r\right) ^{s}\left[ a\left( 2p_{t}-1\right) +\bar{A}_{s}\right] \end{eqnarray*} \end_inset Note that part of the value of \begin_inset Formula \( \bar{A}_{s} \) \end_inset in the summation can be regrouped into the first term, as can the appearance of \begin_inset Formula \( \bar{A}_{t} \) \end_inset in the period \begin_inset Formula \( t+1 \) \end_inset . However, another approach will be more fruitful, namely redefining the objective function. Consider \begin_inset Formula \begin{eqnarray} F\left( p_{t}\right) & \equiv & +\frac{\sigma }{Qr}\left\{ 2p_{t}^{2}z_{1t}+2p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}\right\} \nonumber \\ & & +\frac{a}{Q}\left\{ 2f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-2p_{t}\right) \right\} \nonumber \\ & & +\left( 1-r\right) F\left( p_{t+1}\right) \end{eqnarray} \end_inset As \begin_inset Formula \begin{equation} G\left( p_{t}\right) =\frac{\bar{A}_{t}}{r}+a\left( 2p_{t}-1\right) +\left( 1-r\right) F\left( p_{t+1}\left( p_{t}\right) \right) \end{equation} \end_inset maximizing \begin_inset Formula \( F \) \end_inset homomorphous with maximizing \begin_inset Formula \( G \) \end_inset . \layout Standard The problem is now in a state to which dynamic programming can be applied. Using \begin_inset Formula \( S \) \end_inset possible states for \begin_inset Formula \( p_{t+1} \) \end_inset , let \begin_inset Formula \( p_{t+1}\left( p_{t},f_{t}\right) \) \end_inset be the closest allowed state to the actual computed value. Finding optimal values for \begin_inset Formula \( f_{t}\left( p_{t}\right) \) \end_inset becomes almost trivial. \layout Standard The existence of \begin_inset Formula \( F(p_{t+1>t}) \) \end_inset is taken as a given; it will be called recursively if necessary. This reduces the problem to finding the best set of \begin_inset Formula \( f_{t} \) \end_inset for the current generation. An initial trial solution is considered, from whatever source, yielding a tentative \begin_inset Formula \( p^{t}_{t+1} \) \end_inset . \begin_inset Formula \( F\left( p_{t+1}^{t}\right) \) \end_inset is treated as known, and the remainder of \begin_inset Formula \( F\left( p_{t}\right) \) \end_inset is optimized over \begin_inset Formula \( \left\{ f_{t}:p_{t+1}\left( p_{t},f_{t}\right) =p_{t+1}^{t}\right\} \) \end_inset , for which standard optimization methods suffice. This is compared to the maximum over \begin_inset Formula \( \left\{ f_{t}:p_{t+1}\left( p_{t},f_{t}\right) =p_{t+1}^{t}+\delta \right\} \) \end_inset , and the better value is stored. \layout Standard The optimal value and behavior with granularity \begin_inset Formula \( \delta \) \end_inset can now be found by simply calling this optimal value function for \begin_inset Formula \( F\left( p_{0}\right) \) \end_inset , which calls itself recursively to find any other needed values for other values of \begin_inset Formula \( p_{t} \) \end_inset . Many possible states will likely be passed over, saving computational time. These solutions can then be used as starting values for another run with a finer grain, until the desired level of resolution is found. \layout Standard As a side effect of this method, the stiff matrix problem found with the genetic algorithm in section [] is avoided entirely: there is never an attempt to directly determine the present value of the effect of a change in a current variable on a state several generations away; all such calculation s find the optimal value by looking ahead only one generation. \layout Section Application to Multiple State Variables \layout Standard The method does not translate directly to multiple variables, although adaptatio ns are possible. Similar to the problem at [], the search space explodes as additional states are considered. For example, with a grain of \begin_inset Formula \( .01 \) \end_inset , only \begin_inset Formula \( 101 \) \end_inset states exist for the simple model of section []. However, adding only a second gene, still keeping the variance fixed, increases this toe \begin_inset Formula \( 101^{2} \) \end_inset . Adding variance for each gene, and covariance, reaches \begin_inset Formula \( 101^{5} \) \end_inset , or in excess of ten billion states to consider. Storing a single double precision floating point variable for each of these would take on the order of eighty gigabytes of temporary storage. \layout Standard The above method, however, allowed the possibility of all possible combinations of allowed values. However, if there is a \begin_inset Quotes eld \end_inset reasonably good \begin_inset Quotes erd \end_inset starting point, this is not necessary. Instead, from this starting point, only a small range of values need be considered. \layout Standard For example, suppose that consideration of of the mass selection solution allows placing a bound of fifteen generations, and provides initial values for \begin_inset Formula \( \left\{ p_{t}^{1},p_{t}^{2},\sigma _{t}^{1},\sigma _{t}^{2},\sigma _{t}^{12}:0\leq t\leq 15\right\} \) \end_inset . Suppose further that the researcher is experienced with the problem, perhaps from prior runs, and is thus able to set likely bounds for the values in each generation. While it is not possible to store or even allocate space for all possible values in the domain, it may be possible to create \begin_inset Quotes eld \end_inset disjoint bubbles \begin_inset Quotes erd \end_inset within the space for examination. \layout Standard For graphical simplicity, consider a simple case of two probabilities and their correlation. Starting with low values for each of the three, it seems reasonable to expect that each of the probabilities and the correlation will approach \begin_inset Formula \( 1 \) \end_inset in the face of selection. With an initial trial solution as in figure ( \begin_inset LatexCommand \ref{bubblelines} \end_inset ), \begin_float fig \layout Standard \align center \begin_inset Figure size 133 184 file graphics/bubblelines.eps width 4 45 angle 270 flags 9 \end_inset \layout Caption \begin_inset LatexCommand \label{bubblelines} \end_inset Initial Trial Solution \end_float and a grain of \begin_inset Formula \( .01 \) \end_inset for each of the three variables, there are \begin_inset Formula \( 10^{6} \) \end_inset possible states, a manageable number. However, the researcher is likely to desire a greater granularity, \begin_inset Formula \( .0001 \) \end_inset , or even \begin_inset Formula \( .000001 \) \end_inset , requiring \begin_inset Formula \( 10^{12} \) \end_inset or \begin_inset Formula \( 10^{18} \) \end_inset states, respectively. \layout Standard \pextra_type 3 However, the changes in each variable from period to period are typically far larger than the grain; there are large ranges in each variable which are known ahead of time to be unlikely to be reached. Instead, some region, a cube in \begin_inset Formula \( \Re ^{3} \) \end_inset around each point of the trial solution is most likely to be reached figure ( \begin_inset LatexCommand \ref{bubbles1} \end_inset ). \layout Standard \begin_float fig \layout Standard \align center \begin_inset Figure size 133 184 file graphics/bubbles1.eps width 4 45 angle 270 flags 9 \end_inset \layout Caption \begin_inset LatexCommand \label{bubbles1} \end_inset Partition of space near trial solution into disjoint bubblettes. \end_float While there is no \emph on a priori \emph default reason to assume a cube (or hypercube in higher dimensions), this cube can be chosen so as to include any size and shape of region. \layout Standard \pextra_type 3 \pextra_widthp 60 If reasonable \emph on a priori \emph default bounds can be placed on the size of these cubes for each step, and this region is divided into ten possible values for each of the three variables, each such bubble contains only \begin_inset Formula \( 10^{3} \) \end_inset states, which is manageable for even large number of generations. Figure ( \begin_inset LatexCommand \ref{bubbles2} \end_inset )shows five possible states for each state variable. If the trial solution is \begin_inset Quotes eld \end_inset close enough, \begin_inset Quotes erd \end_inset or if the bubbles are large enough, the solution at each point will be within one of the divisions of the cube, and the dynamic programming approach would then seem to be successive steps with finer grain and smaller bubbles in each period. For example, if the solution shown in figure( \begin_inset LatexCommand \ref{bubbles3} \end_inset )is found to be the best, smaller bubbles and bubblettes may be found around this new tentative solution, as in figure ( \begin_inset LatexCommand \ref{bubbles4} \end_inset )This process would be repeated until with sufficiently fine grain the solution does not move. \begin_float fig \layout Standard \align center \begin_inset Figure size 133 184 file graphics/bubbles2.eps width 4 45 angle 270 flags 9 \end_inset \layout Caption \begin_inset LatexCommand \label{bubbles2} \end_inset Division of bubbles into bubblettes \end_float \begin_float fig \layout Standard \align center \begin_inset Figure size 133 184 file graphics/bubbles3.eps width 4 45 angle 270 flags 9 \end_inset \layout Caption \begin_inset LatexCommand \label{bubbles3} \end_inset Possible new trajectory \end_float \begin_float fig \layout Standard \align center \begin_inset Figure size 133 184 file graphics/bubbles4.eps width 4 45 angle 270 flags 9 \end_inset \layout Caption \begin_inset LatexCommand \label{bubbles4} \end_inset New disjoint bubbles and bubblettes \end_float \layout Subsection Missing the Bubbles \layout Standard The foregoing example has a critical implicit assumption: that the number of steps to the solution is known. This is important as the bubbles are identified with with a specific step. It is entirely possible that these bubbles overlap; this is not a problem. The problem, rather, is that that a tentative solution steps outside of the \begin_inset Quotes eld \end_inset next \begin_inset Quotes erd \end_inset bubble, as in figure ( \begin_inset LatexCommand \ref{bubblemiss} \end_inset ). \begin_float fig \layout Standard \align center \begin_inset Figure size 133 191 file graphics/bubblemiss.eps width 4 45 angle 270 flags 9 \end_inset \layout Caption \begin_inset LatexCommand \label{bubblemiss} \end_inset Missing the bubbles entirely \end_float To this point, it has been assumed that the each step will land in a known neighborhood, that of the bubble previously associated with that step. However, the dashed line lands in the \begin_inset Quotes eld \end_inset wrong \begin_inset Quotes erd \end_inset bubble, while the dotted line fails to land within \emph on any \emph default bubble. \layout Standard This is not a trivial problem. The essence of dynamic programming is to reuse previously calculated states. However, the bubbles have been used because the number of potential states is too great to store, let alone calculate. If a step is made outside a bubble, it would seem likely that the next step is also outside the bubble. Each of these will require calculations of multiple possible states for comparison. Some manner must be found to index or search through the bubbles, so that once a new bubble has been created, later steps can find it and take advantage of its calculations. \layout Standard Another source of misstep can come with a change in the number of generations in the solution. For example, if the initial solution takes fifteen generations before reaching a state in which testing is not profitable, it may be that a path is found which reaches this level in fourteen generations. This may mean that the thirteenth step proceeds to the bubble previously associated with the fifteenth, as with the dashed line in figure ( \begin_inset LatexCommand \ref{bubblemiss} \end_inset ). \layout Section A Structure for the Bubblettes \layout Standard Before turning to the Bubbles, it is necessary to consider the nature of the bubblettes. \layout Standard Most fundamentally, each bubblettes has three possible states: fully calculated, fully uncalculated, and uncalculated with hints of some nature. \layout Standard The fully calculated bubblette must, at a minimum, contain the following information: the fact that it is calculated, all state variables for the bubblette, and the \begin_inset Quotes eld \end_inset next \begin_inset Quotes erd \end_inset bubblette--the bubblette reached in the next generation. However, more information is desirable, and eases calculation. Particularly, it is desirable to store the choice variables, such as the \begin_inset Formula \( f_{t} \) \end_inset , which lead to the next bubblette. Additionally, the value of the objective function for the bubblette is desirable--it allows a later bubblette which considers this state to do so by making a single stop, rather than considering the successor states. Finally, some \begin_inset Quotes eld \end_inset hint \begin_inset Quotes erd \end_inset as to the location of the next bubblette may be desirable; it is not clear that knowledge of the next bubblette necessarily means where it is located within the machine. \layout Standard If a bubble is not yet calculated, it may yet have a \begin_inset Quotes eld \end_inset hint \begin_inset Quotes erd \end_inset left from a prior run or some other information. This hint, perhaps the result for this or an adjacent location in a prior run, can be used as a starting point when the bubblette searches for its values. However, storing the hint does take storage space. As such, it may be desirable to add a pointer variable to bubblettes for hints, allowing them to point to another bubblette for a hint, rather than storing information itself. \layout Section A structure for the bubbles \layout Standard As the bubble is divided into bubblettes, it clearly must, at a minimum, contain storage space for it's constituent bubblettes. While it is conventional to think of \begin_inset Quotes eld \end_inset balls \begin_inset Quotes erd \end_inset in n-space for such a region, the hypercube is more natural to the computer: a \begin_inset Formula \( 5\times 5\times \ldots \times 5 \) \end_inset region is well defined in memory, but the set of all regions within distance \begin_inset Formula \( 5 \) \end_inset is not. The result is probably wasted states, but it may be possible to mitigate this by storing not bubblettes, but pointers to bubblettes, and only allocating space for bubblettes as needed. For eleven possible values for each of five state variables, this means \begin_inset Formula \( 161051 \) \end_inset pointers, which will take \begin_inset Formula \( 629 \) \end_inset kilobytes of storage on a thirty-two bit machine. While this is a massive amount of storage, and will be largely unused, this is still less than ten megabytes for fifteen bubbles, leaving the balance for calculated states. \layout Standard It will generally be assumed that there are an odd number of levels for each variable: the center of the bubble comes from some prior knowledge or solution, and a symmetric number of levels on each side results in an odd number. \layout Standard Merely containing the bubblettes, however, is not enough for the bubble structure. It is necessary to search the bubbles, so that they are not needlessly duplicated. Therefore, each bubble should store the domain for each state variable. To facilitate a rapid search, it may be desirable to have these actually be pointers elsewhere; arrays of minima and maxima for each state variable can be kept elsewhere, with a pointer stored in the bubble structure. A stranded step could then search for an existing bubble rapidly with a construct such as \layout LyX-Code where ( (mystate1.ge.state1min) .and. (mystate1 .le.state1max) .and. (mystate2.ge.state2min) ...) \layout LyX-Code \protected_separator \protected_separator candidateBubbles=.true. \layout LyX-Code elsewhere \layout LyX-Code \protected_separator \protected_separator candidateBubbles=.false. \layout LyX-Code endwhere \layout Standard thereby gaining a list of candidate bubble instantly. By comparison, if it had to reference the variables within structures to do comparisons, the search would be slowed by orders of magnitude. If a bubble is found, the bubblette is either calculated or referenced. If not, a new bubble is created, with the state in question at its center. \layout Standard Another useful feature in a bubble would be reference hints for uncalculated bubblettes. After a run of the algorithm, bubblettes will exist for some or all of the bubblettes, many of which will be contained within the corresponding bubble on the next run. These values can be saved as hints for the corresponding bubblettes within the smaller bubbles of the next pass. \layout Section An initial strategy \layout Standard At this stage, it is possible to make an initial attempt to use dynamic programming to solve a problem. From the performance of the attempt, information will be gained for later refinement. \layout Subsection Replacing choice variables with state variables \layout Standard \begin_inset LatexCommand \label{sec:replacechoicevars} \end_inset One final obstacle must be addressed before attempting a dynamic programming solution: the discretization of the choice space. \layout Standard While the state space, and permitted tentative state spaces, have already been defined in terms of bubbles in [], these states are reached as result of tentative selection of choice variables. Furthermore, the literature on discretized problems has generally assumed the discretization of the choice variables. However, discretizing the \begin_inset Formula \( f_{t} \) \end_inset variables in this case will not result in a cleanly indexable space for the state variables, and the success of any dynamic programming solution relies on the ability to easily look up prior solved situations. Finally, there is no \emph on a priori \emph default reason to believe that such a discretization of the choice variables would result in a state space in which values would be repeated. \begin_float margin \layout Standard Maybe define a \begin_inset Quotes eld \end_inset close enough \begin_inset Quotes erd \end_inset in state space as an alternative approach? But the search costs will be staggering. \end_float \layout Standard Another approach would lie \begin_float margin \layout Standard it is \begin_inset Quotes eld \end_inset lie, \begin_inset Quotes erd \end_inset isn't it? \end_float in keeping the discretization of state space, while fixing \begin_inset Quotes eld \end_inset canonical \begin_inset Quotes erd \end_inset values for allowed choices. As search algorithms generally use some type of Newton method to select trial solutions, it is necessary to be able to define the distance between points. Consider, for example, the function \begin_inset Formula \begin{equation} y=-\left( x-3\right) ^{4} \end{equation} \end_inset ignoring for the moment that a trivial inverse exists. This function takes a maximum value of \begin_inset Formula \( 0 \) \end_inset at \begin_inset Formula \( x=3 \) \end_inset . Suppose that the search range has been reduced to \begin_inset Formula \( \left[ -0.750,.250\right] \) \end_inset , with a grain of \begin_inset Formula \( .001 \) \end_inset for \begin_inset Formula \( x \) \end_inset , centered around the initial guess of \layout Standard \begin_inset Formula \begin{eqnarray} \hat{x} & = & 3.3\nonumber \\ \hat{y} & = & -.008 \end{eqnarray} \end_inset Calculating numerical derivatives becomes problematic. The actual derivative is \begin_inset Formula \begin{equation} \frac{dy}{dx}=-4\left( x-3\right) ^{3} \end{equation} \end_inset so that to change \begin_inset Formula \( y \) \end_inset by \begin_inset Formula \( .001 \) \end_inset requires a change of approximately \begin_inset Formula \( .00926 \) \end_inset . However, changes of \begin_inset Formula \( .007 \) \end_inset and \begin_inset Formula \( .014 \) \end_inset will both result in a calculated \begin_inset Formula \( y \) \end_inset of \begin_inset Formula \( .007 \) \end_inset , though the numerical derivative of the former is twice the size of the latter. Numerical calculation of the Hessian would result in even worse exaggerations. \layout Standard To solve this problem would require limiting the allowed values of \begin_inset Formula \( x \) \end_inset , so that for all \begin_inset Formula \( \left\{ x:\hat{f}\left( x\right) =\hat{y}\right\} \) \end_inset , \begin_inset Formula \( f\left( \hat{x}(x)\right) = \) \end_inset \begin_inset Formula \( \hat{y} \) \end_inset , and to work only with \begin_inset Formula \( \hat{x} \) \end_inset and not \begin_inset Formula \( x \) \end_inset itself. \layout Standard However, as dimensions increase, or as functions which are not invertible inexpensively are used, this will become an impossible problem. \layout Standard These difficulties suggest that transforming the problem into one of choosing state spaces transitions from those permitted, and afterwards transforming the solution back into the choices that reach them, may be the easiest way to solve the problem. Given rules on state space transitions, such as Equations [], the back-conversi on should run in linear time in the number of generations used. \begin_float margin \layout Standard more on this? \end_float \layout Subsection Simple Dynamic Programming Model \layout Standard For an initial model to demonstrate the concept, consider again the simple model of [ \begin_inset LatexCommand \ref{finobjfn} \end_inset ]. As this model has a known solution, it will be the first to be solved. However, in designing structures and methods, more attention will be paid to the usefulness of the model in solving the general case than the problem at hand. \layout Standard Using equations [ \begin_inset LatexCommand \ref{siminfprule} \end_inset \begin_inset LatexCommand \ref{siminfarule} \end_inset \begin_inset LatexCommand \ref{siminfqcns} \end_inset ], \begin_inset Formula \begin{eqnarray} p_{t+1} & = & \frac{1}{Q}\left\{ f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) \right\} \label{dp2prule} \\ \bar{A}_{t+1} & = & \bar{A}_{t}+\frac{\sigma }{Q}\left\{ p_{t}^{2}z_{1t}+2p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}^{2}\right) z_{3t}\right\} \label{dp2abarrule} \\ Q & = & f_{1t}p_{t}^{2}+f_{2t}2p_{t}\left( 1-p_{t}\right) +f_{3t}\left( 1-p_{t}^{2}\right) \label{dp2qcons} \end{eqnarray} \end_inset [ \begin_inset LatexCommand \ref{siminfprule} \end_inset ] can be manipulated into \begin_inset Formula \begin{equation} \label{dpsimf2off1} f_{2t}\left( f_{1t}|p_{t+1}\right) =\frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) } \end{equation} \end_inset [ \begin_inset LatexCommand \ref{siminfqcns} \end_inset ] becomes \begin_inset Formula \begin{eqnarray} f_{3t}\left( f_{1t}|p_{t+1}\right) & = & \frac{Q-f_{1t}p_{t}^{2}-f_{2t}\left( f_{1t}|p_{t+1}\right) 2p_{t}\left( 1-p_{t}\right) }{\left( 1-p_{t}\right) ^{2}}\nonumber \\ & = & \frac{Q-f_{1t}p_{t}^{2}-\frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }2p_{t}\left( 1-p_{t}\right) }{\left( 1-p_{t}\right) ^{2}}\nonumber \label{dpsimf3off1} \\ & = & \frac{Q+f_{1t}p_{t}^{2}-2Qp_{t+1}}{\left( 1-p_{t}\right) ^{2}}\label{dpsimf3off1} \end{eqnarray} \end_inset and [ \begin_inset LatexCommand \ref{siminfarule} \end_inset ] becomes \begin_inset Formula \begin{equation} \label{dpsiminfanextoff1} \bar{A}_{t+1}=\bar{A}_{t}+\frac{\sigma }{Q}\left\{ \begin{array}{c} p_{t}^{2}\phi \left( \Phi ^{-1}\left( 1-f_{1t}\right) \right) \\ +2p_{t}\left( 1-p_{t}\right) \phi \left( \Phi ^{-1}\left( +1-\frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }\right) \right) \\ +\left( 1-p_{t}^{2}\right) \phi \left( \Phi ^{-1}\left( +1-\frac{Q+f_{1t}p_{t}^{2}-2Qp_{t+1}}{\left( 1-p_{t}\right) ^{2}}\right) \right) \end{array}\right\} \end{equation} \end_inset \layout Standard This allows the translation of the problem into a one-dimensional problem for purposes of dynamic programming. The sole variable is \begin_inset Formula \( p_{t+1} \) \end_inset , and [ \begin_inset LatexCommand \ref{dpsiminfanextoff1} \end_inset ] is used to find the maximal \begin_inset Formula \( \bar{A}_{t+1} \) \end_inset for the chosen \begin_inset Formula \( p_{t+1} \) \end_inset . To make the calculation useful for dynamic programming, it is actually the change, the maximal value of \begin_inset Formula \( \bar{A}_{t+1}-\bar{A}_{t} \) \end_inset that is calculated, so that the information is useful regardless of the prior state. Also, given that the available libraries minimize rather than maximize, and that \begin_inset Formula \( \bar{A}_{t+1} \) \end_inset is linear in \begin_inset Formula \( Q \) \end_inset and \begin_inset Formula \( p \) \end_inset , the new function \begin_inset Formula \( y \) \end_inset is defined as \layout Standard \begin_inset Formula \begin{eqnarray} y & = & -\frac{Q}{p}\left( \bar{A}_{t+1}-\bar{A}_{t}\right) \nonumber \\ & = & -p_{t}^{2}z_{1t}-2p_{t}\left( 1-p_{t}\right) z_{2t}-\left( 1-p_{t}\right) ^{2}z_{3t}\nonumber \\ & = & -p_{t}^{2}z_{1t}-2p_{t}\left( 1-p_{t}\right) z\left( \frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }\right) \nonumber \\ & & -\left( 1-p_{t}\right) ^{2}z\left( \frac{Q+f_{1t}p_{t}^{2}-2Qp_{t+1}}{\left( 1-p_{t}\right) ^{2}}\right) \end{eqnarray} \end_inset with the derivative \begin_inset Formula \begin{eqnarray} \frac{dy}{df_{1t}} & = & -p_{t}^{2}x\left( f_{1t}\right) -2p_{t}\left( 1-p_{t}\right) x\left( f_{2t}\left( f_{1t}\right) \right) \frac{df_{2t}}{df_{1t}}\nonumber \\ & & -\left( 1-p_{t}\right) ^{2}x\left( f_{3t}\left( f_{1t}\right) \right) \frac{df_{3t}}{df_{1t}}\nonumber \\ & = & -p_{t}^{2}\Phi ^{-1}\left( 1-f_{1t}\right) -2p_{t}\left( 1-p_{t}\right) \Phi ^{-1}\left( 1-f_{2t}\left( f_{1t}\right) \right) \frac{-p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }\nonumber \\ & & -\left( 1-p_{t}\right) ^{2}\Phi ^{-1}\left( 1-f_{3t}\left( f_{1t}\right) \right) \frac{p_{t}^{2}}{\left( 1-p_{t}\right) ^{2}}\nonumber \\ & = & -p_{t}^{2}\Phi ^{-1}\left( 1-f_{1t}\right) +2p_{t}^{2}\Phi ^{-1}\left( 1-f_{2t}\right) \nonumber \\ & & -p_{t}^{2}\Phi ^{-1}\left( 1-f_{3t}\right) \end{eqnarray} \end_inset \layout Standard or? \begin_inset Formula \begin{eqnarray} \frac{dy}{df_{1t}} & = & -p_{t}^{2}x_{mt}\left( f_{1t}\right) -2p_{t}\left( 1-p_{t}\right) x\left( \frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }\right) \frac{df_{2t}}{df_{1t}}\nonumber \\ & & -\left( 1-p_{t}\right) ^{2}x\left( \frac{Q-f_{1t}p_{1t}^{2}-\frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }}{\left( 1-p_{t}\right) ^{2}}\right) \frac{df_{3t}}{df_{1t}}\nonumber \\ & = & -p_{t}^{2}\Phi ^{-1}\left( f_{1t}\right) -2p_{t}^{2}\Phi ^{-1}\left( \frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }\right) \frac{-p_{t}^{2}}{p\left( 1-p_{t}\right) }\nonumber \\ & & -\left( 1-p_{t}\right) ^{2}\Phi ^{-1}\left( \frac{Q-f_{1t}p_{1t}^{2}-\frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }}{\left( 1-p_{t}\right) ^{2}}\right) \frac{-p_{t}^{2}\left( 1-\frac{1}{p_{t}\left( 1-p_{t}\right) }\right) }{\left( 1-p_{t}\right) ^{2}}\nonumber \\ & = & -p_{t}^{2}\Phi ^{-1}\left( f_{1t}\right) +2\frac{p_{t}^{3}}{1-p_{t}}\Phi ^{-1}\left( \frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }\right) \nonumber \\ & & +p_{t}^{2}\left( 1-\frac{1}{p_{t}\left( 1-p_{t}\right) }\right) \Phi ^{-1}\left( \frac{Q-f_{1t}p_{1t}^{2}-\frac{Qp_{t+1}-f_{1t}p_{t}^{2}}{p_{t}\left( 1-p_{t}\right) }}{\left( 1-p_{t}\right) ^{2}}\right) \end{eqnarray} \end_inset When \begin_inset Formula \( y \) \end_inset is minimized, the increase in \begin_inset Formula \( \bar{A}_{t+1} \) \end_inset is maximal. \layout Standard Note that while the derivative of \begin_inset Formula \( y \) \end_inset has been calculated, this will not always be possible. however, the optimization routines in standard libraries are both faster and more efficient when this derivative is available. \layout Standard Also note that the conversion yields a different starting point than the actual mass selection solution. While the frequencies are those reached by mass selection, the \begin_inset Formula \( f_{t} \) \end_inset are actually different, as they are selected to maximize \begin_inset Formula \( \bar{A}_{t+1} \) \end_inset in the subsequent generation. As such, the starting point is actually a superior solution to the mass selection solution. \layout Subsection Solving the Model \layout Standard Program \family typewriter dp2 \family default is the initial solution to the problem. At this stage, the goal is a clean and easily understood routine, and no attempts have been made at optimization of the algorithm. \layout Standard The mass selection solution is taken as the starting value. An array of bubbles is created with indices from \begin_inset Formula \( 0 \) \end_inset to the total number of generations considered, \family typewriter totGenerations \family default . Each of these bubbles is centered around the corresponding mass selection gene frequency. Bubble \begin_inset Formula \( 0 \) \end_inset has a single bubblette, as the starting frequency is a parameter of the problem. Each of the other bubbles receive bubblettes to reach from \begin_inset Formula \( .05 \) \end_inset beneath the mass selection value to as close as possible to \begin_inset Formula \( 1.0 \) \end_inset without exceeding it, and a grain of \begin_inset Formula \( .05 \) \end_inset is used for all generations. This is far larger than is needed, but by extending beyond all conceivable values, issues related to stepping past boundaries are avoided for the moment. \layout Standard The function \family typewriter bestVal() \family default is defined, which returns the best possible increase in the value of the objective function that can be reached from a specified bubble. This is the change in value from that particular choice of frequency at that generation, \emph on including the value from all subsequent choices. \emph default \layout Standard The function works by sequentially considering a subset of the possible choices for the \emph on next \emph default period's major gene frequency. However, before it does so, it checks to see if this bubblette has already been considered. If so, it returns the previously calculated value. Failing this, an initial guess, which will for the moment and for illustrative purposes be assumed to be \begin_inset Formula \( 0 \) \end_inset , as to the offset from the center of the next bubble is made. The corresponding bubblette in the next generation is queried for its value, to which the gain \begin_inset Formula \( \Delta \bar{A}_{t+1} \) \end_inset resulting from this tentative choice is added, and the result placed in \family typewriter testVal \family default . \family typewriter testVal \family default is also copied to the local variable \family typewriter thisBest \family default , which stores the best value of the best step found to date. \layout Standard The routine then tries an offset one greater than the first considered, and calculates its value in the same way. If the result is better than that stored in \family typewriter thisBest \family default , \family typewriter thisBest \family default is updated, and the process repeated for the next larger offset. \layout Standard If the result of the first step is inferior to \family typewriter thisBest \family default , the direction is switched; the offset one less than the stating value is considered. \layout Standard The process is repeated until the next value considered declines. At this time, the prior step is recognized as the best, the step stored in \family typewriter nextp \family default , and the value returned. \layout Standard It is important to note at this time that the bubblette with the highest value will not necessarily be chosen. \begin_float footnote \layout Standard In fact, it seems that this is rarely what happens. \end_float It is not only the value of the bubblette itself that matters, but also the value \emph on added \emph default in stepping to that bubblette. Thus adding \begin_inset Formula \( .5 \) \end_inset while reaching a bubblette with value \begin_inset Formula \( .3 \) \end_inset is more valuable than adding \begin_inset Formula \( .2 \) \end_inset while reaching a bubblette with value \begin_inset Formula \( .4 \) \end_inset . Additionally, though the value of the bubblette is fixed regardless of the step taken to reach it, the value added reaching it will vary depending upon the prior state. Thus different bubblettes will be chosen from different prior states. In non-trivial problems, it is expected that there will be a tradeoff between the stepping gain and the value of the bubblette; if this were not the case, both could be maximized. \layout Standard \family typewriter bestVal() \family default does have a \begin_inset Quotes eld \end_inset special \begin_inset Quotes erd \end_inset case for the final period. The last choice is made in the penultimate generation. \family typewriter bestVal() \family default is called again, but recognizes that it is called for the final generations, and simply returns \begin_inset Formula \( \left( 2p-1\right) a \) \end_inset , the contribution of the major gene in the final period. \layout Standard The function operates recursively. As such, querying the sole bubblette in generation \begin_inset Formula \( 0 \) \end_inset for its value results in calculation of needed values for all subsequent generations. \layout Standard When a solution is reached, the bubbles are \begin_inset Quotes eld \end_inset collapsed. \begin_inset Quotes erd \end_inset New bubbles are created centered about the path chosen, and with the grain reduced. Additionally, hints are stored regarding the \begin_inset Quotes eld \end_inset expected \begin_inset Quotes erd \end_inset path from bubblette to bubblette. For the center bubblettes, this is simply the next centered bubblette. For other bubblettes \emph on in the region calculated in the prior iteration, \emph default the hint is the bubblette at the same point in space which was previously reached. Due to the collapsing, there will be somewhat arbitrary choices made as to which bubblettes correspond to which. Rather then spend effort and computation on the matter, this is simply left to the default rounding performed by Fortran: at most, it will be off by a single state in any dimension, and adjacent states will be checked in any event. \layout Standard A mininum of two bubblettes above and below the center will be created within each bubble. However, a check is made to determine how many states were actually checked in that direction for that bubble, and if larger than two, this quantity is used instead. Furthermore, a check is made to insure that no boundary states were chosen in the final solution--this could indicate that a better state existed after the boundary. In this case, the boundary is doubled, and the iteration repeated with the same grain. \begin_inset LatexCommand \label{boundarydoubling} \end_inset \begin_float margin \layout Standard is the repetition really needed? discuss? \end_float \layout Standard The process is repeated until the grain of the probability space is satisfactori ly small, with the final iteration yielding the reported solution. \layout Standard Using the equations from \begin_inset LatexCommand \cite{Dekkers} \end_inset , it is possible to use standard iterative methods to find the optimal values. However, this does not solve the problem, but rather uses iteration to reach an already found solution. Dekkers' method yields the following value for fifteen generations, \layout Standard qbegin include \begin_inset Include \verbatiminput{/home/hawk/Dissertation/sumres.15} \end_inset \layout LyX-Code \size footnotesize end include \layout LyX-Code \size footnotesize \protected_separator \layout LyX-Code \size footnotesize \protected_separator h2=0.300 \protected_separator sigp= 1.000 \protected_separator a=0.250 \protected_separator d=0.000 Q=0.200 \protected_separator \protected_separator NO disequilibrium \layout LyX-Code \size footnotesize \protected_separator \layout LyX-Code \size footnotesize \protected_separator OPTIMUM SELECTION (convergence in 290 iterations) \layout LyX-Code \size footnotesize Gen \protected_separator \protected_separator p \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator G \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator Abar \protected_separator \protected_separator \protected_separator \protected_separator f1 \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator f2 \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator f3 \layout LyX-Code \size footnotesize \protected_separator 1 0.05000 -0.224999994039536 \protected_separator 0.00000 0.500295 0.327620 0.185734 \layout LyX-Code \size footnotesize \protected_separator 2 0.08406 \protected_separator 0.207228153944016 \protected_separator 0.41520 0.428348 0.292564 0.181085 \layout LyX-Code \size footnotesize \protected_separator 3 0.12777 \protected_separator 0.644088208675385 \protected_separator 0.83020 0.380323 0.268445 0.176079 \layout LyX-Code \size footnotesize \protected_separator 4 0.18062 \protected_separator 1.085383772850037 \protected_separator 1.24507 0.346372 0.250374 0.170677 \layout LyX-Code \size footnotesize \protected_separator 5 0.24178 \protected_separator 1.530712127685547 \protected_separator 1.65982 0.321152 0.235861 0.164811 \layout LyX-Code \size footnotesize \protected_separator 6 0.31006 \protected_separator 1.979507923126221 \protected_separator 2.07448 0.301616 0.223486 0.158368 \layout LyX-Code \size footnotesize \protected_separator 7 0.38402 \protected_separator 2.431057214736938 \protected_separator 2.48904 0.285942 0.212349 0.151197 \layout LyX-Code \size footnotesize \protected_separator 8 0.46200 \protected_separator 2.884529113769531 \protected_separator 2.90353 0.272983 0.201804 0.143082 \layout LyX-Code \size footnotesize \protected_separator 9 0.54213 \protected_separator 3.338995933532715 \protected_separator 3.31793 0.261977 0.191297 0.133716 \layout LyX-Code \size footnotesize 10 0.62241 \protected_separator 3.793446540832520 \protected_separator 3.73224 0.252411 0.180264 0.122658 \layout LyX-Code \size footnotesize 11 0.70074 \protected_separator 4.246827125549316 \protected_separator 4.14646 0.243907 0.167976 0.109237 \layout LyX-Code \size footnotesize 12 0.77496 \protected_separator 4.698039531707764 \protected_separator 4.56056 0.236185 0.153314 0.092416 \layout LyX-Code \size footnotesize 13 0.84291 \protected_separator 5.145965099334717 \protected_separator 4.97451 0.229023 0.134196 0.070581 \layout LyX-Code \size footnotesize 14 0.90244 \protected_separator 5.589468955993652 \protected_separator 5.38825 0.222212 0.105829 0.041521 \layout LyX-Code \size footnotesize 15 0.95144 \protected_separator 6.027318954467773 \protected_separator 5.80160 0.215555 0.052549 0.007065 \layout LyX-Code \size footnotesize 16 0.98777 \protected_separator 6.457773208618164 \protected_separator 6.21389 0.000000 0.000000 0.000000 \layout LyX-Code \size footnotesize \protected_separator \layout LyX-Code \size footnotesize \protected_separator MASS SELECTION \layout LyX-Code \size footnotesize Gen \protected_separator \protected_separator p \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator G \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator Abar \protected_separator \protected_separator \protected_separator \protected_separator f1 \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator f2 \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator \protected_separator f3 \layout LyX-Code \size footnotesize \protected_separator 1 0.05000 -0.224999994039536 \protected_separator 0.00000 0.356053 0.267948 0.192415 \layout LyX-Code \size footnotesize \protected_separator 2 0.06809 \protected_separator 0.202594995498657 \protected_separator 0.41855 0.352384 0.264710 0.189730 \layout LyX-Code \size footnotesize \protected_separator 3 0.09215 \protected_separator 0.632719278335571 \protected_separator 0.83664 0.347544 0.260449 0.186207 \layout LyX-Code \size footnotesize \protected_separator 4 0.12370 \protected_separator 1.066016197204590 \protected_separator 1.25417 0.341266 0.254944 0.181672 \layout LyX-Code \size footnotesize \protected_separator 5 0.16429 \protected_separator 1.503146409988403 \protected_separator 1.67100 0.333305 0.247997 0.175977 \layout LyX-Code \size footnotesize \protected_separator 6 0.21523 \protected_separator 1.944672703742981 \protected_separator 2.08706 0.323501 0.239494 0.169049 \layout LyX-Code \size footnotesize \protected_separator 7 0.27719 \protected_separator 2.390885829925537 \protected_separator 2.50229 0.311860 0.229472 0.160944 \layout LyX-Code \size footnotesize \protected_separator 8 0.34968 \protected_separator 2.841590642929077 \protected_separator 2.91675 0.298649 0.218199 0.151907 \layout LyX-Code \size footnotesize \protected_separator 9 0.43069 \protected_separator 3.295939683914185 \protected_separator 3.33060 0.284411 0.206169 0.142358 \layout LyX-Code \size footnotesize 10 0.51654 \protected_separator 3.752383232116699 \protected_separator 3.74411 0.269933 0.194065 0.132851 \layout LyX-Code \size footnotesize 11 0.60243 \protected_separator 4.208878993988037 \protected_separator 4.15767 0.256081 0.182607 0.123948 \layout LyX-Code \size footnotesize 12 0.68336 \protected_separator 4.663304328918457 \protected_separator 4.57162 0.243602 0.172390 0.116090 \layout LyX-Code \size footnotesize 13 0.75530 \protected_separator 5.113923549652100 \protected_separator 4.98628 0.232973 0.163768 0.109519 \layout LyX-Code \size footnotesize 14 0.81586 \protected_separator 5.559716701507568 \protected_separator 5.40178 0.224359 0.156834 0.104277 \layout LyX-Code \size footnotesize 15 0.86451 \protected_separator 6.000436305999756 \protected_separator 5.81818 0.217659 0.151476 0.100251 \layout LyX-Code \size footnotesize 16 0.90209 \protected_separator 6.436436176300049 \protected_separator 6.23539 0.000000 0.000000 0.000000 \layout Standard Without using any of the optimal control equations, dp2 yields essentally the same results:q \layout Subsection Mechanics of Conversion to Frequency Space \layout Standard As discussed in \begin_inset LatexCommand \ref{sec:replacechoicevars} \end_inset , a fundamental change to the problem has been made while converting to frequency space, and the reduction from a two-dimensional search space to a single-dimensional space. While the same answer will be reached at the optimal solution, the starting point is not the same as the mass selection solution. \layout Paragraph choosing f1 & f2 \layout Standard With \begin_inset Formula \( p_{t+1} \) \end_inset chosen, \begin_inset Formula \( f_{t} \) \end_inset should be chosen to maximize \begin_inset Formula \( \Delta \bar{A}_{t+1} \) \end_inset . The optimization problem is \begin_inset Formula \begin{eqnarray} max_{f_{t}}L & = & \Delta \bar{A}_{t+1}\nonumber \\ & & +\lambda \left[ f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) -Qp_{t+1}\right] +\mu \left[ f_{1t}p_{t}^{2}+f_{2t}2p_{t}\left( 1-p_{t}\right) +f_{3t}\left( 1-p_{t}^{2}\right) \right] \nonumber \\ & = & \frac{\sigma }{Q}\left\{ p_{t}^{2}z_{1t}+2p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}^{2}z_{3t}\right) \right\} \nonumber \\ & & +\lambda \left[ f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) -Qp_{t+1}\right] +\mu \left[ f_{1t}p_{t}^{2}+f_{2t}2p_{t}\left( 1-p_{t}\right) +f_{3t}\left( 1-p_{t}^{2}\right) \right] \end{eqnarray} \end_inset which would require an iterative solution, due to the presence of inverses of the normal cumulative distribution function. As such, it is simpler to use a routine from a standard library. Still, such routines require boundaries for the search. \layout Standard \begin_inset Formula \( f_{1t} \) \end_inset will be used as the choice variable, with \begin_inset Formula \( f_{2t} \) \end_inset determined by this choice. In any \begin_inset Quotes eld \end_inset sane \begin_inset Quotes erd \end_inset choice, \begin_inset Formula \begin{equation} f_{1t}>f_{2t} \end{equation} \end_inset This will generally be used as the lower bound, and can be calculated as \begin_inset Formula \begin{equation} f^{0}_{1tmin}=Q\frac{p_{t+1}}{p_{t}} \end{equation} \end_inset \layout Standard In the special case where setting \begin_inset Formula \( f_{1t} \) \end_inset and \begin_inset Formula \( f_{2t} \) \end_inset equal causes these portions to exceed \begin_inset Formula \( Q \) \end_inset , \begin_inset Formula \( f_{1t} \) \end_inset must be increased; the floor has been understated. The new floor will still allocate nothing, or as close to nothing as allocated by the algorithm, to \begin_inset Formula \( f_{3t} \) \end_inset . As such, [ \begin_inset LatexCommand \ref{dp2qcons} \end_inset ] reduces to \begin_inset Formula \begin{equation} f_{1t}p_{t}^{2}+f_{2t}2p_{t}\left( 1-p_{t}\right) =Q \end{equation} \end_inset which combined with [ \begin_inset LatexCommand \ref{dp2prule} \end_inset ] yields two equations in two unknowns for the minimal value of \begin_inset Formula \( f_{1t} \) \end_inset . As \begin_inset Formula \( f_{1t} \) \end_inset rises from this value, \begin_inset Formula \( f_{2t} \) \end_inset must diminish, and \begin_inset Formula \( f_{3t} \) \end_inset will rise. Solving the equations, \begin_inset Formula \begin{eqnarray} \left[ \begin{array}{cc} p_{t}^{2} & 2p_{t}\left( 1-p_{t}\right) \\ p_{t}^{2} & p_{t}\left( 1-p_{t}\right) \end{array}\right] \left[ \begin{array}{c} f_{1t}\\ f_{2t} \end{array}\right] & = & \left[ \begin{array}{c} Q\\ Qp_{t+1} \end{array}\right] \nonumber \\ \left[ \begin{array}{cc} p_{t} & 2\left( 1-p_{t}\right) \\ p_{t} & \left( 1-p_{t}\right) \end{array}\right] \left[ \begin{array}{c} f_{1t}\\ f_{2t} \end{array}\right] & = & \frac{Q}{p_{t}}\left[ \begin{array}{c} 1\\ p_{t+1} \end{array}\right] \nonumber \\ \left[ \begin{array}{c} f_{1t}\\ f_{2t} \end{array}\right] & = & \frac{Q}{p_{t}}\frac{\left[ \begin{array}{cc} \left( 1-p_{t}\right) & -2\left( 1-p_{t}\right) \\ -p_{t} & p_{t} \end{array}\right] }{p_{t}\left( 1-p_{t}\right) -p_{t}2\left( 1-p_{t}\right) }\left[ \begin{array}{c} 1\\ p_{t+1} \end{array}\right] \nonumber \\ & = & -\frac{Q}{p_{t}}\frac{\left[ \begin{array}{cc} \left( 1-p_{t}\right) & -2\left( 1-p_{t}\right) \\ -p_{t} & p_{t} \end{array}\right] }{p_{t}\left( 1-p_{t}\right) }\left[ \begin{array}{c} 1\\ p_{t+1} \end{array}\right] \nonumber \\ & = & -\frac{Q}{p_{t}^{2}\left( 1-p_{t}\right) }\left[ \begin{array}{c} \left( 1-p_{t}\right) -2\left( 1-p_{t}\right) p_{t+1}\\ p_{t}+p_{t}p_{t+1} \end{array}\right] \nonumber \\ & = & -\frac{Q}{p_{t}^{2}\left( 1-p_{t}\right) }\left[ \begin{array}{c} \left( 1-p_{t}\right) \left( 1-2p_{t+1}\right) \\ p_{t}\left( 1+p_{t+1}\right) \end{array}\right] \nonumber \\ & = & \left[ \begin{array}{c} \frac{Q\left( 2p_{t+1}-1\right) }{p_{t}^{2}}\\ \frac{Q\left( 1+p_{t+1}\right) }{p_{t}\left( 1-p_{t}\right) } \end{array}\right] \end{eqnarray} \end_inset of which only the first is used. \layout Standard An upper bound must similarly be found. For \begin_inset Formula \begin{equation} p_{t}<\sqrt{Q} \end{equation} \end_inset it is possible to use all of the homozygotes, and \begin_inset Formula \begin{equation} f_{1tmax}^{0}=1 \end{equation} \end_inset For larger values of \begin_inset Formula \( p_{t} \) \end_inset , \begin_inset Formula \begin{equation} f_{1tmax}^{1}=\frac{Q}{p_{t}^{2}} \end{equation} \end_inset selects entirely from this type, and is used as the maximum. \layout Standard For this finite model, each generation has its own bubble; the next bubble is known with certainty. For the initial pass, sufficient bubblettes are used such that the full range \begin_inset Formula \( \left( 0,1\right) \) \end_inset is available in each choice generation, save that a single bubblette is available for generation \begin_inset Formula \( 0 \) \end_inset , as its gene frequency is predetermined as part of the problem. \layout Standard An initial grain of \begin_inset Formula \( .05 \) \end_inset is set for \begin_inset Formula \( p_{t} \) \end_inset in all generations, and the bubblette of generation \begin_inset Formula \( 0 \) \end_inset is asked for its value. \layout Standard When a bubblette is asked for its value, it firsts checks to see if it has been calculated already. If so, it returns its value. If not, it checks itself for a hint from prior generations, choosing a hint of \begin_inset Formula \( 0 \) \end_inset if none is found. Having established the hint, the a \begin_inset Quotes eld \end_inset sanity check \begin_inset Quotes erd \end_inset \begin_float margin \layout Standard discuss danger of check & missing optima \end_float is made on the hint: the gene frequency will never decrease from generation to generation; any such step is wrong. As such, if the hint suggests reducing the next generation's frequency below the present level, it is rejected, and increase to a sane level. Similarly, a sanity check is made to insure that the \begin_inset Formula \( p_{t+1} \) \end_inset considered is actually a possible transition. \layout Standard This achieved, the value of the hint is calculated. The bubblette from the next generation corresponding to the hint is queried, and \begin_inset Formula \( \Delta \bar{A}_{t+1} \) \end_inset is added to this value. This is stored as the best tentative value, and the next higher bubblette in the next generation is checked. If that bubblette is better, it becomes the new hint, and the search proceeds in that direction as long as progress is made. If inferior, the next lower bubblette is checked, and the search proceeds in that direction as long as it is successful. However, if the initial hint is less than \begin_inset Formula \( 0 \) \end_inset , either from a sanity check or the initial hint, the search starts first in the negative direction, and switches to positive if appropriate. \layout Standard Once a bubblette selects a value in this manner, it stores the final hint, indicating the next state chosen, its value, and the fact of calculation. It then returns. \layout Standard After the initial bubblette returns a value, the optimal path can be found by stepping from bubble to bubble, which is the method used by \family typewriter candidateFromBubbles \family default to form a reportable solution. \layout Standard With a solution found, another iteration is made. If any of the steps chosen as optimal were on the boundary of a bubblette, it is not clear that a further step was not desirable. That boundary is doubled, and the process repeated with the same grain. \layout Standard If no boundaries were used, the optimal solution becomes the base frequency of the next solution. \layout Standard Initially, the boundaries were set by doubling one more than the highest state actually used in that direction (above or below the base), with a minimum of ten. This resulted in a solution time of \begin_inset Formula \( 40.1 \) \end_inset seconds (also using zero rather than the actual value for hints). Changing this to one larger than was actually used, with a minimum of two, significantly improved performance, to \begin_inset Formula \( 27.7 \) \end_inset seconds, or a reduction in processing of about one third. Enabling the hints further reduced processing time to \begin_inset Formula \( 16.7 \) \end_inset seconds, only a third of the initial amount. Interestingly, storing \begin_inset Formula \( f_{1t} \) \end_inset to use as a hint in subsequent iterations actually \emph on increased \emph default the execution time by a marginal amount, to \begin_inset Formula \( 16.9 \) \end_inset seconds. \layout Subsection Adding Discounting \layout Standard To this point, the only concern has been the maximum amount of \emph on progress \emph default that can be made, and only the final generation has been considered. It is not difficult, however, to modify the work done to this point to allow for an infinite horizon. \layout Standard Previously, the program considered the effect of the major gene only in the final generation, and added the gain in polygenic value from each generatio n, creating a sum equal to the value in the final generation. \layout Standard It is a property of the genetic model that changes in polygenic value are permanent. As such, a change in generation \begin_inset Formula \( t \) \end_inset increases all generations by the same amount. The discounted value of an increase \begin_inset Formula \( b \) \end_inset , with a discounted value \begin_inset Formula \begin{equation} d=1-r \end{equation} \end_inset is not difficult to calculate for a finite horizon: \layout Standard \begin_inset Formula \begin{eqnarray} \sum _{i=0}^{T}bd^{i} & = & \sum _{i=0}^{\infty }bd^{i}-\sum _{i=T+1}^{\infty }bd^{i}\nonumber \\ & = & \sum _{i=0}^{\infty }bd^{i}-d^{T+1}\sum _{i=0}^{\infty }bd^{i}\nonumber \\ & = & b\frac{1-d^{T+1}}{1-d}\label{identfinitediscount} \end{eqnarray} \end_inset \layout Standard To calculate a discounted value, then, \family typewriter bestVal() \family default need only be modified to apply the identity in [ \begin_inset LatexCommand \ref{identfinitediscount} \end_inset ] to the calculated value for \begin_inset Formula \( \Delta \bar{A}_{t+1} \) \end_inset , discount this value and that returned from \family typewriter nextVal() \family default , and add the current value of the major gene.This is accomplished by a simple if/then structure in \family typewriter bestVal() \family default . To stay with a single code base, these actions are taken only if the variable discount has a non-zero value. The only other change required is to change the print routine for tentative solutions such that the present value of each generation and its future is displayed. The present value \emph on of the choice made \emph default is reported for each generation. This is not the same as the present value of the generation; the value of the current state is not included. By calculating in this manner, it is easier to compare the relative value of choices later when choosing whether or not to test. \layout Subsection An Infinite Horizon \layout Standard Extending the problem to an infinite horizon is also straightforward, and can be done in a number of different ways. \latex latex \backslash sout{Heading: simplistic approach} \layout Standard The simplest approach, which will be used for the moment, is simply to use the solution for the \begin_inset Formula \( n \) \end_inset -generation problem as a starting point for the \begin_inset Formula \( n+1 \) \end_inset -generation solution. The starting value for final \begin_inset Formula \( p_{t} \) \end_inset is taken as increasing by half as much as in the prior generation, but no more than half of the distance to \begin_inset Formula \( 1.0 \) \end_inset . \layout Standard Due to steep discounting, future generations eventually have very little present value. Furthermore, the \emph on difference \emph default between the present value of subsequent far off generations declines. As such, for any arbitrarily small \begin_inset Formula \( \epsilon \) \end_inset , there exists an \begin_inset Formula \( n \) \end_inset such that the difference between the present values of proceeding for \begin_inset Formula \( n \) \end_inset and for \begin_inset Formula \( n+1 \) \end_inset generations is less than \begin_inset Formula \( \epsilon \) \end_inset . This is the first approach taken, and requires minimal modifications such that the prior version of the program is placed in a loop, which exits when the gain between subsequent generations is less than the specified convergence criterion. \layout Subsubsection Using Mass Selection \layout Standard While the simplistic approach could work, it is not guaranteed to without significant work, as the routines used have an upper bound on the \begin_inset Formula \( p_{t} \) \end_inset for which inverses of distribution functions may be calculated. However, a better method exists. In the simple approach, the entire value of the added generation is an increase, though it is possibly offset by different actions in prior generation s (the steps that are optimal for \begin_inset Formula \( n \) \end_inset and \begin_inset Formula \( n+1 \) \end_inset generations are not the same, and thus the first \begin_inset Formula \( n \) \end_inset steps of the \begin_inset Formula \( n+1 \) \end_inset generation solution are worth less than the \begin_inset Formula \( n \) \end_inset generation solution). A more efficient solution is to instead of ignoring generations after \begin_inset Formula \( n \) \end_inset to switch to mass selection, and calculate the present value in that manner. This simplifies the calculations in [ \begin_inset LatexCommand \ref{identfinitediscount} \end_inset ], which becomes \begin_inset Formula \begin{equation} \label{identinfdiscount} \sum _{i=0}^{\infty }bd^{i}=\frac{b}{1-d} \end{equation} \end_inset \layout Standard The only further modification required is to change \family typewriter bestVal() \family default such that in the final generation, it returns the present discounted value of future generations under mass selection. \layout Paragraph The Value of Mass Selection \layout Standard Starting with \begin_inset Formula \begin{equation} \left[ \begin{array}{ccccc} x_{1t} & -x_{2t} & & & -\frac{a}{\sigma }\\ x_{1t} & & -x_{3t} & & -2\frac{a}{\sigma }\\ & & & \Phi \left( -x_{1t}\right) p_{t}^{2}+\Phi \left( -x_{2t}\right) p_{t}\left( 1-p_{t}\right) +\Phi \left( -x_{3t}\right) \left( 1-p_{t}\right) ^{2} & -Q \end{array}\right] =0 \end{equation} \end_inset yields \begin_inset Formula \begin{equation} \Phi \left( -x_{1t}\right) p_{t}^{2}+\Phi \left( -x_{1t}+\frac{a}{\sigma }\right) p_{t}\left( 1-p_{t}\right) +\Phi \left( -x_{1t}+2\frac{a}{\sigma }\right) \left( 1-p_{t}\right) ^{2}-Q \end{equation} \end_inset or \begin_inset Formula \[ L=\Phi \left( -x_{1t}\right) p_{t}^{2}+\Phi \left( -x_{2t}\right) p_{t}\left( 1-p_{t}\right) +\Phi \left( -x_{3t}\right) \left( 1-p_{t}\right) ^{2}-Q\] \end_inset and \begin_inset Formula \begin{eqnarray} L' & = & x_{1t}\phi ^{2}\left( -x_{1t}\right) p_{t}^{2}+\left( x_{1t}-\frac{a}{\sigma }\right) \phi ^{2}\left( -x_{1t}+\frac{a}{\sigma }\right) p_{t}\left( 1-p_{t}\right) \nonumber \\ & & +\left( x_{1t}-2\frac{a}{\sigma }\right) \phi ^{2}\left( -x_{1t}+2\frac{a}{\sigma }\right) \left( 1-p_{t}\right) ^{2} \end{eqnarray} \end_inset These equations are fed to a numeric routine for solution; reducing to the single variable \begin_inset Formula \( x_{1t} \) \end_inset speeds calculation. The resultant values are calculated and discounted until \begin_inset Formula \( p_{t+1} \) \end_inset is within the constant \family typewriter peps \family default , the limit of resolution for gene frequency, of \family typewriter \begin_inset Formula \( 1.0 \) \end_inset . \begin_inset Formula \( p_{t} \) \end_inset \family default is then accepted as being equal to one, and the gain in polygenic value becomes fixed for all later generations, and [ \begin_inset LatexCommand \ref{identinfdiscount} \end_inset ] is used to add the value of all future gain. \begin_float margin \layout Standard The \family typewriter pcrit \family default limit doesn't make sense in the finite horizon, as state isn't Markov \end_float \layout Subsection Testing Costs \layout Standard The results so far consider only the revenue from the breeding program, and not the costs. Realistically, a cost should be imposed when animals are tested, and the breeding program should only continue when the benefits exceed the cost, the benefit being the gain \emph on in excess of \emph default the gain from mass selection. The algorithm changes only slightly to handle this variation: the best possible breeding choice is still found, but its value is compared to the value of switching to mass selection. If it does not beat mass selection by the testing cost \begin_inset Formula \( c \) \end_inset , the switch is made. Using a discount rate of \begin_inset Formula \( 8\% \) \end_inset and a test cost of \begin_inset Formula \( .1 \) \end_inset with Dekkers' parameters, the transition to mass selection occurs after generation \begin_inset Formula \( 7 \) \end_inset . \layout Subsection Changing the Horizon \layout Standard The largest computational cost is not in calculating the values of the states, but in preparing the bubblettes for this computation. Once it is known that states beyond a given generation are not used, there is no reason to continue calculating these states. Similarly, if a breeding program has not switched to mass selection, a longer program may be desirable. The control variable \family typewriter smartShrinkGens \family default is added to handle this situation. \layout Standard With \family typewriter smartShrinkGens \family default set, if mass selection is not chosen in the final choice generation, the time horizon is increased by one. The solution of the current iteration, augmented by mass selection for the final generation, is taken as the center, and the next iteration is run with the same grain. \layout Standard Conversely, if the switch to mass selection occurs before the final generation, the generation in which the switch occurs becomes the final generation. However, the grain is reduced, as the available states are a subset of the states already considered. \layout Standard This final model can be expressed as \begin_inset Formula \begin{eqnarray} \max _{T,\left\{ f_{t}:0\leq <T-1\right\} }L & = & \sum _{t=0}^{T-1}\left( 1-d\right) ^{t+1}\left[ a\left( 2p_{t+1}\left( f_{t}\right) -1\right) +\frac{\Delta \bar{A}_{t+1}\left( f_{t}\right) }{d}-c\right] \nonumber \\ & & +\left( 1-d\right) ^{T}\left[ \frac{a}{d}+\frac{\Delta \bar{A}}{d^{2}}\right] \label{dp2finalobj} \end{eqnarray} \end_inset \begin_float margin \layout Standard discuss 20% penalty for \begin_inset Formula \( \pm 3 \) \end_inset rather than \begin_inset Formula \( \pm 2 \) \end_inset \end_float \layout Standard \latex latex \backslash pagebreak \layout Section Two Dimensions: Adding Disequilibrium \layout Standard The model given in [ \begin_inset LatexCommand \ref{dp2finalobj} \end_inset ] is incomplete in several ways. The first to be considered is that of \emph on gametic phase disequilibrium \emph default \latex latex \backslash jd{ \latex default between the major gene and polygenes \latex latex } \latex default . \emph on \begin_inset LatexCommand \index{gametic phase disequilibrium} \end_inset \emph default While not considered to this point, the selection intensity on the polygenes is weaker for type BB than for Bb, which is in turn weaker than for bb. This is a simple consequence of the higher fractions selected from BB and Bb; animals with a lower polygenic value than in bb survive selection. \layout Standard As with the first model, the actual choice variables, \begin_inset Formula \( f_{t} \) \end_inset , will not be used. Rather than a single state variable and an internal optimization given that variable, a pair of state variables will now be used: \begin_inset Formula \( p_{t} \) \end_inset , and the difference between the \latex latex \backslash jd{ \latex default average \latex latex } \latex default polygenic values \latex latex \backslash jd{ \latex default associated with the B and b gamets, \latex latex } \latex default \begin_inset Formula \( \bar{A}_{B,t}-\bar{A}_{b,t} \) \end_inset . \layout Subsection Gametic Phase Disequilibrium \layout Standard The \begin_float margin \layout Standard should this go in an earlier section? \end_float single dimensional approache ignores the effects of \begin_inset Quotes eld \end_inset gametic phase disequilibrium, \begin_inset Quotes erd \end_inset \begin_inset LatexCommand \index{disequilibrium, gametic phase} \end_inset , a well known consequence of selection. \begin_inset LatexCommand \cite{IQG} \end_inset . Under selection, the superior homozygotes, BB, are subject to a lesser selection intensity for polygenic effects than the other types, \begin_inset LatexCommand \cite{Dekkers} \end_inset and as a consequence, have a lower polygenic value. Rather than a single \latex latex \backslash jd{ \latex default average \latex latex \latex default polygenic \latex latex } \latex default value \begin_inset Formula \( \bar{A}_{t} \) \end_inset , \latex latex \backslash so{ \latex default there are \latex latex } \latex default now two values, \begin_inset Formula \( \bar{A}_{B,t} \) \end_inset and \begin_inset Formula \( \bar{A}_{b,t} \) \end_inset \latex latex \backslash jd{ \latex default must be distiguished \latex latex } \latex default , reflecting \latex latex \backslash jd{ \latex default average \latex latex \latex default polygenic values for gametes carrying the B and b allelles, respectively. \latex latex } \latex default As each individual gets two gametes, one from each parent, the average polygenic values for BB, Bb, and bb are then, respectively, \begin_inset Formula \( 2\bar{A}_{B,t} \) \end_inset , \begin_inset Formula \( \bar{A}_{B,t}+\bar{A}_{b,t} \) \end_inset , and \begin_inset Formula \( 2\bar{A}_{b,t} \) \end_inset . The overall value is then a weighted average, \begin_inset Formula \begin{equation} \bar{A}_{t}=2p_{t}\bar{A}_{B,t}+2\left( 1-p_{t}\right) \bar{A}_{b,t} \end{equation} \end_inset \layout Standard Similarly \begin_float footnote \layout Standard similar or similarly? does it modify \begin_inset Quotes eld \end_inset problem \begin_inset Quotes erd \end_inset or \begin_inset Quotes eld \end_inset written \begin_inset Quotes erd \end_inset ? \end_float to the single dimensional problem, these average values can be written as recursive equations: \begin_inset Formula \begin{equation} \label{diseqABt1} \bar{A}_{B,t+1}=\frac{f_{1t}p_{t}^{2}\left( \bar{A}_{B,t}+\frac{1}{2}i_{1t}\sigma \right) +f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}+\bar{A}_{b,t}+i_{2t}\sigma \right) }{f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) } \end{equation} \end_inset and \begin_inset Formula \begin{equation} \label{diseqAbt1} \bar{A}_{b,t+1}=\frac{f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}+\bar{A}_{b,t}+i_{2t}\sigma \right) +f_{3t}\left( 1-p_{t}\right) ^{2}\left( \bar{A}_{b,t}+\frac{1}{2}i_{3t}\sigma \right) }{1-f_{1t}p_{t}^{2}-f_{2t}p_{t}\left( 1-p_{t}\right) } \end{equation} \end_inset \begin_float margin \layout Standard jack: is \begin_inset Formula \( \bar{A}_{b,t+1} \) \end_inset correct? \end_float the first of which is Dekkers' equation 10. Note that this usage differs from Dekkers' introduction of \begin_inset Formula \( W_{B,t}=p_{t}\bar{A}_{B,t} \) \end_inset and \begin_inset Formula \( W_{b,t}=\left( 1-p_{t}\right) \bar{A}_{b,t} \) \end_inset . While this change was advantageous for the use of optimal control, it would introduce complications for the methods below. Particularly, the raw polygenic values have the same units, which allows their difference to be defined, while the \begin_inset Formula \( W \) \end_inset are weighted to reflect their contributions to the overal average polygenic value. \layout Subsection Finding the State Variables \layout Standard As before, there are more choice variables than state variables, and non-lineari ty makes the actual choice variables impractical for dynamic programming. Two \latex latex \backslash sout{ \latex default choice \latex latex } \backslash rh{ \latex default state \latex latex } \latex default variables are now needed, and the mechanism for maximising [ \begin_inset LatexCommand \ref{dpsiminfanextoff1} \end_inset ] is now irrelevant. The choice of \begin_inset Formula \( p_{t+1} \) \end_inset and \latex latex \backslash rh{ \latex default a single additional variable for period \begin_inset Formula \( t+1 \) \end_inset \latex latex } \latex default d \latex latex \backslash so{ \latex default such other state variable as is chosen \latex latex } \latex default will \emph on exactly \emph default define \begin_inset Formula \( f_{mt} \) \end_inset , \latex latex \backslash jd{ \latex default as [ \begin_inset LatexCommand \ref{siminfqcns} \end_inset ], [ \begin_inset LatexCommand \ref{diseqABt1} \end_inset ], and [ \begin_inset LatexCommand \ref{diseqAbt1} \end_inset ] create three equations in the three unknowns \begin_inset Formula \( f_{mt} \) \end_inset \latex latex } \latex default . \layout Standard The new fitness function for a given generation, comparable to [ \begin_inset LatexCommand \ref{siminfobjfn} \end_inset ] is now \begin_inset Formula \begin{equation} \label{diseqrawobjfn} \bar{G}_{t}=a\left( 2p_{t}-1\right) +2p_{t}\bar{A}_{B,t}+2\left( 1-p_{t}\right) \bar{A}_{b,t} \end{equation} \end_inset However, subsituting [ \begin_inset LatexCommand \ref{diseqABt1} \end_inset ] and [ \begin_inset LatexCommand \ref{diseqAbt1} \end_inset ] into[ \begin_inset LatexCommand \ref{diseqrawobjfn} \end_inset ] does not yield a simple result such as [ \begin_inset LatexCommand \ref{siminfarule} \end_inset ], in which the change \latex latex \backslash jd{ \latex default in \begin_inset Formula \( \bar{A} \) \end_inset \latex latex } \latex default is easily isolated. Instead, a dependence upon two variables of the prior generation \latex latex \backslash jd{ \latex default , \begin_inset Formula \( \bar{A}_{B,t} \) \end_inset and \begin_inset Formula \( \bar{A}_{b,t} \) \end_inset , \latex latex } \latex default remains. In order to create a useful algorithm, it is necessary to completely isolate the effects of the past \latex latex \backslash jd{ \latex default states \latex latex } \latex default and the \latex latex \backslash so{ \latex default changes made \latex latex } \latex default \latex latex \backslash jd{ \latex default choices made; it is necessary to have an expression such as [**] which describes the effect of the choice on the objective function \latex latex } \latex default . \layout Standard One way of doing this is to find an expression for \begin_inset Formula \( \bar{A}_{B,t+1}-\bar{A}_{b,t+1} \) \end_inset \latex latex \backslash rh{ \latex default in terms of \begin_inset Formula \( \bar{A}_{B,t}-\bar{A}_{b,t} \) \end_inset , \begin_inset Formula \( p_{t} \) \end_inset , \begin_inset Formula \( p_{t+1} \) \end_inset , and \begin_inset Formula \( f_{t} \) \end_inset . Using the relation that \latex latex } \latex default \begin_inset Formula \begin{equation} i=\frac{z}{f} \end{equation} \end_inset \begin_inset LatexCommand \cite[equation 11.5]{iqg} \end_inset \latex latex \backslash rh{ \latex default , noting that the denominator of \begin_inset LatexCommand \ref{diseqABt1} \end_inset is equal to \begin_inset Formula \( Qp_{t+1}, \) \end_inset and rearranging terms, \latex latex } \layout Standard \begin_inset Formula \begin{eqnarray} \bar{A}_{B,t+1} & = & \frac{f_{1t}p_{t}^{2}\bar{A}_{B,t}+f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}+\bar{A}_{b,t}\right) +p_{t}^{2}\frac{1}{2}z_{1t}\sigma +p_{t}\left( 1-p_{t}\right) \frac{1}{2}z_{2t}\sigma }{Qp_{t+1}}\nonumber \\ & = & \frac{\left[ f_{1t}p_{t}^{2}+f_{2}p_{t}\left( 1-p_{t}\right) \right] \bar{A}_{B,t}-f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) +p_{t}^{2}\frac{1}{2}z_{1t}\sigma +p_{t}\left( 1-p_{t}\right) \frac{1}{2}z_{2t}\sigma }{Qp_{t+1}}\nonumber \\ & = & \frac{Qp_{t+1}\bar{A}_{B,t}-f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}d_{t}+p_{t}^{2}\frac{1}{2}z_{1t}\sigma +p_{t}\left( 1-p_{t}\right) \frac{1}{2}z_{2t}\sigma }{Qp_{t+1}}\\ & = & \bar{A}_{B,t}-\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{2Qp_{t+1}}d_{t}+\frac{p_{t}^{2}z_{1t}+p_{t}\left( 1-p_{t}\right) z_{2t}}{2Qp_{t+1}}\sigma \end{eqnarray} \end_inset similarly, \layout Standard \begin_inset Formula \begin{eqnarray} \bar{A}_{b,t+1} & = & \frac{f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}+\bar{A}_{b,t}\right) +p_{t}\left( 1-p_{t}\right) \frac{1}{2}z_{2t}\sigma +f_{3t}\left( 1-p_{t}\right) ^{2}\bar{A}_{b,t}+\left( 1-p_{t}\right) ^{2}\frac{1}{2}z_{3t}\sigma }{Q-Qp_{t+1}}\nonumber \\ & = & \frac{f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) +\left[ f_{3t}\left( 1-p_{t}\right) ^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) \right] \bar{A}_{b,t}}{Q\left( 1-p_{t+1}\right) }\nonumber \\ & & +\frac{p_{t}\left( 1-p_{t}\right) \frac{1}{2}z_{2t}+\left( 1-p_{t}\right) ^{2}\frac{1}{2}z_{3t}}{Q\left( 1-p_{t+1}\right) }\sigma \nonumber \\ & = & \frac{\left[ f_{3t}\left( 1-p_{t}\right) ^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) \right] }{Q\left( 1-p_{t+1}\right) }\bar{A}_{b,t}+\frac{1}{2}\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{Q\left( 1-p_{t+1}\right) }d_{t}\nonumber \\ & & +\frac{1}{2}\frac{p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}}{Q\left( 1-p_{t+1}\right) }\sigma \nonumber \\ & = & \frac{\left[ Q-f_{1t}p_{t}^{2}-f_{2t}2p_{t}\left( 1-p_{t}\right) +f_{2t}p_{t}\left( 1-p_{t}\right) \right] }{Q\left( 1-p_{t+1}\right) }\bar{A}_{b,t}+\frac{1}{2}\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{Q\left( 1-p_{t+1}\right) }d_{t}\nonumber \\ & & +\frac{1}{2}\frac{p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}}{Q\left( 1-p_{t+1}\right) }\sigma \nonumber \\ & = & \bar{A}_{b,t}+\frac{1}{2}\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{Q\left( 1-p_{t+1}\right) }d_{t}+\frac{1}{2}\frac{p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}}{Q\left( 1-p_{t+1}\right) }\sigma \end{eqnarray} \end_inset yielding \begin_inset Formula \begin{eqnarray} d_{t+1} & \equiv & \bar{A}_{B,t+1}-\bar{A}_{b,t+1}\nonumber \\ & = & \bar{A}_{B,t}-\frac{1}{2Q}\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t+1}}d_{t}+\frac{1}{2Q}\frac{p_{t}^{2}z_{1t}+p_{t}\left( 1-p_{t}\right) z_{2t}}{p_{t+1}}\sigma \nonumber \\ & & -\bar{A}_{b,t}-\frac{1}{2Q}\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{\left( 1-p_{t+1}\right) }d_{t}-\frac{1}{2Q}\frac{p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}}{\left( 1-p_{t+1}\right) }\sigma \nonumber \\ & = & d_{t}-\frac{1}{2Q}\left[ \frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t+1}}+\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{\left( 1-p_{t+1}\right) }\right] d_{t}\nonumber \\ & & +\frac{1}{2Q}\left[ \frac{p_{t}^{2}z_{1t}+p_{t}\left( 1-p_{t}\right) z_{2t}}{p_{t+1}}-\frac{p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}z_{3t}}{\left( 1-p_{t+1}\right) }\right] \sigma \nonumber \\ & = & d_{t}-\frac{1}{2Q}f_{2t}p_{t}\left( 1-p_{t}\right) \left[ \frac{1-p_{t+1}+p_{t+1}}{p_{t+1}\left( 1-p_{t+1}\right) }\right] d_{t}\nonumber \\ & & +\frac{1}{2Q}\left[ \frac{p_{t}^{2}}{p_{t+1}}z_{1t}+p_{t}\left( 1-p_{t}\right) \left[ \frac{1}{p_{t+1}}-\frac{1}{1-p_{t+1}}\right] z_{2t}-\frac{\left( 1-p_{t}\right) ^{2}}{\left( 1-p_{t+1}\right) }z_{3t}\right] \sigma \nonumber \\ & = & d_{t}-\frac{1}{2Q}f_{2t}\frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}\left( 1-p_{t+1}\right) }d_{t}\nonumber \\ & & +\frac{1}{2Q}\left[ \frac{p_{t}^{2}}{p_{t+1}}z_{1t}+p_{t}\left( 1-p_{t}\right) \left[ \frac{1}{p_{t+1}}-\frac{1}{1-p_{t+1}}\right] z_{2t}-\frac{\left( 1-p_{t}\right) ^{2}}{\left( 1-p_{t+1}\right) }z_{3t}\right] \sigma \label{diseqdrule} \end{eqnarray} \end_inset \layout Standard \begin_inset Formula \begin{equation} \label{diseqABrule} \bar{A}_{B,t+1}-\bar{A}_{B,t}\jd {-}\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{2Qp_{t+1}}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) +\frac{f_{1t}p_{t}^{2}i_{1t}\sigma +f_{2t}p_{t}\left( 1-p_{t}\right) i_{2t}\sigma }{2Qp_{t+1}} \end{equation} \end_inset \layout Standard Before proceeding to \begin_inset Formula \( \bar{A}_{b,t+1} \) \end_inset , it is useful to define \begin_inset Formula \begin{equation} d_{t}\equiv \bar{A}_{B,t}-\bar{A}_{b,t} \end{equation} \end_inset and derive \begin_inset Formula \begin{eqnarray} 1-p_{t+1} & = & 1-\frac{1}{Q}\left\{ f_{1t}p_{t}^{2}+f_{2t}p_{t}\left( 1-p_{t}\right) \right\} \nonumber \\ & = & \frac{Q-f_{1t}p_{t}^{2}-f_{2t}p_{t}\left( 1-p_{t}\right) }{Q}\nonumber \\ & = & \frac{f_{1}p_{t}^{2}+f_{2t}2p_{t}\left( 1-p_{t}\right) +f_{3t}\left( 1-p_{t}^{2}\right) -f_{1t}p_{t}^{2}-f_{2t}p_{t}\left( 1-p_{t}\right) }{Q}\nonumber \\ & = & \frac{1}{Q}\left\{ f_{2t}p_{t}\left( 1-p_{t}\right) +f_{3t}\left( 1-p_{t}^{2}\right) \right\} \end{eqnarray} \end_inset \begin_inset Formula \begin{eqnarray} Q\left( 1-p_{t+1}\right) \bar{A}_{b,t+1} & = & f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}+\bar{A}_{b,t}+i_{2t}\sigma \right) \nonumber \\ & & +f_{3t}\left( 1-p_{t}\right) ^{2}\left( \bar{A}_{b,t}+\frac{1}{2}i_{3t}\sigma \right) \nonumber \\ & = & \left[ f_{2t}p_{t}\left( 1-p_{t}\right) +f_{3t}\left( 1-p_{t}^{2}\right) \right] \bar{A}_{b,t}+f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) \nonumber \\ & & +f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}i_{3t}\sigma +f_{3t}\left( 1-p_{t}\right) ^{2}\frac{1}{2}i_{3t}\sigma \nonumber \\ & = & Q\left( 1-p_{t+1}\right) \bar{A}_{b,t}+f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) \nonumber \\ & & +f_{2t}p_{t}\left( 1-p_{t}\right) \frac{1}{2}i_{3t}\sigma +f_{3t}\left( 1-p_{t}\right) ^{2}\frac{1}{2}i_{3t}\sigma \end{eqnarray} \end_inset \layout Standard \begin_inset Formula \begin{equation} \label{diseqAbrule} \bar{A}_{b,t+1}-\bar{A}_{b,t}=\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{2Q\left( 1-p_{t+1}\right) }\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) +\frac{f_{2t}p_{t}\left( 1-p_{t}\right) i_{2t}\sigma +f_{3t}\left( 1-p_{t}\right) ^{2}i_{3t}\sigma }{2Q\left( 1-p_{t+1}\right) } \end{equation} \end_inset allowing the calculation, \begin_inset Formula \begin{eqnarray} \Delta d_{t+1} & = & \left( \bar{A}_{B,t+1}-\bar{A}_{b,t+1}\right) -\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) \nonumber \\ & = & \frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{2Qp_{t+1}}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) +\frac{f_{1t}p_{t}^{2}i_{1t}\sigma +f_{2t}p_{t}\left( 1-p_{t}\right) i_{2t}\sigma }{2Qp_{t+1}}\nonumber \\ & & -\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{2Q\left( 1-p_{t+1}\right) }d_{t}-\frac{f_{2t}p_{t}\left( 1-p_{t}\right) i_{2t}\sigma +f_{3t}\left( 1-p_{t}\right) ^{2}i_{3t}\sigma }{2Q\left( 1-p_{t+1}\right) }\nonumber \\ & = & \frac{f_{2}p_{t}\left( 1-p_{t}\right) }{2Qp_{t+1}\left( 1-p_{t+1}\right) }d_{t}\nonumber \\ & & +\frac{p_{t}^{2}\left( 1-p_{t+1}\right) z_{1t}+p_{t}\left( 1-p_{t}\right) z_{2t}+\left( 1-p_{t}\right) ^{2}p_{t+1}z_{3t}}{2Qp_{t+1}\left( 1-p_{t+1}\right) }\sigma \label{dp3dt1} \end{eqnarray} \end_inset \layout Standard Also note that [ \begin_inset LatexCommand \ref{diseqABrule} \end_inset ] and [ \begin_inset LatexCommand \ref{diseqABrule} \end_inset ] can be written as \layout Standard \begin_inset Formula \begin{eqnarray} \bar{A}_{B,T} & = & \bar{A}_{B,0}+\sum _{t=0}^{T-1}\frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{2Qp_{t+1}}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) +\frac{f_{1t}p_{t}^{2}i_{1t}\sigma +f_{2t}p_{t}\left( 1-p_{t}\right) i_{2t}\sigma }{2Qp_{t+1}}\nonumber \\ & = & \bar{A}_{B,0}+\sum ^{T-1}_{t=0}\Delta \bar{A}_{B,t+1}\label{diseqABassum} \end{eqnarray} \end_inset and \begin_inset Formula \begin{equation} \label{diseqAbassum} \bar{A}_{b,T}=\bar{A}_{b,0}+\sum _{t=0}^{T-1}\Delta \bar{A}_{b,t+1} \end{equation} \end_inset Which means that as in the prior case, changes \latex latex \backslash jd{ \latex default in the average polygenic value \latex latex } \latex default are permanent. However, as [ \begin_inset LatexCommand \ref{diseqrawobjfn} \end_inset ] depends upon \begin_inset Formula \( p_{t} \) \end_inset , the value of \begin_inset Formula \( \Delta \bar{A}_{B,t} \) \end_inset is not the same in all future \latex latex \backslash so{ \latex default equations \latex latex } \backslash jd{ \latex default generations \latex latex } \latex default , making calculations such as [**], which calculates the present value of a change, impossible. \begin_float margin \layout Standard write prior expresson above \end_float \layout Standard Instead, consider \begin_inset Formula \begin{eqnarray} \bar{A}_{t+1}-\bar{A}_{t} & = & 2p_{t+1}\bar{A}_{Bt+1}+2\left( 1-p_{t+1}\right) \bar{A}_{b,t+1}-2p_{t}\bar{A}_{B,t}-2\left( 1-p_{t}\right) \bar{A}_{b,t}\nonumber \\ & = & 2\bar{A}_{b,t+1}+2p_{t+1}\left( \bar{A}_{B,t+1}-\bar{A}_{b,t+1}\right) -2\bar{A}_{b,t}-2p_{t}\left( \bar{A}_{B,t}-\bar{A}_{b,t}\right) \nonumber \\ & = & 2\left( \bar{A}_{b,t+1}-\bar{A}_{b,t}\right) +2p_{t+1}d_{t+1}-2p_{t}d_{t}\nonumber \\ & = & 2\left( \frac{f_{2t}p_{t}\left( 1-p_{t}\right) }{2Q\left( 1-p_{t+1}\right) }d_{t}+\frac{f_{2t}p_{t}\left( 1-p_{t}\right) i_{2t}\sigma +f_{3t}\left( 1-p_{t}\right) ^{2}i_{3t}\sigma }{2Q\left( 1-p_{t+1}\right) }\right) \nonumber \\ & & +2p_{t+1}d_{t+1}-2p_{t}d_{t}\nonumber \\ & = & \frac{f_{2t}p_{t}\left( 1-p_{t}\right) \left[ d_{t}+i_{2t}\sigma \right] +f_{3t}\left( 1-p_{t}\right) ^{2}i_{3t}\sigma }{Q\left( 1-p_{t+1}\right) }+2p_{t+1}d_{t+1}-2p_{t}d_{t}\nonumber \\ & = & \frac{p_{t}\left( 1-p_{t}\right) \left[ f_{2t}d_{t}+z_{2t}\sigma \right] +\left( 1-p_{t}\right) ^{2}z_{3t}\sigma }{Q\left( 1-p_{t+1}\right) }+2p_{t+1}d_{t+1}-2p_{t}d_{t}\label{diseqArule} \end{eqnarray} \end_inset which \latex latex \backslash so{ \latex default , while not pretty, \latex latex } \latex default is written entirely in terms of the state variables \begin_inset Formula \( p_{t} \) \end_inset and \begin_inset Formula \( d_{t} \) \end_inset , and fractions which are functions of these choice variables. Thus [ \begin_inset LatexCommand \ref{diseqArule} \end_inset ] can be used to write the present value of \latex latex \backslash so{ \latex default a change \latex latex } \latex default \latex latex \backslash jd{ \latex default the change in state \latex latex } \latex default as \begin_inset Formula \begin{equation} pv=\frac{2\left( 1-r\right) }{r}\left[ \bar{A}_{t+1}-\bar{A}_{t}\right] \end{equation} \end_inset \layout Subsection The State \layout Standard The state at any time will be described by the pair \begin_inset Formula \( \left( p_{t},d_{t}\right) \) \end_inset , where \begin_inset Formula \( d_{t} \) \end_inset is the disequilibrium at \latex latex \backslash so{ \latex default the \latex latex } \latex default time \latex latex \backslash jd{ \latex default \begin_inset Formula \( t \) \end_inset \latex latex } \latex default , \begin_inset Formula \begin{equation} d_{t}=\bar{A}_{B,t}-\bar{A}_{b,t} \end{equation} \end_inset Rather than calculating \begin_inset Formula \( f_{1t} \) \end_inset to maximize the polygenic gain, the successor states \latex latex \backslash jd{ \latex default \begin_inset Formula \( p_{t+1} \) \end_inset and \begin_inset Formula \( d_{t+1} \) \end_inset \latex latex } \latex default are chosen, and the choice made fully dictates \begin_inset Formula \( f_{t} \) \end_inset . Equations [ \begin_inset LatexCommand \ref{dp2prule} \end_inset ], [ \begin_inset LatexCommand \ref{dp3dt1} \end_inset ], and [ \begin_inset LatexCommand \ref{dp2qcons} \end_inset ] constitute a system of three equations in the three unknowns \begin_inset Formula \( f_{t} \) \end_inset , which can be solved by a library routine. As a practical matter, [ \begin_inset LatexCommand \ref{dp2qcons} \end_inset ] is used to eliminate \begin_inset Formula \( f_{3t} \) \end_inset from the system to reduce computation. \layout Subsection Mass selection With Gametic Phase Disequilibrium \layout Standard \latex latex \backslash so{ \latex default While \latex latex } \backslash jd{ \latex default When \latex latex } \latex default igoring disequilibrium, all genotypes had the same polygenic distribution, and mass selection truncation points \begin_inset Formula \( x_{t} \) \end_inset were calculated by setting \begin_inset Formula \begin{equation} x_{1t}-\frac{a}{\sigma }=x_{2t}=x_{3t}+\frac{a}{\sigma } \end{equation} \end_inset This is no longer the case under disequilibrium; the new rule is \begin_inset Formula \begin{equation} x_{1t}+\left( 2\bar{A}_{B,t}-\frac{a}{\sigma }\right) =x_{2t}+\left( \bar{A}_{B,t}+\bar{A}_{b,t}\right) =x_{3t}+2\bar{A}_{b,t}+\frac{a}{\sigma } \end{equation} \end_inset which can be rewritten as \begin_inset Formula \begin{equation} x_{1t}+\left( 2\bar{A}_{b,t}+2d_{t}-\frac{a}{\sigma }\right) =x_{2t}+\left( 2\bar{A}_{b,t}+d_{t}\right) =x_{3t}+\left( 2\bar{A}_{b,t}+\frac{a}{\sigma }\right) \end{equation} \end_inset or \begin_inset Formula \begin{equation} x_{1t}+\left( d_{t}-\frac{a}{\sigma }\right) =x_{2t}=x_{3t}-\left( d_{t}-\frac{a}{\sigma }\right) \end{equation} \end_inset \layout Subsection Translation from state to choice variables \layout Standard Given that the problem is to be solved with state variables rather than choice variables, it is necessary to have a mechanism allowing translation from state transitions to the corresponding choice variables, both to report the results and to check for violations of possible states. \layout Standard Given that \begin_inset Formula \( f_{1t} \) \end_inset has already been eliminated from [ \begin_inset LatexCommand \ref{diseqArule} \end_inset ], it is useful to solve [ \begin_inset LatexCommand \ref{dp2prule} \end_inset ] for \begin_inset Formula \( f_{1t} \) \end_inset , \begin_inset Formula \begin{equation} f_{1t}=\frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}} \end{equation} \end_inset which can be used to solve [ \begin_inset LatexCommand \ref{dp2qcons} \end_inset ] for \begin_inset Formula \( f_{3t} \) \end_inset , \begin_inset Formula \begin{eqnarray} f_{3t} & = & \frac{Q-\frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}p_{t}^{2}-2p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\nonumber \\ & = & \frac{Q-Qp_{t+1}+f_{2t}p_{t}\left( 1-p_{t}\right) -2p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\nonumber \\ & = & \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}} \end{eqnarray} \end_inset which is in turn csubstituted into [ \begin_inset LatexCommand \ref{diseqArule} \end_inset ] \begin_inset Formula \begin{eqnarray} 0 & = & f_{2}p_{t}\left( 1-p_{t}\right) \frac{d_{t}}{\sigma }\nonumber \\ & & +p_{t}^{2}\left( 1-p_{t+1}\right) z\left( \frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}\right) \nonumber \\ & & +p_{t}\left( 1-p_{t}\right) z\left( f_{2t}\right) \nonumber \\ & & +\left( 1-p_{t}\right) ^{2}p_{t+1}z\left( \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\right) \nonumber \\ & & -\frac{\left( d_{t+1}-d_{t}\right) }{\sigma }\left( 2Qp_{t+1}\left( 1-p_{t+1}\right) \right) \end{eqnarray} \end_inset which can be solved by a fast numeric routine \layout Subsection Translation from state to choice variables (new) \layout Standard While working with the state variables \begin_inset Formula \( d \) \end_inset and \begin_inset Formula \( p \) \end_inset is sufficient to describe the \emph on behavior \emph default of the system, it is not enough to quantitatively \emph on evaluate \emph default a path: the original \begin_inset Formula \( f_{t} \) \end_inset , \begin_inset Formula \( x_{t} \) \end_inset and \begin_inset Formula \( z_{t} \) \end_inset are needed in equations such as [ \begin_inset LatexCommand \ref{diseqArule} \end_inset ] to determine the value of the proposed state to the breeder. Equations [ \begin_inset LatexCommand \ref{diseqArule} \end_inset ], [ \begin_inset LatexCommand \ref{diseqdrule} \end_inset ], and [ \begin_inset LatexCommand \ref{dp2qcons} \end_inset ] are three equations in the three choice variables, determining them exactly. While it is not possible to write an explicit, closed form solution, it is possible to reduce the equaitons to a single variable in a single unknown, and then solve with a fast numeric routine. \layout Standard As \begin_inset Formula \( f_{1t} \) \end_inset has already been eliminated from [ \begin_inset LatexCommand \ref{diseqArule} \end_inset ], and \begin_inset Formula \( f_{3t} \) \end_inset has generally been treated as an artifact of the choice of \begin_inset Formula \( f_{1t} \) \end_inset and \begin_inset Formula \( f_{2t} \) \end_inset , \begin_inset Formula \( f_{2t} \) \end_inset seems to be the natural choice. Manipulating [ \begin_inset LatexCommand \ref{dp2prule} \end_inset ] yields \begin_inset Formula \begin{equation} f_{1t}=\frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}} \end{equation} \end_inset which can in turn be used in [ \begin_inset LatexCommand \ref{dp2qcons} \end_inset ] to get \begin_inset Formula \begin{eqnarray} f_{3t} & = & \frac{Q-\frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}p_{t}^{2}-2p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\nonumber \\ & = & \frac{Q-Qp_{t+1}+f_{2t}p_{t}\left( 1-p_{t}\right) -2p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\nonumber \\ & = & \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}} \end{eqnarray} \end_inset both of which are placed into [ \begin_inset LatexCommand \ref{diseqdrule} \end_inset ], [ \begin_inset LatexCommand \ref{diseqdrule} \end_inset ], \begin_inset Formula \begin{eqnarray} 0 & = & d_{t}-d_{t+1}-\frac{1}{2Q}f_{2t}\frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}\left( 1-p_{t+1}\right) }d_{t}\nonumber \\ & & +\frac{1}{2Q}\left[ \frac{p_{t}^{2}}{p_{t+1}}z\left( \frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}\right) +p_{t}\left( 1-p_{t}\right) \left[ \frac{1}{p_{t+1}}-\frac{1}{1-p_{t+1}}\right] z\left( f_{2t}\right) \right. \nonumber \\ & & \left. -\frac{\left( 1-p_{t}\right) ^{2}}{\left( 1-p_{t+1}\right) }z\left( \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\right) \right] \sigma \end{eqnarray} \end_inset which has a derivative of \begin_inset Formula \begin{eqnarray*} der_{f_{2}} & = & -\frac{1}{2Q}\frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}\left( 1-p_{t+1}\right) }d_{t}\\ & & +\frac{1}{2Q}\left[ \frac{p_{t}^{2}}{p_{t+1}}x\left( \frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}\right) \frac{-\left( 1-p_{t}\right) }{p_{t}}\right. \\ & & +p_{t}\left( 1-p_{t}\right) \left[ \frac{1}{p_{t+1}}-\frac{1}{1-p_{t+1}}\right] x\left( f_{2t}\right) \\ & & \left. -\frac{\left( 1-p_{t}\right) ^{2}}{\left( 1-p_{t+1}\right) }x\left( \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\right) \frac{-p_{t}}{1-p_{t}}\right] \sigma \\ & = & -\frac{\sigma }{2Q}\frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}\left( 1-p_{t+1}\right) }\left[ \frac{d_{t}}{\sigma }+\left( 1-p_{t+1}\right) x_{1}-\left( 1-2p_{t+1}\right) x\left( f_{2t}\right) -p_{t+1}x_{3}\right] \end{eqnarray*} \end_inset \layout Standard \begin_inset Formula \begin{eqnarray} & = & \frac{1}{2Q}\frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}\left( 1-p_{t+1}\right) }d_{t}\nonumber \\ & & -\frac{1}{2Q}\left[ \frac{p_{t}^{2}}{p_{t+1}}x\left( \frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}\right) \frac{1-p_{t}}{p_{t}}\right. \nonumber \\ & & +p_{t}\left( 1-p_{t}\right) \left[ \frac{1}{p_{t+1}}-\frac{1}{1-p_{t+1}}\right] x\left( f_{2t}\right) \nonumber \\ & & \left. -\frac{\left( 1-p_{t}\right) ^{2}}{\left( 1-p_{t+1}\right) }x\left( \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\right) \frac{p_{t}}{1-p_{t}}\right] \sigma \nonumber \\ & = & \frac{1}{2Q}\frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}\left( 1-p_{t+1}\right) }d_{t}\nonumber \\ & & -\frac{1}{2Q}\left[ \frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}}x\left( \frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}\right) \right. \nonumber \\ & & +p_{t}\left( 1-p_{t}\right) \left[ \frac{1}{p_{t+1}}-\frac{1}{1-p_{t+1}}\right] x\left( f_{2t}\right) \nonumber \\ & & \left. -\frac{p_{t}\left( 1-p_{t}\right) }{\left( 1-p_{t+1}\right) }x\left( \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\right) \right] \sigma \nonumber \\ & = & \frac{p_{t}\left( 1-p_{t}\right) }{2Q}\sigma \left[ \frac{\frac{d_{t}}{\sigma }}{p_{t+1}\left( 1-p_{t+1}\right) }-\frac{x\left( \frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}\right) }{p_{t+1}}\right. \nonumber \\ & & \left. -\left[ \frac{1}{p_{t+1}}-\frac{1}{1-p_{t+1}}\right] x\left( f_{2t}\right) +\frac{x\left( \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\right) }{1-p_{t+1}}\right] \nonumber \\ & = & \frac{p_{t}\left( 1-p_{t}\right) }{p_{t+1}\left( 1-p_{t+1}\right) }\frac{\sigma }{2Q}\left[ \frac{d_{t}}{\sigma }-\left( 1-p_{t+1}\right) x\left( \frac{Qp_{t+1}-f_{2t}p_{t}\left( 1-p_{t}\right) }{p_{t}^{2}}\right) \right. \nonumber \\ & & \left. -x\left( f_{2t}\right) +p_{t+1}x\left( \frac{Q\left( 1-p_{t+1}\right) -p_{t}\left( 1-p_{t}\right) f_{2t}}{\left( 1-p_{t}\right) ^{2}}\right) \right] \end{eqnarray} \end_inset \layout Subsection Is \begin_inset Formula \( \Delta \bar{A}_{t+1} \) \end_inset monotone in \begin_inset Formula \( d_{t+1} \) \end_inset ? \layout Standard Differentiating [ \begin_inset LatexCommand \ref{diseqArule} \end_inset ], \begin_inset Formula \begin{eqnarray*} \frac{\partial \Delta \bar{A}_{t+1}}{\partial d_{t+1}} & = & \frac{p_{t}\left( 1-p_{t}\right) d_{t}}{Q\left( 1-p_{t+1}\right) }\frac{\partial f_{2t}}{\partial d_{t+1}}+\frac{\left( 1-p_{t}\right) ^{2}\sigma }{Q\left( 1-p_{t+1}\right) }\frac{\partial z_{3t}}{\partial d_{t+1}}+2p_{t+1}\\ & = & \frac{p_{t}\left( 1-p_{t}\right) d_{t}}{Q\left( 1-p_{t+1}\right) }\frac{1}{\frac{\partial d_{t+1}}{\partial f_{2t}}}+\frac{\left( 1-p_{t}\right) ^{2}\sigma }{Q\left( 1-p_{t+1}\right) }\frac{\partial z_{3t}}{\partial f_{3t}}\frac{\partial f_{3t}}{\partial f_{2t}}\frac{1}{\frac{\partial d_{t+1}}{\partial f_{2t}}}+2p_{t+1}\\ & = & \frac{p_{t}\left( 1-p_{t}\right) d_{t}}{Q\left( 1-p_{t+1}\right) }\frac{1}{\frac{\partial d_{t+1}}{\partial f_{2t}}}+\frac{\left( 1-p_{t}\right) ^{2}\sigma }{Q\left( 1-p_{t+1}\right) }\frac{\partial z_{3t}}{\partial f_{3t}}\frac{\partial f_{3t}}{\partial f_{2t}}\frac{1}{\frac{\partial d_{t+1}}{\partial f_{2t}}}+2p_{t+1} \end{eqnarray*} \end_inset \layout Standard \latex latex \backslash pagebreak \layout Subsection N-dimensional search rules \layout Standard An algorithm that can solve \begin_inset Formula \( n \) \end_inset dimensions rather than a fixed number is more valuable and easily maintained than a set of algorithms for assorted dimensions. Furthermore, it is easier to write than an algorithm for any fixed \begin_inset Formula \( n \) \end_inset larger than two. Accordingly, the focus will be on the algorithm of arbitrary dimension. \layout Standard To solve [ \begin_inset LatexCommand \pageref{dp2finalobj} \end_inset ], steps were made in one direction or the other until a gain was no longer found. This process may be used iteratively to solve multiple dimensions. \family typewriter bestval() \family default is initially called with no location information, causing it to start at the first variable. It evaluates this variable as before, but rather than directly calling the next generation, it calls the same generation, but with the current iteration value. \layout Standard When \family typewriter bestVal() \family default receives an array of positions of rank \begin_inset Formula \( n \) \end_inset , it queries the next generation. \layout Section Four Dimensions: Differential Selection by Gender \layout Section Unknown Infinite Horizon \layout Section Variable variance \layout Standard The actual variance is binomial, for large enough \begin_inset Formula \( n \) \end_inset that it is adequately approximated by the normal. \layout Standard Redefine model. \begin_inset Formula \( n \) \end_inset is the number of minor gene sites, and \begin_inset Formula \( q \) \end_inset is the minor gene frequency. Letting \begin_inset Formula \( \pi \) \end_inset and \begin_inset Formula \( \theta \) \end_inset be the draw for each, \begin_inset Formula \begin{eqnarray*} \pi & \in & \left\{ 0,1,2\right\} \\ \theta & \in & \left\{ 0,1,\ldots 2n\right\} \end{eqnarray*} \end_inset and the value of an animal is \begin_inset Formula \[ \pi a+\theta b-\left( a+nq_{0}b\right) \] \end_inset \layout Chapter \latex latex \backslash sout{moved to appendix} \layout Chapter Results \begin_inset LatexCommand \label{Results} \end_inset \layout Section Simple Finite Generations \layout Subsubsection Simple Newton Raphson \begin_inset LatexCommand \label{NR/SFG} \end_inset \layout Section Discounted Finite Generations \layout Section Infinite Horizon \layout Chapter Conclusions \layout Standard \latex latex \backslash appendix \layout Chapter Glossary \layout Section Terms \layout Subsubsection* Breeding Value \layout Standard The average effect of all genes that a parent passes on to offspring.. \layout Subsubsection* Diploid \layout Standard Having two chromosomes. All animals, and many plants, are diploid. \layout Subsubsection* Genetic Phase Disequilibrium \layout Standard When different truncation points are used for the genotypes, the result is a different truncation point for the polygenes in each group. In the next generation, the polygenes will have different means and variances in the different group. This creates a negative correlation between the major and polygenes known as Gametic Phase Disequilibrium. \layout Subsubsection* Genotype \layout Standard Classification by the presence of the gene. E.g., aa, aA, and AA. \layout Subsubsection* Genotype \layout Standard The actual genetic status of the organism for a given locus. For example, \begin_inset Formula \( Bb \) \end_inset . \layout Subsubsection* Heterozygote \layout Standard In a diploid organism, a heterozygote for a given locus has a two different alleles at the locus in question. \layout Subsubsection* Heritability \layout Standard The fraction of phenotypic variation in a trait that is due to genetics. \layout Subsubsection* Homozygote \layout Standard An organism with two of the same allele at the locus in question. \layout Subsubsection* Locus \layout Standard A point on a chromosome where a gene is located. \layout Subsubsection* Major Gene \layout Standard A gene with a large effect. It is assumed that the major gene can be identified, by QTL or other methods. \layout Subsubsection* Mass Selection \layout Standard Also phenotypic selection. Organisms are selected to reproduce based solely upon their own phenotypic value for the trait. \layout Subsubsection* Phenotype \layout Standard The observed trait. For example, the weight of an animal. \layout Subsubsection* Polygenes \layout Standard Polygenes cannot be identified, but are seen only by their combined effect on phenotype. It is assumed that each of the polygenes has a small effect compared to the whole and to the major gene. \layout Subsubsection* Qualitative trait \layout Standard A trait that takes on only one of a finite set of possibilities. \layout Subsubsection* Quantitative Trait \layout Standard A quantitative trait is one which takes a quantitative rather than qualitative value. Mendel's peas were qualitatively either wrinkled or not; they were one of the two types. A quantitative trait would instead be a measure of the height of the peas: a value in the range of three to six inches, for example, with the intermediate values being possible. \layout Subsubsection* QTL \layout Standard Quantitative trait locus, the locus that controls or affects a quantitative trait. \layout Subsubsection* Genotypic Selection \layout Standard Genotypic selection considers both the phenotypic and genotypic values. a value of \begin_inset Formula \( I=g+h^{2}(P-g) \) \end_inset is used, where \begin_inset Formula \( P \) \end_inset is the phenotypic value, \begin_inset Formula \( g \) \end_inset the genotypic value, and \begin_inset Formula \( h^{2} \) \end_inset the heritability. \layout Subsubsection* Truncation Point. \layout Standard A cutoff point on the selection criterion for selection. All creatures above this point breed, and those below do not. \layout Section Variable Names and Definitions \layout Standard The following variables have the following meaning unless otherwise specified:d \layout Standard \added_space_top 0.3cm \added_space_bottom 0.3cm \align center \LyXTable multicol5 12 2 0 0 -1 -1 -1 -1 1 1 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 1 0 0 8 1 0 "" "" 8 1 1 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" 0 8 1 0 0 0 0 "" "" Variable \newline Meaning \newline \begin_inset Formula \( g \) \end_inset \newline genotypic value for the identified major gene \newline \begin_inset Formula \( h^{2} \) \end_inset \newline heritability of the trait \newline \begin_inset Formula \( P \) \end_inset \newline the phenotypic value \newline \begin_inset Formula \( m \) \end_inset \newline an indicator for genotype \newline \begin_inset Formula \( A \) \end_inset \newline polygenic breeding value \newline \begin_inset Formula \( \hat{A} \) \end_inset \newline estimated polygenic breeding value \newline \begin_inset Formula \( b_{mt} \) \end_inset \newline weight used for genotype \begin_inset Formula \( m \) \end_inset selected in generation \begin_inset Formula \( t \) \end_inset \newline \begin_inset Formula \( a \) \end_inset \newline the value of each allele of the major gene \newline \begin_inset Formula \( p_{t} \) \end_inset \newline the frequency of the major gene in generation \begin_inset Formula \( t \) \end_inset \newline \begin_inset Formula \( q_{mt} \) \end_inset \newline the fraction of the population in generation \begin_inset Formula \( t \) \end_inset of type \begin_inset Formula \( m \) \end_inset \newline \begin_inset Formula \( x_{mt} \) \end_inset \newline the truncation point for genotype \begin_inset Formula \( m \) \end_inset in generation \begin_inset Formula \( t \) \end_inset \layout Section The Assumptions \layout Standard This work is done primarily in the limiting case of a very large population, with a vary large number of genes. It is assumed that the breeding value can be written as an additive function of the major genes and of the sum of the interactions of the polygenes, and the statistical distribution of the polygenes is known. For the baseline case, there is a single major gene, and each copy of the gene adds directly to a normally distributed value for the polygenes. A heterozygote receives a value of \begin_inset Formula \( 0 \) \end_inset , while the homozygotes receive \begin_inset Formula \( \pm a \) \end_inset . The variance of the polygenic breeding value is known, the distribution is normal, and it is initially normalized at 0, and equal fractions of males and females are chosen. \begin_float footnote \layout Standard If this were not done, the solution would be to take few enough males to impregnate all the females, and the effects of inbreeding would have to be considered. \end_float \layout Standard The decrease in variance of the polygenic breeding value due to selection and phase disequilibrium are disregarded in the simplest case. Also, due to random mating of those chosen to reproduce, the major gene will always be in equilibrium: all genotypes have the same \begin_inset Formula \( p_{t} \) \end_inset , and so an organism has the same chance of getting the major gene from a parent regardless of the genotype to which the parent belonged. \layout Chapter dp2 program \layout Bibliography \bibitem [Dekkers 98]{Dekkers} Dekkers paper. \layout Bibliography \bibitem [Varian]{varian} Varian \layout Bibliography \bibitem [Vertical Coordination]{Vertical Coordination} Vertical Coordination and Consumer Welfare: The Case of the Pork Industry, USDA AER #753, August, 1997 \layout Bibliography \bibitem [Bulmer]{Bulmer} Bulmer, The Mathematical Theory of Quantitative Genetics, Clarendon Press, 1980 \layout Bibliography \bibitem [Rothschild]{Rothschild} Rothschild,*** \layout Bibliography \bibitem [Gibsen]{Gibsen} Gibsen, **, 1994 \layout Bibliography \bibitem [IQG]{iqg} Falconer & Mackay,Introduction to Quantitative Genetics, Fourth Edition,Longman, 1966. \layout Standard \bibitem {4} \begin_inset LatexCommand \printindex{} \end_inset \the_end