Experimental tests of the quantitative model. To test the model of the previous section, we performed several experiments to genotype DNA for the presence of a homozygous or heterozygous SNP, 187C>G, which is found in the hemochromatosis gene (HFE). This mutation we analyzed 5'-TCA-3' -> 5'-TGA-3' 3'-AGT-5' 3'-ACT-5' has the nearest-neighbor symmetry described above which results in a melting curve for the homozygous case which is theoretically identical to and experimentally indistinguishable from that of wild-type DNA. (See figure ? for the complete sequence with SNP and primers highlighted. H63D_sequence_031010.doc) We chose to examine a range of 21 different ratios of wild-type spike to spike plus unknown, from 1/28 to 14/28 by increments of 1/28, and from 15/28 to 27/28 in steps of 2/28. This allowed us to include the theoretically optimal value, 1/7, and observe the behavior of the process in some detail over a wide range of interest for pooled samples as well. The ratio j/28 of spike to spike plus unknown corresponds to the ratio j/(28-j) of spike to unknown, so for instance, the optimal value of 4/28 is the same as 4/24=1/6 spike to unknown. We spiked three replicates of each of the three genotypes (denoted WT, MUT, and HET) before PCR with each fraction of additional wild-type DNA. Samples with a common spike fraction as well as two control samples containing unspiked heterozygous DNA were amplified in the presence of a high-resolution fluorescent dye and analyzed simultaneously. The PCR protocols may be found in appendix ? (Michael) Following amplification, an additional melting was performed to denature the perfectly complementary post-extension duplexes after which the temperature was rapidly decreased to re-anneal strands independent of the presence or absence of a single mismatched base-pair. We then performed high-resolution melting analysis on all of the resulting samples to produce actual fluorescence vs. temperature melting curves corresponding to the model of the previous section. This is a closed-tube process which avoids risk of contamination and leaves the sample undisturbed for further types of analysis. It provides a fast, economical, and accurate method of genotyping and mutation scanning which has been described and studied in a variety of contexts ([ ],[ ],...,[ ]) (Carl) After high-resolution melting analysis, we performed temperature-gradient capillary electrophoresis (TGCE) on each sample. In this technique, we detect the arrival of duplexes in a sample after they are drawn through a gel. Each species of duplex has a characteristic arrival frequency distribution depending on its spatial conformation. In particular, the center of a heteroduplex arrival frequency peak is significantly delayed due to the "bubble" formed by mismatched base pairs. While the two species of homoduplex have arrival frequency peaks which superpose indistinguishably, the two heteroduplex peaks are easily separated from the homoduplex peak and from each other. The peaks exhibit simple mathematical behavior which makes it possible to separate and quantify the relative contributions of the heteroduplexes. This provides an independent and direct validation of our theoretical model of melting curve separation, which was based upon relative concentration of heteroduplexes in the samples. The mathematical analysis of the data from these two methods is discussed in the next section. -------------------------------- Analysis of experimental data and comparison with the theory. (Outline) -------------------------------- Analysis of melting curve data: Background removal (line or new method - not much difference, so should we stick with line method or describe new method?) Temperature shift uses features of the background removed curves themselves to compensate for small variations in temperature control - reported vs. actual temperature. Difference plots highlight relative variation between genotypes According to the theory, location of maximum difference is constant, and magnitude of maximum difference, and area under difference are directly proportional to the heteroduplex concentration of the samples. Figure ? and Table ? in the appendix show the calculated values of the maximum difference and area between the average of three replicate spiked wild-type melting curves and the individual spiked homozygous and heterozygous SNP melting curves as a function of the proportion of the spike in the total mixture. The values are normalized to make the value of an unspiked heterozygote equal to .5, to correspond to the concentration of heteroduplexes in the theoretical model, which is superimposed on the figures. (The differences between the homozygous and heterozygous replicates is implicit, and can be obtained by taking the difference of their individual differences with the mean wild-type curve, as in the theoretical analysis.) The location of the maximum differences is reported in Table ? --------------------------------- Analysis of TGCE data Individual TGCE arrival frequency peaks may be approximated by exponential distributions of the form F(t)=Ae^{-kt}, t >=t_0; F(t)=0, t