| 1 | <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN"> |
|---|
| 2 | <HTML> |
|---|
| 3 | <HEAD> |
|---|
| 4 | <TITLE>dnamove</TITLE> |
|---|
| 5 | <META NAME="description" CONTENT="dnamove"> |
|---|
| 6 | <META NAME="keywords" CONTENT="dnamove"> |
|---|
| 7 | <META NAME="resource-type" CONTENT="document"> |
|---|
| 8 | <META NAME="distribution" CONTENT="global"> |
|---|
| 9 | <META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1"> |
|---|
| 10 | </HEAD> |
|---|
| 11 | <BODY BGCOLOR="#ccffff"> |
|---|
| 12 | <DIV ALIGN=RIGHT> |
|---|
| 13 | version 3.6 |
|---|
| 14 | </DIV> |
|---|
| 15 | <P> |
|---|
| 16 | <DIV ALIGN=CENTER> |
|---|
| 17 | <H1>DNAMOVE - Interactive DNA parsimony</H1> |
|---|
| 18 | </DIV> |
|---|
| 19 | <P> |
|---|
| 20 | © Copyright 1986-2002 by the University of |
|---|
| 21 | Washington. Written by Joseph Felsenstein. Permission is granted to copy |
|---|
| 22 | this document provided that no fee is charged for it and that this copyright |
|---|
| 23 | notice is not removed. |
|---|
| 24 | <P> |
|---|
| 25 | DNAMOVE is an interactive DNA parsimony program, inspired by Wayne Maddison and |
|---|
| 26 | David and Wayne Maddison's marvellous program MacClade, which is written for |
|---|
| 27 | Macintosh computers. DNAMOVE reads in a data set which is prepared in almost |
|---|
| 28 | the same format as one for the DNA parsimony program DNAPARS. It allows |
|---|
| 29 | the user to choose an initial tree, and displays this tree on the screen. The |
|---|
| 30 | user can look at different sites and the way the nucleotide states are |
|---|
| 31 | distributed on that tree, given the most parsimonious reconstruction of state |
|---|
| 32 | changes for that particular tree. The user then can specify how the tree is to |
|---|
| 33 | be rearraranged, rerooted or written out to a file. By looking at different |
|---|
| 34 | rearrangements of the tree the user can manually search for the most |
|---|
| 35 | parsimonious tree, and can get a feel for how different sites are affected |
|---|
| 36 | by changes in the tree topology. |
|---|
| 37 | <P> |
|---|
| 38 | This program uses graphic characters that show the tree to best |
|---|
| 39 | advantage on some computer systems. |
|---|
| 40 | Its graphic characters will work best on MSDOS systems or MSDOS windows in |
|---|
| 41 | Windows, and to |
|---|
| 42 | any system whose screen or terminals emulate ANSI standard terminals |
|---|
| 43 | such as old Digital VT100 terminals, |
|---|
| 44 | Telnet programs, |
|---|
| 45 | or VT100-compatible windows in the X windowing system. |
|---|
| 46 | For any other screen types, (such as Macintosh windows) there is a generic |
|---|
| 47 | option which does |
|---|
| 48 | not make use of screen graphics characters. The program will work well |
|---|
| 49 | in those cases, but the tree it displays will look a bit uglier. |
|---|
| 50 | <P> |
|---|
| 51 | The input data file is set up almost identically to the data files for |
|---|
| 52 | DNAPARS. The code for nucleotide sequences is the standard one, as |
|---|
| 53 | described in the molecular sequence programs document. |
|---|
| 54 | The user trees are contained in the input tree file |
|---|
| 55 | which is used for input of the starting tree (if desired). The |
|---|
| 56 | output tree file is used for the final tree. |
|---|
| 57 | <P> |
|---|
| 58 | The user interaction starts with the program presenting a menu. The |
|---|
| 59 | menu looks like this: |
|---|
| 60 | <P> |
|---|
| 61 | <TABLE><TR><TD BGCOLOR=white> |
|---|
| 62 | <PRE> |
|---|
| 63 | |
|---|
| 64 | Interactive DNA parsimony, version 3.6a3 |
|---|
| 65 | |
|---|
| 66 | Settings for this run: |
|---|
| 67 | O Outgroup root? No, use as outgroup species 1 |
|---|
| 68 | W Sites weighted? No |
|---|
| 69 | T Use Threshold parsimony? No, use ordinary parsimony |
|---|
| 70 | I Input sequences interleaved? Yes |
|---|
| 71 | U Initial tree (arbitrary, user, specify)? Arbitrary |
|---|
| 72 | 0 Graphics type (IBM PC, ANSI, none)? (none) |
|---|
| 73 | S Width of terminal screen? 80 |
|---|
| 74 | L Number of lines on screen? 24 |
|---|
| 75 | |
|---|
| 76 | Are these settings correct? (type Y or the letter for one to change) |
|---|
| 77 | |
|---|
| 78 | </PRE> |
|---|
| 79 | </TD></TR></TABLE> |
|---|
| 80 | <P> |
|---|
| 81 | The O (Outgroup), W (Weights), T (Threshold), and 0 (Graphics type) options |
|---|
| 82 | are the usual |
|---|
| 83 | ones and are described in the main documentation file. The I |
|---|
| 84 | (Interleaved) option is the usual one and is described in the main |
|---|
| 85 | documentation file and the molecular sequences programs documentation file. |
|---|
| 86 | The U (initial tree) option allows the user to choose whether |
|---|
| 87 | the initial tree is to be arbitrary, interactively specified by the user, or |
|---|
| 88 | read from a tree file. Typing U causes the program to change among the |
|---|
| 89 | three possibilities in turn. I |
|---|
| 90 | would recommend that for a first run, you allow the tree to be set up |
|---|
| 91 | arbitrarily (the default), as the "specify" choice is difficult |
|---|
| 92 | to use and the "user tree" choice requires that you have available a tree file |
|---|
| 93 | with the tree topology of the initial tree, which must be a rooted tree. |
|---|
| 94 | Its default name is <TT>intree</TT>. The program will ask you for its name if |
|---|
| 95 | it looks for the input tree file and does not find one of this name. |
|---|
| 96 | If you wish to set up some |
|---|
| 97 | particular tree you can also do that by the rearrangement commands specified |
|---|
| 98 | below. |
|---|
| 99 | <P> |
|---|
| 100 | The W (Weights) option allows only weights of 0 or 1. |
|---|
| 101 | <P> |
|---|
| 102 | The T (threshold) option allows a continuum of methods between parsimony and |
|---|
| 103 | compatibility. Thresholds less than or equal to 1.0 do not have any |
|---|
| 104 | meaning and should not be used: they will result in a tree dependent only on |
|---|
| 105 | the input order of species and not at all on the data! |
|---|
| 106 | <P> |
|---|
| 107 | The L (screen Lines) option allows the user to change the height of the |
|---|
| 108 | screen (in lines of characters) that is assumed to be available on the |
|---|
| 109 | display. This may be particularly helpful when displaying large trees |
|---|
| 110 | on terminals that have more than 24 lines per screen, or on workstation |
|---|
| 111 | or X-terminal screens that can emulate the ANSI terminals with more than |
|---|
| 112 | 24 lines. |
|---|
| 113 | <P> |
|---|
| 114 | After the initial menu is displayed and the choices are made, |
|---|
| 115 | the program then sets up an initial tree and displays it. Below it will be a |
|---|
| 116 | one-line menu of possible commands, which looks like this: |
|---|
| 117 | <P> |
|---|
| 118 | <PRE> |
|---|
| 119 | NEXT? (Options: R # + - S . T U W O F C H ? X Q) (H or ? for Help) |
|---|
| 120 | </PRE> |
|---|
| 121 | <P> |
|---|
| 122 | If you type H or ? you will get a single screen showing a description of each |
|---|
| 123 | of these commands in a few words. Here are slightly more detailed |
|---|
| 124 | descriptions: |
|---|
| 125 | <P> |
|---|
| 126 | <DL> |
|---|
| 127 | <DT>R ("Rearrange")</DT> <DD> This command asks for the number of a node which is to be |
|---|
| 128 | removed from the tree. It and everything to the right of it on the tree is to |
|---|
| 129 | be removed (by breaking the branch immediately below it). The command also |
|---|
| 130 | asks for the number of a node below which that group is to be inserted. If an |
|---|
| 131 | impossible number is given, the program refuses to carry out the rearrangement |
|---|
| 132 | and asks for a new command. The rearranged tree is displayed: it will often |
|---|
| 133 | have a different number of steps than the original. If you wish to undo a |
|---|
| 134 | rearrangement, use the Undo command, for which see below.</DD> |
|---|
| 135 | <DT>#</DT> <DD>This command, and the +, - and S commands described below, determine |
|---|
| 136 | which site has its states displayed on the branches of |
|---|
| 137 | the trees. The initial tree displayed by the program does not show |
|---|
| 138 | states of sites. When # is typed, the program does not ask the user which |
|---|
| 139 | site is to be shown but automatically shows the states of the next |
|---|
| 140 | site that is not compatible with the tree (the next site that does not |
|---|
| 141 | perfectly fit the current tree). The search for this site "wraps around" |
|---|
| 142 | so that if it reaches the last site without finding one that is not |
|---|
| 143 | compatible with the tree, the search continues at the first site; if no |
|---|
| 144 | incompatible site is found the current site is shown again, and if no current |
|---|
| 145 | site is being shown then the first site is shown. The display takes the form of |
|---|
| 146 | different symbols or textures on the branches of the tree. The state of each |
|---|
| 147 | branch is actually the state of the node above it. A key of the symbols or |
|---|
| 148 | shadings used for states A, C, G, T (U) and ? are shown next to the |
|---|
| 149 | tree. State ? means that more than one possible nucleotide could exist at |
|---|
| 150 | that point |
|---|
| 151 | on the tree, and that the user may want to consider the different |
|---|
| 152 | possibilities, which are usually apparent by inspection.</DD> |
|---|
| 153 | <DT>+</DT> <DD>This command is the same as \# except that it goes forward one site, |
|---|
| 154 | showing the states of the next site. If no site has been shown, using + will |
|---|
| 155 | cause the first site to be shown. Once the last site has been |
|---|
| 156 | reached, using + again will show the first site.</DT> |
|---|
| 157 | <P> |
|---|
| 158 | <DT>-</DT> <DD>This command is the same as + except that it goes backwards, showing the |
|---|
| 159 | states of the previous site. If no site has been shown, using - will |
|---|
| 160 | cause the last site to be shown. Once site number 1 has been |
|---|
| 161 | reached, using - again will show the last site.</DD> |
|---|
| 162 | <DT>S ("Show").</DT> <DD> This command is the same as + and - except that it causes |
|---|
| 163 | the program to ask you for the number of a site. That site is |
|---|
| 164 | the one whose states will be displayed. If you give the site number as 0, |
|---|
| 165 | the program will go back to not showing the states of the sites.</DD> |
|---|
| 166 | <DT>. (dot)</DT> <DD>This command simply causes the current tree to be redisplayed. It is of |
|---|
| 167 | use when the tree has partly disappeared off of the top of the screen owing to |
|---|
| 168 | too many responses to commands being printed out at the bottom of the screen. |
|---|
| 169 | </DD> |
|---|
| 170 | <P> |
|---|
| 171 | <DT>T ("Try rearrangements").</DT> <DD>This command asks for the name of a node. The |
|---|
| 172 | part of the tree at and above that node is removed from the tree. The program |
|---|
| 173 | tries to re-insert it in each possible location on the tree (this may take some |
|---|
| 174 | time, and the program reminds you to wait). Then it prints out a summary. For |
|---|
| 175 | each possible location the program prints out the number of the node to the |
|---|
| 176 | right of the |
|---|
| 177 | place of insertion and the number of steps required in each case. These are |
|---|
| 178 | divided into those that are better then or tied with the current tree. Once |
|---|
| 179 | this summary is printed out, the group that was removed is reinserted into its |
|---|
| 180 | original position. It is up to you to use the R command to actually carry out |
|---|
| 181 | any of the arrangements that have been tried. </DD> |
|---|
| 182 | <DT>U ("Undo").</DT> <DD>This command reverses the effect of the most recent |
|---|
| 183 | rearrangement, outgroup re-rooting, or flipping of branches. It returns to the |
|---|
| 184 | previous tree topology. It will be of great use when rearranging the tree and |
|---|
| 185 | when a rearrangement proves worse than the preceding one -- it permits you to |
|---|
| 186 | abandon the new one and return to the previous one without remembering its |
|---|
| 187 | topology in detail.</DD> |
|---|
| 188 | <DT>W ("Write").</DT> <DD>This command writes out the current tree onto a tree output |
|---|
| 189 | file. If the file already has been written to by this run of DNAMOVE, it will |
|---|
| 190 | ask you whether you want to replace the contents of the file, add the tree to |
|---|
| 191 | the end of the file, or not write out the tree to the file. The tree |
|---|
| 192 | is written in the standard format used by PHYLIP (a subset of the |
|---|
| 193 | Newick standard). It is in the proper format to serve as the |
|---|
| 194 | User-Defined Tree for setting up the initial tree in a subsequent run of the |
|---|
| 195 | program. Note that if you provided the initial tree topology in a tree file |
|---|
| 196 | and replace its contents, that initial tree will be lost.</DD> |
|---|
| 197 | <DT>O ("Outgroup").</DT> <DD>This asks for the number of a node which is to be the |
|---|
| 198 | outgroup. The tree will be redisplayed with that node |
|---|
| 199 | as the left descendant of the bottom fork. Note that it is possible to |
|---|
| 200 | use this to make a multi-species group the outgroup (i.e., you can give the |
|---|
| 201 | number of an interior node of the tree as the outgroup, and the program will |
|---|
| 202 | re-root the tree properly with that on the left of the bottom fork.</DD> |
|---|
| 203 | <DT>F ("Flip").</DT> <DD>This asks for a node number and then flips the two branches at |
|---|
| 204 | that node, so that the left-right order of branches at that node is |
|---|
| 205 | changed. This does not actually change the tree topology (or the number of |
|---|
| 206 | steps on that tree) but it does change the appearance of the tree.</DD> |
|---|
| 207 | <DT>C ("Clade").</DT> <DD>When the data consist of more than 12 species (or more than |
|---|
| 208 | half the number of lines on the screen if this is not 24), it may be |
|---|
| 209 | difficult to display the tree on one screen. In that case the tree |
|---|
| 210 | will be squeezed down to |
|---|
| 211 | one line per species. This is too small to see all the interior states of the |
|---|
| 212 | tree. The C command instructs the program to print out only that part of the |
|---|
| 213 | tree (the "clade") from a certain node on up. The program will prompt you for |
|---|
| 214 | the number of this node. Remember that thereafter you are not looking at the |
|---|
| 215 | whole tree. To go back to looking at the whole tree give the C command again |
|---|
| 216 | and enter "0" for the node number when asked. Most users will not want to use |
|---|
| 217 | this option unless forced to.</DD> |
|---|
| 218 | <DT>H ("Help").</DT> <DD>Prints a one-screen summary of what the commands do, a few |
|---|
| 219 | words for each command.</DD> |
|---|
| 220 | <DT>? ("huh?").</DT> <DD>A synonym for H. Same as Help command.</DD> |
|---|
| 221 | <DT>X ("Exit").</DT> <DD>Exit from program. If the current tree has not yet been saved |
|---|
| 222 | into a file, the program will first ask you whether it should be saved.</DD> |
|---|
| 223 | <DT>Q ("Quit").</DT> <DD>A synonym for X. Same as the eXit command.</DD> |
|---|
| 224 | </DL> |
|---|
| 225 | <P> |
|---|
| 226 | <H3>ADAPTING THE PROGRAM TO YOUR COMPUTER AND TO YOUR TERMINAL</H3> |
|---|
| 227 | <P> |
|---|
| 228 | As we have seen, the initial menu of the program allows you to choose |
|---|
| 229 | among three screen types (PCDOS, Ansi, and none). We have tried to |
|---|
| 230 | have the default values be correct for PC, Macintosh, and Unix |
|---|
| 231 | screens. If the setting is "none" (which is necessary on |
|---|
| 232 | Macintosh screens), the special graphics |
|---|
| 233 | characters will not be used to indicate nucleotide states, but only letters |
|---|
| 234 | will be used for the four nucleotides. This is less easy to look at. |
|---|
| 235 | <P> |
|---|
| 236 | <H2>MORE ABOUT THE PARSIMONY CRITERION</H2> |
|---|
| 237 | <P> |
|---|
| 238 | This program carries out unrooted parsimony (analogous to Wagner |
|---|
| 239 | trees) (Eck and Dayhoff, 1966; Kluge and Farris, 1969) on DNA |
|---|
| 240 | sequences. The method of Fitch (1971) is used to count the number of |
|---|
| 241 | changes of base needed on a given tree. The assumptions of this |
|---|
| 242 | method are exactly analogous to those of MIX: |
|---|
| 243 | <P> |
|---|
| 244 | <OL> |
|---|
| 245 | <LI>Each site evolves independently. |
|---|
| 246 | <LI>Different lineages evolve independently. |
|---|
| 247 | <LI>The probability of a base substitution at a given site is |
|---|
| 248 | small over the lengths of time involved in |
|---|
| 249 | a branch of the phylogeny. |
|---|
| 250 | <LI>The expected amounts of change in different branches of the phylogeny |
|---|
| 251 | do not vary by so much that two changes in a high-rate branch |
|---|
| 252 | are more probable than one change in a low-rate branch. |
|---|
| 253 | <LI>The expected amounts of change do not vary enough among sites that two |
|---|
| 254 | changes in one site are more probable than one change in another. |
|---|
| 255 | </OL> |
|---|
| 256 | <P> |
|---|
| 257 | That these are the assumptions of parsimony methods has been documented |
|---|
| 258 | in a series of papers of mine: (1973a, 1978b, 1979, 1981b, |
|---|
| 259 | 1983b, 1988b). For an opposing view arguing that the parsimony methods |
|---|
| 260 | make no substantive |
|---|
| 261 | assumptions such as these, see the papers by Farris (1983) and Sober (1983a, |
|---|
| 262 | 1983b), but also read the exchange between Felsenstein and Sober (1986). |
|---|
| 263 | <P> |
|---|
| 264 | Change from an occupied site to a deletion is counted as one |
|---|
| 265 | change. Reversion from a deletion to an occupied site is allowed and is also |
|---|
| 266 | counted as one change. |
|---|
| 267 | <P> |
|---|
| 268 | Below is a test data set, but we cannot show the |
|---|
| 269 | output it generates because of the interactive nature of the program. |
|---|
| 270 | <P> |
|---|
| 271 | <HR> |
|---|
| 272 | <P> |
|---|
| 273 | <H3>DATA SET</H3> |
|---|
| 274 | <P> |
|---|
| 275 | <TABLE><TR><TD BGCOLOR=white> |
|---|
| 276 | <PRE> |
|---|
| 277 | 5 13 |
|---|
| 278 | Alpha AACGUGGCCA AAU |
|---|
| 279 | Beta AAGGUCGCCA AAC |
|---|
| 280 | Gamma CAUUUCGUCA CAA |
|---|
| 281 | Delta GGUAUUUCGG CCU |
|---|
| 282 | Epsilon GGGAUCUCGG CCC |
|---|
| 283 | </PRE> |
|---|
| 284 | </TD></TR></TABLE> |
|---|
| 285 | </BODY> |
|---|
| 286 | </HTML> |
|---|