1 | <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN"> |
---|
2 | <HTML> |
---|
3 | <HEAD> |
---|
4 | <TITLE>dnamove</TITLE> |
---|
5 | <META NAME="description" CONTENT="dnamove"> |
---|
6 | <META NAME="keywords" CONTENT="dnamove"> |
---|
7 | <META NAME="resource-type" CONTENT="document"> |
---|
8 | <META NAME="distribution" CONTENT="global"> |
---|
9 | <META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1"> |
---|
10 | </HEAD> |
---|
11 | <BODY BGCOLOR="#ccffff"> |
---|
12 | <DIV ALIGN=RIGHT> |
---|
13 | version 3.6 |
---|
14 | </DIV> |
---|
15 | <P> |
---|
16 | <DIV ALIGN=CENTER> |
---|
17 | <H1>DNAMOVE - Interactive DNA parsimony</H1> |
---|
18 | </DIV> |
---|
19 | <P> |
---|
20 | © Copyright 1986-2002 by the University of |
---|
21 | Washington. Written by Joseph Felsenstein. Permission is granted to copy |
---|
22 | this document provided that no fee is charged for it and that this copyright |
---|
23 | notice is not removed. |
---|
24 | <P> |
---|
25 | DNAMOVE is an interactive DNA parsimony program, inspired by Wayne Maddison and |
---|
26 | David and Wayne Maddison's marvellous program MacClade, which is written for |
---|
27 | Macintosh computers. DNAMOVE reads in a data set which is prepared in almost |
---|
28 | the same format as one for the DNA parsimony program DNAPARS. It allows |
---|
29 | the user to choose an initial tree, and displays this tree on the screen. The |
---|
30 | user can look at different sites and the way the nucleotide states are |
---|
31 | distributed on that tree, given the most parsimonious reconstruction of state |
---|
32 | changes for that particular tree. The user then can specify how the tree is to |
---|
33 | be rearraranged, rerooted or written out to a file. By looking at different |
---|
34 | rearrangements of the tree the user can manually search for the most |
---|
35 | parsimonious tree, and can get a feel for how different sites are affected |
---|
36 | by changes in the tree topology. |
---|
37 | <P> |
---|
38 | This program uses graphic characters that show the tree to best |
---|
39 | advantage on some computer systems. |
---|
40 | Its graphic characters will work best on MSDOS systems or MSDOS windows in |
---|
41 | Windows, and to |
---|
42 | any system whose screen or terminals emulate ANSI standard terminals |
---|
43 | such as old Digital VT100 terminals, |
---|
44 | Telnet programs, |
---|
45 | or VT100-compatible windows in the X windowing system. |
---|
46 | For any other screen types, (such as Macintosh windows) there is a generic |
---|
47 | option which does |
---|
48 | not make use of screen graphics characters. The program will work well |
---|
49 | in those cases, but the tree it displays will look a bit uglier. |
---|
50 | <P> |
---|
51 | The input data file is set up almost identically to the data files for |
---|
52 | DNAPARS. The code for nucleotide sequences is the standard one, as |
---|
53 | described in the molecular sequence programs document. |
---|
54 | The user trees are contained in the input tree file |
---|
55 | which is used for input of the starting tree (if desired). The |
---|
56 | output tree file is used for the final tree. |
---|
57 | <P> |
---|
58 | The user interaction starts with the program presenting a menu. The |
---|
59 | menu looks like this: |
---|
60 | <P> |
---|
61 | <TABLE><TR><TD BGCOLOR=white> |
---|
62 | <PRE> |
---|
63 | |
---|
64 | Interactive DNA parsimony, version 3.6a3 |
---|
65 | |
---|
66 | Settings for this run: |
---|
67 | O Outgroup root? No, use as outgroup species 1 |
---|
68 | W Sites weighted? No |
---|
69 | T Use Threshold parsimony? No, use ordinary parsimony |
---|
70 | I Input sequences interleaved? Yes |
---|
71 | U Initial tree (arbitrary, user, specify)? Arbitrary |
---|
72 | 0 Graphics type (IBM PC, ANSI, none)? (none) |
---|
73 | S Width of terminal screen? 80 |
---|
74 | L Number of lines on screen? 24 |
---|
75 | |
---|
76 | Are these settings correct? (type Y or the letter for one to change) |
---|
77 | |
---|
78 | </PRE> |
---|
79 | </TD></TR></TABLE> |
---|
80 | <P> |
---|
81 | The O (Outgroup), W (Weights), T (Threshold), and 0 (Graphics type) options |
---|
82 | are the usual |
---|
83 | ones and are described in the main documentation file. The I |
---|
84 | (Interleaved) option is the usual one and is described in the main |
---|
85 | documentation file and the molecular sequences programs documentation file. |
---|
86 | The U (initial tree) option allows the user to choose whether |
---|
87 | the initial tree is to be arbitrary, interactively specified by the user, or |
---|
88 | read from a tree file. Typing U causes the program to change among the |
---|
89 | three possibilities in turn. I |
---|
90 | would recommend that for a first run, you allow the tree to be set up |
---|
91 | arbitrarily (the default), as the "specify" choice is difficult |
---|
92 | to use and the "user tree" choice requires that you have available a tree file |
---|
93 | with the tree topology of the initial tree, which must be a rooted tree. |
---|
94 | Its default name is <TT>intree</TT>. The program will ask you for its name if |
---|
95 | it looks for the input tree file and does not find one of this name. |
---|
96 | If you wish to set up some |
---|
97 | particular tree you can also do that by the rearrangement commands specified |
---|
98 | below. |
---|
99 | <P> |
---|
100 | The W (Weights) option allows only weights of 0 or 1. |
---|
101 | <P> |
---|
102 | The T (threshold) option allows a continuum of methods between parsimony and |
---|
103 | compatibility. Thresholds less than or equal to 1.0 do not have any |
---|
104 | meaning and should not be used: they will result in a tree dependent only on |
---|
105 | the input order of species and not at all on the data! |
---|
106 | <P> |
---|
107 | The L (screen Lines) option allows the user to change the height of the |
---|
108 | screen (in lines of characters) that is assumed to be available on the |
---|
109 | display. This may be particularly helpful when displaying large trees |
---|
110 | on terminals that have more than 24 lines per screen, or on workstation |
---|
111 | or X-terminal screens that can emulate the ANSI terminals with more than |
---|
112 | 24 lines. |
---|
113 | <P> |
---|
114 | After the initial menu is displayed and the choices are made, |
---|
115 | the program then sets up an initial tree and displays it. Below it will be a |
---|
116 | one-line menu of possible commands, which looks like this: |
---|
117 | <P> |
---|
118 | <PRE> |
---|
119 | NEXT? (Options: R # + - S . T U W O F C H ? X Q) (H or ? for Help) |
---|
120 | </PRE> |
---|
121 | <P> |
---|
122 | If you type H or ? you will get a single screen showing a description of each |
---|
123 | of these commands in a few words. Here are slightly more detailed |
---|
124 | descriptions: |
---|
125 | <P> |
---|
126 | <DL> |
---|
127 | <DT>R ("Rearrange")</DT> <DD> This command asks for the number of a node which is to be |
---|
128 | removed from the tree. It and everything to the right of it on the tree is to |
---|
129 | be removed (by breaking the branch immediately below it). The command also |
---|
130 | asks for the number of a node below which that group is to be inserted. If an |
---|
131 | impossible number is given, the program refuses to carry out the rearrangement |
---|
132 | and asks for a new command. The rearranged tree is displayed: it will often |
---|
133 | have a different number of steps than the original. If you wish to undo a |
---|
134 | rearrangement, use the Undo command, for which see below.</DD> |
---|
135 | <DT>#</DT> <DD>This command, and the +, - and S commands described below, determine |
---|
136 | which site has its states displayed on the branches of |
---|
137 | the trees. The initial tree displayed by the program does not show |
---|
138 | states of sites. When # is typed, the program does not ask the user which |
---|
139 | site is to be shown but automatically shows the states of the next |
---|
140 | site that is not compatible with the tree (the next site that does not |
---|
141 | perfectly fit the current tree). The search for this site "wraps around" |
---|
142 | so that if it reaches the last site without finding one that is not |
---|
143 | compatible with the tree, the search continues at the first site; if no |
---|
144 | incompatible site is found the current site is shown again, and if no current |
---|
145 | site is being shown then the first site is shown. The display takes the form of |
---|
146 | different symbols or textures on the branches of the tree. The state of each |
---|
147 | branch is actually the state of the node above it. A key of the symbols or |
---|
148 | shadings used for states A, C, G, T (U) and ? are shown next to the |
---|
149 | tree. State ? means that more than one possible nucleotide could exist at |
---|
150 | that point |
---|
151 | on the tree, and that the user may want to consider the different |
---|
152 | possibilities, which are usually apparent by inspection.</DD> |
---|
153 | <DT>+</DT> <DD>This command is the same as \# except that it goes forward one site, |
---|
154 | showing the states of the next site. If no site has been shown, using + will |
---|
155 | cause the first site to be shown. Once the last site has been |
---|
156 | reached, using + again will show the first site.</DT> |
---|
157 | <P> |
---|
158 | <DT>-</DT> <DD>This command is the same as + except that it goes backwards, showing the |
---|
159 | states of the previous site. If no site has been shown, using - will |
---|
160 | cause the last site to be shown. Once site number 1 has been |
---|
161 | reached, using - again will show the last site.</DD> |
---|
162 | <DT>S ("Show").</DT> <DD> This command is the same as + and - except that it causes |
---|
163 | the program to ask you for the number of a site. That site is |
---|
164 | the one whose states will be displayed. If you give the site number as 0, |
---|
165 | the program will go back to not showing the states of the sites.</DD> |
---|
166 | <DT>. (dot)</DT> <DD>This command simply causes the current tree to be redisplayed. It is of |
---|
167 | use when the tree has partly disappeared off of the top of the screen owing to |
---|
168 | too many responses to commands being printed out at the bottom of the screen. |
---|
169 | </DD> |
---|
170 | <P> |
---|
171 | <DT>T ("Try rearrangements").</DT> <DD>This command asks for the name of a node. The |
---|
172 | part of the tree at and above that node is removed from the tree. The program |
---|
173 | tries to re-insert it in each possible location on the tree (this may take some |
---|
174 | time, and the program reminds you to wait). Then it prints out a summary. For |
---|
175 | each possible location the program prints out the number of the node to the |
---|
176 | right of the |
---|
177 | place of insertion and the number of steps required in each case. These are |
---|
178 | divided into those that are better then or tied with the current tree. Once |
---|
179 | this summary is printed out, the group that was removed is reinserted into its |
---|
180 | original position. It is up to you to use the R command to actually carry out |
---|
181 | any of the arrangements that have been tried. </DD> |
---|
182 | <DT>U ("Undo").</DT> <DD>This command reverses the effect of the most recent |
---|
183 | rearrangement, outgroup re-rooting, or flipping of branches. It returns to the |
---|
184 | previous tree topology. It will be of great use when rearranging the tree and |
---|
185 | when a rearrangement proves worse than the preceding one -- it permits you to |
---|
186 | abandon the new one and return to the previous one without remembering its |
---|
187 | topology in detail.</DD> |
---|
188 | <DT>W ("Write").</DT> <DD>This command writes out the current tree onto a tree output |
---|
189 | file. If the file already has been written to by this run of DNAMOVE, it will |
---|
190 | ask you whether you want to replace the contents of the file, add the tree to |
---|
191 | the end of the file, or not write out the tree to the file. The tree |
---|
192 | is written in the standard format used by PHYLIP (a subset of the |
---|
193 | Newick standard). It is in the proper format to serve as the |
---|
194 | User-Defined Tree for setting up the initial tree in a subsequent run of the |
---|
195 | program. Note that if you provided the initial tree topology in a tree file |
---|
196 | and replace its contents, that initial tree will be lost.</DD> |
---|
197 | <DT>O ("Outgroup").</DT> <DD>This asks for the number of a node which is to be the |
---|
198 | outgroup. The tree will be redisplayed with that node |
---|
199 | as the left descendant of the bottom fork. Note that it is possible to |
---|
200 | use this to make a multi-species group the outgroup (i.e., you can give the |
---|
201 | number of an interior node of the tree as the outgroup, and the program will |
---|
202 | re-root the tree properly with that on the left of the bottom fork.</DD> |
---|
203 | <DT>F ("Flip").</DT> <DD>This asks for a node number and then flips the two branches at |
---|
204 | that node, so that the left-right order of branches at that node is |
---|
205 | changed. This does not actually change the tree topology (or the number of |
---|
206 | steps on that tree) but it does change the appearance of the tree.</DD> |
---|
207 | <DT>C ("Clade").</DT> <DD>When the data consist of more than 12 species (or more than |
---|
208 | half the number of lines on the screen if this is not 24), it may be |
---|
209 | difficult to display the tree on one screen. In that case the tree |
---|
210 | will be squeezed down to |
---|
211 | one line per species. This is too small to see all the interior states of the |
---|
212 | tree. The C command instructs the program to print out only that part of the |
---|
213 | tree (the "clade") from a certain node on up. The program will prompt you for |
---|
214 | the number of this node. Remember that thereafter you are not looking at the |
---|
215 | whole tree. To go back to looking at the whole tree give the C command again |
---|
216 | and enter "0" for the node number when asked. Most users will not want to use |
---|
217 | this option unless forced to.</DD> |
---|
218 | <DT>H ("Help").</DT> <DD>Prints a one-screen summary of what the commands do, a few |
---|
219 | words for each command.</DD> |
---|
220 | <DT>? ("huh?").</DT> <DD>A synonym for H. Same as Help command.</DD> |
---|
221 | <DT>X ("Exit").</DT> <DD>Exit from program. If the current tree has not yet been saved |
---|
222 | into a file, the program will first ask you whether it should be saved.</DD> |
---|
223 | <DT>Q ("Quit").</DT> <DD>A synonym for X. Same as the eXit command.</DD> |
---|
224 | </DL> |
---|
225 | <P> |
---|
226 | <H3>ADAPTING THE PROGRAM TO YOUR COMPUTER AND TO YOUR TERMINAL</H3> |
---|
227 | <P> |
---|
228 | As we have seen, the initial menu of the program allows you to choose |
---|
229 | among three screen types (PCDOS, Ansi, and none). We have tried to |
---|
230 | have the default values be correct for PC, Macintosh, and Unix |
---|
231 | screens. If the setting is "none" (which is necessary on |
---|
232 | Macintosh screens), the special graphics |
---|
233 | characters will not be used to indicate nucleotide states, but only letters |
---|
234 | will be used for the four nucleotides. This is less easy to look at. |
---|
235 | <P> |
---|
236 | <H2>MORE ABOUT THE PARSIMONY CRITERION</H2> |
---|
237 | <P> |
---|
238 | This program carries out unrooted parsimony (analogous to Wagner |
---|
239 | trees) (Eck and Dayhoff, 1966; Kluge and Farris, 1969) on DNA |
---|
240 | sequences. The method of Fitch (1971) is used to count the number of |
---|
241 | changes of base needed on a given tree. The assumptions of this |
---|
242 | method are exactly analogous to those of MIX: |
---|
243 | <P> |
---|
244 | <OL> |
---|
245 | <LI>Each site evolves independently. |
---|
246 | <LI>Different lineages evolve independently. |
---|
247 | <LI>The probability of a base substitution at a given site is |
---|
248 | small over the lengths of time involved in |
---|
249 | a branch of the phylogeny. |
---|
250 | <LI>The expected amounts of change in different branches of the phylogeny |
---|
251 | do not vary by so much that two changes in a high-rate branch |
---|
252 | are more probable than one change in a low-rate branch. |
---|
253 | <LI>The expected amounts of change do not vary enough among sites that two |
---|
254 | changes in one site are more probable than one change in another. |
---|
255 | </OL> |
---|
256 | <P> |
---|
257 | That these are the assumptions of parsimony methods has been documented |
---|
258 | in a series of papers of mine: (1973a, 1978b, 1979, 1981b, |
---|
259 | 1983b, 1988b). For an opposing view arguing that the parsimony methods |
---|
260 | make no substantive |
---|
261 | assumptions such as these, see the papers by Farris (1983) and Sober (1983a, |
---|
262 | 1983b), but also read the exchange between Felsenstein and Sober (1986). |
---|
263 | <P> |
---|
264 | Change from an occupied site to a deletion is counted as one |
---|
265 | change. Reversion from a deletion to an occupied site is allowed and is also |
---|
266 | counted as one change. |
---|
267 | <P> |
---|
268 | Below is a test data set, but we cannot show the |
---|
269 | output it generates because of the interactive nature of the program. |
---|
270 | <P> |
---|
271 | <HR> |
---|
272 | <P> |
---|
273 | <H3>DATA SET</H3> |
---|
274 | <P> |
---|
275 | <TABLE><TR><TD BGCOLOR=white> |
---|
276 | <PRE> |
---|
277 | 5 13 |
---|
278 | Alpha AACGUGGCCA AAU |
---|
279 | Beta AAGGUCGCCA AAC |
---|
280 | Gamma CAUUUCGUCA CAA |
---|
281 | Delta GGUAUUUCGG CCU |
---|
282 | Epsilon GGGAUCUCGG CCC |
---|
283 | </PRE> |
---|
284 | </TD></TR></TABLE> |
---|
285 | </BODY> |
---|
286 | </HTML> |
---|