Context Navigation

dnamove.html

Visit:

Last change on this file was 2176, checked in by westram, 22 years ago
* empty log message *
Property svn:eol-style set to `native` Property svn:keywords set to `Author Date Id Revision`
File size: 14.4 KB

Line
1	<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
2	<HTML>
3	<HEAD>
4	<TITLE>dnamove</TITLE>
5	<META NAME="description" CONTENT="dnamove">
6	<META NAME="keywords" CONTENT="dnamove">
7	<META NAME="resource-type" CONTENT="document">
8	<META NAME="distribution" CONTENT="global">
9	<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
10	</HEAD>
11	<BODY BGCOLOR="#ccffff">
12	<DIV ALIGN=RIGHT>
13	version 3.6
14	</DIV>
15	<P>
16	<DIV ALIGN=CENTER>
17	<H1>DNAMOVE - Interactive DNA parsimony</H1>
18	</DIV>
19	<P>
20	© Copyright 1986-2002 by the University of
21	Washington. Written by Joseph Felsenstein. Permission is granted to copy
22	this document provided that no fee is charged for it and that this copyright
23	notice is not removed.
24	<P>
25	DNAMOVE is an interactive DNA parsimony program, inspired by Wayne Maddison and
26	David and Wayne Maddison's marvellous program MacClade, which is written for
27	Macintosh computers. DNAMOVE reads in a data set which is prepared in almost
28	the same format as one for the DNA parsimony program DNAPARS. It allows
29	the user to choose an initial tree, and displays this tree on the screen. The
30	user can look at different sites and the way the nucleotide states are
31	distributed on that tree, given the most parsimonious reconstruction of state
32	changes for that particular tree. The user then can specify how the tree is to
33	be rearraranged, rerooted or written out to a file. By looking at different
34	rearrangements of the tree the user can manually search for the most
35	parsimonious tree, and can get a feel for how different sites are affected
36	by changes in the tree topology.
37	<P>
38	This program uses graphic characters that show the tree to best
39	advantage on some computer systems.
40	Its graphic characters will work best on MSDOS systems or MSDOS windows in
41	Windows, and to
42	any system whose screen or terminals emulate ANSI standard terminals
43	such as old Digital VT100 terminals,
44	Telnet programs,
45	or VT100-compatible windows in the X windowing system.
46	For any other screen types, (such as Macintosh windows) there is a generic
47	option which does
48	not make use of screen graphics characters. The program will work well
49	in those cases, but the tree it displays will look a bit uglier.
50	<P>
51	The input data file is set up almost identically to the data files for
52	DNAPARS. The code for nucleotide sequences is the standard one, as
53	described in the molecular sequence programs document.
54	The user trees are contained in the input tree file
55	which is used for input of the starting tree (if desired). The
56	output tree file is used for the final tree.
57	<P>
58	The user interaction starts with the program presenting a menu. The
59	menu looks like this:
60	<P>
61	<TABLE><TR><TD BGCOLOR=white>
62	<PRE>
63
64	Interactive DNA parsimony, version 3.6a3
65
66	Settings for this run:
67	O Outgroup root? No, use as outgroup species 1
68	W Sites weighted? No
69	T Use Threshold parsimony? No, use ordinary parsimony
70	I Input sequences interleaved? Yes
71	U Initial tree (arbitrary, user, specify)? Arbitrary
72	0 Graphics type (IBM PC, ANSI, none)? (none)
73	S Width of terminal screen? 80
74	L Number of lines on screen? 24
75
76	Are these settings correct? (type Y or the letter for one to change)
77
78	</PRE>
79	</TD></TR></TABLE>
80	<P>
81	The O (Outgroup), W (Weights), T (Threshold), and 0 (Graphics type) options
82	are the usual
83	ones and are described in the main documentation file. The I
84	(Interleaved) option is the usual one and is described in the main
85	documentation file and the molecular sequences programs documentation file.
86	The U (initial tree) option allows the user to choose whether
87	the initial tree is to be arbitrary, interactively specified by the user, or
88	read from a tree file. Typing U causes the program to change among the
89	three possibilities in turn. I
90	would recommend that for a first run, you allow the tree to be set up
91	arbitrarily (the default), as the "specify" choice is difficult
92	to use and the "user tree" choice requires that you have available a tree file
93	with the tree topology of the initial tree, which must be a rooted tree.
94	Its default name is <TT>intree</TT>. The program will ask you for its name if
95	it looks for the input tree file and does not find one of this name.
96	If you wish to set up some
97	particular tree you can also do that by the rearrangement commands specified
98	below.
99	<P>
100	The W (Weights) option allows only weights of 0 or 1.
101	<P>
102	The T (threshold) option allows a continuum of methods between parsimony and
103	compatibility. Thresholds less than or equal to 1.0 do not have any
104	meaning and should not be used: they will result in a tree dependent only on
105	the input order of species and not at all on the data!
106	<P>
107	The L (screen Lines) option allows the user to change the height of the
108	screen (in lines of characters) that is assumed to be available on the
109	display. This may be particularly helpful when displaying large trees
110	on terminals that have more than 24 lines per screen, or on workstation
111	or X-terminal screens that can emulate the ANSI terminals with more than
112	24 lines.
113	<P>
114	After the initial menu is displayed and the choices are made,
115	the program then sets up an initial tree and displays it. Below it will be a
116	one-line menu of possible commands, which looks like this:
117	<P>
118	<PRE>
119	NEXT? (Options: R # + - S . T U W O F C H ? X Q) (H or ? for Help)
120	</PRE>
121	<P>
122	If you type H or ? you will get a single screen showing a description of each
123	of these commands in a few words. Here are slightly more detailed
124	descriptions:
125	<P>
126	<DL>
127	<DT>R ("Rearrange")</DT> <DD> This command asks for the number of a node which is to be
128	removed from the tree. It and everything to the right of it on the tree is to
129	be removed (by breaking the branch immediately below it). The command also
130	asks for the number of a node below which that group is to be inserted. If an
131	impossible number is given, the program refuses to carry out the rearrangement
132	and asks for a new command. The rearranged tree is displayed: it will often
133	have a different number of steps than the original. If you wish to undo a
134	rearrangement, use the Undo command, for which see below.</DD>
135	<DT>#</DT> <DD>This command, and the +, - and S commands described below, determine
136	which site has its states displayed on the branches of
137	the trees. The initial tree displayed by the program does not show
138	states of sites. When # is typed, the program does not ask the user which
139	site is to be shown but automatically shows the states of the next
140	site that is not compatible with the tree (the next site that does not
141	perfectly fit the current tree). The search for this site "wraps around"
142	so that if it reaches the last site without finding one that is not
143	compatible with the tree, the search continues at the first site; if no
144	incompatible site is found the current site is shown again, and if no current
145	site is being shown then the first site is shown. The display takes the form of
146	different symbols or textures on the branches of the tree. The state of each
147	branch is actually the state of the node above it. A key of the symbols or
148	shadings used for states A, C, G, T (U) and ? are shown next to the
149	tree. State ? means that more than one possible nucleotide could exist at
150	that point
151	on the tree, and that the user may want to consider the different
152	possibilities, which are usually apparent by inspection.</DD>
153	<DT>+</DT> <DD>This command is the same as \# except that it goes forward one site,
154	showing the states of the next site. If no site has been shown, using + will
155	cause the first site to be shown. Once the last site has been
156	reached, using + again will show the first site.</DT>
157	<P>
158	<DT>-</DT> <DD>This command is the same as + except that it goes backwards, showing the
159	states of the previous site. If no site has been shown, using - will
160	cause the last site to be shown. Once site number 1 has been
161	reached, using - again will show the last site.</DD>
162	<DT>S ("Show").</DT> <DD> This command is the same as + and - except that it causes
163	the program to ask you for the number of a site. That site is
164	the one whose states will be displayed. If you give the site number as 0,
165	the program will go back to not showing the states of the sites.</DD>
166	<DT>. (dot)</DT> <DD>This command simply causes the current tree to be redisplayed. It is of
167	use when the tree has partly disappeared off of the top of the screen owing to
168	too many responses to commands being printed out at the bottom of the screen.
169	</DD>
170	<P>
171	<DT>T ("Try rearrangements").</DT> <DD>This command asks for the name of a node. The
172	part of the tree at and above that node is removed from the tree. The program
173	tries to re-insert it in each possible location on the tree (this may take some
174	time, and the program reminds you to wait). Then it prints out a summary. For
175	each possible location the program prints out the number of the node to the
176	right of the
177	place of insertion and the number of steps required in each case. These are
178	divided into those that are better then or tied with the current tree. Once
179	this summary is printed out, the group that was removed is reinserted into its
180	original position. It is up to you to use the R command to actually carry out
181	any of the arrangements that have been tried. </DD>
182	<DT>U ("Undo").</DT> <DD>This command reverses the effect of the most recent
183	rearrangement, outgroup re-rooting, or flipping of branches. It returns to the
184	previous tree topology. It will be of great use when rearranging the tree and
185	when a rearrangement proves worse than the preceding one -- it permits you to
186	abandon the new one and return to the previous one without remembering its
187	topology in detail.</DD>
188	<DT>W ("Write").</DT> <DD>This command writes out the current tree onto a tree output
189	file. If the file already has been written to by this run of DNAMOVE, it will
190	ask you whether you want to replace the contents of the file, add the tree to
191	the end of the file, or not write out the tree to the file. The tree
192	is written in the standard format used by PHYLIP (a subset of the
193	Newick standard). It is in the proper format to serve as the
194	User-Defined Tree for setting up the initial tree in a subsequent run of the
195	program. Note that if you provided the initial tree topology in a tree file
196	and replace its contents, that initial tree will be lost.</DD>
197	<DT>O ("Outgroup").</DT> <DD>This asks for the number of a node which is to be the
198	outgroup. The tree will be redisplayed with that node
199	as the left descendant of the bottom fork. Note that it is possible to
200	use this to make a multi-species group the outgroup (i.e., you can give the
201	number of an interior node of the tree as the outgroup, and the program will
202	re-root the tree properly with that on the left of the bottom fork.</DD>
203	<DT>F ("Flip").</DT> <DD>This asks for a node number and then flips the two branches at
204	that node, so that the left-right order of branches at that node is
205	changed. This does not actually change the tree topology (or the number of
206	steps on that tree) but it does change the appearance of the tree.</DD>
207	<DT>C ("Clade").</DT> <DD>When the data consist of more than 12 species (or more than
208	half the number of lines on the screen if this is not 24), it may be
209	difficult to display the tree on one screen. In that case the tree
210	will be squeezed down to
211	one line per species. This is too small to see all the interior states of the
212	tree. The C command instructs the program to print out only that part of the
213	tree (the "clade") from a certain node on up. The program will prompt you for
214	the number of this node. Remember that thereafter you are not looking at the
215	whole tree. To go back to looking at the whole tree give the C command again
216	and enter "0" for the node number when asked. Most users will not want to use
217	this option unless forced to.</DD>
218	<DT>H ("Help").</DT> <DD>Prints a one-screen summary of what the commands do, a few
219	words for each command.</DD>
220	<DT>? ("huh?").</DT> <DD>A synonym for H. Same as Help command.</DD>
221	<DT>X ("Exit").</DT> <DD>Exit from program. If the current tree has not yet been saved
222	into a file, the program will first ask you whether it should be saved.</DD>
223	<DT>Q ("Quit").</DT> <DD>A synonym for X. Same as the eXit command.</DD>
224	</DL>
225	<P>
226	<H3>ADAPTING THE PROGRAM TO YOUR COMPUTER AND TO YOUR TERMINAL</H3>
227	<P>
228	As we have seen, the initial menu of the program allows you to choose
229	among three screen types (PCDOS, Ansi, and none). We have tried to
230	have the default values be correct for PC, Macintosh, and Unix
231	screens. If the setting is "none" (which is necessary on
232	Macintosh screens), the special graphics
233	characters will not be used to indicate nucleotide states, but only letters
234	will be used for the four nucleotides. This is less easy to look at.
235	<P>
236	<H2>MORE ABOUT THE PARSIMONY CRITERION</H2>
237	<P>
238	This program carries out unrooted parsimony (analogous to Wagner
239	trees) (Eck and Dayhoff, 1966; Kluge and Farris, 1969) on DNA
240	sequences. The method of Fitch (1971) is used to count the number of
241	changes of base needed on a given tree. The assumptions of this
242	method are exactly analogous to those of MIX:
243	<P>
244	<OL>
245	<LI>Each site evolves independently.
246	<LI>Different lineages evolve independently.
247	<LI>The probability of a base substitution at a given site is
248	small over the lengths of time involved in
249	a branch of the phylogeny.
250	<LI>The expected amounts of change in different branches of the phylogeny
251	do not vary by so much that two changes in a high-rate branch
252	are more probable than one change in a low-rate branch.
253	<LI>The expected amounts of change do not vary enough among sites that two
254	changes in one site are more probable than one change in another.
255	</OL>
256	<P>
257	That these are the assumptions of parsimony methods has been documented
258	in a series of papers of mine: (1973a, 1978b, 1979, 1981b,
259	1983b, 1988b). For an opposing view arguing that the parsimony methods
260	make no substantive
261	assumptions such as these, see the papers by Farris (1983) and Sober (1983a,
262	1983b), but also read the exchange between Felsenstein and Sober (1986).
263	<P>
264	Change from an occupied site to a deletion is counted as one
265	change. Reversion from a deletion to an occupied site is allowed and is also
266	counted as one change.
267	<P>
268	Below is a test data set, but we cannot show the
269	output it generates because of the interactive nature of the program.
270	<P>
271	<HR>
272	<P>
273	<H3>DATA SET</H3>
274	<P>
275	<TABLE><TR><TD BGCOLOR=white>
276	<PRE>
277	5 13
278	Alpha AACGUGGCCA AAU
279	Beta AAGGUCGCCA AAC
280	Gamma CAUUUCGUCA CAA
281	Delta GGUAUUUCGG CCU
282	Epsilon GGGAUCUCGG CCC
283	</PRE>
284	</TD></TR></TABLE>
285	</BODY>
286	</HTML>

Note: See TracBrowser for help on using the repository browser.

Context Navigation

source: trunk/GDE/PHYLIP/doc/dnamove.html

Download in other formats: