1 | #Please insert up references in the next lines (line starts with keyword UP) |
---|
2 | UP arb.hlp |
---|
3 | UP glossary.hlp |
---|
4 | |
---|
5 | #Please insert subtopic references (line starts with keyword SUB) |
---|
6 | SUB partial_sequences.hlp |
---|
7 | SUB pa_quick.hlp |
---|
8 | SUB pa_add.hlp |
---|
9 | |
---|
10 | # Hypertext links in helptext can be added like this: LINK{ref.hlp|http://add|bla@domain} |
---|
11 | |
---|
12 | #************* Title of helpfile !! and start of real helpfile ******** |
---|
13 | TITLE Add marked partial species |
---|
14 | |
---|
15 | OCCURRENCE ARB_PARSIMONY/Tree/Add Species to Tree/Add Marked Partial Species |
---|
16 | |
---|
17 | DESCRIPTION Use this to add sequences with partial sequence data into an existing tree. |
---|
18 | The current tree topology will not be optimized after insertion. |
---|
19 | |
---|
20 | The branchlengths of the added partial sequences represent the |
---|
21 | number of (weighted) mutations against the full sequence. |
---|
22 | |
---|
23 | Only the overlapping part of both sequences is taken into account and |
---|
24 | the distance to the FLS will be weighted by the length of the overlap. |
---|
25 | |
---|
26 | Different partial sequences will never attract each other, simply because |
---|
27 | they are not considered as 'possible neighbours' (see also section about |
---|
28 | partials in LINK{pa_quick.hlp}). |
---|
29 | |
---|
30 | As for non-partial sequences, when using a filter only the unfiltered |
---|
31 | positions will be taken into account to calculate the number of mutations. |
---|
32 | |
---|
33 | The insertion order has no effect on the resulting placement of the |
---|
34 | partial sequences. For each sequence, the best matching |
---|
35 | full-length sequence (FLS) is searched and afterwards all partial |
---|
36 | sequences are placed next to their detected FLS. |
---|
37 | |
---|
38 | Often partial sequences have equal distances to more than one FLS. This happens |
---|
39 | whenever two FLS only differ in alignment regions, where the partial sequence has |
---|
40 | no data. In that case a warning is printed ("Insertion of '<name>' is ambiguous") |
---|
41 | and one of the ambiguous insertion possibilities is chosen. |
---|
42 | This is more likely to happen for low amounts of sequence data (i.e. very short sequences). |
---|
43 | Consider to remove these species from the tree, as their placement might be |
---|
44 | meaningless or misleading. |
---|
45 | |
---|
46 | NOTES Adding species using this function marks them as "partial sequence" if |
---|
47 | they have no 'ARB_partial' entry yet. |
---|
48 | If they were already marked as FLS, the insertion is aborted with an error. |
---|
49 | |
---|
50 | All species already in tree, when calling this function, will be marked |
---|
51 | as FLS, if they have no 'ARB_partial' entry yet. |
---|
52 | |
---|
53 | Species with partial sequences have the field "ARB_partial" set to 1, species |
---|
54 | with FLS have the field "ARB_partial" set to 0. |
---|
55 | |
---|
56 | EXAMPLES None |
---|
57 | |
---|
58 | WARNINGS Calculating branch length after adding partial sequences |
---|
59 | will lead to wrong lengths. |
---|
60 | |
---|
61 | BUGS No bugs known |
---|