| 1 | #Please insert up references in the next lines (line starts with keyword UP) |
|---|
| 2 | UP arb.hlp |
|---|
| 3 | UP glossary.hlp |
|---|
| 4 | |
|---|
| 5 | #Please insert subtopic references (line starts with keyword SUB) |
|---|
| 6 | SUB partial_sequences.hlp |
|---|
| 7 | SUB pa_quick.hlp |
|---|
| 8 | SUB pa_add.hlp |
|---|
| 9 | |
|---|
| 10 | # Hypertext links in helptext can be added like this: LINK{ref.hlp|http://add|bla@domain} |
|---|
| 11 | |
|---|
| 12 | #************* Title of helpfile !! and start of real helpfile ******** |
|---|
| 13 | TITLE Add marked partial species |
|---|
| 14 | |
|---|
| 15 | OCCURRENCE ARB_PARSIMONY/Tree/Add Species to Tree/Add Marked Partial Species |
|---|
| 16 | |
|---|
| 17 | DESCRIPTION Use this to add sequences with partial sequence data into an existing tree. |
|---|
| 18 | The current tree topology will not be optimized after insertion. |
|---|
| 19 | |
|---|
| 20 | The branchlengths of the added partial sequences represent the |
|---|
| 21 | number of (weighted) mutations against the full sequence. |
|---|
| 22 | |
|---|
| 23 | Only the overlapping part of both sequences is taken into account and |
|---|
| 24 | the distance to the FLS will be weighted by the length of the overlap. |
|---|
| 25 | |
|---|
| 26 | Different partial sequences will never attract each other, simply because |
|---|
| 27 | they are not considered as 'possible neighbours' (see also section about |
|---|
| 28 | partials in LINK{pa_quick.hlp}). |
|---|
| 29 | |
|---|
| 30 | As for non-partial sequences, when using a filter only the unfiltered |
|---|
| 31 | positions will be taken into account to calculate the number of mutations. |
|---|
| 32 | |
|---|
| 33 | The insertion order has no effect on the resulting placement of the |
|---|
| 34 | partial sequences. For each sequence, the best matching |
|---|
| 35 | full-length sequence (FLS) is searched and afterwards all partial |
|---|
| 36 | sequences are placed next to their detected FLS. |
|---|
| 37 | |
|---|
| 38 | Often partial sequences have equal distances to more than one FLS. This happens |
|---|
| 39 | whenever two FLS only differ in alignment regions, where the partial sequence has |
|---|
| 40 | no data. In that case a warning is printed ("Insertion of '<name>' is ambiguous") |
|---|
| 41 | and one of the ambiguous insertion possibilities is chosen. |
|---|
| 42 | This is more likely to happen for low amounts of sequence data (i.e. very short sequences). |
|---|
| 43 | Consider to remove these species from the tree, as their placement might be |
|---|
| 44 | meaningless or misleading. |
|---|
| 45 | |
|---|
| 46 | NOTES Adding species using this function marks them as "partial sequence" if |
|---|
| 47 | they have no 'ARB_partial' entry yet. |
|---|
| 48 | If they were already marked as FLS, the insertion is aborted with an error. |
|---|
| 49 | |
|---|
| 50 | All species already in tree, when calling this function, will be marked |
|---|
| 51 | as FLS, if they have no 'ARB_partial' entry yet. |
|---|
| 52 | |
|---|
| 53 | Species with partial sequences have the field "ARB_partial" set to 1, species |
|---|
| 54 | with FLS have the field "ARB_partial" set to 0. |
|---|
| 55 | |
|---|
| 56 | EXAMPLES None |
|---|
| 57 | |
|---|
| 58 | WARNINGS Calculating branch length after adding partial sequences |
|---|
| 59 | will lead to wrong lengths. |
|---|
| 60 | |
|---|
| 61 | BUGS No bugs known |
|---|