| 1 | # main topics: |
|---|
| 2 | UP arb.hlp |
|---|
| 3 | UP glossary.hlp |
|---|
| 4 | |
|---|
| 5 | # sub topics: |
|---|
| 6 | #SUB subtopic.hlp |
|---|
| 7 | |
|---|
| 8 | # format described in ../help.readme |
|---|
| 9 | |
|---|
| 10 | |
|---|
| 11 | TITLE ARB_PHYLO: Create filter by base frequency |
|---|
| 12 | |
|---|
| 13 | OCCURRENCE ARB_NT/SAI/Create Filter/by Base Frequency |
|---|
| 14 | |
|---|
| 15 | DESCRIPTION Calculate base frequencies and/or a base frequency filter. |
|---|
| 16 | |
|---|
| 17 | Species scope: |
|---|
| 18 | |
|---|
| 19 | The base frequencies will be calculated upon those species |
|---|
| 20 | that are marked when ARB_PHYLO is started up (changing marks |
|---|
| 21 | later has no effect). |
|---|
| 22 | |
|---|
| 23 | Open Config/Filter to define how the filter will be created: |
|---|
| 24 | |
|---|
| 25 | Define the alignment range, for which the filter shall |
|---|
| 26 | be generated using the input fields 'Start at column' |
|---|
| 27 | and 'Stop at column'. Columns outside that range will get |
|---|
| 28 | filtered, i.e. the generated filter will contain '0'. |
|---|
| 29 | |
|---|
| 30 | Define the similarity range, which will lead to a '1' in the |
|---|
| 31 | resulting base frequency filter, by specifying the lower and |
|---|
| 32 | upper percentage in the input field 'Minimal similarity' |
|---|
| 33 | and 'Maximal similarity'. Similarity values outside that range |
|---|
| 34 | will lead to '0' in the generated base frequency filter and will |
|---|
| 35 | as well not be exported as base frequencies. (Note: It's not necessary to |
|---|
| 36 | re-calculate, when you change this value) |
|---|
| 37 | |
|---|
| 38 | For non-regular sequence characters (gaps, ambiguity codes and lowercase characters) |
|---|
| 39 | you may define how the frequency calculation acts, by setting their toggles to one of |
|---|
| 40 | the following values: |
|---|
| 41 | |
|---|
| 42 | don't count (ignore) |
|---|
| 43 | |
|---|
| 44 | The corresponding characters are ignored (as if they were not present). |
|---|
| 45 | |
|---|
| 46 | if occurs most often => forget whole column |
|---|
| 47 | |
|---|
| 48 | If the corresponding character(s) occur(s) more often than any other |
|---|
| 49 | regular sequence character, then skip column, i.e. write '0' to filter. |
|---|
| 50 | |
|---|
| 51 | if occurs => forget whole column |
|---|
| 52 | |
|---|
| 53 | If (one of) the corresponding character(s) occurs, then skip the column, |
|---|
| 54 | i.e. write '0' to filter. |
|---|
| 55 | |
|---|
| 56 | count, but do NOT use as maximum |
|---|
| 57 | |
|---|
| 58 | The corresponding characters are counted like regular sequence characters, |
|---|
| 59 | but never interpreted as "being the most homologous character". |
|---|
| 60 | Compared with 'don't count' it will result in lower base frequencies, cause |
|---|
| 61 | the overall number of characters occurring per column will be higher. |
|---|
| 62 | |
|---|
| 63 | treat as regular character |
|---|
| 64 | |
|---|
| 65 | This is only applicable to 'ambiguity codes'. Treats ambiguity codes like |
|---|
| 66 | regular sequence characters. (Note: 'treat as uppercase char' setting for |
|---|
| 67 | lowercase characters affects ambiguity codes as well, if treated as regular |
|---|
| 68 | characters) |
|---|
| 69 | |
|---|
| 70 | treat as uppercase char |
|---|
| 71 | |
|---|
| 72 | This is only applicable to 'lowercase chars'. Simply treat regular lowercase |
|---|
| 73 | sequence characters like uppercase characters. |
|---|
| 74 | |
|---|
| 75 | Press 'Calculate/Column Filter' to calculate the base frequencies. |
|---|
| 76 | |
|---|
| 77 | When calculated, three lines will appear below your sequences in the main window. |
|---|
| 78 | Reading the columns of these 3 lines from top to bottom, they mark completely |
|---|
| 79 | filtered columns by 'XXX' and else show the frequency of the most frequent base in percent. |
|---|
| 80 | |
|---|
| 81 | The color of each column indicates whether the frequency will be exported and respectively |
|---|
| 82 | whether the column will be '0' or '1' in the base frequency filter. |
|---|
| 83 | |
|---|
| 84 | If pleased with the results, either export detailed base frequencies |
|---|
| 85 | via 'File/Export frequencies' or export a base frequency filter via 'File/Export filter'. |
|---|
| 86 | |
|---|
| 87 | NOTES None |
|---|
| 88 | |
|---|
| 89 | EXAMPLES To create a filter which hits all columns containing sequence data, use the following settings: |
|---|
| 90 | |
|---|
| 91 | - all alignment columns |
|---|
| 92 | - similarity 0 to 100 |
|---|
| 93 | - both gaps: 'don't count (ignore)' |
|---|
| 94 | - ambiguity codes: 'treat as regular character' |
|---|
| 95 | - lowercase chars: 'treat as uppercase char' |
|---|
| 96 | |
|---|
| 97 | WARNINGS None |
|---|
| 98 | |
|---|
| 99 | BUGS No bugs known |
|---|