1 | #Please insert up references in the next lines (line starts with keyword UP) |
---|
2 | UP arb.hlp |
---|
3 | UP glossary.hlp |
---|
4 | |
---|
5 | #Please insert subtopic references (line starts with keyword SUB) |
---|
6 | #SUB subtopic.hlp |
---|
7 | |
---|
8 | # Hypertext links in helptext can be added like this: LINK{ref.hlp|http://add|bla@domain} |
---|
9 | |
---|
10 | #************* Title of helpfile !! and start of real helpfile ******** |
---|
11 | TITLE ARB_PHYLO - Create filters by base frequency |
---|
12 | |
---|
13 | OCCURRENCE ARB_NT/SAI/Create Filter/by Base Frequency |
---|
14 | |
---|
15 | DESCRIPTION Calculate base frequencies and/or a base frequency filter. |
---|
16 | |
---|
17 | Species scope: |
---|
18 | |
---|
19 | The base frequencies will be calculated upon those species |
---|
20 | that are marked when ARB_PHYLO is started up (changing marks |
---|
21 | later has no effect). |
---|
22 | |
---|
23 | Open Config/Filter to define how the filter will be created: |
---|
24 | |
---|
25 | Define the alignment range, for which the filter shall |
---|
26 | be generated using the input fields 'Start at column' |
---|
27 | and 'Stop at column'. Columns outside that range will get |
---|
28 | filtered, i.e. the generated filter will contain '0'. |
---|
29 | |
---|
30 | Define the similarity range, which will lead to a '1' in the |
---|
31 | resulting base frequency filter, by specifying the lower and |
---|
32 | upper percentage in the input field 'Minimal similarity' |
---|
33 | and 'Maximal similarity'. Similarity values outside that range |
---|
34 | will lead to '0' in the generated base frequency filter and will |
---|
35 | as well not be exported as base frequencies. (Note: It's not necessary to |
---|
36 | re-calculate, when you change this value) |
---|
37 | |
---|
38 | For non-regular sequence characters (gaps, ambiguity codes and lowercase characters) |
---|
39 | you may define how the frequency calculation acts, by setting their toggles to one of |
---|
40 | the following values: |
---|
41 | |
---|
42 | don't count (ignore) |
---|
43 | |
---|
44 | The corresponding characters are ignored (as if they were not present). |
---|
45 | |
---|
46 | if occurs most often => forget whole column |
---|
47 | |
---|
48 | If the corresponding character(s) occur(s) more often than any other |
---|
49 | regular sequence character, then skip column, i.e. write '0' to filter. |
---|
50 | |
---|
51 | if occurs => forget whole column |
---|
52 | |
---|
53 | If (one of) the corresponding character(s) occurs, then skip the column, |
---|
54 | i.e. write '0' to filter. |
---|
55 | |
---|
56 | count, but do NOT use as maximum |
---|
57 | |
---|
58 | The corresponding characters are counted like regular sequence characters, |
---|
59 | but never interpreted as "being the most homologous character". |
---|
60 | Compared with 'don't count' it will result in lower base frequencies, cause |
---|
61 | the overall number of characters occurring per column will be higher. |
---|
62 | |
---|
63 | treat as regular character |
---|
64 | |
---|
65 | This is only applicable to 'ambiguity codes'. Treats ambiguity codes like |
---|
66 | regular sequence characters. (Note: 'treat as uppercase char' setting for |
---|
67 | lowercase characters affects ambiguity codes as well, if treated as regular |
---|
68 | characters) |
---|
69 | |
---|
70 | treat as uppercase char |
---|
71 | |
---|
72 | This is only applicable to 'lowercase chars'. Simply treat regular lowercase |
---|
73 | sequence characters like uppercase characters. |
---|
74 | |
---|
75 | Press 'Calculate/Column Filter' to calculate the base frequencies. |
---|
76 | |
---|
77 | When calculated, three lines will appear below your sequences in the main window. |
---|
78 | Reading the columns of these 3 lines from top to bottom, they mark completely |
---|
79 | filtered columns by 'XXX' and else show the frequency of the most frequent base in percent. |
---|
80 | |
---|
81 | The color of each column indicates whether the frequency will be exported and respectively |
---|
82 | whether the column will be '0' or '1' in the base frequency filter. |
---|
83 | |
---|
84 | If pleased with the results, either export detailed base frequencies |
---|
85 | via 'File/Export frequencies' or export a base frequency filter via 'File/Export filter'. |
---|
86 | |
---|
87 | NOTES None |
---|
88 | |
---|
89 | EXAMPLES To create a filter which hits all columns containing sequence data, use the following settings: |
---|
90 | |
---|
91 | - all alignment columns |
---|
92 | - similarity 0 to 100 |
---|
93 | - both gaps: 'don't count (ignore)' |
---|
94 | - ambiguity codes: 'treat as regular character' |
---|
95 | - lowercase chars: 'treat as uppercase char' |
---|
96 | |
---|
97 | WARNINGS None |
---|
98 | |
---|
99 | BUGS No bugs known |
---|