source: tags/ms_r16q4/HELP_SOURCE/oldhelp/seq_quality.hlp

Last change on this file was 10842, checked in by westram, 11 years ago
  • Property svn:eol-style set to native
  • Property svn:keywords set to Author Date Id Revision
File size: 3.2 KB
Line 
1#Please insert up references in the next lines (line starts with keyword UP)
2UP      arb.hlp
3UP      glossary.hlp
4
5#Please insert subtopic references  (line starts with keyword SUB)
6#SUB    subtopic.hlp
7
8# Hypertext links in helptext can be added like this: LINK{ref.hlp|http://add|bla@domain}
9
10#************* Title of helpfile !! and start of real helpfile ********
11TITLE           Calculate sequence quality
12
13OCCURRENCE      ARB_NT/Sequence/Calculate sequence quality
14
15DESCRIPTION     'Calculate sequence quality' tries to measure the quality of sequences and
16                the quality their alignment.
17
18        HANDLING:
19
20                Fill in the values you think are appropriate.
21                The default values are the values that worked best in the first test runs.
22                Many criteria are evaluated (see 'THE VALUES' below for details).
23
24                A final "quality-value" (percentage) for each sequence is calculated
25                and all sequences below the given threshold may get marked.
26               
27        HOW IT WORKS:
28
29                In the section "weights" you have quite a few options to fill in.
30
31                These are some of the criteria used to evaluate the quality of the sequences..
32                The values represent the share of the criteria in the final evaluation-formula.
33                All values represent percentages, therefore all values together should sum up to 100.
34
35        THE VALUES:
36
37                Base analysis:
38
39                        This is the number of bases that are stored in the sequence. "-" and "." are
40                        not counted.
41
42                Deviation:
43
44                        This is the deviation of the number of bases from a sequence to the average number
45                        of bases in a group.
46
47                No Helices:
48
49                        This is the number of positions in a sequence where no helix structure can be built.
50
51                Consensus:
52
53                        For each named group found in the tree (selected below)
54                        a consensus sequence is calculated.
55
56                        Every species' sequence is compared against the consensus sequences
57                        of all groups of which the species is a member.
58
59                        That comparison uses conformity with and deviation from the consensus sequence.
60
61#                       A consensus is computed from sequences in one group and then from subgroups to groups.
62#                        So "multilevel" consensi are generated.
63#                        The value consists of two analysis: Every sequence is tested against every level of the consensus.
64#                        Conformity and deviation from the consensus are measured.
65
66                IUPAC:
67
68                        This is the number of iupac-codes stored in a sequence.
69
70                GC proportion:
71
72                        This is the deviation in GC proportion from a sequence to group.
73
74NOTES           Generally speaking the consensus is the mightiest tool to evaluate the quality. So keep the
75                percentage high unless you know what you're doing or you want to evaluate with just one or
76                two values.
77
78                Be aware that the computation is very complex and can easily take hours to finish.
79                So if you don't see the statusbar moving in the first ten minutes it just means
80                that you are analyzing a huge database.
81
82EXAMPLES        None
83
84WARNINGS        None
85
86BUGS            No bugs known
Note: See TracBrowser for help on using the repository browser.