1 | #Please insert up references in the next lines (line starts with keyword UP) |
---|
2 | UP arb.hlp |
---|
3 | UP glossary.hlp |
---|
4 | |
---|
5 | #Please insert subtopic references (line starts with keyword SUB) |
---|
6 | #SUB group_rename.hlp |
---|
7 | |
---|
8 | # Hypertext links in helptext can be added like this: LINK{ref.hlp|http://add|bla@domain} |
---|
9 | |
---|
10 | #************* Title of helpfile !! and start of real helpfile ******** |
---|
11 | TITLE Search taxonomic groups |
---|
12 | |
---|
13 | OCCURRENCE ARB_NT/Tree/Search groups |
---|
14 | |
---|
15 | DESCRIPTION Allows to find taxonomic groups in trees. |
---|
16 | |
---|
17 | First select which trees shall be searched: |
---|
18 | - current tree (displayed in main window) |
---|
19 | - selected trees (use the '(select)' button to LINK{group_trees.hlp}) |
---|
20 | - all trees |
---|
21 | |
---|
22 | Standard search mode is to 'list' all groups that 'match' the query. |
---|
23 | Alternatives are: |
---|
24 | - selecting 'dont match' instead of match will invert the overall query |
---|
25 | - selecting 'add', 'keep' or 'remove' instead of 'list' will allow to |
---|
26 | combine the results of multiple consecutive searches. |
---|
27 | |
---|
28 | Query expressions are handled similar to those in LINK{sp_search.hlp}. |
---|
29 | While species-search operates on database fields, |
---|
30 | groups-search mainly operates on values which are calculated on-the-fly. |
---|
31 | Please refer to LINK{searching.hlp} for general information about |
---|
32 | query expression syntax and about the combination of multiple query |
---|
33 | expressions. |
---|
34 | For details about the search criteria available for group search |
---|
35 | see section below. |
---|
36 | |
---|
37 | Press ENTER or click 'Search' to start the search. |
---|
38 | |
---|
39 | The HITLIST will display all matching groups. |
---|
40 | The number of hits is shown above the HITLIST. |
---|
41 | |
---|
42 | Click onto a result to select the group in the main window. |
---|
43 | |
---|
44 | Double click or press ENTER on a result to expand or collaps the selected group. |
---|
45 | |
---|
46 | Below the HITLIST is a radio button which allows to |
---|
47 | choose the order (and the content) of the displayed results. |
---|
48 | |
---|
49 | The following criteria are available for sorting: |
---|
50 | |
---|
51 | - by name: sort alphabetically by name of group |
---|
52 | - by nesting: sort numerically by level of group nesting (top level groups |
---|
53 | like 'Bacteria' have level 0, their direct child group have level 1, etc.) |
---|
54 | - by size: sort numerically by size (number of group-members) |
---|
55 | - by marked: sort numerically by number of marked species (will not automatically |
---|
56 | update if you change marks; rerun 'Search' to do so) |
---|
57 | - by marked%: similar to mark, but uses percentage of marked |
---|
58 | - by treename: sort alphabetically by name of tree |
---|
59 | - by treeorder: sort by tree (in order defined by LINK{treeadm.hlp}) |
---|
60 | - by hit: sort by hit-description (the hit description tells you |
---|
61 | why an item was hit by query) |
---|
62 | - by cluster: only has effect for duplicate search (see section below) |
---|
63 | - by AID (see below) |
---|
64 | - by keeled: keeled groups at top (see LINK{keeled_groups.hlp}) |
---|
65 | - reverse: reverts previously selected sort order |
---|
66 | |
---|
67 | The most recently selected criterion will be most authoritative, |
---|
68 | but previously selected criteria still remain active, i.e. when you first sort |
---|
69 | by name and afterwards by treename, the results will group by trees, but inside |
---|
70 | each tree-section groups are sorted by name. |
---|
71 | |
---|
72 | The 2 buttons above the HITLIST allow to modify it: |
---|
73 | |
---|
74 | * 'Remove' allows to manually remove unwanted hits from the |
---|
75 | result list (useful before applying one of the actions |
---|
76 | listed in the next section). |
---|
77 | * 'Clear' empties the result list. |
---|
78 | |
---|
79 | SECTION Search criteria |
---|
80 | |
---|
81 | Group search supports the following search expressions: |
---|
82 | |
---|
83 | - 'groupname' matches against the name of each group |
---|
84 | - 'parent' matches against the name of the direct parent group of |
---|
85 | the queried group |
---|
86 | - 'parent (any)' is true if any parent group of the queried group matches |
---|
87 | the given expression |
---|
88 | - 'parent (all)' is true if all parent groups of the queried group match |
---|
89 | the given expression (a common use of this is to check for exclusion. |
---|
90 | To e.g. check that no parent group matches '*bacteria*', |
---|
91 | check that all parents mismatch '*bacteria*') |
---|
92 | - 'nesting' allows to query the nesting-level of groups (top-level-groups |
---|
93 | like 'Bacteria' have a nesting level of 0 (zero), their direct child-groups |
---|
94 | have a nesting level of 1, ...) |
---|
95 | - 'folded' allows to query for folded/unfolded groups (possible values are 0 and 1) |
---|
96 | - 'size' matches against the groupsize (i.e. against the number of species inside a |
---|
97 | group and its subgroups) |
---|
98 | - 'marked' queries against the number of marked species inside a group |
---|
99 | - 'marked%' queries against the percentage of marked species inside a group |
---|
100 | - 'zombies' queries against the number of zombie species inside a group |
---|
101 | - 'AID' queries against the average ingroup distance (see following section) |
---|
102 | - 'keeled' queries against the keeled state (0=normal,1=keeled upper |
---|
103 | son,2=keeled lower son; see LINK{keeled_groups.hlp}) |
---|
104 | |
---|
105 | SECTION Average ingroup distance (AID) |
---|
106 | |
---|
107 | For each group the average distance of all possible pairs of species |
---|
108 | inside that group is retrieved from the tree structure and is provided |
---|
109 | as criterion for group-search and to order results. |
---|
110 | |
---|
111 | The distance of two species is defined as the sum of the lengths |
---|
112 | of all branches connecting the two species. |
---|
113 | |
---|
114 | SECTION Duplicate search |
---|
115 | |
---|
116 | Next to the 'Search' button is a selector which allows to restrict the listed |
---|
117 | groups by analysing whether they are duplicates or not. |
---|
118 | The different available modes are: |
---|
119 | |
---|
120 | - 'no' = deactivate duplicate search |
---|
121 | - 'duplicate groups only' = activate duplicate search |
---|
122 | - 'unique groups only' = list all group not reported by duplicate search |
---|
123 | |
---|
124 | The 'Configure' button provides detailed settings for duplicate search: |
---|
125 | |
---|
126 | - Min. size of duplicate cluster |
---|
127 | |
---|
128 | The minimum number of groups, which have to be strictly consistent |
---|
129 | with any given duplication criteria (=core of cluster). |
---|
130 | If that min. size isn't found, these groups will not be |
---|
131 | listed in results. |
---|
132 | |
---|
133 | - Search duplicates |
---|
134 | |
---|
135 | Defines where duplicates are expected to occur. |
---|
136 | |
---|
137 | - inside same tree |
---|
138 | |
---|
139 | All groups of a cluster have to be members of the same tree. |
---|
140 | Duplicates in other trees probably form their own cluster. |
---|
141 | |
---|
142 | - in different trees |
---|
143 | |
---|
144 | The core of the cluster will only consist of groups from |
---|
145 | different trees (one hit per tree) and the whole cluster |
---|
146 | will be discarded, if the required minimum size isn't reached. |
---|
147 | For the final result all other duplicate will be added, |
---|
148 | i.e. there may be more than one hit per tree. |
---|
149 | |
---|
150 | - anywhere |
---|
151 | |
---|
152 | - Ignore case? |
---|
153 | |
---|
154 | Define whether to ignore case when matching group names, |
---|
155 | words or when checking against the list of ignored words. |
---|
156 | |
---|
157 | - Duplicates are names that |
---|
158 | |
---|
159 | Defines how duplicate groups are detected. Either |
---|
160 | |
---|
161 | - by matching the whole name or |
---|
162 | - by matching single/multiple words. |
---|
163 | |
---|
164 | - Min. number of matching words |
---|
165 | |
---|
166 | If fewer words match between two compared groupnames, |
---|
167 | they are counted as mismatch. |
---|
168 | Hits with more matching words are preferred over such with fewer words. |
---|
169 | |
---|
170 | - Word separators |
---|
171 | |
---|
172 | Defines characters which separate words. |
---|
173 | Should normally contain a SPACE character. |
---|
174 | |
---|
175 | - Ignored words |
---|
176 | |
---|
177 | Specifes a list of words that will be completely ignored when |
---|
178 | matching wordwise. |
---|
179 | |
---|
180 | Sorting results 'by cluster' will list related duplicate-groups |
---|
181 | next to each other. It will also add a new column showing the |
---|
182 | unique IDs of each cluster of groups. |
---|
183 | |
---|
184 | SECTION Working with listed groups |
---|
185 | |
---|
186 | Right of the HITLIST are several buttons allowing to work with the found results: |
---|
187 | |
---|
188 | * 'Rename ...' allows to LINK{group_rename.hlp}. |
---|
189 | * 'Expand listed' will expand all listed groups and their parent groups. |
---|
190 | * 'Expand listed collapse rest' does the same and additionally folds all other groups. |
---|
191 | * 'Expand parents' will expand the parents of all listed groups, i.e. all listed groups |
---|
192 | will become visible. |
---|
193 | * 'Collapse listed' will collapse all listed groups. |
---|
194 | * The button 'Mark', 'Unmark' and 'Inv' allow to change marks of species |
---|
195 | contained in listed groups. Use the option-menu below these buttons and select |
---|
196 | - 'selected' to operate on all species contained in the currently selected group, |
---|
197 | - 'any listed' to operate on all species contained in ANY of the listed groups, |
---|
198 | - 'all listed' to operate on all species contained in ALL of the listed groups or |
---|
199 | - 'database' to operate on all species in the database. |
---|
200 | * 'Destroy selected group' will delete that group from the tree. |
---|
201 | Be aware that this action is currently irreversible (see BUGS below). |
---|
202 | * 'Destroy all listed groups' will delete all groups currently listed in |
---|
203 | the HITLIST. |
---|
204 | |
---|
205 | NOTES Please configure auto-focus options in LINK{nt_tree_settings.hlp} (esp. auto-unfold) |
---|
206 | to improve the usability of the group search. |
---|
207 | |
---|
208 | Use the config manager icon (see LINK{prop_configs.hlp}) to |
---|
209 | store/restore group search and rename settings. |
---|
210 | |
---|
211 | EXAMPLES Common combinations of expanding and collapsing groups: |
---|
212 | |
---|
213 | * to collapse ALL groups of ALL searched trees press |
---|
214 | - 'Clear' to empty the HITLIST and |
---|
215 | - 'Expand listed collapse rest'. |
---|
216 | * to expand parent groups of listed and fold the rest press |
---|
217 | - 'Expand listed collapse rest', |
---|
218 | - 'Collapse listed' and |
---|
219 | - 'Expand parents'. |
---|
220 | |
---|
221 | Common combinations for marking specific group members: |
---|
222 | |
---|
223 | * to mark all species NOT contained in all listed groups use |
---|
224 | - 'Mark' + 'any listed' followed by |
---|
225 | - 'Unmark' + 'all listed' |
---|
226 | |
---|
227 | WARNINGS Searching for duplicates with wordwise mode and expecting only a few words (e.g. 2 words for |
---|
228 | tree in SSURef_NR99_128_SILVA), may take very, very long. Expecting more words will speed up the search. |
---|
229 | |
---|
230 | BUGS UNDO does not work for deleting groups (http://bugs.arb-home.de/ticket/480) |
---|