| 1 | # main topics: |
|---|
| 2 | UP arb.hlp |
|---|
| 3 | UP glossary.hlp |
|---|
| 4 | |
|---|
| 5 | # sub topics: |
|---|
| 6 | #SUB subtopic.hlp |
|---|
| 7 | |
|---|
| 8 | # format described in ../help.readme |
|---|
| 9 | |
|---|
| 10 | |
|---|
| 11 | TITLE Search taxonomic groups |
|---|
| 12 | |
|---|
| 13 | OCCURRENCE ARB_NT/Tree/Search groups |
|---|
| 14 | |
|---|
| 15 | DESCRIPTION Allows to find taxonomic groups in trees. |
|---|
| 16 | |
|---|
| 17 | First select which trees shall be searched: |
|---|
| 18 | - current tree (displayed in main window) |
|---|
| 19 | - selected trees (use the '(select)' button to LINK{group_trees.hlp}) |
|---|
| 20 | - all trees |
|---|
| 21 | |
|---|
| 22 | Standard search mode is to 'list' all groups that 'match' the query. |
|---|
| 23 | Alternatives are: |
|---|
| 24 | - selecting 'dont match' instead of match will invert the overall query |
|---|
| 25 | - selecting 'add', 'keep' or 'remove' instead of 'list' will allow to |
|---|
| 26 | combine the results of multiple consecutive searches. |
|---|
| 27 | |
|---|
| 28 | Query expressions are handled similar to those in LINK{sp_search.hlp}. |
|---|
| 29 | While species-search operates on database fields, |
|---|
| 30 | groups-search mainly operates on values which are calculated on-the-fly. |
|---|
| 31 | Please refer to LINK{searching.hlp} for general information about |
|---|
| 32 | query expression syntax and about the combination of multiple query |
|---|
| 33 | expressions. |
|---|
| 34 | For details about the search criteria available for group search |
|---|
| 35 | see section below. |
|---|
| 36 | |
|---|
| 37 | Press ENTER or click 'Search' to start the search. |
|---|
| 38 | |
|---|
| 39 | The HITLIST will display all matching groups. |
|---|
| 40 | The number of hits is shown above the HITLIST. |
|---|
| 41 | |
|---|
| 42 | Click onto a result to select the group in the main window. |
|---|
| 43 | |
|---|
| 44 | Double click or press ENTER on a result to expand or collaps the selected group. |
|---|
| 45 | |
|---|
| 46 | Below the HITLIST is a radio button which allows to |
|---|
| 47 | choose the order (and the content) of the displayed results. |
|---|
| 48 | |
|---|
| 49 | The following criteria are available for sorting: |
|---|
| 50 | |
|---|
| 51 | - by name: sort alphabetically by name of group |
|---|
| 52 | - by nesting: sort numerically by level of group nesting (top level groups |
|---|
| 53 | like 'Bacteria' have level 0, their direct child group have level 1, etc.) |
|---|
| 54 | - by size: sort numerically by size (number of group-members) |
|---|
| 55 | - by marked: sort numerically by number of marked species (will not automatically |
|---|
| 56 | update if you change marks; rerun 'Search' to do so) |
|---|
| 57 | - by marked%: similar to mark, but uses percentage of marked |
|---|
| 58 | - by treename: sort alphabetically by name of tree |
|---|
| 59 | - by treeorder: sort by tree (in order defined by LINK{treeadm.hlp}) |
|---|
| 60 | - by hit: sort by hit-description (the hit description tells you |
|---|
| 61 | why an item was hit by query) |
|---|
| 62 | - by cluster: only has effect for duplicate search (see section below) |
|---|
| 63 | - by AID (see below) |
|---|
| 64 | - by keeled: keeled groups at top (see LINK{keeled_groups.hlp}) |
|---|
| 65 | - reverse: reverts previously selected sort order |
|---|
| 66 | |
|---|
| 67 | The most recently selected criterion will be most authoritative, |
|---|
| 68 | but previously selected criteria still remain active, i.e. when you first sort |
|---|
| 69 | by name and afterwards by treename, the results will group by trees, but inside |
|---|
| 70 | each tree-section groups are sorted by name. |
|---|
| 71 | |
|---|
| 72 | The 2 buttons above the HITLIST allow to modify it: |
|---|
| 73 | |
|---|
| 74 | * 'Remove' allows to manually remove unwanted hits from the |
|---|
| 75 | result list (useful before applying one of the actions |
|---|
| 76 | listed in the next section). |
|---|
| 77 | * 'Clear' empties the result list. |
|---|
| 78 | |
|---|
| 79 | SECTION Search criteria |
|---|
| 80 | |
|---|
| 81 | Group search supports the following search expressions: |
|---|
| 82 | |
|---|
| 83 | - 'groupname' matches against the name of each group |
|---|
| 84 | - 'parent' matches against the name of the direct parent group of |
|---|
| 85 | the queried group |
|---|
| 86 | - 'parent (any)' is true if any parent group of the queried group matches |
|---|
| 87 | the given expression |
|---|
| 88 | - 'parent (all)' is true if all parent groups of the queried group match |
|---|
| 89 | the given expression (a common use of this is to check for exclusion. |
|---|
| 90 | To e.g. check that no parent group matches '*bacteria*', |
|---|
| 91 | check that all parents mismatch '*bacteria*') |
|---|
| 92 | - 'nesting' allows to query the nesting-level of groups (top-level-groups |
|---|
| 93 | like 'Bacteria' have a nesting level of 0 (zero), their direct child-groups |
|---|
| 94 | have a nesting level of 1, ...) |
|---|
| 95 | - 'folded' allows to query for folded/unfolded groups (possible values are 0 and 1) |
|---|
| 96 | - 'size' matches against the groupsize (i.e. against the number of species inside a |
|---|
| 97 | group and its subgroups) |
|---|
| 98 | - 'marked' queries against the number of marked species inside a group |
|---|
| 99 | - 'marked%' queries against the percentage of marked species inside a group |
|---|
| 100 | - 'zombies' queries against the number of zombie species inside a group |
|---|
| 101 | - 'AID' queries against the average ingroup distance (see following section) |
|---|
| 102 | - 'keeled' queries against the keeled state (0=normal,1=keeled upper |
|---|
| 103 | son,2=keeled lower son; see LINK{keeled_groups.hlp}) |
|---|
| 104 | |
|---|
| 105 | SECTION Average ingroup distance (AID) |
|---|
| 106 | |
|---|
| 107 | For each group the average distance of all possible pairs of species |
|---|
| 108 | inside that group is retrieved from the tree structure and is provided |
|---|
| 109 | as criterion for group-search and to order results. |
|---|
| 110 | |
|---|
| 111 | The distance of two species is defined as the sum of the lengths |
|---|
| 112 | of all branches connecting the two species. |
|---|
| 113 | |
|---|
| 114 | SECTION Duplicate search |
|---|
| 115 | |
|---|
| 116 | Next to the 'Search' button is a selector which allows to restrict the listed |
|---|
| 117 | groups by analysing whether they are duplicates or not. |
|---|
| 118 | The different available modes are: |
|---|
| 119 | |
|---|
| 120 | - 'no' = deactivate duplicate search |
|---|
| 121 | - 'duplicate groups only' = activate duplicate search |
|---|
| 122 | - 'unique groups only' = list all group not reported by duplicate search |
|---|
| 123 | |
|---|
| 124 | The 'Configure' button provides detailed settings for duplicate search: |
|---|
| 125 | |
|---|
| 126 | - Min. size of duplicate cluster |
|---|
| 127 | |
|---|
| 128 | The minimum number of groups, which have to be strictly consistent |
|---|
| 129 | with any given duplication criteria (=core of cluster). |
|---|
| 130 | If that min. size isn't found, these groups will not be |
|---|
| 131 | listed in results. |
|---|
| 132 | |
|---|
| 133 | - Search duplicates |
|---|
| 134 | |
|---|
| 135 | Defines where duplicates are expected to occur. |
|---|
| 136 | |
|---|
| 137 | - inside same tree |
|---|
| 138 | |
|---|
| 139 | All groups of a cluster have to be members of the same tree. |
|---|
| 140 | Duplicates in other trees probably form their own cluster. |
|---|
| 141 | |
|---|
| 142 | - in different trees |
|---|
| 143 | |
|---|
| 144 | The core of the cluster will only consist of groups from |
|---|
| 145 | different trees (one hit per tree) and the whole cluster |
|---|
| 146 | will be discarded, if the required minimum size isn't reached. |
|---|
| 147 | For the final result all other duplicate will be added, |
|---|
| 148 | i.e. there may be more than one hit per tree. |
|---|
| 149 | |
|---|
| 150 | - anywhere |
|---|
| 151 | |
|---|
| 152 | - Ignore case? |
|---|
| 153 | |
|---|
| 154 | Define whether to ignore case when matching group names, |
|---|
| 155 | words or when checking against the list of ignored words. |
|---|
| 156 | |
|---|
| 157 | - Duplicates are names that |
|---|
| 158 | |
|---|
| 159 | Defines how duplicate groups are detected. Either |
|---|
| 160 | |
|---|
| 161 | - by matching the whole name or |
|---|
| 162 | - by matching single/multiple words. |
|---|
| 163 | |
|---|
| 164 | - Min. number of matching words |
|---|
| 165 | |
|---|
| 166 | If fewer words match between two compared groupnames, |
|---|
| 167 | they are counted as mismatch. |
|---|
| 168 | Hits with more matching words are preferred over such with fewer words. |
|---|
| 169 | |
|---|
| 170 | - Word separators |
|---|
| 171 | |
|---|
| 172 | Defines characters which separate words. |
|---|
| 173 | Should normally contain a SPACE character. |
|---|
| 174 | |
|---|
| 175 | - Ignored words |
|---|
| 176 | |
|---|
| 177 | Specifes a list of words that will be completely ignored when |
|---|
| 178 | matching wordwise. |
|---|
| 179 | |
|---|
| 180 | Sorting results 'by cluster' will list related duplicate-groups |
|---|
| 181 | next to each other. It will also add a new column showing the |
|---|
| 182 | unique IDs of each cluster of groups. |
|---|
| 183 | |
|---|
| 184 | SECTION Working with listed groups |
|---|
| 185 | |
|---|
| 186 | Right of the HITLIST are several buttons allowing to work with the found results: |
|---|
| 187 | |
|---|
| 188 | * 'Rename ...' allows to LINK{group_rename.hlp}. |
|---|
| 189 | * 'Expand listed' will expand all listed groups and their parent groups. |
|---|
| 190 | * 'Expand listed collapse rest' does the same and additionally folds all other groups. |
|---|
| 191 | * 'Expand parents' will expand the parents of all listed groups, i.e. all listed groups |
|---|
| 192 | will become visible. |
|---|
| 193 | * 'Collapse listed' will collapse all listed groups. |
|---|
| 194 | * The button 'Mark', 'Unmark' and 'Inv' allow to change marks of species |
|---|
| 195 | contained in listed groups. Use the option-menu below these buttons and select |
|---|
| 196 | - 'selected' to operate on all species contained in the currently selected group, |
|---|
| 197 | - 'any listed' to operate on all species contained in ANY of the listed groups, |
|---|
| 198 | - 'all listed' to operate on all species contained in ALL of the listed groups or |
|---|
| 199 | - 'database' to operate on all species in the database. |
|---|
| 200 | * 'Destroy selected group' will delete that group from the tree. |
|---|
| 201 | Be aware that this action is currently irreversible (see BUGS below). |
|---|
| 202 | * 'Destroy all listed groups' will delete all groups currently listed in |
|---|
| 203 | the HITLIST. |
|---|
| 204 | |
|---|
| 205 | NOTES Please configure auto-focus options in LINK{nt_tree_settings.hlp} (esp. auto-unfold) |
|---|
| 206 | to improve the usability of the group search. |
|---|
| 207 | |
|---|
| 208 | Use the config manager icon (see LINK{prop_configs.hlp}) to |
|---|
| 209 | store/restore group search and rename settings. |
|---|
| 210 | |
|---|
| 211 | EXAMPLES Common combinations of expanding and collapsing groups: |
|---|
| 212 | |
|---|
| 213 | * to collapse ALL groups of ALL searched trees press |
|---|
| 214 | - 'Clear' to empty the HITLIST and |
|---|
| 215 | - 'Expand listed collapse rest'. |
|---|
| 216 | * to expand parent groups of listed and fold the rest press |
|---|
| 217 | - 'Expand listed collapse rest', |
|---|
| 218 | - 'Collapse listed' and |
|---|
| 219 | - 'Expand parents'. |
|---|
| 220 | |
|---|
| 221 | Common combinations for marking specific group members: |
|---|
| 222 | |
|---|
| 223 | * to mark all species NOT contained in all listed groups use |
|---|
| 224 | - 'Mark' + 'any listed' followed by |
|---|
| 225 | - 'Unmark' + 'all listed' |
|---|
| 226 | |
|---|
| 227 | WARNINGS Searching for duplicates with wordwise mode and expecting only a few words (e.g. 2 words for |
|---|
| 228 | tree in SSURef_NR99_128_SILVA), may take very, very long. Expecting more words will speed up the search. |
|---|
| 229 | |
|---|
| 230 | BUGS UNDO does not work for deleting groups (http://bugs.arb-home.de/ticket/480) |
|---|