source: branches/sina/HELP_SOURCE/source/pt_server.hlp

Last change on this file was 19708, checked in by westram, 3 months ago
  • update doc+gui:
    • avoid terms "ARB_NT" (=former name of main arb window), "ARB_NTREE" and similar
      • instead talk about "ARB main window"
    • correct refs to 'ARB/Probes/PT_SERVER Admin'.
    • update 'What are marked species?'.
    • update protection level documentation (parts already done with previous commit).
  • Property svn:eol-style set to native
  • Property svn:keywords set to Author Date Id Revision
File size: 5.3 KB
Line 
1#       main topics:
2UP      arb.hlp
3UP      glossary.hlp
4
5#       sub topics:
6
7SUB     probeadmin.hlp
8SUB     probematch.hlp
9SUB     probedesign.hlp
10SUB     multiprobe.hlp
11
12# format described in ../help.readme
13
14
15TITLE           PT_SERVER: What Why and How
16
17OCCURRENCE      ARB/Probes/*
18                ARB_EDIT4/Edit/<misc aligners>
19
20DESCRIPTION     Probe design, probe matching, and searching the nearest relatives require a
21                lot of database searching.
22                Simply scanning the whole database for a specific target
23                takes several tens of seconds. Probe design and searching for
24                nearest relatives start thousands of search
25                operations. Waiting several weeks to get a result from the computer
26                makes every user hate the program, so better algorithms have
27                to be developed.
28
29        The basic ARB SEARCH ALGORITHM:
30
31        - When any (calling) program wants to scan a database, it does not do the job itself
32          but calls a special SEARCH_PATTERNS_IN_A_BIG_DATABASE_PROGRAM.
33
34              We named this program PT_SERVER ('Prefix tree server' or
35              synonymously 'Positional tree server'). The PT_SERVER searches for patterns
36              in special database files and sends all matches back to the calling program.
37
38        - Different databases have different PT_SERVERs.
39          You must choose a PT_SERVER to match the database you wish to search.
40          The file $ARBHOME/lib/arb_tcp.dat defines all possible choices.
41
42        - If there is no PT_SERVER running, one is automatically started.
43          A PT_SERVER does not scan the database of the calling program, but the database in $ARBHOME/lib/pts/*.arb.
44
45                  That means:
46
47                    If you have just entered a new sequence this sequence
48                    will not be found by the the nearest-relative search.
49
50                         This is normally very useful, as you only want to use old
51                         sequences as a reference in the aligning process.
52
53                    For generating probes it isn't useful, because new sequences have to
54                    be added to the PT_SERVER before designing or matching probes.
55                    Remember: By adding 1 new sequence to the database, any previously validated
56                    probe may become invalid.
57
58                          Re-build your PT-server frequently and re-evaluate any probes
59                          you've made before.
60
61# @@@ move down section 'create template'. this is information for experts.
62
63        Create a new PT_SERVER template:
64
65        - Edit the file $ARBHOME/lib/arb_tcp.dat using <ARB/Probes/PT_SERVER Admin/Configure>
66
67          Note: Make a copy of your changes because a new ARB installation may reinstall and overwrite your changed $ARBHOME/lib/arb_tcp.dat.
68
69        - Restart arb
70
71        Create a database for an existing template:
72
73        - Start ARB with the database you want to send to the PT_SERVER.
74        - Open the PT_SERVER ADMIN Window <ARB/Probes/PT_SERVER Admin>.
75        - Select a template
76        - Press <BUILD SERVER>: ARB will save the DB into $ARBHOME/lib/pts/name.arb
77        - Wait
78
79                ARB generates an index file $ARBHOME/lib/pts/name.arb.pt
80
81                If any '*.arb' file in $ARBHOME/lib/pts is newer than the
82                corresponding '*.arb.pt' ( == prefix tree) file the
83                '*.arb.pt' file will be updated as soon as the
84                PT_SERVER is started.
85
86        Updating a server:
87
88                 see 'Create a database for an existing template'.
89
90NOTES      Once started a PT_SERVER never stops. The only ways to stop a
91           PT_SERVER are:
92
93# @@@ check if it's still true that a PT_SERVER never stops
94
95                        - SOFT KILL (everybody) (only idle PT_SERVERS):
96
97                                - <ARB/Probes/PT_SERVER Admin/Stop server>
98
99                        - HARD KILL (supervisor) (all servers):
100
101                                - become superuser
102                                - enter 'ps -auxww |grep pt_server' at any shell
103                                - enter 'kill -9 PID'
104                                  ( to get help enter 'man kill' or 'man ps' )
105
106
107           WHY DOESN'T ARB USE BLAST?
108
109                The basic idea of blast is to create a finite automaton for all
110                search patterns and do the database search only once.
111                If someone wants to start a fuzzy search (allowing mismatches)
112                the size of this automata increases exponentially.
113
114                        For example:
115                        Three mismatches in a 20-base pattern would yield
116                        about 100,000 states in the automaton per search
117                        pattern. If probe design creates 1000 search
118                        patterns, and every state needs 10 bytes of
119                        computer memory, the search operation will require
120                        1 gigabyte RAM.
121
122
123WARNINGS        Do not modify any databases in $ARBHOME/lib/pts/ except
124                by the <PT_SERVER Admin> tools.
125                The reason is to keep the write protections of those files
126                consistent.
127
128                You will need a lot of swap space to run multiple PT_SERVER.
129
130BUGS            No bugs known
Note: See TracBrowser for help on using the repository browser.