| 1 | # main topics: |
|---|
| 2 | UP arb.hlp |
|---|
| 3 | UP glossary.hlp |
|---|
| 4 | UP save.hlp |
|---|
| 5 | UP arb_intro.hlp |
|---|
| 6 | |
|---|
| 7 | # sub topics: |
|---|
| 8 | |
|---|
| 9 | SUB importift.hlp |
|---|
| 10 | |
|---|
| 11 | SUB universal_ift.hlp |
|---|
| 12 | SUB dssp_ift.hlp |
|---|
| 13 | SUB gde_flat_ift.hlp |
|---|
| 14 | |
|---|
| 15 | SUB arb_export.hlp |
|---|
| 16 | |
|---|
| 17 | # format described in ../help.readme |
|---|
| 18 | |
|---|
| 19 | |
|---|
| 20 | TITLE Import Foreign Data(bases) |
|---|
| 21 | |
|---|
| 22 | OCCURRENCE ARB_INTRO <Create and Import>, |
|---|
| 23 | ARB_NT/File/Import/Import from external format |
|---|
| 24 | |
|---|
| 25 | DESCRIPTION Reads foreign data(base) formats, creates a new ARB database, |
|---|
| 26 | and imports the foreign data. A selection of commonly used |
|---|
| 27 | foreign formats can be automatically identified. Data can be |
|---|
| 28 | imported from single or multiple files. |
|---|
| 29 | |
|---|
| 30 | Type a source file name to the 'Enter file name of foreign |
|---|
| 31 | database' subwindow. Use '*' and '?' as multiple and single |
|---|
| 32 | character wildcards to load a set of files, respectively. |
|---|
| 33 | Alternatively you may select a file from the directories and |
|---|
| 34 | files subwindow. |
|---|
| 35 | |
|---|
| 36 | Make a selection whether you want to import |
|---|
| 37 | |
|---|
| 38 | - a full genome flatfile (in GENBANK or EMBL format) or |
|---|
| 39 | - normal sequence files. |
|---|
| 40 | |
|---|
| 41 | In the second case select the file format from the |
|---|
| 42 | 'Select foreign database format' subwindow |
|---|
| 43 | or press the 'AUTO DETECT' button. |
|---|
| 44 | |
|---|
| 45 | If your file type is not in the list and you are only |
|---|
| 46 | interested in the sequence, try 'universal'. |
|---|
| 47 | |
|---|
| 48 | Enter the name and type of the destination alignment (see LINK{alignment.hlp}). |
|---|
| 49 | |
|---|
| 50 | Use different alignment names for different genes |
|---|
| 51 | to be able to store them in the same datebase while |
|---|
| 52 | still being able to distinguish them. |
|---|
| 53 | |
|---|
| 54 | Choose the default protection used for the imported data. |
|---|
| 55 | |
|---|
| 56 | Check "Create selection?" to store the names of all imported |
|---|
| 57 | species in a LINK{species_configs.hlp}. |
|---|
| 58 | |
|---|
| 59 | Click the 'FTS' selection button to define or select LINK{xferset.hlp}. |
|---|
| 60 | |
|---|
| 61 | Press the 'GO' button. |
|---|
| 62 | |
|---|
| 63 | SECTION Custom import filters |
|---|
| 64 | |
|---|
| 65 | You may create and store your own private import filters in |
|---|
| 66 | directory '~/.arb_prop/filter'. See LINK{importift.hlp} for information |
|---|
| 67 | about the import filter definition language. |
|---|
| 68 | |
|---|
| 69 | Press the 'Test' button to modify and test the selected filter |
|---|
| 70 | (see LINK{import_test.hlp}). |
|---|
| 71 | |
|---|
| 72 | If you want to import only parts of the data provided by an |
|---|
| 73 | existing import filter, use LINK{xferset.hlp}. |
|---|
| 74 | |
|---|
| 75 | NOTES Following file formats currently can be detected and loaded: |
|---|
| 76 | GENBANK, RDP: GENBANK, EBI and FastA |
|---|
| 77 | |
|---|
| 78 | Several uncommon file formats (including AE2, GCG and DSSP) |
|---|
| 79 | are kept in directory '$ARBHOME/lib/import/older'. To |
|---|
| 80 | make them available, copy or symlink them into ''$ARBHOME/lib/import' or into |
|---|
| 81 | your local filter directory '~/.arb_prop/filter/'. |
|---|
| 82 | |
|---|
| 83 | To import big new databases into an existing ARB database, convert it |
|---|
| 84 | to the ARB format first, save and merge it with the |
|---|
| 85 | ARB_MERGE tool. |
|---|
| 86 | |
|---|
| 87 | For importing other formats such as PHYLIP or PAUP into an |
|---|
| 88 | existing ARB database use the 'Import sequences using Readseq' |
|---|
| 89 | function accessible via the 'File' menu of the 'ARB_NT' main menu. |
|---|
| 90 | See LINK{agde_readseq.hlp}. |
|---|
| 91 | |
|---|
| 92 | If 'AUTO DETECT' does not find any format, selecting a format |
|---|
| 93 | by hand most likely wont help you (exception: universal format). |
|---|
| 94 | |
|---|
| 95 | WARNINGS When using 'AUTO DETECT', check if the correct format is detected. |
|---|
| 96 | RDP files may for instance be identified as GenBank. In such case |
|---|
| 97 | choose 'rdp.ift' manually. |
|---|
| 98 | |
|---|
| 99 | BUGS 'AUTO DETECT' looks for certain key-words in the files. If it can't |
|---|
| 100 | find these words, it does not accept the file, even if the file has |
|---|
| 101 | the correct format. This is especially true for the gcg format. |
|---|
| 102 | |
|---|