source: trunk/GDE/SINA/builddir/doc/source/CHANGELOG.rst

Last change on this file was 19170, checked in by westram, 2 years ago
  • sina source
    • unpack + remove tarball
    • no longer ignore sina builddir.
File size: 8.5 KB

Changelog

?
.. program:: sina

Version 1.7.2:

Version 1.7.1:

Version 1.7.0:

Version 1.6.1:

Version 1.6.0:

  • make internal kmer engine the default (:issue:`23`)

    ?
  • add pretty progress monitor

  • run search stage in parallel (:issue:`32`)

    ?
  • :option:`--num-pts` defaults to number of cores available (previous: 1)

    ?
  • add :option:`--search-engine` setting search engine for search module

    ?
  • always run internal engine without thread limit

  • split num pt servers evently between search and align

  • use fixed point format for logging (instead of scientific format)

  • rewrote family selection (use :option:`--fs-oldmatch` for old implementation)

    ?
  • replace boost::mutex with std::mutex (c++11)

  • fix :option:`--show-dist` if alignment width don't match

    ?
  • fix race starting pt servers (library code not threadsafe)

  • fix engine type not shown in :option:`--show-conf`

    ?
  • fix writing to ARB sequence cache not threadsafe

  • use lock free map for ARB sequence cache (speedup)

  • add pod buffer to replace std::vector (speedup)

  • add FIFO cache for kmer search results (speedup for :option:`--search` and :option:`--turn`)

    ??

Version 1.5.0:

  • update documentation (:issue:`20`)

    ?
  • reinstate :option:`--show-dist`

    ?
  • reinstate :option:`--fs-msc-max`

    ?
  • add choice exact to :option:`--search-iupac`

    ?
  • change default for :option:`--search-kmer-len` to match :option:`--fs-kmer-len`

    ??
  • parallelize launch of background PT servers

  • lower memory usage: - avoid redundant sequence caching by libARBDB - use compact aligned base (50% on internal sequence cache)

  • improve internal kmer search performace - add caching of kmer index on disk - parallelize kmer index construction - add presence/absence optimization

  • fix field align_ident_slv added for 100% matches even when not enabled

  • fix crash on overhang past alignment edge

  • fix libARBDB writing to stdout, clobbering sequence output

  • fix out-of-bounds access on iterator in NAST implementation

  • remove dependency on boost serialization library

  • build release binaries with GCC 7 and C++11 ABI

  • add integration tests watching for accuracy regressions (:issue:`25`)

    ?

Version 1.4.0:

Version 1.3.5:

  • report number of references discarded due to configured constraints

  • fix crash if no acceptable references found for a query

  • fix :option:`--search` causes a program option error (:issue:`28`)

    ??
  • fix race condition in terminating PT server

Version 1.3.4:

Version 1.3.3:

Version 1.3.2:

Version 1.3.1:

  • add OSX support
  • change license to GPL
  • remove limitation on ARB integration mode
  • move revisioning to git
  • fix compilation with CLANG

Version 1.3.0:

  • dropped support for ARB 5.x

Version 1.2.13:

  • uppercase aligned bases if lowercase=unaligned
  • fix manual typos (thx to Mohamed El-hadidi)
  • search-db defaults to pt-db
  • search-port defaults to pt-port if search/align DBs are identical fixes unnecessary start of two PT servers (thx to Christian Wurzbacher)
  • change default for lca-quorum to 0.7
  • change default for search-min-sim to 0.7
  • be smarter about recoginizing FASTA format files and creating output FASTA name (".frn", ".fna", ".fas", "/dev/stdin" as input, ".fasta.aligned" and "/dev/stdout" as output)
  • write sequence ID in first column of CSV output
  • add fasta-block and fasta-idx options allowing to process only specific smaller blocks of larger fasta files (for parallelization)

Version 1.2.12:

  • use same ARB field type for align_ident_slv as SILVA uses
  • skip sequences with non-IUPAC characters when building reference and when loading sequences to be aligned from ARB file (complaint is issued on stderr)

Version 1.2.11:

Version 1.2.10:

  • added option :option:`--fs-no-graph` Uses a column profile with PSP score as template (instead of the POA method) This feature is merely for completeness sake and evaluation. With SILVA SSU the POA based method is much more accurate.

    ?
  • changed default for :option:`--fs-cover-gene` to 0 (faster) The cover-gene feature only makes sense if :option:--gene-start` and :option:`--gene-end` are set such that the reference actually contains sequences touching these boundaries. If this is not the case, the reference selection algorithm wastes time with a futile search.

    ??
  • use unix socket as default for :option:`--ptport` and :option:`--search-port` Using "/tmp/sina_<PID>.socket" is a more suitable default than "localhost:4040", as it runs less risk of accessing a different PT server than intended.

    ??
  • fix inconsistencies in generated meta data fields and log output

  • updated ARB components to SVN revision 8225

  • added option :option:`--write-used-rels` The field used_rels is interpreted by ARB as the field containing the reference sequences that were used during alignment.

    ?
  • no longer write full_name content when exporting meta data encoded in the FASTA header

  • re-add clamped align_quality_slv

  • fix score normalization (scores > 1 were possible when fs-weight > 0)

  • fix calculation of bp score when orig-db no set (default ptdb)

  • added option :option:`--fs-req-gaps` n Ignores reference sequences having less than n gaps before the last base. I.e.: Ignores "unaligned" sequences. This is useful when running SINA out of ARB to prevent accidental alignment against unaligned sequences.

    ?
  • added options :option:`--search-iupac`, :option:`--search-correction` and :option:`--search-cover` These options configure how the "distance" (identity, similarity, ...) is calculated.

    ???
  • skip FASTA input sequences that contain invalid characters (i.e. not IUPAC encoded bases, '.', '-' or white space)

Version 1.2.9:

  • fixed sequence not filled with gap characters after copying full alignment

Version 1.2.8:

  • made --extra-fields actually load multiple fields from arb file
  • fixed sequence not filled with gap characters after copying subalignment
  • updated ARB components to SVN revision 7985
  • added changelog :)
Note: See TracBrowser for help on using the repository browser.