Skip to content

protr 1.6-1

Compare
Choose a tag to compare
@nanxstats nanxstats released this 24 Feb 19:18
· 114 commits to master since this release

Improvements

  • Added a new argument batches to parSeqSim(). The new argument supports breaking down the pairwise similarity computation into smaller batches. This is useful when you have a large number of protein sequences, enough number of CPU cores, but not enough RAM to compute and hold all the pairwise similarities in a single batch. Also, use the other new argument verbose to track the computation progress.

New Features

  • Added a new function parSeqSimDisk(). Compared to the in-memory version parSeqSim(), this new function caches the partial results in each batch to the hard drive and merges the results together in the end. This could further reduce the memory usage for parallel similarity computations involving a large number of protein sequences.

Bug Fixes

  • Fixed an issue in parGOSim() that will create minor numerical inconsistencies in results due to argument matching.