Bio-MUST-Apps-FortyTwo

 view release on metacpan or  search on metacpan

bin/prune-outliers.pl  view on Meta::CPAN

        for (my $t = $ARGV_min_ident; $t <= $ARGV_max_ident; $t += 0.1) {

            my @ids
                = grep { ($count_for{$_} / keys %count_for) >= $t }
                  grep {  $count_for{$_} >= $ARGV_min_hits } keys %count_for
            ;

            ### threshold: $t . ' - ' . scalar @ids . ' seqs kept out of ' . scalar keys %count_for

            my $ali = Ali->load($infile);
            $ali->dont_guess if $ARGV_noguessing;

            my $list = IdList->new( ids => \@ids );
            my $new_ali = $list->filtered_ali($ali);

            # create output dirs named after input dir and identity threshold
            my $subdir = dir( $outdir, $t )->relative;
            $subdir->mkpath();

            # store Ali in corresponding dir
            my ($filename) = fileparse($infile);

bin/prune-outliers.pl  view on Meta::CPAN

    n.default: 0.8

=item --min-hits=<n> | --min_hits=<n>

Minimum number of hits in the all-versus-all BLAST searches required for a
sequence to be retained in the output file [default: n.default].

=for Euclid: n.type: num
    n.default: 10

=item --[no]guessing

[Don't] guess whether sequences are aligned or not [default: yes].

=item --version

=item --usage

=item --help

=item --man

Print the usual program information



( run in 3.959 seconds using v1.01-cache-2.11-cpan-748bfb374f4 )