Acme-CPANModules-Similarity

 view release on metacpan or  search on metacpan

lib/Acme/CPANModules/Similarity.pm  view on Meta::CPAN

** Between strings/texts

Similarity between two text can be calculated using Levenshtein edit distance.
There are several levenshtein modules on CPAN, among others:
<pm:Text::Levenshtein>, <pm:Text::Levenshtein::XS>,
<pm:Text::Levenshtein::Flexible>, <pm:Text::LevenshteinXS>, <pm:Text::Fuzzy>.
For more details, see <pm:Bencher::Scenario::LevenshteinModules>.

Soundex can also be used. Some example soundex moduless: <pm:Text::Soundex>,
<pm:Text::Phonetic::Soundex>.

<pm:Algorithm::HowSimilar> uses Algorithm::Diff to calculate similarity between
two strings. It's roughly similar in speed to pure-perl Levenshtein modules, and
tend to be faster for longer strings. It can also calculate similarity between
two arrays.

<pm:String::Similarity>

<pm:String::Similarity::Group>

<pm:Text::Similarity>

<pm:String::Simrank>

<pm:String::Similex>


** Between vectors

<pm:Data::CosineSimilarity>


** Between words (semantic similarity)

<pm:WordNet::Similarity>

<pm:WordNet::SenseRelate::AllWords>


** Others

<pm:Cluster::Similarity>

_

our $LIST = {
    summary => 'List of modules to finding similarity between stuffs',
    description => $text,
    tags => ['task'],
};

Acme::CPANModulesUtil::Misc::populate_entries_from_module_links_in_description;

1;
# ABSTRACT: List of modules to finding similarity between stuffs

__END__

=pod

=encoding UTF-8

=head1 NAME

Acme::CPANModules::Similarity - List of modules to finding similarity between stuffs

=head1 VERSION

This document describes version 0.001 of Acme::CPANModules::Similarity (from Perl distribution Acme-CPANModules-Similarity), released on 2024-01-17.

=head1 DESCRIPTION

** Between arrays/bags/sets

L<Algorithm::HowSimilar> uses Algorithm::Diff to calculate similarity between
two arrays. It can also calculate similarity between two strings.

L<Bag::Similarity>

L<Set::Jaccard::SimilarityCoefficient>

L<Set::Partitions::Similarity>

L<Set::Similarity> provides several algorithms.

** Between codes

L<School::Code::Compare>

** Between colors

L<Color::Similarity>

L<Color::RGB::Util> provides C<rgb_diff()> and C<rgb_distance()> to calculate
difference between two RGB colors using one of several algorithms.

** Between files

L<File::FindSimilars> uses file size and a modified soundex algorithm on the
filename to determine similarity.

** Between graphs

L<Graph::Similarity>

** Between HTML/XML documents

L<HTML::Similarity> calculates the structural similarity between two HTML
documents.

L<XML::Similarity>

** Between images

L<Image::Similar>

** Between strings/texts

Similarity between two text can be calculated using Levenshtein edit distance.
There are several levenshtein modules on CPAN, among others:
L<Text::Levenshtein>, L<Text::Levenshtein::XS>,



( run in 1.480 second using v1.01-cache-2.11-cpan-39bf76dae61 )