CCCP-Encode
view release on metacpan or search on metacpan
lib/CCCP/Encode.pm view on Meta::CPAN
$CCCP::Encode::CharMap = {
"\x{2014}" => '-',
"\x{2015}" => 'foo'
};
=head3 $CCCP::Encode::Regexp
By default value is C<[^\p{Cyrillic}|\p{IsLatin}|\p{InBasic_Latin}]> - replace any character which not in Cyrillic or Latin map exist.
You can override this expression.
See more on C<http://www.regular-expressions.info/unicode.html>
=head1 OVERHEAD
CCCP::Encode with $CCCP::Encode::Entities eq "html":
2 wallclock secs ( 1.63 usr + 0.01 sys = 1.64 CPU) @ 60975.61/s (n=100000)
CCCP::Encode with $CCCP::Encode::Entities eq "xml":
3 wallclock secs ( 2.49 usr + 0.00 sys = 2.49 CPU) @ 40160.64/s (n=100000)
CCCP::Encode with $CCCP::Encode::ToText eq "1":
( run in 0.372 second using v1.01-cache-2.11-cpan-88abd93f124 )