App-optex-textconv
view release on metacpan or search on metacpan
- Use Command::Run::Tmpfile (raw mode) instead of the bundled
App::optex::Tmpfile (requires Command::Run 1.01)
1.07 2023-12-31T06:04:43Z
- add plutil to default rule
- return empty string when the msdoc file is empty
1.06 2023-06-27T01:52:00Z
- Support .gpg files.
- Support footnote reference in docx file.
- Now xslt.pm file is not loaded by default.
1.05 2023-04-14T05:03:38Z
- Support forced line breaks in paragraphs.
1.04 2023-03-09T03:50:37Z
- Introduce textconv::git module to handle git file object.
- jpeg
JPEG file is converted to its exif information (.jpeg, .jpg).
- http
Name start with `http://` or `https://` is converted to text data
translated by [w3c(1)](http://man.he.net/man1/w3c) command.
- gpg
Invoke [gpg(1)](http://man.he.net/man1/gpg) command to decrypt encrypted files with `.gpg`
extention.
- pandoc
Use [pandoc](https://pandoc.org/) command to translate Microsoft
office document in XML format.
See [App::optex::textconv::pandoc](https://metacpan.org/pod/App%3A%3Aoptex%3A%3Atextconv%3A%3Apandoc).
- tika
lib/App/optex/textconv.pm view on Meta::CPAN
=item jpeg
JPEG file is converted to its exif information (.jpeg, .jpg).
=item http
Name start with C<http://> or C<https://> is converted to text data
translated by L<w3c(1)> command.
=item gpg
Invoke L<gpg(1)> command to decrypt encrypted files with C<.gpg>
extention.
=item pandoc
Use L<pandoc|https://pandoc.org/> command to translate Microsoft
office document in XML format.
See L<App::optex::textconv::pandoc>.
=item tika
lib/App/optex/textconv/default.pm view on Meta::CPAN
package App::optex::textconv::default;
use strict;
use warnings;
use App::optex::textconv::Converter 'import';
our @CONVERTER = (
[ qr/\.jpe?g$/i => 'exif "%s"' ],
[ qr[^https?://] => 'w3m -dump "%s"' ],
[ qr/\.gpg$/i => 'gpg --quiet --no-mdc-warning --decrypt "%s"' ],
[ qr/\.(plist|strings)$/ => 'plutil -p "%s"' ],
);
1;
( run in 0.815 second using v1.01-cache-2.11-cpan-df04353d9ac )