Alvis-NLPPlatform

 view release on metacpan or  search on metacpan

examples/InputDocument.xml  view on Meta::CPAN

                <item>Form: source+resources</item> 
                <item>Obtain: links and instructions at <ulink url="http://www-tsujii.is.s.u-tokyo.ac.jp/~genia/postagger/geniatagger-2.0.1.tar.gz">http://www-tsujii.is.s.u-tokyo.ac.jp/~genia/postagger/geniatagger-2.0.1.tar.gz</ulink></item> 
                <item>Install: untar geniatagger-2.0.1.tar.gz in a directory cd tools/geniatagger-2.0.1 Run make</item> 
                <item>Licence: free for research only (and Wordnet licence for the dictionary)</item> 
                <item>Version number required: 2.0.1</item></list></section>
            <section title="Parser">
              <section>Parser</section> 
              <section>Link Grammar Parser:</section> 
              <list>
                <item>Form: sources + resources</item> 
                <item>Obtain: <ulink url="http://www.link.cs.cmu.edu/link/ftp-site/link-grammar/link-4.1b/unix/link-4.1b.tar.gz">http://www.link.cs.cmu.edu/link/ftp-site/link-grammar/link-4.1b/unix/link-4.1b.tar.gz</ulink></item> 
                <item>Install: untar link-4.1b.tar.gz See the Makefile for configuration run make Apply the additional patch for the Link Grammar parser (lib/Alvis/NLPPlatform/patches). cd link-4.1b patch -p0 60 lib/Alvis/NLPPlatform/patches/link-4.1...
                <item>Licence: Compatible with GPL</item> 
                <item>Version number required: 4.1a or 4.1b</item></list></section>
            <section title="Parser specialized for biological texts">
              <section>Parser specialized for biological texts</section> 
              <section>BioLG:</section>  
              <section>Form: sources + resources</section> 
              <section>Obtain: <ulink url="http://www.it.utu.fi/biolg/">http://www.it.utu.fi/biolg/</ulink></section> 
              <section>Install:  untar See the Makefile for configuration run make</section> 
              <section>Licence: Compatible with GPL</section> 

examples/InputDocument.xml  view on Meta::CPAN

          <section title="LICENSE">
            <section>LICENSE</section> 
            <section>Copyright (C) 2005 by Thierry Hamon and Julien Deriviere</section> 
            <section>This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.</section></section></s...
      <metaData>
        <meta name="title">title</meta>
      </metaData>
      <links>
        <outlinks>
          <link type="a">
            <anchorText>http://www.link.cs.cmu.edu/link/ftp-site/link-grammar/link-4.1b/unix/link-4.1b.tar.gz</anchorText>
            <location>http://www.link.cs.cmu.edu/link/ftp-site/link-grammar/link-4.1b/unix/link-4.1b.tar.gz</location>
          </link>
          <link type="a">
            <anchorText>http://www-tsujii.is.s.u-tokyo.ac.jp/~genia/postagger/geniatagger-2.0.1.tar.gz</anchorText>
            <location>http://www-tsujii.is.s.u-tokyo.ac.jp/~genia/postagger/geniatagger-2.0.1.tar.gz</location>
          </link>
          <link type="a">
            <anchorText>http://www.alvis.info</anchorText>
            <location>http://www.alvis.info</location>
          </link>
          <link type="a">

lib/Alvis/NLPPlatform.pm  view on Meta::CPAN



=over

=item * Form:

 sources + resources

=item * Obtain:

 http://www.link.cs.cmu.edu/link/ftp-site/link-grammar/link-4.1b/unix/link-4.1b.tar.gz

=item * Install: 

    untar link-4.1b.tar.gz

    See the Makefile for configuration

    run make

    Apply the additional patch for the Link Grammar parser (lib/Alvis/NLPPlatform/patches).

lib/Alvis/NLPPlatform/Convert.pm  view on Meta::CPAN

    if ($file =~ /.ppt$/i) {
	$type = "application/powerpoint";
	warn "Getting the type thanks to the extension\n";
    }
    if ($file =~ /.xls$/i) {
	$type = "application/vnd.ms-excel";
	warn "Getting the type thanks to the extension\n";
    }
    # if msword may be it should be relevant to check the extension, to better determine the type
    $type =~ s/;.*//;
    if (($type eq "message/rfc822") || ($file =~ /^x-system\/x-unix;/)) {
	if ($file =~ /.tex$/i) {
	    $type = "text/x-tex";
	    warn "Getting the type thanks to the extension\n";
	}
    }
    print STDERR "Type file: $type\n";
    return($type);

}



( run in 0.586 second using v1.01-cache-2.11-cpan-39bf76dae61 )