Lingua-Norms-SUBTLEX
view release on metacpan or search on metacpan
lib/Lingua/Norms/SUBTLEX.pm view on Meta::CPAN
=head1 DIAGNOSTICS
=over 4
=item * Need a valid <lang> attribute
When constructing the class object with L<new|Lingua::Norms::SUBTLEX/new>, the B<lang> argument must have a valid value, as indicated in the table above. Also, the module needs to read in the contents of a file named "specs.csv" which should be locat...
=item * Value given to argument 'dir' (VALUE) in new() is not a directory
Croaked from L<new|Lingua::Norms::SUBTLEX/new> if called with a value for the argument B<dir>, and this value is not actually a directory/folder. This is the directory/folder in which the actual SUBTLEX datafiles should be located.
=item * Cannot find required database for language ...
Croaked from L<new|Lingua::Norms::SUBTLEX/new> if none of the given values to arguments B<lang>, B<dir> or B<path> are valid, and even the default site/lib directory and US database are not accessible. Check that your have indeed a file with the give...
=item * Cannot determine fields for given language
Croaked upon construction if no fields are recognized for the given language. The value given to B<lang> must be one of DE, NL, UK or US.
=item * The requested value is not defined for the ... SUBTLEX corpus
Croaked when calling for a value for a statistic that is not defined for a given language, e.g., when requesting a value for the Zipf frequency in the NL corpus.
=item * No string to test; pass a value for <string> to FUNCTION()
Croaked by several methods that expect a value for the named argument B<string>, and when no such value is given. These methods require the letter-string to be passed to it as a I<key> => I<value> pair, with the key B<string> followed by the value of...
=item * No string(s) to test; pass one or more letter-strings named \'strings\' as a referenced array
Same as above but specifically croaked by L<frq_hash|Lingua::Norms::SUBTLEX/frq_hash> which accepts more than one string in a single call.
=item * Need to install and have access to module File::RandomLine
Croaked by method L<random_string|Lingua::Norms::SUBTLEX/random_string> if the module it depends on (File::RandomLine) is not installed or accessible. This should have been installed (if not already) upon installation of the present module. See L<CPA...
=back
=head1 DEPENDENCIES
L<File::RandomLine|File::RandomLine> : for L<random_string|Lingua::Norms::SUBTLEX/random_string>
L<Lingua::Orthon|Lingua::Orthon> : for C<set_eq> method
L<List::AllUtils|List::AllUtils> : C<all>, C<any>, C<none>, C<uniq> and other functions
L<Number::Misc|Number::Misc> : C<is_numeric>
L<Path::Tiny|Path::Tiny> : for directory reading when calling L<new|Lingua::Norms::SUBTLEX/new>
L<Statistics::Lite|Statistics::Lite> : for various statistical methods
L<String::Trim|String::Trim> : C<trim>
L<String::Util|String::Util> : for determining valid string values
L<Text::CSV::Hashify|Text::CSV::Hashify> : reads in the specs file
L<Text::CSV::Separator|Text::CSV::Separator> : for determining the field delimiter within the datafiles
L<Text::Unidecode|Text::Unidecode> : for plain ASCII transliterations of Unicode text
=head1 REFERENCES
Brysbaert, M., Buchmeier, M., Conrad, M., Jacobs, A.M., Boelte, J., & Boehl, A. (2011). The word frequency effect: A review of recent developments and implications for the choice of frequency estimates in German. I<Experimental Psychology>, I<58>, 41...
Brysbaert, M., & New, B. (2009). Moving beyond Kucera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. I<Behavior Research Methods>, I<41>, 977-...
Brysbaert, M., New, B., & Keuleers,E. (2012). Adding part-of-speech information to the SUBTLEX-US word frequencies. I<Behavior Research Methods>, I<44>, 991-997. doi: L<10.3758/s13428-012-0190-4|http://dx.doi.org/10.3758/s13428-012-0190-4>
Herdagdelen, A., & Marelli, M. (2017). Social media and language processing: How Facebook and Twitter provide the best frequency estimates for studying word recognition. I<Cognitive Science>, I<41>, 976-995. doi:L<10.1111/cogs.12392|http://dx.doi.org...
Keuleers, E., Brysbaert, M., & New, B. (2010). SUBTLEX-NL: A new frequency measure for Dutch words based on film subtitles. I<Behavior Research Methods>, I<42>, 643-650. doi: L<10.3758/BRM.42.3.643|http://dx.doi.org/10.3758/BRM.42.3.643>
New, B., Brysbaert, M., Veronis, J., & Pallier, C. (2007). The use of film subtitles to estimate word frequencies. I<Applied Psycholinguistics>, I<28>, 661-677.
Soares, A. P., Machado, J., Costa, A., Comesaña, M., & Perea, M. (in press). On the advantages of frequency measures extracted from subtitles: The case of Portuguese. I<Quarterly Journal of Experimental Psychology>.
Van Heuven, W. J. B., Mandera, P., Keuleers, E., & Brysbaert, M. (2014). SUBTLEX-UK: A new and improved word frequency database for British English. I<Quarterly Journal of Experimental Psychology>, I<67>, 1176-1190. doi: L<10.1080/17470218.2013.85052...
=head1 AUTHOR
Roderick Garton, C<< <rgarton at cpan.org> >>
=head1 BUGS AND LIMITATIONS
Please report any bugs or feature requests to C<bug-lingua-norms-subtlfreq-0.07 at rt.cpan.org>, or through
the web interface at L<http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Lingua-Norms-SUBTLEX-0.07>. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.
=head1 SUPPORT
You can find documentation for this module with the perldoc command.
perldoc Lingua::Norms::SUBTLEX
You can also look for information at:
=over 4
=item * RT: CPAN's request tracker (report bugs here)
L<http://rt.cpan.org/NoAuth/Bugs.html?Dist=Lingua-Norms-SUBTLEX-0.07>
=item * AnnoCPAN: Annotated CPAN documentation
L<http://annocpan.org/dist/Lingua-Norms-SUBTLEX-0.07>
=item * CPAN Ratings
L<http://cpanratings.perl.org/d/Lingua-Norms-SUBTLEX-0.07>
=item * Search CPAN
L<http://search.cpan.org/dist/Lingua-Norms-SUBTLEX-0.07/>
=back
=head1 LICENSE AND COPYRIGHT
Copyright 2014-2018 Roderick Garton.
( run in 1.615 second using v1.01-cache-2.11-cpan-2398b32b56e )