DTA-CAB

 view release on metacpan or  search on metacpan

CAB/XmlRpcProtocol.pod  view on Meta::CPAN


(boolean) whether input token text is losslessly encodable in
the latin-1 (ISO-8859-1) character set.
If this is true, the token's L<xlit.latin1Text|/token.xlit.latin1Text> field should
be identical to its L<text|/token.text> field.

=item token.xlit.isLatinExt

(boolean) whether input token text is losslessly encodable
in the Unicode "latin-extended" character set.

=back

=item token.morph

Morphological analyses, an array of structs,
each element of which represents a single analysis,
and has fields:

=over 4

=item token.morph[i].w

Analysis weight (float)

=item token.morph[i].hi

Analysis output string

=back

=item token.msafe

int (boolean) representing heuristic analysis of whether
morphological analyses are considered "safe" (proper names
e.g. are not considered "safe" in this sense).

=item token.lts

Letter-to-sound transduction results as an array
of structs.  See L</token.morph> for the element format.


=item token.eqpho

Array of analysis structs a la L</token.morph>
representing the set of phonetically equivalent indexed word types.
Note that the notion of "phonetic equivalence" implicit in this analysis
may or may not be consistent with actual identity of phonetic strings
as returned in the L</token.lts> field.

=item token.eqphox

Array of strings representing the k-best phonetically equivalent word types known to
the underlying (intensional) lexicon.  Differs from L</token.eqpho> in the set from
which the phonetic equivalents are drawn.

=item token.rw

Array of analysis structs a la L</token.morph>, where weights
and analyses are determined by a (canonicalizing) rewrite cascade.
Each analysis struct may additionally have "lts" and/or "morph"
fields of its own, representing the respective analyses of the
rewrite I<target>.

=item token.eqrw

Array of analysis structs a la L</token.morph> representing the indexed word types
which are "rewrite-equivalent"
to the current token; i.e. which were rewritten to the same string as the
current token.

=item token.eqlemma

Array of analysis structs a la L</token.morph> representing the indexed word types
which are lemma-equivalent to the current token; i.e. which were assigned the same
lemma as the current token during the most recent indexing run.
See L</token.moot.lemma>.

=item token.dmoot

Canonicalization-disambiguator output, a struct containing sub-fields:

=over 4

=item token.dmoot.tag

(required): string representing the optimal canonicalization of the token
as returned by the HMM disambiguator.

=item token.dmoot.morph

(optional): morphological analyses for L</token.dmoot.tag>, in the same
format as L</token.morph>.

=item token.dmoot.analyses

(optional): disambiguator input analyses, an array of structs representing the
canonicalization candidates passed to the disambiguator.  Each element is a struct
of the form

=over 4

=item token.dmoot.analyses[i].tag

Canonicalization candidate for this analysis (string).

=item token.dmoot.analyses[i].details

Details for this canonicalizaion candidate (string).

=item token.dmoot.analyses[i].cost

Heuristic cost of this canonicalization candidate (float).

=back

=back


=item token.moot



( run in 0.440 second using v1.01-cache-2.11-cpan-0bb4e1dffa6 )