Fuzzy Word Meanings as Semantic Granules.
Emergent constraints for self-organizing tree
structures in SCIP systems.
Burghard B. Rieger
Department
of Computational Linguistics, FB II: LDV/CL
University of Trier, D-54286 TRIER, Germany
Abstract
The notion of semiotic cognitive information
processing (SCIP) is concerned with the situated employment of
natural language expressions for communicative purposes. Natural
languages (NL) provide not only linguistic structures
representational for processes of understanding, but also crucial
hints on the operational constitution of their processing. These
allow for the decomposition of wholes into their constituents or
parts (granulation ), for the composition or integration of
their parts into wholes (organization ), and for the
association of semiotic causes with effects (meaning ).
Thus, information granulation 1 can algorithmically be modeled and realized both,
in its crisp as well as fuzzy modes of
representation and processing, by exploiting the structuredness of
pragmatically homogeneous NL text samples (PHT corpora).
1 Introduction
Based upon the notion of Computing with Words (CW), the concept of fuzzy and/or crisp
granulation - once their process-result ambiguity is
solved - lends itself easily to a unifying view of the way
structural linguists used to and still categorize (segment and
classify) observable natural language phenomena (tokens like
phones, morphs, lexes, utterances , etc.) to constitute abstract
linguistic entities (types like phonemes, morphemes, lexemes,
sentences , etc.). These may either be derived as soft
linguistic categories or fuzzy granules represented as
vectors (fuzzy sets ), or they may be postulated as
abstractions to form crisp categories representable by
symbols (signs ) whose linear compositions in well-formed
strings, in turn, give rise to the notion of correctness .
Whereas the latter may formally be characterized by rules ,
the derivation of the former can be determined procedurally by
algorithms operating on language data.
Their twofold process -analytical and
result -representational function render these algorithms
semiotic.
Table 1:
Formalizing (
syntagmatic/paradigmatic )
constraints by consecutive (a- and d-) abstractions over usage regularities of items xi and entities yj respectively.
2 FL and CS
In fuzzy linguistic (FL) models and computational
semiotic (CS) realizations of sign processes,
analytical procedures are derived detecting and, at the same time,
operating on intrinsic (or structural) information that
constitutes understanding as (intermediate) representation of the
phenomena concerned. Based upon the assumption the structuredness
of natural language discourse, its organizing functions,
i.e. integration of parts into wholes (sign formation), as well as
the causative functions, i.e. semiotic association of
causes with effects (meaning constitution), is realized and
accessible in PHT corpora, these may be analyzed for inherent
regularities which may be explored in order to re-construct
(crisp and fuzzy) semantic granules. Tied
to the empirically well founded and testable observations and
rigorous mathematical description of results, entity formation in
natural language discourse can be shown to constitute (different
levels of) processes and/or their representational results. On
word level these are viewed as enactment of universal principles
which are realized in and detectable from pragmatically
homogeneous texts (PHT) of either performed or intended
communicative interaction in actual situations.
The semantic meaning functions have been modeled and
computed earlier as results of those same (semiotic)
procedures by way of which (representational) structures emerge.
Their actual interpretation could be simulated by analyzing the
possibilistic constraints found to be imposed upon the linear
ordering (syntagmatics ) and the selective combination
(paradigmatics ) of natural language entities (word-types)
in discourse. In a FL/CS approach to lexical
semantics this is tantamount to (re-)construct an entity's
semiotic potential (meaning function) by a weighted graph
(fuzzy distributional pattern ) representing
a particular state of the modeled system's lexical state space
rather than by a single symbol whose interpretation would
have to be extrinsic to that system. In this view,
the emergence of semantic structure can be represented and studied
as a self-organizing process based upon word usage regularities in
natural language discourse. In its course, the linearly
agglomerative (or syntagmatic ) as well as the
distributionally selective (or paradigmatic ) constraints
are exploited by text analyzing algorithms which accept natural
language text corpora as input and produce-via levels of
intermediate processing and representation-a vector space
structure as output . As semantic hyperspace (SHS)
it may be interpreted as an internal (endo ) representation
of the SCIP system's states of adaptation to the external (
exo ) structures of its environment as mediated by the discourse
processed. The degree of correspondence between these two is
determined by the granularity that the texts provide in depicting
an exo- view, and the resolution that the SCIP system is
able to acquire as its endo- view in the course of that
discourse' processing.
3 Empirical Reconstruction
Following the procedural approach in FL/CS, the reconstruction of
linguistic functions or meanings of words is based upon a
fundamental analytical as well as representational formalism. It
can be characterized as a two-level process of abstraction (called
a- and d-abstraction) on the set of fuzzy
subsets of the vocabulary-providing the word-types' usage
regularities or corpus points -and on the set of
fuzzy subsets of these-providing the corresponding
meaning points (Tab. 1). These may be understood
to interpret semantically (by way of the meaning function) those
word-types which are being instantiated by word-tokens as employed
in natural language PHT corpora.
The basically descriptive statistics used to grasp these relations
on the level of words in discourse is centered around a
correlational measure () to specify intensities of
co-occurring lexical items in texts, and a measure of similarity
(or rather, dissimilarity) () to specify these
correlation value distributions' differences. Simultaneously,
these two measures may also be interpreted semiotically as
providing for the set theoretical constraints or formal mappings
a and d which model the meanings of words as a
function of these words' differences of usage regularities as
produced in discourse and analyzed in the PHT corpus.
The coefficient ai,j (1) measures pairwise
relatedness of word-types (xi,xj) Î V ×V
where eit=[(Hi)/L] lt and
ejt=[(Hj)/L] lt,
the PHT corpus of texts
K={ kt } ; t=1,¼,T
has the length
L=åt=1T lt; 1 £ lt £ L
measured by the number of word-tokens per text, and a vocabulary
V={ xn } ; n=1,¼,i,j,¼,N whose type frequencies
are denoted by
Hi=åt=1Thit ; 0 £ hit £ Hi.
Figure 1:
The semantic inference procedure
is a parallel process activated from start nodes (
premises ) generating DDS graphs and stopped by first node
common to all (conclusion ). Subtrees constitute
perspectively determined information granules of differing
connotative, resolutional, and dependency structure.
The lexical items' usage regularities detected are represented by
tuples of a(xi,xn) -values which-interpreted as
coordinates ai(xn)- can be represented by points in
a vector space C spanned by the number of axes each of which
corresponds to an entry in the vocabulary xi Î V.
Their similarities and/or dissimilarities are calculated by a
distance measure d () as the Euclidian
metric on C
whose pairwise representations as tuples of
d(yi,yn) -values determine-interpreted as
coordinates di(yn) again-meaning points
zn Î S or vectors in the hyperstructure or semantic
space áS,zñ spanned by the number of axes
corresponding to vocabulary entries xn Î V and a
Euclidian metric z.
Thus, the two-stage mapping corresponds to a category-type
morphism or composition [(d)\tilde] | yn ° [(a)\tilde] | xn (Fig. 2), resulting in
the format of semantic hyperspace (SHS) which constitutes
a system of meaning points as an empirically founded and
functionally derived representation of a lexically labeled
knowledge structure.
4 SHS
The semantic hyperspace (SHS) structure resulting from the
performance oriented approach allows to reconstruct
formally and model procedurally both, the significance of
entities and the meanings of signs as a function of a
first and second order semiotic embedding relation of
language games (or cotexts2) and of situations (or contexts). As this function corresponds to the two-level
actualization of cognitive processes in language understanding,
SHS provides the format and structural information for an
intermediate representational (tree-graph) structures to be
generated as semantic or dispositional
dependencies (DDS) introduced elsewhere. Their
property of situational (co- and contextual) sensitivity gave rise
to the algorithmic derivation and diagrammatical emulation of
(perspective and relevance driven) information granulation
and semantic inferencing which operate on DDS-structures.
4.1 Granular structure and constraints
Dispositional dependency structures (DDS) (Fig.
1) can be viewed as an alternative procedural format
of fuzzy information granulation which extends the
rule-based frame as introduced by the concept of generalized
constraint and exemplified in as
unconditional constraints .
According to Zadeh's (1997) theory of fuzzy information
granulation (TFIG), a generalized constraint on values of X is
expressed as X isr R, where X is a variable which takes
values in a universe of discourse U, isr is a parametric
copula with r being a discrete variable whose values define the
way in which R constrains X, and R is the constraining
relation. For r different values may be defined as
equality, possibility, verity, probability, random set, and
fuzzy graph , and their (definitional, operational,
procedural, computational) interpretations can be given.
From our perspective it is important to observe that r is a
means to extend the copula's interpretations in a controlled and
operationally defined way which relates to R in a predicative
sense, i.e. specifying the interpretation of R (generally a
distribution of grades of membership) as being possibilities,
truth values, probabilities or composites thereof. As these
functional types of r needed to be specified for rule-based
mechanisms in order to determine their different interpretations
of R, this necessity may be relaxed or even become obsolete when
the rule-based inference mechanism is replaced by an algorithmic
procedure, operating on a well-defined structure like SHS as
specified numerically by the value distributions which constitute
the meaning points' interpretations.
4.2 Deriving semantic granules
Taking the concept of generalized constraints being applicable
likewise for sentences (propositions) as well as for words (DDS),
then the TFIG notational format translates to X @ {xn}
where X is a variable which takes values-via a- and
d-abstraction-of zn Î áS ñ with S Í U. A semiotically generalized constraint on values of
X is expressed by X ddsi S where dds relates
xi via zi to S by restricting SHS procedurally in
generating the tree structure from meaning point zi as its
root, and zn as its discrete variables whose values determine
different structures (dependency paths ) which constrain
the topology of S in a semantically perspective way.
Thus, dependency path is a structural representation for a
dynamic concept of granular word meaning which induces a
reflexive, symmetric, and weakly transitive relation between
relevant meaning points as its components, allowing for the
procedural definition and computational enactment of
semantic inferencing on the word level,
very much like the rule-based models of inferencing in
granular fuzzy information processing based on fuzzy
rules , or the syntagmatically defined propositional
formats of symbolic processing in (cognitive linguistic) sentence
semantics based on crisp logic calculi.
In Fig. 1 the semantic hyperspace áS,zñ was computed from a corpus of Reuters 1987
newswire articles3. Two
vocabulary items xi= administration, xj=
deposit, corresponding to meaning points zi, zj were
chosen as premises for the semantic inference process. It
restricts áS ñ simultaneously by generating the
graphs DDSi, DDSj in parallel. The inferred
conclusion is the first common node zk= estate whose
different dependency paths depi(zk),depj(zk) are given (center column). Depending on the
semantic perspectives, however, as determined by the root node
zi, zj respectively, the subtrees or information
granules igi(k), igj(k), headed by zk=
estate (left and right column) demonstrate the i and j
induced differences both, in connotative meaning and in semantic
resolution of these fuzzy information granules.
5 Conclusion
The dynamics of semiotic knowledge structures and the
processes operating on them essentially consist in their
recursively applied mappings of multi-level
representations resulting in a multi-resolutional
granularity of fuzzy word meanings which emerge from and are
modified by such text processing. Computational tests and
experiments with different PHT corpora have produced promising
evidence on the SCIP system's granular meaning acquisition and
language understanding capacity without any explicit initial
morphological, lexical, syntactic, or semantic knowledge.
References
- [1]
-
A. Meystel: Semiotic Modeling and Situation Analysis.
Bala Cynwyd, PA (AdRem Inc), 1995.
- [2]
- B. Rieger: Unscharfe Semantik. Frankfurt/ Bern/ Paris
(Lang), 1989.
- [3]
- B. B. Rieger: Distributed Semantic Representation of Word
Meanings.
In: Becker/Eisele/Mündemann (eds): Parallelism, Learning,
Evolution. Berlin/ Heidelberg/ New York
(Springer), 1991, pp. 243-273.
- [4]
- B. B. Rieger: Meaning Acquisition by SCIPS. In: Ayyub (ed):
ISUMA/NAFIPS-95-Proceedings, Los Alamitos, CA (IEEE), 1995,
pp. 390-395.
- [5]
- B. B. Rieger: Situations, Language Games, and SCIPS. In:
Meystel/Nerode (eds): Semiotic Modeling and Situation
Analysis: 10th IEEE Symposium on Intelligent Control.
Bala Cynwyd, PA (AdRem), 1995, pp. 130-138.
- [6]
- B. B. Rieger: Situation Semantics and Computational Linguistics:
towards
Informational Ecology. In: Kornwachs/Jacoby (eds): Information. New
Questions to a Multidisciplinary Concept , Berlin (Akademie), 1996,
pp. 285-315.
- [7]
- B. B. Rieger: Computational Semiotics and Fuzzy Linguistics.
In: Meystel (ed): A Learning
Perspective, ISAS-97-Proceedings,
Washington, DC (US Gov.
Printing), 1997, pp. 541-551.
- [8]
- B. B. Rieger: Tree-like Dispositional Dependency Structures
for non-propositional Semantic Inferencing. In:
Bouchon-Meunier/Yager (eds): IPMU-98-Proceedings, Paris
(EKD), 1998, pp. 351-358.
- [9]
- B. B. Rieger: Semiotics and Computational Linguistics. In:
Zadeh/Kacprzyk (ed): Computing with Words in Information/
Intelligent Systems I , Heidelberg/ New York (Physica), 1999,
pp. 93-118.
- [10]
- B. B. Rieger: Computing Granular Word Meanings. In:
Wang/Meystel/Albus (eds): Computing with Words , New York,
NY (Wiley), 2000, [in print].
- [11]
- B. B. Rieger/C. Thiopoulos: Semiotic Dynamics: a self-organizing
lexical system in hypertext. In: Köhler/Rieger (eds):
QUALICO-91-Proceedings, Dordrecht (Kluwer), 1993, pp. 67-78.
- [12]
- L. A. Zadeh: Outline of a computational approach to meaning and
knowledge representation. In: Thoma/Wyner (eds): AI and
Man-Machine Systems, Heidelberg (Springer), 1986, pp. 198-211.
- [13]
- L. A. Zadeh: Fuzzy logic = Computing with words.
IEEE-Transactions on Fuzzy Systems, 4: 103-111, 1996.
- [14]
- L. A. Zadeh: Toward a Theory of Fuzzy Information Granulation and
its Centrality in Human Reasoning and Fuzzy Logic. Fuzzy
Sets and Systems , 90(3): 111-127, 1997.
Footnotes:
1According to
Zadeh (1997), all processes of human cognition are
structured by granulation, organization and
causation .
2The text-linguistic term
refers to the language environment (cotext ) of an
expression embedded in its discourse situation (
context ).
3Reuters-21578 (1.0) Text Categorization
Test Collection, prepared by D.D.Lewis (AT&T Labs) and thankfully
acknowledged here (
www.research.att.com/ ~ lewis/reuters21578.html).