The Semantic, IEML-powered tag cloud
A tag cloud is a list of words in different sizes and colors, with or without a sense of depth (3D), meant to represent the statistical importance of keywords mentioned in a particular document base (a blog, a website, twitter,…). It serves as an indicator of the relative importance of the use of certain ideas in the document base at hand. It is a bottom-up, very fuzzy method for the synthesis of knowledge from an arbitrarily big aggregate of (text) data. Because it rests entirely on statistics, very often there is absolutely no relationships between the keywords of a tag cloud. Worse even, if they existed (by pure chance), there is absolutely no way of finding out about the meaning of those relationships.
On the other hand, ontologies propose a top-down approach to the same problem. They require first that some expert, or committee of experts, agree on a common description of a particular domain. Given an ontology, it is possible to map a document base to the concepts it describes: every concept would act as a filter to retrieve documents that match it. The clear advantage of this method is that it is possible to use the semantics expressed in the ontology to search for a specific combination of concepts. A tag cloud rendition of an ontology would, however, carry two major drawbacks with it:
- you need an ontology per domain you wish to search for, hence ontology-based navigation is always going to be extremely specific;
- no semantic value added is generated (note that there is indeed a minor semantic value added in the statistical tag cloud, albeit of a purely quantitative nature)
An IEML tag cloud would combine the advantages of both approaches, while overcoming their limitations: it would generate significant semantic value added through the discovery of the underlying conceptual network of a given document base; and it would allow for navigating in semantics. As for the statistical tag cloud, the conceptual network thus discovered would remain quantitative in nature to a certain degree. Indeed the salience of the semantic relationships is going to vary depending on the precise contents of the document base, as is going to be necessary to reflect in the rendition of the resulting tag cloud.
Another pending issue as far as rendition of an IEML-based tag cloud goes is deciding upon the relative positioning of concepts on the user’s display, in 2D and/or 3D.