Models of collaborative tagging

Last updated

Collaborative tagging, also known as social tagging or folksonomy, allows users to apply public tags to online items, typically to make those items easier for themselves or others to find later. It has been argued that these tagging systems can provide navigational cues or "way-finders" for other users to explore information. [1] [2] The notion is that given that social tags are labels users create to represent topics extracted from online documents, the interpretation of these tags should allow other users to predict the contents of different documents efficiently. Social tags are arguably more important in exploratory search, in which the users may engage in iterative cycles of goal refinement and exploration of new information (as opposed to simple fact-retrievals), and interpretation of information contents by others will provide useful cues for people to discover topics that are relevant.

Contents

One significant challenge that arises in social tagging systems is the rapid increase in the number and diversity of tags. As opposed to structured annotation systems, tags provide users an unstructured, open-ended mechanism to annotate and organize web content. As users are free to create any tag to describe any resource, it leads to what is referred to as the vocabulary problem. [3] Because users may use different words to describe the same document or extract different topics from the same document based on their own background knowledge, the lack of any top-down mediation may lead to an increase in the use of incoherent tags to represent the information resources in the system. In other words, the lack of structure inherent in social tags may hinder their potential as navigational cues for searchers because the diversities of users and their motivation may lead to diminishing tag-topic relations as the system grows. However, a number of studies have shown that structures do emerge at the semantic level – indicating that there are cohesive forces driving the emergent structures in a social tagging system. [4]

Distinction between descriptive and predictive models

Just like any social phenomena, behavioral patterns in social tagging systems can be characterized by either a descriptive or predictive model. While descriptive models ask the question of "what", predictive models go deeper to also ask the question of "why" by attempting to provide explanations for the aggregate behavioral patterns. [5] While there may be no general agreement on what an acceptable explanation should be like, many believe that a good explanation should have a certain level of predictive accuracy.

Descriptive models typically are not concerned with explaining the actions of individuals. Instead, they focus on describing the patterns that emerge as individual behavior is aggregated in a large social information system. Predictive models, however, attempt to explain aggregate patterns by analyzing how individuals interact and link to each other in ways that bring about similar or different emergent patterns of social behavior. In particular, a mechanism-based predictive model assumes a certain set of rules governing how individuals interact with each other, and understand how these interactions could produce aggregate patterns as observed and characterized by descriptive models. Predictive models can therefore provide explanations to why different system characteristics may lead to different aggregate patterns, and can therefore potentially provide information on how systems should be designed to achieve different social purposes.

Descriptive models

Information theory models

For most tagging systems, the total number of objects being tagged far exceeds the total number of tags in the collective vocabulary. If a single tag in this system is specified, many documents would match, so that using single tags cannot effectively isolate any one document. However, some documents are more popular or important than others, which is reflected in the number of bookmarks per document. Thus, the focus should be on how well the mapping of tags to documents retains information about the distribution of the documents. Information theory provides a framework to understand the amount of shared information between two random variables. The conditional entropy measures the amount of entropy remaining in one random variable when the value of a second random variable is known.

A 2008 paper by Ed Chi and Todd Mytkowicz showed that the entropy of documents conditional on tags, H(D|T), is increasing rapidly. [6] This suggests that, even after knowing completely the value of a tag, the entropy of the set of documents is increasing over time. Conditional entropy asks the question: "Given that a set of tags is known, how much uncertainty remains regarding the document set referenced by those tags?" This curve is strictly increasing, which suggests that the specificity of any given tag is decreasing. As a navigation aid, tags are becoming harder and harder to use, and a single tag will gradually reference too many documents to be considered useful.

Another approach is through mutual information, a measure of independence between two variables. Full independence is reached when I(D;T) = 0.[ clarification needed ] Chi and Mytkowicz's research shows that as a measure of usefulness of tags and their encoding, there is a worsening trend in the ability of users to specify and find tags and documents when they are engaged in simple fact retrieval. [6] This suggests that search and recommendation systems should be built to help users sift through resources in social tagging systems, especially when they are engaged in activities beyond fact retrieval, as characterized by information theory. Although the number of documents associated with any given tag is increasing, there are many ways contextual information can help users to look for relevant information. This is one of the major weakness of the simple information theory in explaining usefulness of tags – it ignores how humans can extract meanings from a set of tags assigned to a document. For example, a 2007 paper showed that while the number of tags is increasing, the general growth pattern is scale-free – the general distribution of tag-tag co-occurrences follows a power law. [7]

The same paper also found that the characteristics of this scale-free distribution depend on the semantics of the tag – tags that are semantically general (e.g., blogs) tend to co-occur with many tags, while semantically narrow tags (e.g., Ajax) tend to co-occur with few tags across a wide set of documents in a social tagging system. [7] This suggests that the assumption of the information theory approach is too simple – when taking into account the semantics of the set of tags assigned to documents, the predictive value of tags on contents of documents is relatively stable. This finding is important for development of recommender systems – discovering these higher-level semantic patterns is important in helping people find relevant information.

Tag convergence

Despite this potential vocabulary problem, research has found that at the aggregate level, tagging behavior seemed relatively stable, and that the tag choice proportions seemed to be converging rather than diverging. While these observations provided evidence against the proposed vocabulary problem, they also initiated research investigating how and why tag proportions tended to converge over time.

One explanation for the stability was that there was an inherent propensity for users to "imitate" word use of others as they create tags. This propensity may act as a form of social cohesion that fosters the coherence of tag-topic relations in the system, and leads to stability in the system. [8] It was shown that the stochastic urn model created in 1923 [9] was useful in explaining how simple imitation behavior at the individual level could explain the converging usage patterns of tags. [8] Specifically, the convergence of tag choices was simulated by a process in which a colored ball was randomly selected from an urn, then replaced in the urn along with an additional ball of the same color, simulating the probabilistic nature of tag reuse. This simple model, however, does not explain why certain tags would be "imitated" more often than others, and therefore cannot provide a realistic mechanism for tag choices and how social tags could be used as navigational cues during exploratory search.

Complex systems dynamics and emergent vocabularies

Research based on data from the social bookmarking website Del.icio.us has shown that collaborative tagging systems exhibit a form of complex systems (or self-organizing) dynamics. [10] Furthermore, although there is no central, controlled vocabulary to constrain the actions of individual users, the distributions of tags that describe different resources has been shown to converge over time to a stable, power-law distribution. [10] Once such stable distributions form, the correlations between different tags can be used to construct simple folksonomy graphs, which can be partitioned to obtain a form of community or shared vocabularies. [11] Such vocabularies can be seen as emerging from the decentralized actions of many users – a form of crowdsourcing.

Tag choice by stochastic process

The memory-based Yule-Simon (MBYS) model [7] attempts to explain tag choices by a stochastic process. It was found that the temporal order of tag assignment influences users' tag choices. Similar to the stochastic urn model, the MBYS model assumes that at each step, a tag would be randomly sampled: with probability that the sampled tag was new, and with probability 1- that the sampled tag was copied from existing tags. When copying, the probability of selecting a tag was assumed to decay with time, and this decay function was found to follow a power-law distribution. Thus, tags that were more recently used had a higher probability of being reused.

One major finding was that semantically general tags (e.g., "blog") generally co-occurred more frequently with other tags than semantically narrower tags (e.g., "Ajax"), and this difference could be captured by the decay function of tag reuse in their model. [7] Specifically, it was found that a slower decay parameter (when the tag is reused more often) could explain the phenomenon that semantically general tags tended to co-occur with a larger set of tags. In other words, it was argued that the "semantic breadth" of a tag could be modeled by a memory decay function, which could lead to different emergent behavioral patterns in a tagging system. [7]

Predictive models

Semantic imitation model

Descriptive models were based on analyses of word-word relations as revealed by the various statistical structures in the organization of tags (e.g., how likely one tag would co-occur with other tags or how likely each tag was reused over time). Thus, these models are descriptive models at the aggregate level, and have little to offer about predictions at the level of an individual's interface interactions and cognitive processes.

Rather than imitating other users at the word level, one possible explanation for this kind of social cohesion could be grounded on the natural tendency for people to process tags at the semantic level, and it was at this level of processing that most of the imitation occurred. This explanation was supported by research in the area of reading comprehension, which showed that during comprehension, people tended to be influenced by meanings of words rather than the words themselves. [12] Assuming that people in the same culture tend to have shared structures – such as using similar vocabularies and their corresponding meanings to conform and communicate, users of the same social tagging system may also share similar semantic representations of words and concepts, even when the use of tags may vary across individuals at the word level. As such, part of the reason for the stability of social tagging systems can be attributed to the shared semantic representations among the users, such that users may have relatively stable and coherent interpretation of information contents and tags as they interact with the system. Based on this assumption, the semantic imitation model predicts how different semantic representations may lead to differences in individual tag choices and eventually different emergent properties at the aggregate behavioral level. [13] [14] The model also predicts that the folksonomies in the system reflect the shared semantic representations of the users.

Semantic imitation has important implications to the general vocabulary problem in information retrieval and human–computer interaction – the creation of a large number of diverse tags to describe the same set of information resources. Semantic imitation implies that the unit of communication among users is more likely at the semantic level rather than the word level. Thus, although there may not be strong coherence in the choice of words in describing a resource, at the semantic level, there seems to be a stronger coherence force that guides the convergence of descriptive indices. This is in sharp contrast to conclusions derived based on a purely information-theoretical approach, which assumes that humans search and evaluate information at the word level. Instead, the process of semantic imitation in social tagging implies that the information-theoretic approach is at most incomplete, as it does not take into account the basic unit of human information processing. Similar to the fact that human communication occurs at the semantic level, the fact that people may use different words or syntax does not affect the effectiveness of communication, so long as the underlying "common ground" between two people is the same. [15]

In the social tagging case, as long as users share a similar understanding of the contents of the information resources, the fact that the information value of tag-document decreases (that humans have more words in their languages) does not imply that it will always be harder to find relevant information (similarly, the fact that there are an increasing number words in human languages does not mean that communication becomes less effective). However, it does point to the notion that one needs to effectively present these semantic structures in the information system so that people can effectively interpret the semantics of the tagged documents. Intelligent techniques based on statistical models of language, such as latent semantic analysis and the probabilistic topics model [ clarification needed ], could potentially overcome this vocabulary problem.[ citation needed ]

See also

Related Research Articles

<span class="mw-page-title-main">Semantic Web</span> Extension of the Web to facilitate data exchange

The Semantic Web, sometimes known as Web 3.0, is an extension of the World Wide Web through standards set by the World Wide Web Consortium (W3C). The goal of the Semantic Web is to make Internet data machine-readable.

Personal information management (PIM) is the study and implementation of the activities that people perform in order to acquire or create, store, organize, maintain, retrieve, and use informational items such as documents, web pages, and email messages for everyday use to complete tasks and fulfill a person's various roles ; it is information management with intrapersonal scope.

Information foraging is a theory that applies the ideas from optimal foraging theory to understand how human users search for information. The theory is based on the assumption that, when searching for information, humans use "built-in" foraging mechanisms that evolved to help our animal ancestors find food. Importantly, a better understanding of human search behavior can improve the usability of websites or any other user interface.

<span class="mw-page-title-main">Collaborative filtering</span> Algorithm

Collaborative filtering (CF) is a technique used by recommender systems. Collaborative filtering has two senses, a narrow one and a more general one.

An annotation is extra information associated with a particular point in a document or other piece of information. It can be a note that includes a comment or explanation. Annotations are sometimes presented in the margin of book pages. For annotations of different digital media, see web annotation and text annotation.

Social bookmarking is an online service which allows users to add, annotate, edit, and share bookmarks of web documents. Many online bookmark management services have launched since 1996; Delicious, founded in 2003, popularized the terms "social bookmarking" and "tagging". Tagging is a significant feature of social bookmarking systems, allowing users to organize their bookmarks and develop shared vocabularies known as folksonomies.

Semantic similarity is a metric defined over a set of documents or terms, where the idea of distance between items is based on the likeness of their meaning or semantic content as opposed to lexicographical similarity. These are mathematical tools used to estimate the strength of the semantic relationship between units of language, concepts or instances, through a numerical description obtained according to the comparison of information supporting their meaning or describing their nature. The term semantic similarity is often confused with semantic relatedness. Semantic relatedness includes any relation between two terms, while semantic similarity only includes "is a" relations. For example, "car" is similar to "bus", but is also related to "road" and "driving".

<span class="mw-page-title-main">Tag (metadata)</span> Keyword assigned to information

In information systems, a tag is a keyword or term assigned to a piece of information. This kind of metadata helps describe an item and allows it to be found again by browsing or searching. Tags are generally chosen informally and personally by the item's creator or by its viewer, depending on the system, although they may also be chosen from a controlled vocabulary.

In human–computer interaction, the keystroke-level model (KLM) predicts how long it will take an expert user to accomplish a routine task without errors using an interactive computer system. It was proposed by Stuart K. Card, Thomas P. Moran and Allen Newell in 1980 in the Communications of the ACM and published in their book The Psychology of Human-Computer Interaction in 1983, which is considered as a classic in the HCI field. The foundations were laid in 1974, when Card and Moran joined the Palo Alto Research Center (PARC) and created a group named Applied Information-Processing Psychology Project (AIP) with Newell as a consultant aiming to create an applied psychology of human-computer interaction. The keystroke-level model is still relevant today, which is shown by the recent research about mobile phones and touchscreens.

<span class="mw-page-title-main">Automatic image annotation</span>

Automatic image annotation is the process by which a computer system automatically assigns metadata in the form of captioning or keywords to a digital image. This application of computer vision techniques is used in image retrieval systems to organize and locate images of interest from a database.

Exploratory search is a specialization of information exploration which represents the activities carried out by searchers who are:

Sentiment analysis is the use of natural language processing, text analysis, computational linguistics, and biometrics to systematically identify, extract, quantify, and study affective states and subjective information. Sentiment analysis is widely applied to voice of the customer materials such as reviews and survey responses, online and social media, and healthcare materials for applications that range from marketing to customer service to clinical medicine. With the rise of deep language models, such as RoBERTa, also more difficult data domains can be analyzed, e.g., news texts where authors typically express their opinion/sentiment less explicitly.

Social information processing is "an activity through which collective human actions organize knowledge." It is the creation and processing of information by a group of people. As an academic field Social Information Processing studies the information processing power of networked social systems.

Wikipedia has been studied extensively. Between 2001 and 2010, researchers published at least 1,746 peer-reviewed articles about the online encyclopedia. Such studies are greatly facilitated by the fact that Wikipedia's database can be downloaded without help from the site owner.

Collaborative search engines (CSE) are Web search engines and enterprise searches within company intranets that let users combine their efforts in information retrieval (IR) activities, share information resources collaboratively using knowledge tags, and allow experts to guide less experienced people through their searches. Collaboration partners do so by providing query terms, collective tagging, adding comments or opinions, rating search results, and links clicked of former (successful) IR activities to users having the same or a related information need.

Folksonomy is a classification system in which end users apply public tags to online items, typically to make those items easier for themselves or others to find later. Over time, this can give rise to a classification system based on those tags and how often they are applied or searched for, in contrast to a taxonomic classification designed by the owners of the content and specified when it is published. This practice is also known as collaborative tagging, social classification, social indexing, and social tagging. Folksonomy was originally "the result of personal free tagging of information [...] for one's own retrieval", but online sharing and interaction expanded it into collaborative forms. Social tagging is the application of tags in an open online environment where the tags of other users are available to others. Collaborative tagging is tagging performed by a group of users. This type of folksonomy is commonly used in cooperative and collaborative projects such as research, content repositories, and social bookmarking.

<span class="mw-page-title-main">Human–computer interaction</span> Academic discipline studying the relationship between computer systems and their users

Human–computer interaction (HCI) is research in the design and the use of computer technology, which focuses on the interfaces between people (users) and computers. HCI researchers observe the ways humans interact with computers and design technologies that allow humans to interact with computers in novel ways. A device that allows interaction between human being and a computer is known as a "Human-computer Interface (HCI)".

In statistics and natural language processing, a topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents. Topic modeling is a frequently used text-mining tool for discovery of hidden semantic structures in a text body. Intuitively, given that a document is about a particular topic, one would expect particular words to appear in the document more or less frequently: "dog" and "bone" will appear more often in documents about dogs, "cat" and "meow" will appear in documents about cats, and "the" and "is" will appear approximately equally in both. A document typically concerns multiple topics in different proportions; thus, in a document that is 10% about cats and 90% about dogs, there would probably be about 9 times more dog words than cat words. The "topics" produced by topic modeling techniques are clusters of similar words. A topic model captures this intuition in a mathematical framework, which allows examining a set of documents and discovering, based on the statistics of the words in each, what the topics might be and what each document's balance of topics is.

Social navigation is a form of social computing introduced by Paul Dourish and Matthew Chalmers in 1994, who defined it as when "movement from one item to another is provoked as an artifact of the activity of another or a group of others". According to later research in 2002, "social navigation exploits the knowledge and experience of peer users of information resources" to guide users in the information space, and that it is becoming more difficult to navigate and search efficiently with all the digital information available from the World Wide Web and other sources. Studying others' navigational trails and understanding their behavior can help improve one's own search strategy by guiding them to make more informed decisions based on the actions of others.

<span class="mw-page-title-main">Aude Billard</span> Swiss physicist

Aude G. Billard is a Swiss physicist in the fields of machine learning and human-robot interactions. As a full professor at the School of Engineering at Swiss Federal Institute of Technology in Lausanne (EPFL), Billard’s research focuses on applying machine learning to support robot learning through human guidance. Billard’s work on human-robot interactions has been recognized numerous times by the Institute of Electrical and Electronics Engineers (IEEE) and she currently holds a leadership position on the executive committee of the IEEE Robotics and Automation Society (RAS) as the vice president of publication activities.

References

  1. Kang, Ruogu; Fu, Wai-Tat; Kannampallil, Thomas George (2010). "Exploiting knowledge-in-the-head and knowledge-in-the-social-web". Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Atlanta, Georgia, USA: ACM Press. pp. 393–402. doi:10.1145/1753326.1753386. ISBN   9781605589299. S2CID   5261530.
  2. Furnas, George W.; Fake, Caterina; von Ahn, Luis; Schachter, Joshua; Golder, Scott; Fox, Kevin; Davis, Marc; Marlow, Cameron; Naaman, Mor (2006). "Why do tagging systems work?". CHI '06 Extended Abstracts on Human Factors in Computing Systems. Chi Ea '06. Montréal, Québec, Canada: ACM Press. pp. 36–39. doi:10.1145/1125451.1125462. ISBN   9781595932983. S2CID   33400901.
  3. Furnas, G. W.; Landauer, T. K.; Gomez, L. M.; Dumais, S. T. (1987-11-01). "The vocabulary problem in human-system communication". Communications of the ACM. 30 (11): 964–971. doi:10.1145/32206.32212. S2CID   3002280.
  4. Fu, Wai-Tat; Kannampallil, Thomas; Kang, Ruogu; He, Jibo (2010-07-01). "Semantic imitation in social tagging". ACM Transactions on Computer-Human Interaction. 17 (3): 1–37. doi:10.1145/1806923.1806926. S2CID   6964273.
  5. Hedström, Peter (December 12, 2005). Dissecting the social: on the principles of analytical sociology. Cambridge: Cambridge University Press. ISBN   0511136919. OCLC   62868580.
  6. 1 2 Chi, Ed H.; Mytkowicz, Todd (2008). "Understanding the efficiency of social tagging systems using information theory". Proceedings of the nineteenth ACM conference on Hypertext and hypermedia. Pittsburgh, PA, USA: ACM Press. pp. 81–88. doi:10.1145/1379092.1379110. ISBN   9781595939852. S2CID   14008770.
  7. 1 2 3 4 5 Cattuto, C.; Loreto, V.; Pietronero, L. (2007-01-30). "Semiotic dynamics and collaborative tagging". Proceedings of the National Academy of Sciences. 104 (5): 1461–1464. arXiv: cs/0605015 . doi: 10.1073/pnas.0610487104 . ISSN   0027-8424. PMC   1785269 . PMID   17244704.
  8. 1 2 Golder, Scott A.; Huberman, Bernardo A. (2006). "Usage patterns of collaborative tagging systems". Journal of Information Science. 32 (2): 198–208. doi:10.1177/0165551506062337. ISSN   0165-5515. S2CID   1946917.
  9. Eggenberger, F.; Pólya, G. (1923). "Über die Statistik verketteter Vorgänge". ZAMM - Zeitschrift für Angewandte Mathematik und Mechanik (in German). 3 (4): 279–289. Bibcode:1923ZaMM....3..279E. doi:10.1002/zamm.19230030407.
  10. 1 2 Halpin, Harry; Robu, Valentin; Shepherd, Hana (2007). "The complex dynamics of collaborative tagging". Proceedings of the 16th international conference on World Wide Web. Banff, Alberta, Canada: ACM Press. pp. 211–220. doi:10.1145/1242572.1242602. ISBN   9781595936547. S2CID   13935265.
  11. Robu, Valentin; Halpin, Harry; Shepherd, Hana (September 2009). "Emergence of consensus and shared vocabularies in collaborative tagging systems" (PDF). ACM Transactions on the Web. 3 (4): 1–34. doi:10.1145/1594173.1594176. S2CID   3330929.
  12. Kintsch, Walter (1988). "The role of knowledge in discourse comprehension: A construction-integration model". Psychological Review. 95 (2): 163–182. doi:10.1037/0033-295X.95.2.163. ISSN   1939-1471. PMID   3375398. S2CID   15246663.
  13. Fu, Wai-Tat (2008). "The microstructures of social tagging". Proceedings of the 2008 ACM conference on Computer supported cooperative work. San Diego, CA, USA: ACM Press. pp. 229–238. doi:10.1145/1460563.1460600. ISBN   9781605580074. S2CID   2202814.
  14. Fu, Wai-Tat (Aug 2009), "A Semantic Imitation Model of Social Tagging." (PDF), Proceedings of the IEEE Conference on Social Computing: 66–72, archived from the original (PDF) on 2009-12-29
  15. Clark, Herbert H. (1996). Using language . Cambridge, England. ISBN   0521561582. OCLC   33078546.{{cite book}}: CS1 maint: location missing publisher (link)