Beyond cardsorting: Free-listing methods to explore user categorizations
by Rashmi Sinha on 2003/02/24 | [8 Comments]
So what is free-listing good for?
Free-listing can be used to understand the contents of a domain. For example, a practitioner designing the information architecture for an online book vendor might need to generate a list of book genres and subgenres. Or the practitioner might already possess a list of genres, but need to verify that the list is exhaustive. Or the goal of free-listing might be to arrange the genres according to centrality or salience in the user’s mind.
Free-listing can also serve as a way to gain familiarity with user vocabulary for the domain. As a precursor to cardsorting, it allows you to define and limit the domain in question, and frame card items in the user?s own language. Apart from helping in all these situations, free-listing can also serve as a rough proxy for similarity methods, such as cardsorting.
Free-listing might seem similar to open-ended questions about subjective preferences, such as “What cars do you like?,” but there is an important difference when free-listing is used to explore semantic domains. The assumption with semantic domains is that there is something common to people’s understanding of that domain (i.e., user understanding is not completely idiosyncratic). Free-listing is a good way to explore that common understanding of the domain.
How to conduct free-listing exercises
Free-listing is a semi-structured method. It can be conducted as part of an interview, or as a written exercise (and can be done online as well). Simply ask the respondent, “Name all the x’s you know.” Give them sufficient time to do so. If they stop after very few items, encourage them by saying, “Can you think of any more?”
Of course, the first question is: How many respondents does one need? And, expectedly, the answer is: It depends. Note that if there was perfect agreement about the domain, then one respondent would be sufficient for the free-listing exercise (the same principle holds true for cardsorting). While you will never encounter perfect agreement about a domain, the logical corollary to this principle is that the more the agreement about a domain, the fewer respondents you need. For many domains, 20 to 30 respondents are enough to get a clear picture (Weller & Romney, 1988).
The best way to judge how many respondents you need is to look at your own data after you have run five to six respondents. Create a list of all items, sorted by their average rank (of being listed by a respondent). Examine how that rank order changes with the addition of each new respondent. If the ranks are relatively stable, then you can stop adding new respondents.
Salience, frequency and rank
Across respondents, some items will occur more frequently than others. This is because some items are more central (or prototypical) to the concept than others. For example, in a list of everyday animals, “cat” will be mentioned more frequently than “rat.” An interesting property of free-lists is that frequency of item occurrence across people is related to item rank (Bousfield & Barclay, 1950). Thus, the higher the likelihood of “cat” being mentioned in a list of everyday animals, the higher the position of “cat,” or the earlier it will be mentioned by an individual respondent. This indicates that frequency and rank are manifestations of the same underlying property: salience of an item (Romney & Dandrade, 1984). Thus, free-lists provide two related indices of the psychological salience of an item in a domain.
![]() |
| Figure 1: Chart of item frequency |
One way to decide what items to keep for the next stage of analysis is to identify any natural break points in the data. Or, use all items that are mentioned by more than one user.
All roads lead to similarity
Understanding the similarity matrix is crucial to understanding methods such as free-listing or cardsorting. The similarity matrix is the key data structure in going from user data to information architecture. Data-gathering methods can be direct similarity (rate the similarity of these two items or put all similar cards into the same group) or indirect methods (list all the items in a domain). Data from such exercises can be converted into a similarity matrix, which forms the input for analysis such as cluster analysis.
![]() |
| Table 1: Similarity matrix |
Free-listing is not the best way to understand similarity between items, but there are two possible ways to compute similarity between items using free-listing data. Both methods should be considered an indirect index of similarity, inferior to other more direct measures such as cardsorting.
![]() |
| Table 2a: Item by respondent matrix |
![]() |
| Table 2b: Item co-occurrence (a type of similarity matrix) |
Either of these measuresco-occurrence or average rank distancewill give you a similarity matrix (see Table 2b above), which can then be examined through cluster analysis in the same way that cardsorting data can be examined.
What else can free-lists tell you?
This is a general introduction to free-listing. There are many other ways to use free-listing data. Free-lists can be used to identify whether some respondents have more domain familiarity than others. It can also help isolate respondents who, for some reason, might perceive the domain in a completely idiosyncratic fashion. Such a respondent could make it difficult for you to identify trends and patterns shared by other respondents (especially in a small sample). A follow-up method that asks respondents to generate multiple free-lists (each item in the first free-list is used as starting point for others) can be used to further understand the structure of the content domain. Another usage is to compare the content and boundaries of two or more domains.
Summary
Free-listing exercises are easy to conduct. Researchers have noted that people find it a natural task. Because of this, it can be used with all sorts of populations, including children and illiterate people. Simple spreadsheet-based analysis can be used to answer many different questions. This method is useful both as a precursor to cardsorting, or as an independent method to create understanding of the content domain.
![]()
- Borgatti, S. (1998). Elicitation Techniques for Cultural Domain Analysis. In Ethnographer’s Toolkit, edited by J. Schensul. Newbury Park: Sage.
- Bousfield, W.A. & Barclay, W.D. (1950). The Relationship Between Order and Frequency of Occurrence of Restricted Associative Responses. Journal of Experimental Psychology.
- Henley, N.M. (1969). A Psychological Study of the Semantics of Animal Terms. Journal of Verbal Learning and Verbal Behavior 8:176-184.
- Romney, A.K. & DAndrade, R.G. (1964). Cognitive Aspects of English Kin Terms. American Anthropologist 66(3, Part 2):146-170.
- Weller, S.C. & Romney, A.K. (1988). Systematic Data Collection, Thousand Oaks, CA: Sage.








Readers' Comments (8)
Reputation points
Posted 2003/02/26 @ 00:50AM with
Reputation points
Posted 2003/03/07 @ 11:00AM with
Reputation points
Posted 2003/03/07 @ 13:38PM with
Reputation points
Posted 2003/03/12 @ 05:33AM with
Reputation points
Posted 2003/03/13 @ 07:02AM with
Reputation points
Posted 2003/03/28 @ 00:54AM with
Reputation points
Posted 2003/04/30 @ 06:48AM with
B. Raj Giri
0 Reputation points
Posted 2007/07/19 @ 16:28PM with