Menu Close

DocuBurst: Visualizing Document Content using Language Structure


Christopher Collins, Gerald Penn, Sheelagh Carpendale, Brittany Kondo, Bradley Chicoine

DocuBurst is the first visualization of document content that takes advantage of the human-created structure in lexical databases. We use an accepted design paradigm to generate visualizations that improve the usability and utility of WordNet as the backbone for document content visualization. A radial, space-filling layout of hyponymy (IS-A relation) is presented with interactive techniques of zoom, filter, and details-on-demand for the task of document visualization. The techniques can be generalized to multiple documents.

Check out the live demo here.

Media Coverage



The code for displaying and interacting with radial, space-filling trees in prefuse is open source and is available for download. The code is distributed as a zip file and can be imported into Eclipse. It is dependent on the prefuse information visualization toolkit and, unfortunately, is minimally documented at this time:


    [pods name="publication" id="4443" template="Publication Template (list item)" shortcodes=1]


WordNet Visualization


Christopher Collins

Interface designs for lexical databases in NLP have suffered from not following design principles developed in the information visualization research community. We present a design paradigm and show it can be used to generate visualizations that maximize the usability and utility of WordNet. The techniques can be generally applied to other lexical databases used in NLP research.


    [pods name="publication" id="4464" template="Publication Template (list item)" shortcodes=1]