Tools

Tools AlphabeticallyTools by DisplayTools by Scope
  • Bubblelines (word distribution visualization)
  • Bubbles (text reading visualization)
  • Cirrus (word cloud A visual presentation of keywords drawn from a text, visually differentiated based on their position and frequency of use in that text. More Info on the TAPoR WordCloud Tool A good discussion on Tag Clouds Return to Glossary. visualization)
  • Corpus Grid (table of texts in a corpus)
  • Corpus Summary (corpus overview)
  • Corpus Term Frequencies (table of term frequencies by corpus)
  • Collocate Term Frequencies (table of term frequencies in proximity to keyword)
  • Document Term Frequencies (table of term frequencies by document)
  • Document KWICs (concordance A concordance is a gathering of passages that "concord" or agree. Usually it is a gathering of passages with a sought for word. Concordances are a form of reading tool that go back to the Middle Ages. They are typically lists of words with their appearances. A concordance for the bible, for example, would have entries for all the content words of the bible in alphabetical order. Each entry would include information about where the word appears and some context. Searching for words on a computer now typically returns a concordance called a Key Word in Context (KWIC) with the sought word down the center and a few words of context on either side. Google returns a type of concordance when you search for a word with an example of the word in context for each page it recommends. See the Wikipedia entry on Concordance (Publishing) Return to Glossary. or table of keywords in contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary.)
  • Entities Browser (named entities visualization)
  • Knots (term occurrence visualization)
  • Lava (keyword in contextA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary. visualization)
  • Links (term frequencies in proximity to keyword visualization)
  • Mandala (term browsing visualization)
  • Reader (large-scale document reader)
  • ScatterPlot (term distribution visualization)
  • Term Frequencies Chart (term distribution visualization)
  • Term Fountain (term frequencies visualization)
Bubblelines

Bubblelines is a visualization tool that helps to understand patterns of word repetition in one or more documents. Each document is represented as a horizontal lineA line is the string of text limited by the width of a page. Lines are often used in tokenization, and may contain parts of one or more sentences. For example "The quick brown fox jumps over the lazy dog." is a complete sentence and occurs on one line. By contrast, "Hard by a great forest dwelt a poor wood-cutter with his wife and his two children. The boy was called Hansel and the girl Gretel. He had little to bite and to break, and once when great dearth fell on the land, he could no longer procure even daily bread." spans three sentences and four lines. Return to Glossary. and each seach term is represented as a bubble – the bubble represents the frequency of the term in the corresponding segment of text (the text is divided into segments of equal length). The larger the bubble, the more frequent the term.

Bubbles

Bubbles reads the words in a document (or corpus) and displays the highest frequency words within proportionately large bubbles.

Cirrus

Cirrus is a visualization tool that displays a word cloud A visual presentation of keywords drawn from a text, visually differentiated based on their position and frequency of use in that text. More Info on the TAPoR WordCloud Tool A good discussion on Tag Clouds Return to Glossary. relating to the frequency of words appearing in one or more documents. One can click on any word appearing in the cloud to obtain detailed information about its relativity. The larger the word, the more frequent the term.

Corpus Grid

Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).

Corpus Summary

Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, and distinctive words. Users can click within these features for more detailed information of the analysis.

Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.


 
Document Term Frequencies

Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below. Hover over the column headers or toolbar buttons for more information.

Document KWICs

Document KWICs shows a table of keywords in their contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary.. In other words, it provides a list of certain keywords and their occurrence within a corpus or document.

Entities Browser (named entities visualization)
Knots

Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted lineA line is the string of text limited by the width of a page. Lines are often used in tokenization, and may contain parts of one or more sentences. For example "The quick brown fox jumps over the lazy dog." is a complete sentence and occurs on one line. By contrast, "Hard by a great forest dwelt a poor wood-cutter with his wife and his two children. The boy was called Hansel and the girl Gretel. He had little to bite and to break, and once when great dearth fell on the land, he could no longer procure even daily bread." spans three sentences and four lines. Return to Glossary. – when the lines overlap it means a relevance or linkage within the terms.

Lava

Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further. By clicking on certain parts of the visualization, you are able to explore terms within their contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary..

Links

Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.

Collocate Term Frequencies (table of term frequencies in proximity to keyword)
Mandala

Mandala is a visualization tool that imports “textual” files to perform analysis on the frequency and linkage of words. For example, you may import a play and find the linkage and frequency between a word and its speaker.

Reader

Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.

ScatterPlot

ScatterPlot creates a scatter plot graph of terms, spaced by their variation from one another.

Term Frequencies Chart

Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added).

  
Term Fountain

Term Fountain visualizes word frequencies as a fountain.

Cirrus

 Cirrus is a visualization tool that displays a word cloud A visual presentation of keywords drawn from a text, visually differentiated based on their position and frequency of use in that text. More Info on the TAPoR WordCloud Tool A good discussion on Tag Clouds Return to Glossary. relating to the frequency of words appearing in one or more documents. One can click on any word appearing in the cloud to obtain detailed information about its relativity. The larger the word, the more frequent the term.

Getting Started

When you first arrive to the Cirrus tool you will see one of two possible screens:

Cirrus without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Cirrus with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a Corpus.

Interface Elements

Cirrus includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

Hovering over a word will cause a box to appear that displays the frequency count for that term.

Hovering over a word.

If a word is clicked, you are taken to an analysis of the word within the document(s). On the right side, Voyeur displays the relativity of the word in each of the document(s). (This is the Document Term Frequencies tool.)

Word relativity

Exporting

Like all Voyeur tools, Cirrus can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Bubblelines

 Bubblelines is a visualization tool that helps to understand patterns of word repetition in one or more documents. Each document is represented as a horizontal lineA line is the string of text limited by the width of a page. Lines are often used in tokenization, and may contain parts of one or more sentences. For example "The quick brown fox jumps over the lazy dog." is a complete sentence and occurs on one line. By contrast, "Hard by a great forest dwelt a poor wood-cutter with his wife and his two children. The boy was called Hansel and the girl Gretel. He had little to bite and to break, and once when great dearth fell on the land, he could no longer procure even daily bread." spans three sentences and four lines. Return to Glossary. and each seach term is represented as a bubble – the bubble represents the frequency of the term in the corresponding segment of text (the text is divided into segments of equal length). The larger the bubble, the more frequent the term.

Getting Started

When you first arrive to the Bubblelines tool you will see one of two possible screens:

Bubblelines without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Bubblelines with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Bubblelines includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

Hovering over a bubble, or set of bubbles, will cause a box to appear that displays the frequency counts for that segment of text.

Hovering over a Bubble

Similarly, hovering over the number at the end of the lineA line is the string of text limited by the width of a page. Lines are often used in tokenization, and may contain parts of one or more sentences. For example "The quick brown fox jumps over the lazy dog." is a complete sentence and occurs on one line. By contrast, "Hard by a great forest dwelt a poor wood-cutter with his wife and his two children. The boy was called Hansel and the girl Gretel. He had little to bite and to break, and once when great dearth fell on the land, he could no longer procure even daily bread." spans three sentences and four lines. Return to Glossary. will cause a box to appear that summarizes the frequency for the entire document.

Hovering over a Line

When Bubblelines first loads a corpus, you may see terms that have been pre-selected and included in the URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. or embedded page. If no terms are specified, Bubblelines automatically fetches the five most frequent terms and displays bubbles based on those.

You can remove the default terms by clicking on the "Clear Terms" button.

Clear Terms

You can add additional terms to be displayed using the "Find Term" box. Note that available terms will appear as you type and you can pick an item from the list to have it added.

Clear Terms

In addition to adding and removing terms, you can toggle the display of the terms that have been loaded. To do so simply click on the term (active terms are underlined).

Clear Terms

Exporting

Like all Voyeur tools, Bubblelines can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Corpus Summary

 Corpus Summary is a tool that provides a simple, textual overview of the current corpus. Features of this tool include number of words, number of unique words, longest documents, highest vocabulary density, most frequent words, notable peaks in frequency, and distinctive words. Users can click within these features for more detailed information of the analysis.

Getting Started

When you first arrive to the Corpus Summary tool you will see one of two possible screens:

Corpus Summary without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Corpus Summary with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Corpus Summary includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

Once the analysis is complete, links to words and individual documents appear. Words appear highlighted in yellow. You may click on any of these links, and it will provide detailed information pertaining to the term or document.

If we click on a specific word, we see a chart explaining its appearance and relativity in different documents within the corpus. (This is the Document Term Frequencies tool.)

Exporting

Like all Voyeur tools, Corpus Summary can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Corpus Term Frequencies

 Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.

Getting Started

When you first arrive to the Corpus Term Frequencies tool you will see one of two possible screens:

Corpus Term Frequencies without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Corpus Term Frequencies with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Corpus Term Frequencies includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

On the left you will see the most frequent terms highlighted in yellow, and in the other columns. In the columns on the right you will see how many times the word appears in the document(s), and a miniature graph showing its trend within the document(s).

Overview of Corpus Term Frequencies

At the bottom, the traditional Voyeur favourite buttons and search can be found. Click on the boxes next to any terms and click on the Favourite button to add them to your favourite terms.

Voyeur favourite buttons

Double-click on any word within the corpus to access the Document Term Frequencies tool.

Exporting

Like all Voyeur tools, Corpus Term Frequencies can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Bubbles

 Bubbles reads the words in a document (or corpus) and displays the highest frequency words within proportionately large bubbles.

Getting Started

When you first arrive to the Bubbles tool you will see one of two possible screens:

Bubbles without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Bubbles with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Bubbles includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

As more Bubbles load, you will see a list of terms. The higher the term is on the list, the higher frequency it has within the corpus. The same applies for the bubbles – the larger the bubble, the higher frequency it has within the corpus.

Different sizes of bubbles

At the bottom left of the screen, the tool displays the number of bubbles displayed out of the possible terms in the corpus. If the number is increasing, this means bubbles is still loading in more terms.

Number of terms in the corpus

Exporting

Like all Voyeur tools, Bubbles can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Mandala

Mandala

Mandala is a visualization tool that imports “textual” files to perform analysis on the frequency and linkage of words. For example, you may import a play and find the linkage and frequency between a word and its speaker.

Getting Started 

Once you’ve launched Mandala, the interface should be blank. On the top left corner of Mandala, you have a few options for how to proceed with loading a file. You may click Open File to proceed to open a file of your choosing (preferably .xmlXML, or Extensible Markup Language, is a language used in web development to make a text readable by web browsers and/or store data. Like HTML, XML is primarily formed of paired elements. Unlike HTML, the elements are defined by the user, rather than predefined. For example, both < book >< /book > and < murfle >< /murfle > are valid element pairs. These elements apply characteristics and metadata to the text within them. One pair of elements may be nested inside another: < book >< title >< /title >< /book > Elements may also be modified by attributes and attribute values: < book format="hardcover" > In this case, the book element has the attribute 'format' and the attribute value 'hardcover'. In addition to storing metadata about the text, attribute/attribute value pairs are frequently used in combination with CSS to apply formatting to the text within the element. Return to Glossary. , .txt, or .zip), or you may choose any file listed in the dropdown menu. Dots Represent allows you to choose which tags in your file you would like to visualize. More often than not, Mandala will choose the option that will produce the most useful visualization. Finally, once you’ve made your selection, click Load to load the file into view. Additionally, you may load a new file into the current visualization with Merge.

File palette

If you’ve loaded RomeoJuliet.xmlXML, or Extensible Markup Language, is a language used in web development to make a text readable by web browsers and/or store data. Like HTML, XML is primarily formed of paired elements. Unlike HTML, the elements are defined by the user, rather than predefined. For example, both < book >< /book > and < murfle >< /murfle > are valid element pairs. These elements apply characteristics and metadata to the text within them. One pair of elements may be nested inside another: < book >< title >< /title >< /book > Elements may also be modified by attributes and attribute values: < book format="hardcover" > In this case, the book element has the attribute 'format' and the attribute value 'hardcover'. In addition to storing metadata about the text, attribute/attribute value pairs are frequently used in combination with CSS to apply formatting to the text within the element. Return to Glossary. , Mandala should look similar to this:

Example visualization

For this example, Mandala automatically visualizes the most frequent elementAn element, also called a tag, is characteristically used within HTML and XML to apply characteristics (such as headings, paragraphs or user-defined categories) or metadata to a document, usually a text. Elements generally appear in matching pairs of an opening element and a closing element, with text in between. All text within an element pair is modified by that element, and one element pair may be nested inside another. In the case of HTML, elements are used to format a text directly, or as a delimiter for CSS formatting to the text within that element. An HTML paragraph element: < p >< /p > In the case of XML, elements may be also be used as a delimiter for CSS formatting to the text within that element, but its primary purpose is to apply metadata to that text. Ex: < book format="hardcover" >< /book > Both HTML and XML elements may be modified with attribute/value pairs. In the above example, format="hardcover" is the attribute/value pair modifying the element < book >. Return to Glossary. in the represented tagA tag, also called an element, is characteristically used within HTML and XML to apply characteristics (such as headings, paragraphs or user-defined categories) or metadata to a document, usually a text. HTML and XML tags generally appear in matching pairs of an opening and a closing tag, with text in between. All text within a tag pair is modified by that tag, and one tag pair may be nested inside another. In the case of HTML, tags are used to format a text directly, or as a delimiter for CSS formatting to the text within that tag. An HTML paragraph tag: < p >< /p > In the case of XML, tags may be also be used as a delimiter for CSS formatting to the text within that tag, but its primary purpose is to apply metadata to that text. Ex: < book format="hardcover" >< /book > Both HTML and XML tags may be modified with attribute/value pairs. In the above example, format="hardcover" is the attribute/value pair modifying the tag < book >. Return to Glossary. (in this case, the most frequent speaker of a speech is Romeo.) It will then visualize one of the most frequent words within all of the speeches, and also visualize the matches between the speaker and the word. In this case, love is visualized and we see the matches between love and Romeo visualized with a blue/green magnet. Magnets are Mandala’s method of grouping differing elements on screen. If we click on any of the dots surrounding a magnet, we will see a speech displayed in the right-hand display called the reader panelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary.. We can also double-click on a magnet for all of its speeches, or even click-and-hold to lasso dots and display their speeches.

If you’d like to add a new search term, navigate to the Search palette and click Add New Magnet. Then, type in your new search criteria in the text box. Then click Go to add this new search term to our visualization:

Search palette

If we click on the dance magnet, we can see that we are unable to see who is actually speaking. If we’d like to see whose speaking, we need to click on the Display palette and on the Fields always displayed dropdown, click speech—speaker. If you return to the dance magnet, we will now be able to see who is speaking. 

Display palette

We can also add new magnets representing speakers. Click Add New Magnet and under Field, select speech—speaker. You may type in Juliet’s name, or whichever character you would like to visualize. Click Go. We now can see the subsets of where Juliet says the words love and dance

New speaker magnet

If you would like to see the finer details of your Mandala visualization, you may zoom in / out and adjust the screen with the Display palette. Click on the Display palette and you may zoom in / out with the ‘+’ and ‘-‘ signs. Another way to zoom is with a mousewheel / two-finger scroll. You may also navigate the screen with the ←, →, ↓, ↑ arrows. If you’d like more control, you can click on the hand tool to drag the screen around. We may also export our Mandala visualization by clicking Export on the bottom-right side of the window. If you choose Text, you can save a copy of the text displayed in the reader panelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary.. If you choose Screenshot, you can save a visual copy of the Mandala visualization.

Export panel

 

Advanced Features

Mandala allows you to perform specific queries pertaining to certain dots. You may, as mentioned earlier click-and-drag to select many dots, or you can use custom lasso tools that may assist you in highlighting certain dots:

Tools palette

To reset any selected dots, just choose Reset selection state from the Tools palette. Mandala also includes something called the Microtext PanelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary. which allows you to walk sequentially through the magnets. If you click on any of the gray bars, you will see the specified dots displayed in the right-hand panelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary.. The Microtext PanelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary. divides the number of dots evenly. You may also navigate this panelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary. by clicking Prev Page or Next Page on the bottom-right of the screen.

Microtext panel

If you would like to establish your Mandala window with every speaker in RomeoJuliet.xmlXML, or Extensible Markup Language, is a language used in web development to make a text readable by web browsers and/or store data. Like HTML, XML is primarily formed of paired elements. Unlike HTML, the elements are defined by the user, rather than predefined. For example, both < book >< /book > and < murfle >< /murfle > are valid element pairs. These elements apply characteristics and metadata to the text within them. One pair of elements may be nested inside another: < book >< title >< /title >< /book > Elements may also be modified by attributes and attribute values: < book format="hardcover" > In this case, the book element has the attribute 'format' and the attribute value 'hardcover'. In addition to storing metadata about the text, attribute/attribute value pairs are frequently used in combination with CSS to apply formatting to the text within the element. Return to Glossary. , under Search set Field to speech-speaker, then click on the text field and choose [All terms] from the top of the dropdown. This will then warn you about creating a multitude of magnets, click Yes to continue. You will then see a bunch of new magnets representing speakers in the play. You may double-click on them to preview their speeches.

Search palette (again)

It should also be noted that clicking Randomize in the Display palette will generate random magnets. This could potentially reveal some interesting trends within the text.

In addition to the sample files, you may load your own XMLXML, or Extensible Markup Language, is a language used in web development to make a text readable by web browsers and/or store data. Like HTML, XML is primarily formed of paired elements. Unlike HTML, the elements are defined by the user, rather than predefined. For example, both < book >< /book > and < murfle >< /murfle > are valid element pairs. These elements apply characteristics and metadata to the text within them. One pair of elements may be nested inside another: < book >< title >< /title >< /book > Elements may also be modified by attributes and attribute values: < book format="hardcover" > In this case, the book element has the attribute 'format' and the attribute value 'hardcover'. In addition to storing metadata about the text, attribute/attribute value pairs are frequently used in combination with CSS to apply formatting to the text within the element. Return to Glossary. / text files as well. Again, Mandala will choose the most suitable tags to use as fields, but you may specify your own when you are loading the file. (This can be done by either clicking on the Dots represent field or you may choose Custom and define your own XPath.) If you choose to load a text file, sections will be divided by paragraph. Also, if you have multiple files with similar structures that you’d like to analyse, you may import them all as a .zip file.

When you import your own files to be analyzed, Mandala may not automatically display magnets. In the Search palette click Add new magnet and you may customize what kind of Field and Match type that the magnet should represent.

 

Document KWICs

 Document KWICs shows a table of keywords in their contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary.. In other words, it provides a list of certain keywords and their occurrence within a corpus or document.

Getting Started

When you first arrive to the Document KWICs tool you will see one of two possible screens:

Document KWICs without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Document KWICs with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Document KWICsA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary. includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

Document KWICsA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary. requires not only input, but a specified term to perform the analysis. For example, in the image below, we see the analysis looking at all of the documents in relation to the word 'lord'. The document earliest in the corpus is shown at the top, and so forth. We can see on the left and right of the term (highlighted in yellow) the contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary. it resides in.

Document KWICs standard screen

To expand the contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary. for a preview of where it resides in the document, press the '+' button beside the listing.

Expanding the KWIC to see a preview

At the bottom, the traditional Voyeur favourite buttons and search can be found, along with Document KWICsA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary. specific options. To cycle through the pages of KWICsA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary., press the forward and back buttons by the 'page' display. To change the number of words on either side of the keyword, choose an option from the 'ContextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary.' menu. To change the number of words of the expanded preview, choose an option from the 'Preview' menu. Creating a new search will change the keyword within Document KWICsA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary.. Click on any term listing and click on the Favourite button to add them to your favourite terms.

Document KWICs bottom toolbar

Exporting

Like all Voyeur tools, Document KWICsA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary. can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Document Term Frequencies

 Document Term Frequencies shows word frequencies for each document in the corpus. You can see the selected word at the top of the window highlighted in yellow. Its relevance to the documents is shown in the table below. Hover over the column headers or toolbar buttons for more information.

Getting Started

When you first arrive to the Document Term Frequencies tool you will see one of two possible screens:

Document Term Frequencies without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Document Term Frequencies with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Document Term Frequencies includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

Document Term Frequencies requires not only input, but a specified term to perform the analysis. For example, in the image below, we see the analysis looking at all of the documents in relation to the word 'thou'. The document with the highest count is shown at the top, and so forth. We also have the documents' relativity in the columns in the right.

Document Term Frequencies overview

Document Term Frequencies is commonly accessed from other tools by clicking on individual terms. For example, if using the tool Cirrus, clicking on one of the words will bring you to a Document Term Frequencies analysis.

At the bottom, the traditional Voyeur favourite buttons and search can be found. Click on the boxes next to any terms and click on the Favourite button to add them to your favourite terms.

Favourite buttons

Exporting

Like all Voyeur tools, Document Term Frequencies can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Corpus Grid

 Corpus Grid shows an overview of the corpus, including each document's title, number of word tokens (total words), number or word types (unique words), and lexical density (the ratio of tokens to types).

Getting Started

When you first arrive to the Corpus Grid tool you will see one of two possible screens:

Corpus Grid without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Corpus Grid with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Corpus Grid includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

Once the analysis is complete, a grid will appear with information pertaining to all of the documents within the corpus. If you click on the column headers, you will find sorting information.

If you click on 'Group By This Field' when clicking on the column header, it will change the layout of the grid and thus sort by that field.

Exporting

Like all Voyeur tools, Corpus Grid can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Knots

 Knots is a visualization tool that helps to understand patterns of word relevance in one or more documents. Each term is represented as a twisted lineA line is the string of text limited by the width of a page. Lines are often used in tokenization, and may contain parts of one or more sentences. For example "The quick brown fox jumps over the lazy dog." is a complete sentence and occurs on one line. By contrast, "Hard by a great forest dwelt a poor wood-cutter with his wife and his two children. The boy was called Hansel and the girl Gretel. He had little to bite and to break, and once when great dearth fell on the land, he could no longer procure even daily bread." spans three sentences and four lines. Return to Glossary. – when the lines overlap it means a relevance or linkage within the terms.

Getting Started

When you first arrive to the Knots tool you will see one of two possible screens:

Knots without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Knots with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Knots includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

As you hover over segments of lines, you will see them become highlighted. If you click on them, you will be taken to the Document KWICs tool for an analysis of the word in contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary.. You also have the option to drag the Knots visualization by clicking and dragging the centre point.

When Knots first loads a corpus, you may see terms that have been pre-selected and included in the URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. or embedded page. If no terms are specified, Knots automatically fetches the five most frequent terms and displays lines based on those.

Initial loading of Knots

When Knots first loads a corpus, you may see terms that have been pre-selected and included in the URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. or embedded page. If no terms are specified, Knots automatically fetches the five most frequent terms and displays bubbles based on those.

You can remove the default terms by clicking on the "Clear Terms" button.

Clear Terms

You can add additional terms to be displayed using the "Find Term" box. Note that available terms will appear as you type and you can pick an item from the list to have it added.

Find term box

In addition to adding and removing terms, you can toggle the display of the terms that have been loaded. To do so simply click on the term (active terms are underlined).

Toggle terms

Some options included in Knots are 'Build Speed', 'Starting Angle', and 'Tangles'. Build speed affects how quickly the visualization is performed. Starting angle adjusts the angle at which the lines expand and develop. Tangles affects how many twists there are within the visualization.

Knots options

Exporting

Like all Voyeur tools, Knots can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Links

 Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.

Getting Started

When you first arrive to the Links tool you will see one of two possible screens:

Links without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Links with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Links includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

When the visualization appears, you will see a list of the included documents within the corpus. They are each associated with a certain colour, and this colour is shown as term links within the visualization. A term's colour is determined by which document it appears in most frequently. You may add a word to the visualization by typing it into the 'Add word' box and hitting return.

Documents list

If you hover over a term, Voyeur will tell you it's linkage within the corpus documents. You may also drag and drop terms to organize them as you please.

Hovering over a term

Clicking on the options button (the button that looks like a gear) will launch a dialog box with various options pertaining to the Links tool. Stop words list is if you would like to exclude words from the visualization. (Usually words such as 'a', 'the', and 'and'.) 'NodeA node in a graph is the basic unit of data from which a graph can be constructed. In text analysis using a hypergraph, nodes connect to other nodes. Each node represents a word, and nodes touching where words are found in conjunction with one another in the source text. For more information on nodes, see the Wikipedia. Return to Glossary. size determined by type frequency' is the default, and will result in sorting by how often the term appears in the documents. Sorting by 'NodeA node in a graph is the basic unit of data from which a graph can be constructed. In text analysis using a hypergraph, nodes connect to other nodes. Each node represents a word, and nodes touching where words are found in conjunction with one another in the source text. For more information on nodes, see the Wikipedia. Return to Glossary. links' will result in terms appearing larger if they are heavily linked with other terms. 'Autofit graph on screen' sizes the graph depending on the size of your browser window. 'Remove orphans' will remove terms which are not linked to any other term in the visualization.

Links options

If you would like to manipulate the visualization, right-click on any of the terms and choose 'Stick/unstick' or 'Remove'. 'Stick/unstick' puts the term in place, and is not moved when other terms are moved. 'Remove' simply removes the term from the visualization.

Sticking a term

Exporting

Like all Voyeur tools, Links can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Lava

 Lava allows you to view multiple levels of a corpus in a three-dimensional environment. Clicking on certain documents within the corpus expands the Lava visualization in a ring to explore further. By clicking on certain parts of the visualization, you are able to explore terms within their contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary..

Getting Started

When you first arrive to the Lava tool you will see one of two possible screens:

Lava without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Lava with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Lava includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

When you first launch Lava (with a predefined corpus and multiple documents), you will see what looks like a black pole. (If you launch with one document, Lava will look like a black disc.) If you hover on sections of this pole you will see it illuminate with the names of different documents within the corpus.

Hovering over a document

If you click on an illuminated part of the pole, Lava will expand as a colourful ring – each colour representing a high frequency term. If you hover over any of the coloured sections, it will illuminate and show the term that it represents.

Hovering over the ring

If you click on the term, it will expand as a coloured arm out from the centre. There are different sections of this arm as well. When you hover over them, it will show the term in the contextIn text analysis, context refers to the text surrounding a string of characters, which may be as short as a word or as long as a paragraph. Context is particularly important when generating a concordance for a string. Return to Glossary. of the document. Clicking this term will take you to a more in-depth analysis.

Hovering over the arm

Clicking other sections of the pole will expand other documents within the corpus so you may visualize and interact with them as well.

Expanding other documents

Exporting

Like all Voyeur tools, Lava can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Reader

 Reader acts as a method of reading all documents within a specified corpus. It does not provide text analysis but rather a method of viewing the contents of a corpus.

Getting Started

When you first arrive to the Reader tool you will see one of two possible screens:

Reader without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Reader with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Reader includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

When you load a collection of documents, you will see a panelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary. of coloured bars on the left of the screen. These bars represent the different documents within the corpus, and if you hover over them they will display the document title. Click on any of these documents to view it in Reader.

Documents list within Reader

At the bottom left of the screen you will see a search panelWeb frameworks like the TAPoR Portal organize information into panels (sometimes called portlets or coplets.) These can me minimized, maximized and closed using the three buttons in the upper left-hand corner of the panel. With Voyant you can export panels of results and place them into other web sites. Return to Glossary.. Type in any term you would like to find within the corpus and Reader will highlight it within the documents.

Reader search box

Once you begin typing, Reader will give you suggestions for terms as you type. As soon as you search for a term, certain coloured bars will change depending on if they properly fit the search.

Finding specific words in Reader

Exporting

Like all Voyeur tools, Reader can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Term Fountain

 Term Fountain visualizes word frequencies as a fountain.

Getting Started

When you first arrive to the Term Fountain tool you will see one of two possible screens:

Term Fountain without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Term Fountain with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Term Fountain includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

Once you load a corpus into Term Fountain, it will begin to generate a fountain. Hover over certain streams to see which word the stream represents. The higher the stream, the higher the term frequency.

Hovering over a Term

Exporting

Like all Voyeur tools, Term Fountain can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Term Frequencies Chart

 Term Frequencies Chart shows how terms are distributed across document(s) in a corpus (documents are shown in the order in which they were added). The example below uses the play Macbeth and not the standard “entire plays of Shakespeare” corpus.

Getting Started

When you first arrive to the Term Frequencies Chart tool you will see one of two possible screens:

Term Frequencies Chart without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

Term Frequencies Chart with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

Term Frequencies Chart includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

When you add analyze a corpus with Term Frequencies Grid, you will initially have common words at the top of the chart with colour codes. You will see lines within the graph which are coloured accordingly to those words. If you click on one of the terms at the top, it will omit that term from the graph.

Terms at the top of the chart

When we hover over the segment points, we can see the frequency of that term in that segment. So, in the example below the word “Macbeth” appears in the last tenth of the play Macbeth. If you click on the point, Voyeur will open a new window with detailed information of that segment and term within its Document KWICs tool.

Clicking on a segment point

If you click and drag on a section of the chart it will zoom in to that section. To reset the chart to its original state, click on “reset zoom”.

Zooming in on the chart

If you would like to see less or more segments on the chart, simply click on “Segments” at the bottom left of the chart to choose the desired segments.

Choosing segment amount

Exporting

Like all Voyeur tools, Term Frequencies Chart can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.

Scatter Plot

 ScatterPlot creates a scatter plot graph of terms, spaced by their variation from one another.

Getting Started

When you first arrive to the ScatterPlot tool you will see one of two possible screens:

ScatterPlot without a pre-loaded corpus. See loading texts into Voyeur for help on how to proceed.

ScatterPlot with a pre-loaded corpus. You were probably given a URLA URL (Uniform Resource Locator), sometimes called a web address, is used to locate and identify web content. For more information, see the Wikipedia. Return to Glossary. that included the corpus, or you're viewing a page that has an embedded Voyeur tool in it. If you prefer, you can also start without a corpus.

Interface Elements

ScatterPlot includes the standard set of interface elements (see image to the right). For more help with these see the Voyeur Tools Standard Interface Elements page.

Standard UI Elements

When you first load ScatterPlot, you will see a variety of terms plotted on a graph. If you hover over the terms, you will see their variation explained by each component on the x and y axis. If you click on any of these terms, it will bring you to the Document KWICsA concordance or keyword in context (KWIC) is usually represented as a list of occurrences of a word with some limited context shown (words to the left and words to the right). Here is an example that shows the occurrences of the word "dream" in A Midsummer Night's Dream in TACTweb: I.1/577.1 | Four nights will quickly dream away the time; | And I.1/578.2 Swift as a shadow, short as any dream; | Brief as the II.2/585.1 | Ay me, for pity! what a dream was here! | Lysander, III.2/591.1 this derision | Shall seem a dream and fruitless vision, | IV.1/593.1 as the fierce vexation of a dream. | But first I will IV.1/594.2 to me | That yet we sleep, we dream. Do not you think | The IV.1/594.2 rare | vision. I have had a dream, past the wit of man to IV.1/594.2 the wit of man to | say what dream it was: man is but an IV.1/594.2 he go | about to expound this dream. Methought I was--there IV.1/594.2 his heart to report, what my dream | was. I will get Peter IV.1/594.2 to write a ballad of | this dream: it shall be called IV.1/594.2 it shall be called Bottom's dream, | because it hath no V.1/599.1 | Following darkness like a dream, | Now are frolic: not a V.1/599.2 theme, | No more yielding but a dream, | Gentles, do not See also the definition at Wikipedia. Return to Glossary. tool for further analysis.

Clicking on a term

ScatterPlot offers options for changing the plot. The terms button allows you choose how many terms should be displayed. The dimensions button lets you switch between a two or three dimensional graph. Toggle labels simply removes or adds labels for the terms on the graph.

Scatterplot's options

Exporting

Like all Voyeur tools, ScatterPlot can be reused in a variety of ways:

  • create a link that is specific to the corpus and options that are currently being used
  • embed the current corpus and options as a tool in an external page

For more information see exporting and reusing Voyeur Tools.