Work within Internet Explorer Web Browser.
Log onto Crivella West Knowledge Kiosk Site, using user ID and password.
Open one or more tabs and load Self-Directed Analyzer Windows, using supplied url.
Four Columns - adding a term within a column creates an "or" statement; going across a column creates an "and" statement; using the fourth column creates a "not" statement.
Create and run algorithm.
Review algorithm, eliminate garbage words by using fourth column, re-run algorithm.
Inspect summary results.
Inspect saliency, 2gram and 3gram results.
Modify algortihm, repeat.
Inspect summary results and select sample to read in context. Add relevant terms to algorithm.
Reiterate process.
For “inner experience” you can broaden the search by using
inner /3 exper*
internal /3 exper*
To expose the constructed analyzers, add ?admin to the supplied Self-Directed Analyzer url.
This feature can be used to identify and eliminate garbage collected by a wildcard (*) search, simply add terms to be excluded to the fourth column of the analyzer.
The Self-Directed Analyzer has been temporarily locked to use the rationalized Lonergan corpus as the target set and the reference set. A new feature is being developed that will allow the user to set the target and reference sets independently. Until this feature is introduced, there is no need or opportunity to set the Doc Set ID.
The Crivella West team is working on a feature that will reveal the metadata for a KEID number, such as title, author and date of publication. A link to an excel file with a list of the KEID numbers for the rationalized Lonergan corpus is provided at the bottom of this page.
Current Lonergan Locator Code
37b5eaa1-50ec-4dec-b988-470970c014d1
Current Newman Locator Code
03e491c6-38f0-411c-95f1-4953ba94b1c0
Definition of Terms
Reference Set - set of texts used to define the expected frequency of terms.
Target Set - set of texts that are analyzed.
Result Set - a set of excerpts selected from the target set by an algorithm.
Rationalized Corpus - set of an author's available works that has eliminated duplications due to multiple printings or editions.
The saliency function in the Self-Directed Analyzer analyzes the frequency of terms found in the result set against the frequency of terms found in the reference set to determine which occur with significantly varied frequency.
Crivella West Inc., Pittsburgh PA
Arthur Crivella
Wayne West
Rich Ekstrom
Monty Crivella
Zakkiyya Asad
John Murphy
John M. Kelly Library, Toronto ON
Jonathan Bengston
Michael Bramah
University of Toronto Press
Lynn Fisher
Gordon Rixon, S.J.
gordon.rixon@utoronto.ca
416-922-5474 x225
www.RegisCollege.ca/faculty/gordon-rixon
| Attachment | Size |
|---|---|
| lonergan_keid_list.xls | 88.5 KB |
| 2011_12_07_-_keyword_lists_.xls | 87 KB |