Advanced Analysis Pilot Research Project

Advanced Analysis of Bernard Lonergan's Digitized Corpus

Basics for Using the Self-Directed Analyzer

Work within Internet Explorer Web Browser.

Log onto Crivella West Knowledge Kiosk Site, using user ID and password.

Open one or more tabs and load Self-Directed Analyzer Windows, using supplied url.

Four Columns - adding a term within a column creates an "or" statement; going across a column creates an "and" statement; using the fourth column creates a "not" statement.

Create and run algorithm.

Review algorithm, eliminate garbage words by using fourth column, re-run algorithm.

Inspect summary results.

Inspect saliency, 2gram and 3gram results.

Modify algortihm, repeat.

Inspect summary results and select sample to read in context. Add relevant terms to algorithm.

Reiterate process.

Tips for Creating Algorithms

For “inner experience” you can broaden the search by using
inner /3 exper*
internal /3 exper*

To expose the constructed analyzers, add ?admin to the supplied Self-Directed Analyzer url.

This feature can be used to identify and eliminate garbage collected by a wildcard (*) search, simply add terms to be excluded to the fourth column of the analyzer.

Technical Reference Information - Updated 7 December 2011

The Self-Directed Analyzer has been temporarily locked to use the rationalized Lonergan corpus as the target set and the reference set. A new feature is being developed that will allow the user to set the target and reference sets independently. Until this feature is introduced, there is no need or opportunity to set the Doc Set ID.

The Crivella West team is working on a feature that will reveal the metadata for a KEID number, such as title, author and date of publication. A link to an excel file with a list of the KEID numbers for the rationalized Lonergan corpus is provided at the bottom of this page.

Current Lonergan Locator Code
37b5eaa1-50ec-4dec-b988-470970c014d1

Current Newman Locator Code
03e491c6-38f0-411c-95f1-4953ba94b1c0

Definition of Terms

Reference Set - set of texts used to define the expected frequency of terms.

Target Set - set of texts that are analyzed.

Result Set - a set of excerpts selected from the target set by an algorithm.

Rationalized Corpus - set of an author's available works that has eliminated duplications due to multiple printings or editions.

The saliency function in the Self-Directed Analyzer analyzes the frequency of terms found in the result set against the frequency of terms found in the reference set to determine which occur with significantly varied frequency.

Appreciation List - Gratitude is Extended to the Following:

Crivella West Inc., Pittsburgh PA
Arthur Crivella
Wayne West
Rich Ekstrom
Monty Crivella
Zakkiyya Asad
John Murphy

John M. Kelly Library, Toronto ON
Jonathan Bengston
Michael Bramah

University of Toronto Press
Lynn Fisher

Moderator Contact Information

Gordon Rixon, S.J.
gordon.rixon@utoronto.ca
416-922-5474 x225
www.RegisCollege.ca/faculty/gordon-rixon

 

AttachmentSize
lonergan_keid_list.xls88.5 KB
2011_12_07_-_keyword_lists_.xls87 KB

Lonergan Working Group Activities

Upcoming Events

Friday, December 16, 2011, 11:00: Lab Tutorial Session

Friday, February 3, 2012, 11:00: Lab Tutoral Session

Past Events

Wednesday, December 7, 2011: Lonergan KEID File Posted

Wednesday, December 7, 2011: Rationalized Lonergan Corpus set as Reference Set in Self-Directed Analyzer

Wednesday, December 7, 2011: Word Sets for Presets posted

Friday, November 25, 2011: Introductory Lab Tutorial Session

Friday, November 11, 2011: Information Session for Invited Participants