|
|
All user interaction and system responses are logged in detail by "okapi" for potential analysis by researchers. The information logged for each search is stored in three files:
|
A complete, chronological history of the search. This includes all user-entered and RF terms at each iteration. |
|
The complete set of user-entered and RF terms at the last iteration. |
|
The complete set of
relevance judgements made by the searcher. |
The last two files are used by the application while running; the information contained in these two files is also contained in the history file. For this reason, only the history file will be documented here.
The commands and results returned by them are described in the following section.
Where:
command_no |
A sequential number allocated to each command issued. Results that
are generated by a given command will have the same command_no as
the command entry. e.g. A "search" command generates a new
document set and a new hitlist. Thus the "docset", "hl_title",
"hl_info" and "hl_terms" entries will have the same command_no as
the "search" entry.
"open_database" will always be command_no 0. |
topic_no | A number assigned to the search. |
elapsed_time | The time in seconds from the beginning of the search at which the entry was written to the history file. This will be the time the command was issued or the time at which the results were generated. |
The next (fourth) field is always the command/result name
corresponding to an entry in the name column in the above
History File Entry Types table.
These four fields are followed by zero, one or more fields
depending on the command or result. E.g. a full
open_database command might look like:
open_ database |
<database_name>:<success>
open_database:trec23_95:OK |
define |
<term>:[<operation>]
define:stock market:A |
query |
<termset_no>:<term_no>:<bss_set>:<np>:
<r>:<wgt>:<rsv>: <source>:<parsed>:<operation> query:0:0:2:14575:0:72:58:U:S:computerisation:computer:N |
search |
<termset_no>:<bss_set>:<weight>:<op_code>
where <op_code;gt IN [ ABSGN ]. G is a GSL phrase, N is a single indexed term, and [ABS] describe the different types of user-entered phrases recognised by the system. search:4:5:198:10:105:12:72:6:99:3:74:2:72:4:28769 |
docset |
<bss-set>:<np>:<maxwt>:<nmaxwt>:
<ngw>:<mpw>:<nmpw>
docset:4:28769:450:1:28769:1206:0 |
hl_title |
<iteration_no>:<set_recno>: <internal_recno>:
<docid>:<weight>: <passage_offset>:<passage_length>: <fulldoc_offset>:<fulldoc_length> hl_title:0:3:78668:FT931-11306:21.133:12:1569:12:7483 |
hl_info |
<hl_info>:<iteration_no>:<set_recno>:
<line_no>:<line>
Entries for one document might be:
hl_info:0:3:0:FT 17 JUL 92 / Fraud trials to come
More than two dozen |
hl_terms |
<hl_terms>:<iteration_no>:<set_recno>
<term source>:<document_tf>
Entries for one document might be:
hl_terms:0:3:0:Fraud:4 |
show |
<iteration_no>:<set_recno>:<docid>:
<weight>:<rel_length>:<relj>
Note: <rel_length> is the length of the full document. show:0:1:FT921-3159:21.821:2929:F |
expand |
<termset_no>
expand:5 |
remove |
<termset_no>:<term_no>:<source>:<opcode>
remove:5:3:Bloggs:n |
restore |
<termset_no>:<term_no>:<source>:<opcode>
remove:5:3:Bloggs:n |
Okapi-Pack Main Menu | Mail Okapi Support | Registration |