| Content validation | Although the Access database was clearly of great value to its
users as a way of locating documents, the data suffers from a lack
of on-entry validation of field values, and enforcement of
referential integrity among its logical relationships. The
following points were noted:
Unmatched codes
Many records in the Documents table contain either null values or unmatched codes in fields which should join to a corresponding lookup table: the table below summarises this. Some of the missing codes have been established and are recorded in the Field Descriptions.
| Field |
Lookup table |
Unmatched values |
Total unmatched rows |
| DocGroupID |
DocGroupID |
MBlink, KOLD, MBlin |
304 (143 null) |
| CompanyID | CompanyID | AP, BL, BREL, BRI, BRM, BRPB, BRT, BYCN, DA, DC, Dee, Doc, FA, FE, GM, GU, HW, JA, JP, LA, LCC, LG, LI, LR, NCC, NU, OS, RCL, RDD, RES, RZ, SCC, SD, SM, TES, TF3, TRI, TT, WO | 27461 (26302 null) | | BoxNumber | Boxes | | 169 (169 null) | | BROriginatorDepartment | DeptCodes | BRB, BRML, BT, FP, OF, PBA, VEN. | 58503 (57381 null) |
Row numbers and Record IDs
The UniqueID field in the Documents table is a row number which
was automatically generated by Access as each new record was
created. However, the highest value in UniqueID (91270) does not
match the number of records in the Documents table (91259), because
there are no records with the UniqueID 90331 through 90341
(inclusive). This is the only break in the UniqueID sequence.
The last record in the Documents table (UniqueID = 91270) should
be considered blank: all fields contain only the default values
specified in the Access table definition. This is common in Access
tables where the only field specified as mandatory is an
automatically-generated primary key.
In three records in the Documents table where ParentID has been
specified, miskeying by the person entering the data has resulted
in clearly erroneous references to non-existent Document records.
For these records, NDAD has identified with reasonable certainty
what the real ParentID should be:
| UniqueID |
ParentID |
ParentID should be |
| 76334 |
94239 |
74239 |
| 78969 |
744451 |
74445 |
| 83605 |
741484 |
74148 |
Missing Values
The following is a summary for each field in the Documents table of the number of rows with blank, null or spaces entered in that field (zeros are not, for the purpose of this analysis, interpreted as missing values):
| Field |
Total missing |
| Barcode
|
85955
|
| OCR |
86
|
| Image |
86 |
| PartNo | 76273 | | ParentID | 924 | | Text1 | 22887
| | Text2 | 81003 | | Text3 | 87085 | | Text4 | 2487
| | CompanyID | 26302 | | BROriginatorDept | 57381 | | BROriginatorFunction | 89262
| | BROriginatorPerson | 83689 | | BROriginatorUserReference | 49980 | | StartDate | 89118 | | EndDate | 57396 | |
DocumentGroupID | 143 | | BoxNumber | 169
| | LocationImage | 1952
| | LocationText | 1952 | | LinkedDocument | 14361
|
Derived tables
Of the records in the Documents table, 20075 have related entries in the Images table. The Image records with UniqueID 90331 through 90342 inclusive (11 in all) have no corresponding record in the Documents table. These records also occur in the MultiSection table. |
|---|