Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter.
Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.
Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.
OCR for page 25
26
Why Create a Knowledge Base? This knowledge base staff can perform other important
duties in conjunction with the release or analysis of data.
The best argument in favor of a select group of professionals These duties include calculating quality control measures,
to support linkages and analyses is that it will provide consis- processing data dictionaries and coding manuals, creating
tent and trained service for decision makers by delivering data basic standardized reports, updating contact information, and
and analytic support. This type of argument would require documenting the availability of the data. All of these prod-
that multiple agencies compare the cost of their analytic sup- ucts support the authorized users, including the custodial
port and data sharing efforts at this time and estimate what agencies.
they could relinquish if a single knowledge-based staff took
on selected tasks. By virtue of their experience working with each of the
data sources, the staff acquires knowledge of the contents
A likely impetus for creation of such a knowledge base and limitations of each database. They become ideally suited
comes from the inability of agencies to meet the rising to serve as an analytic resource to assist other users, and they
demand for access to data and requests for assistance in using can communicate the caveats about each database and help
the data. In an era of budget constraints and contraction of users formulate questions in a manner that can be reliably
agency services to only the "core" business missions, a cen- answered using the data. When users need help conducting
tralized knowledge base is one way to gain efficiency and pro- an analysis, the staff can explain the coding and structure of
vide a necessary service that normally would be outside of each file or they can conduct analyses for the users.
any single agency's purview. It is also a way to eliminate the
nonconstructive arguments between agencies about the "offi- Ancillary Benefits of a Knowledge Base
cial" numbers on traffic-related injuries, deaths, and costs
because everyone would be working from the same set of A knowledge base, whether through a data clearinghouse
numbers. Communication about the data and how to use the method or through a specialized staff method, brings the pos-
data correctly is also simplified when there is a single source. sibility of meeting users' needs in a highly visible manner. It
is assumed that better customer service will support expanded
use of the data by a larger number of people drawn from a
How a Knowledge Base Works
broader spectrum of users. To the extent that these new users
see themselves as stakeholders in the traffic records system,
On a periodic basis agreed to by each agency that acts as cus-
they can in turn support improvements to all the components
todian of a component of the traffic records system, a copy
of the traffic records system. One important possible benefit
of their data is forwarded to the data clearinghouse or the
of a traffic records knowledge base is the building of a coali-
staff resources assigned to be the knowledge base for safety
tion that will help to support expansion and improvement
linkage and analysis. The agency will also provide current
of the system that supports them. With this in mind, it is sug-
documentation, a data dictionary, and contact information.
gested that the knowledge base include customer service
Ideally, the data submitted are as complete as possible in that measurements and that the staff maintain a customer contact
personal identifiers are left in the file at this point to assist in database.
linkages. However, the staff would be responsible for remov-
ing any identifiers based on the custodial agency's require-
ments before releasing any information. Because files may SUMMARY
contain information that cannot be released to the public,
archival copies are generally not made available to anyone Data collection, data management, and data usage are the
except those authorized by the custodial agency and only for three areas that define the success of a crash records system.
limited purposes. There is currently no single system in the United States that
would meet any reasonable definition of a best practice
The staff will generate an extract for general release that approach to all three areas simultaneously. There are, how-
has all personal identifying information redacted in accor- ever, examples of systems that are successful in one area or
dance with applicable privacy laws. At that point, the data another. Based on these examples, and using the literature
will be available for general release to authorized users. The and consultant experience with traffic records systems, some
staff can then use the original data again to create one or more overall descriptions of systems that are possible with today's
linked databases as needed. Because linkages among traffic technology and could serve the needs of stakeholders at all
records components often are most reliable when using per- levels of the traffic safety community were developed. These
descriptions are summarized here.
sonal identifiers as linking variables, the personal informa-
tion is left in the files until the linkage is performed. When
this is not possible, or where that linkage fails, a probabilis- Crash Data Collection
tic matching process may be needed. The resulting linked
datasets can then be purged of personal identifiers and made The most promising approach to crash data collection is an
available to authorized users. automated field data collection tool that is used to capture
OCR for page 26
27
information as close to the event as possible. Field data col- data collection and electronic data transfer. The savings in
lection hardware can include a portable or in-vehicle com- reduced data entry, along with improvements in data quality
puter, GPS unit, magnetic strip and/or bar code reader, and and timeliness, will benefit all stakeholders.
other technology as desired. The officer using this tool would
be able to link with the state driver and vehicle data to com-
plete sections of the crash report without having to reenter Data Linkages for Reporting and Analysis
information that already exists electronically. Officers may
Crash data alone do not serve as the sole basis upon which to
also scan information directly from vehicle identification
make highway and traffic safety decisions. A comprehensive
numbers and/or registration documents, license plates, or driv-
traffic records system is required with linkable components
er's licenses to obtain information for their reports.
to support analyses of all types of data. In most states, a full
traffic records system could not exist in a single agency and
The field software tool can include edit checks that match
have it fit well with the core business of that agency. For
those in the statewide crash report system and prompt offi-
example, an agency that is responsible for issuing licenses to
cers to complete all required fields, including supplemental
drivers and titles to vehicles may not have the resources to
reports. A supervisor could then automatically review the
support other components of a traffic records system that do
resulting crash report. Once accepted, the report can be sent
not assist them in completing their agency's primary mis-
to the agency's local records management system, if desired,
sion. A traffic records knowledge base, either through a data
as well as to the statewide crash records system. This paper-
clearinghouse or through resources dedicated specifically for
less process would also support generation of a graphic
ad hoc data linkages and analyses, is a method for a state to
image of the form suitable for printing and archival storage.
achieve the goal of serving the needs of all highway and traf-
The primary advantage of automated field data collection
fic safety stakeholders.
software is a reduction in the time spent in records manage-
ment and supervisory review. The improvement in quality
A knowledge base supports all or most components of the
and timeliness of the crash data benefits all stakeholders in
traffic records system readily available to the users for analy-
the traffic safety community.
sis and reporting. Data sources are linked directly with the
crash data or linked indirectly through probabilistic matching.
Crash Data Management This type of knowledge base is one way to increase the util-
ity of crash data for less experienced users and to help build
Crash records systems must have the capability to accept data strong advocates for traffic records improvement throughout
electronically. Adding this capability often results in major the state. The Missouri DOT is an example of a directly
updates to the structure and processing of a statewide crash linked data system that primarily supports only that agency's
database. However, the document management and archival users. The Massachusetts data warehouse is an example of a
storage of crash reports must accommodate both electronic university-based system with Internet access for analysis and
and paper forms if the system does not create an electronic reporting given to all approved users. Although the number
image of the crash report form. Some manual post-processing of traffic records data clearinghouses is increasing, most
of crash information, especially for quality control of loca- states conduct data linkages on an ad hoc basis, often using
tion coding, is advisable even with automation of the field university-based staff.