Skip to main content

Currently Skimming:


Pages 108-121

The Chapter Skim interface presents what we've algorithmically identified as the most significant single chunk of text within every page in the chapter.
Select key terms on the right to highlight them within pages of the chapter.


From page 108...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 101 State Transportation Agencies 4.0 Iowa DOT Findability Tests 4.1 Planning and Scoping Iowa has an Electronic Records Management System (ERMS) that has long served as the agency's official system of record for construction project records.
From page 109...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 102 State Transportation Agencies each project. If a project number is available, then a data service can be used to identify the location.
From page 110...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 103 State Transportation Agencies "Slow" PDF to text conversion Pdfminer.six also extracts text data from PDFs, obtaining the exact location of characters on the page. This makes it take around 30 times longer than the "fast" method on average, but with greater fidelity allowing for more accurate text extraction.
From page 111...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 104 State Transportation Agencies 4.3 Solution Development and Testing Auto-classification A rule-based approach was applied to classify the documents. Following the rule-based analysis, a machine learning model was created to provide a comparison of effort as well as accuracy.
From page 112...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 105 State Transportation Agencies Figure 28. IADOT Plan example.
From page 113...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 106 State Transportation Agencies Recall and Precision Testing Results of the recall and precision testing are shown in Table 30. The methodology for these tests and analysis of results are provided below.
From page 114...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 107 State Transportation Agencies Precision We chose a random subsample of 100 documents categorized as each document type (for both the 2-category and 3-category classification runs)
From page 115...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 108 State Transportation Agencies the number of documents identified by the rule-based approach. Precision was computed as the ratio of the number of correctly classified documents to the total number of classified documents (including false positives)
From page 116...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 109 State Transportation Agencies number. Some 11 project numbers had to be adjusted manually.
From page 117...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 110 State Transportation Agencies classifying project proposals and plans, and extracting key metadata elements from the documents including project number, PIN and work type. The purpose of the implementation plan is to provide IADOT with a roadmap for future development and application of the techniques demonstrated in the NCHRP 20-97 test.
From page 118...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 111 State Transportation Agencies Task Explanation 10. Implement processes and tools for continued application of autoclassification and entity extraction techniques Once a solution is selected (commercial or open source tools)
From page 119...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 112 State Transportation Agencies 5. Establish document intake procedures for content management systems.
From page 120...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 113 State Transportation Agencies e. Modify and apply the auto-classification rules Modify the auto-classification rules from the pilot to reflect the condition(s)
From page 121...
... NCHRP Web-Only Document 279: Information Findability Implementation Pilots at 114 State Transportation Agencies • identifying a manager who will be responsible for this function, • identifying staff and/or external contract resources who will perform the work, • deploying the selected solution (if applicable) , • conducting training necessary to get staff up to speed with the selected solution, and • developing an initial work plan that defines activities and responsibilities for application of the techniques, and ongoing maintenance and refinement.

Key Terms



This material may be derived from roughly machine-read images, and so is provided only to facilitate research.
More information on Chapter Skim is available.