Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter.
Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.
Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.
OCR for page 1
Summary
This report responds to a request from the U.S. Department of Defense
(DOD) to identify engineering practices that have proved successful for
system development and testing in industrial environments. It is the latest
in a series of studies by the National Research Council (NRC), through
the Committee on National Statistics, on the acquisition, testing, and
evaluation of defense systems. The previous studies have been concerned
with the role of statistical methods in testing and evaluation, reliability
practices, software methods, combining information, and evolutionary
acquisition. This study was sponsored by DOD’s Director of Operational
Test and Evaluation (DOT&E) and the Under Secretary of Defense for
Acquisition, Technology, and Logistics (USD-AT&L). It was conducted by
the Panel on Industrial Methods for the Effective Test and Development
of Defense Systems.
The study panel’s charge was to plan and conduct a workshop to
explore how developmental and operational testing, modeling and simu -
lation, and related techniques can improve the development and perfor-
mance of defense systems, particularly techniques that have been shown
to be effective in industrial applications and are likely to be useful in
defense system development. This workshop was the panel’s main fact-
finding activity, which featured speakers who described practices from
software and hardware industries.
We emphasize that we could not, and did not, carry out a compre-
hensive literature review or examination of industrial and engineering
methods for system development. Rather, drawing on information from
1
OCR for page 2
2 INDUSTRIAL METHODS FOR EFFECTIVE DEVELOPMENT AND TESTING
the workshop and the experience and expertise of the panel’s members,
we focused on the techniques that have been found to be useful in indus-
trial system development and their applicability to the DOD environment,
while acknowledging the differences in the two environments. To that
end, we also considered the availability and access to data (especially
test data), the availability of engineering and modeling expertise, and the
organizational structure of defense acquisition.
Many, perhaps even most, of the industrial practices we discuss and
recommend are or have been used in DOD, but they are not systemati-
cally followed. We do not offer new policy or procedural recommenda-
tions when (1) the techniques are already represented in DOD acquisi-
tion policies and procedures, (2) DOD has been trying to implement the
desirable practices, or (3) the desirable practices have previously been
recommended in other NRC reports or by other advisory bodies. In these
cases we reiterate the benefits of and the need to fully adopt and follow
the relevant policies, procedures, and practices. We do offer recommenda-
tions to determine if the defense acquisition community is moving in the
wrong direction by reviewing policies, procedures, and practices that are
new or have elements that are new.
REQUIREMENTS SETTING
Conclusion 1: It is critical that there is early and clear communica -
tions and collaboration with users about requirements. In particu-
lar, it is extremely beneficial to get users, developers, and testers
to collaborate on initial estimates of feasibility and for users to
then categorize their requirements into a list of “must haves” and a
“wish list” with some prioritization that can be used to trade off at
later stages of system development if necessary.
Although communication with users is common in defense acquisi-
tion, the emphasis at the workshop was on a continuous exchange with
and involvement of users in the development of requirements. In addi -
tion, the industrial practice of asking customers to separate their needs
into a list of “must haves” and a “wish list” forces customers to carefully
examine a system’s needs and capabilities and any discrepancies between
them and thus make decisions early in the development process. It is also
important to use input from the test and evaluation community in the
setting of initial requirements.
Conclusion 2: Changes to requirements that necessitate a substan -
tial revision of a system’s architecture should be avoided as they
OCR for page 3
3
SUMMARY
can result in considerable cost increases, delays in development,
and even the introduction of other defects.
Having stable requirements during development allows the system
architecture to be optimized for a specific set of specifications, rather than
be modified in a suboptimal manner to try and accommodate various
updates and changes over time. However, there must also be some flex-
ibility that allows for modifications that are responsive to users’ needs
and changing environments. Although existing DOD regulations mandate
that changes in requirements must go through a rigorous engineering
assessment before they are approved, these regulations do not appear to
be strictly enforced.
Conclusion 3: Model-based design tools are very useful in pro-
viding a systematic and rigorous approach to requirements set-
ting. There are also benefits from applying them during the test
generation stage. These tools are increasingly gaining attention in
industry, including among defense contractors. Providing a com-
mon representation of the system under development will also
enhance interactions with defense contractors.
The term “model-based design tools’’ relates to formal methods used
to translate and quantify requirements from high-level system and sub -
system specifications, assess the feasibility of proposed requirements,
and help examine the implications of trading off various performance
capabilities (including various aspects of effectiveness and suitability,
including durability and maintainability). It has also been called model-
based engineering. In addition to rigorously assessing the feasibility of
proposed requirements and helping assess the results of “lowering” some
requirements while “raising” others, model-based design tools are known
to provide a range of benefits: a formal specification of the actual intent of
the functionality, they document the requirements; the model is execut -
able, so any ambiguities can be identified; the model can be used to auto -
matically generate test suites; and, possibly most importantly, the model
captures knowledge that can be preserved.
DOD should have expertise in these tools and technologies and use
them with contractors and users. More broadly, DOD should actively
participate, if not lead, in the development of model-based design tools.
Recommendation 1: The Office of the Undersecretary of Defense
for Acquisition, Technology, and Logistics and the Office of the
Director of Operational Test and Evaluation of the U.S. Department
of Defense and their service equivalents should acquire expertise
OCR for page 4
4 INDUSTRIAL METHODS FOR EFFECTIVE DEVELOPMENT AND TESTING
and appropriate tools related to model-based approaches for the
requirements setting process and for test case and scenario genera -
tion for validation.
DESIGN AND DEVELOPMENT
Technological Maturity and Assessment
Conclusion 4: The maturity of technologies at the initiation of an
acquisition program is a critical determinant of the program’s suc -
cess as measured by cost, schedule, and performance. The U.S.
Department of Defense (DOD) continues to be plagued by prob-
lems caused by the insertion of immature technology into the criti-
cal path of major programs. Since there are DOD directives that are
intended to ensure technological readiness, the problem appears
to be caused by lack of strict enforcement of existing procedures.
Technological immaturity is known to be a primary cause of schedule
slippage and cost growth in DOD program acquisition. Many studies,
including those of the National Research Council (2011), the Defense
Science Board (1990), and the U.S. General Accounting Office (1992) and
its successor, the U.S. Government Accountability Office (2004), have
discussed the dangers associated with inserting insufficiently mature
technologies in the critical path of DOD design and development.
Recommendation 2: The Under Secretary for Acquisition, Technol-
ogy, and Logistics of the U.S. Department of Defense (USD-AT&L)
should require that all technologies to be included in a formal
acquisition program have sufficient technological maturity, consis -
tent with TRL (technology readiness level) 7, before the acquisition
program is approved at Milestone B (or earlier) or before the tech-
nology is inserted in a later increment if evolutionary acquisition
procedures are being used. In addition, the USD-AT&L or the ser-
vice acquisition executive should request the Director of Defense
Research and Engineering (the DOD’s most senior technologist) to
certify or refuse to certify sufficient technological maturity before
a Milestone B decision is made. The acquisition executive should
also
• eview the analysis of alternatives assessment of technological
r
risk and maturity;
• btain an independent evaluation of that assessment as required
o
in DOD instruction (DODI) 5000.02; and
OCR for page 5
5
SUMMARY
• nsure, during developmental test and evaluation, that the
e
materiel developer shall assess technical progress and maturity
against critical technical parameters that are documented in the
Test and Evaluation Master Plan (TEMP).
A substantial part of the above recommendation is currently required
by law or by DOD instructions. Moreover, earlier NRC reports have also
made similar recommendations. DOD has been moving in the wrong
direction regarding the enforcement of an important and reasonable
policy as stated in DODI 5000.02.
Conclusion 5: The performance of a defense system early in devel-
opment is often not rigorously assessed, and in some cases the
results of assessments are ignored; this is especially so for suit -
ability assessments. This lack of rigorous assessment occurs in the
generation of system requirements; in the timing of the delivery of
prototype components, subsystems, and systems from the devel -
oper to the government for developmental testing; and in the deliv-
ery of production-representative system prototypes for operational
testing. As a result, throughout early development, systems are
allowed to advance to later stages of development when substantial
design problems remain. Instead, there should be clear-cut decision
making during milestones based on the application of objective
metrics. Adequate metrics do exist (e.g., contractual design speci -
fications, key performance parameters, reliability criteria, critical
operational issues). However, the primary problem appears to be a
lack of enforcement.
Defense systems should not pass milestones unless there is objec-
tive quantitative evidence that major design thresholds, key performance
parameters, and reliability criteria have been met or can be achieved with
minor product improvements.
Staged Development
Conclusion 6: There are substantial benefits to the use of staged
development, with multiple releases, of large complex systems,
especially in the case of software systems and software-intensive
systems. Staged development allows for feedback from customers
that can be used to guide subsequent releases.
The “agile development” process for software systems (discussed at
the workshop) is a disciplined framework that ensures that best practices
OCR for page 6
6 INDUSTRIAL METHODS FOR EFFECTIVE DEVELOPMENT AND TESTING
are consistently used throughout system development. A staged develop-
ment appears to be natural for large-scale complex software systems, and
it may also be appropriate for some hardware systems. Each of the stages
must retain the functionality of its predecessor systems, at the very least
to satisfy the natural expectations of the customer over time.
TESTING METHODS
The panel supports the recommendations on testing that have
appeared in previous reports on this topic by the NRC. These recom -
mendations have addressed the following issues:
• the importance of comprehensive test planning (National Research
Council, 1998)
• the benefits from use of state-of-the-art experimental design prin-
ciples and practices (National Research Council, 1998)
• the potential benefits from combining information for operational
assessment (National Research Council, 1998)
• that testing should be carried out with an operational perspective
(National Research Council, 2006)
• that testing should give greater emphasis to suitability (National
Research Council, 1998)
• the benefits from the use of accelerated reliability testing methods
(National Research Council, 1998)
COMMUNICATION, RESOURCES, AND INFRASTRUCTURE
Conclusion 1 highlights the need for early and clear communications
about requirements. In addition, industry representatives at the workshop
stressed the importance of collaboration and communication among cus -
tomers and program developers, as well as participants across all aspects
of system development and testing to avoid long, costly, and unsuccess -
ful product development programs. Leading industrial companies have
established programs to promote higher levels of collaboration among
suppliers, manufacturers, customers, service organizations, and the ulti -
mate users of the product.
A Data Archive
Conclusion 7: A data archive with information on developmental
and operational test and field data will provide a common frame-
work for discussions on requirements and priorities for develop-
ment. In addition, it can be used to expedite the identification of
OCR for page 7
7
SUMMARY
and correction of design flaws. Given the expenses and complexity
of developing such an archive, it is important that the benefits of a
data archive be adequately demonstrated to support development .
The collection and analysis of data on test and field performance,
including warranty data, is a standard feature in commercial industries.
The development of a data archive has been discussed in previous NRC
reports, and we repeat its importance here. One possible reason for DOD’s
failure to establish a data archive is the lack of an incentive to support
this and any other central activity. DOD needs to be convinced of the
advantages of building and maintaining such a database and then to
commission an appropriate group of people with experience in program
development to develop a concrete proposal on how the data archive
should be structured.
Recommendation 3: The U.S. Department of Defense should create
a defense system data archive containing developmental test, opera-
tional test, and field performance data from both contractors and
the government. Such an archive would achieve several important
objectives in the development of defense systems:
• ubstantially increase DOD’s ability to produce more feasible
s
requirements,
• upport early detection of system defects,
s
• mprove developmental and operational test design, and
i
• mprove modeling and simulation through better model validation.
i
As DOD initiates plans to begin creation of a defense system data
archive, at least three issues need immediate resolution: (1) whether the
archive should be DOD-wide or should be stratified by type of system
to limit its size, (2) what data are to be included and how the data ele -
ments should be represented to facilitate linkages of related systems, and
(3) what data-based management structure is used. A flexible architec-
ture should be used so that if the archive is initially limited to a subset
of the data sources recommended here due to budgetary considerations,
the archive can be readily expanded over time to include the remaining
sources.
Feedback Loops
Conclusion 8: Feedback loops can significantly improve system
development by improving both developmental and operational
test design and the use of modeling and simulation. Feedback
OCR for page 8
8 INDUSTRIAL METHODS FOR EFFECTIVE DEVELOPMENT AND TESTING
systems can function similarly to warranty management systems
that have proved essential to the automotive industry. To develop
feasible requirements, understanding how components installed
in related systems have performed when fielded is extremely use -
ful in understanding their limitations for possibly more stressful
use in a proposed system. To support such feedback loops, data
on field performance, test data, and results from modeling and
simulation must be easily accessible, which highlights the neces -
sity for a test and field data archive.
Field performance data are the ultimate indicators of how well a sys -
tem is functioning in operational conditions. By field performance data,
we also mean data on all the circumstances that can have an impact on the
quality of the components, subsystems, and systems. These data include
all relevant pre- and postdeployment activities, including transportation,
maintenance, implementation, and storage. They could also include train-
ing data, if such data were collected objectively. Such information can
and should be used to better understand the strengths and weaknesses
of newly fielded systems in undertaking various missions, including such
tactical information as identifying the scenarios in which the current sys -
tem should and should not be used. Unfortunately, these data are rarely
archived in a way that facilitates analysis.
Recommendation 4: After a test and field data archive has been
established, the Under Secretary of Defense for Acquisition, Tech-
nology, and Logistics (USD-AT&L) and the acquisition executives
in the military services should lead a U.S. Department of Defense
(DOD) effort to develop feedback loops on improving fielded sys-
tems and on better understanding tactics of use of fielded systems.
The DOD acquisition and testing communities should also learn to
use feedback loops to improve the process of system development,
to improve developmental and operational test schemes, and to
improve any modeling and simulation used to assess operational
performance.
SYSTEMS ENGINEERING EXPERTISE
Conclusion 9: In recent years, the U.S. Department of Defense has
lost much of its expertise in all the key areas of systems engineer-
ing. It is important to regain in-house capability in areas relating
to the design, development and operation of major systems and
OCR for page 9
9
SUMMARY
subsystems. One such area is expertise in the model-based design
tools as discussed earlier.
Commercial companies place a great deal of importance on systems
engineering expertise. This is key for system development as well as for
requirements setting, model development, and testing. Unfortunately,
DOD’s expertise in systems engineering was decimated by congressio -
nally mandated manpower reductions in the late 1990s and additional
reductions by the services. DOD has recognized this problem and is
taking steps to rectify it. However, given the time it will take to build up
that expertise in house, the DOD should examine the short-term use of
contractors, academics, employees of national laboratories, and others.
MANAGEMENT ISSUES
Enforcement of DOD Directives and Procedures
Conclusion 10: Many of the critical problems in the U.S. Depart -
ment of Defense acquisition system can be attributed to the lack
of enforcement of existing directives and procedures rather than to
deficiencies in them or the need for new ones.
As workshop participants noted, there are many studies, documents,
and DOD procedures relating to best practices. The problem is that they
are not systematically followed in practice.
Role of a DOD Program Manager
The role of program manager is noticeably different in industry than
in DOD. In industry, the program manager’s tenure covers the entire
product realization process, from planning, design, development, and
manufacturing to even initial phases of sales and field support, and the
program manager is fully responsible and accountable for all of these
activities. This tenure ensures a smooth transition across the different
phases of acquisition, as well as transfer of knowledge. In contrast, in
DOD the tenure of a program manager rarely covers more than one phase
of a project, and there is little accountability. Moreover, there is little incen-
tive for a DOD program manager to take a comprehensive approach to
seek and discover system defects or design flaws.
Recommendation 5: The Under Secretary of Defense for Acquisi-
tion, Technology, and Logistics should provide for an indepen-
OCR for page 10
10 INDUSTRIAL METHODS FOR EFFECTIVE DEVELOPMENT AND TESTING
dent evaluation of the progress of ACAT I systems in development
when there is a change in program manager. This evaluation should
include a review by the Office of the Secretary of Defense (OSD),
complemented by independent scientific expertise as needed, to
address outstanding technical manufacturing and capability issues,
to assess the progress of a defense system under the previous pro -
gram manager, and to ensure that the new program manager is
fully informed of and calibrated to present and likely future OSD
concerns.
Clearly, there are many details and challenges associated with devel-
oping and implementing this recommendation that are beyond the panel’s
scope and expertise. However, we emphasize that there are systemic prob-
lems with the current system of program management and that they are
serious obstacles to the implementation of efficient practices.