Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter.
Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.
Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.
OCR for page 51
2
Elements of a Computer Science
Research Agenda for Sustainability
The discussion of sustainability challenges in Chapter 1 shows that
there are numerous opportunities for information technology (IT) to have
an impact on these global challenges. A chief goal of computer science
(CS) in sustainability can be viewed as that of informing, supporting,
facilitating, and sometimes automating decision making—decision mak-
ing which leads to actions that will have significant impacts on achieving
sustainability objectives. The committee uses the term “decision making”
in a broad sense—encompassing individual behaviors, organizational
activities, and policy making. Informed decisions and their associated
actions are at the root of all of these activities.
A key to enabling information-driven decision making is to estab-
lish models and feed them with measurement data. Various algorithmic
approaches, such as optimization or triggers, can be used to support
and automate decisions and to drive action. Sensing—that is, taking
and collecting measurements—is a core component of this approach. In
many cases, models are established on the basis of previous work in the
various natural sciences. However, in many cases such models have yet
to be developed, or existing models are insufficient to support decision
making and need to be refined. To discover models, multiple dimensions
of data need to be analyzed, either for the testing of a hypothesis or the
establishing of a hypothesis through the identification of relationships
among various dimensions of measured data. Data-analysis and data-
mining tools—some existing and some to be developed—can assist with
this task.
51
OCR for page 52
52 COMPUTING RESEARCH FOR SUSTAINABILITY
Once a model is established, “what-if” scenarios can be simulated,
evaluated, and used as input for decision making. Modeling and simula-
tion tools vary widely, from spreadsheets to highly sophisticated model-
ing environments. When a model reaches a certain maturity and trust
level, algorithms, such as optimizations or triggers, can be deployed to
automate the decision making if automation is appropriate (for example,
in terms of actuation). Alternatively, information can be distilled and
presented in visual, interactive, or otherwise usable ways so that other
agents—individuals, organizations and businesses, and policy mak-
ers and governments—can deliberate, coordinate, and ultimately make
appropriate, better-optimized choices and, ultimately, actions.
All of the steps described above must be done in an iterative fashion.
Given that most sustainability challenges involve complex, interacting
systems of systems undergoing constant change, all aspects of sens-
ing, modeling, and action need to be refined, revised, or transformed
as new information and deeper understandings are gained. A strong
approach is to deploy technology in the field using reasonably well
understood techniques to explore the space and to map where there are
gaps needing work. Existing data and models then help provide context
for developing qualitatively new techniques and technologies for even
better solutions.
FINDING: Enabling and informing actions and decision making by
both machines and humans are key components of what CS and IT
contribute to sustainability objectives, and they demand advances
in a number of topics related to human-computer interaction. Such
topics include the presentation of complex and uncertain informa-
tion in useful, actionable ways; the improvement of interfaces for
interacting with very complex systems; and ongoing advances in
understanding how such systems interact with individuals, orga-
nizations, and existing practices.
Many aspects of computer science and computer science research
are relevant to these challenges. In this chapter, the committee describes
four broad research areas, listed below, that can be viewed as organizing
themes for research programs and that have the potential for significant
positive impact on sustainability. The list is not prioritized. Efforts in all
of the areas will be needed, often in tandem.
• Measurement and instrumentation;
• Information-intensive systems;
• Analysis, modeling, simulation, and optimization; and
• Human-centered systems.
OCR for page 53
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 53
For each area, examples of research problems focused on sustain-
ability opportunities are given. The discussions do not provide a com-
prehensive list of problems to be solved, but do provide exemplars of the
type of work that both advances computer science and has the potential
to advance sustainability objectives significantly. In examining oppor-
tunities for research in CS and sustainability, questions that one should
attempt to answer include these: What is the potential impact for sustain-
ability? What is the level of CS innovation needed to make meaningful
progress?
As discussed in Chapter 1, complete solutions to global sustainability
challenges will require deep economic, political, and cultural changes.
With regard to those changes, the potential role for CS and IT research
discussed in this chapter is often indirect, but it is still important. For
example, CS research could focus on innovative ways for citizens to delib-
erate over and to engage with government and with one another about
these issues, with the deliberations closely grounded in data and scientific
theory. For some critical sustainability challenges, such as the anticipated
effects of global population growth, the potential CS research contribution
is almost entirely of this indirect character. For instance, there is potential
for using the results of modeling and visualization research toward the
aim of improved education and better understanding of population and
related issues. In addition, advances in IT in the areas of remote sensing,
network connectivity services, adaptive architectures, and approaches for
enhanced health diagnosis and care delivery—especially in rural areas—
also have a bearing on population concerns. Other contributions from CS
and IT research toward meeting such challenges could be aimed at devel-
oping tools to support thoughtful deliberation, with particular emphasis
on encompassing widely differing views and perspectives.
The research areas described in this chapter correspond well with
the broader topics of measurement, data mining, modeling, control,
and human-computer interaction, which are, of course, well-established
research areas in computer science. This overlap with established research
areas has positive implications—in particular, the fact that research
communities are already established making it unnecessary to develop
entirely new areas of investigation. At the same time, the committee
believes that there is real opportunity in these areas for significant impacts
on global sustainability challenges. Finding a way to achieve such impacts
effectively may require new approaches to these problems and almost
certainly new ways of conducting research.
In terms of a broad research program, an important question is how
to structure a portfolio that spans a range of fundamental questions, pilot
efforts, and deployed technologies while maintaining focus on sustain-
ability objectives. For any given research area in the sustainability space,
OCR for page 54
54 COMPUTING RESEARCH FOR SUSTAINABILITY
efforts can have an impact in a spectrum of ways. First, one can explore
the immediate applicability of known techniques: What things do we
know how to do already with computational techniques and tools, and
how can we immediately apply them to a given sustainability challenge?
Second, one can seek opportunities to apply known techniques in inno-
vative ways: Where are the opportunities in which the straightorward f
application of a known technique will not work but where it seems prom-
ising to transform or translate a known technique into the domain of a
particular sustainability challenge? This process tends to transform the
techniques themselves into new forms. Finally, one can search for the
areas in which innovation and the development of fundamentally new
computer science techniques, tools, and methodologies are needed to
meet sustainability challenges. While endorsing approaches across this
spectrum, the committee urges emphasis on solutions that have the poten-
tial for significant impacts and urges the avoidance of simply developing
or improving technology for its own sake.
The advancing of sustainability objectives is central to the research
agenda outlined in this report. As in any solution-oriented research space,
there is a tension between solving a substantive domain problem, perhaps
creating tools, techniques, and methods that are particularly germane
to the domain, and tackling generalized problems, perhaps motivated
by the domain, for which solutions advance the broader field. (Chap-
ter 3 discusses this challenge in more detail and provides the commit-
tee’s recommendations on structuring research programs and developing
research communities in ways that constructively address these issues.)
When focusing on the challenges presented in a particular domain, it is
often essential that the details are right in order for the work to have mean-
ingful impact. For the work to have broader impact, it must be possible
to transcend the details of a particular problem and setting. Much of the
power in computer science derives from the development of appropriate
abstractions that capture essential characteristics, hide unnecessary detail,
and permit solutions to subproblems to be composed into solutions to
larger problems. A focus on getting the abstraction right for large impact,
appropriability, and generalizability is important. Simultaneously, it is
important to characterize aspects of the solution that are not generalizable.
FINDING: Although current technologies can and should be put
to immediate use, CS research and IT innovation will be critical to
meeting sustainability challenges. Effectively realizing the poten-
tial of CS to address sustainability challenges will require sus-
tained and appropriately structured and tailored investments in
CS research.
OCR for page 55
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 55
PRINCIPLE: A CS research agenda to address sustainability should
incorporate sustained effort in measurement and instrumentation;
information-intensive systems; analysis, modeling, simulation, and
optimization; and human-centered systems.
MEASUREMENT AND INSTRUMENTATION
Historically, sensors, meters, gauges, and instruments have been
deployed and used within the vertically integrated context of a single
task or system. For example, a zone thermostat triggers the inflow of cold
or hot air into specific rooms when the measured air temperature devi-
ates from the target set point by an amount in excess of the guard band;
the manifold pressure sensor in a car dictates the engine ignition timing
adjustment; the household electric meter is the basis for the monthly
utility bill; water temperature, salinity, and turbidity sensors are placed
at particular junctures in a river to determine the effects of mixing and
runoff; and so on. Examples of specific scenarios are innumerable and
incredibly diverse, but they have in common the following: the selection
of the measurement device, its placement and role in the encompassing
system or process, and the interpretation of the readings it produces are
all determined a priori, at design time, and the resulting system is essen-
tially closed—sensor readings are not used outside the system.1
This situation has changed dramatically over the past couple of
decades owing to the following key factors:
• Embedded computing. Until the 1990s, the electronics associated with
the analog-to-digital conversion, the rescaling to engineering units, and
the associated storage and the data processing dwarfed the size and cost
of the transducer used to convert the physical phenomenon to an electri-
cal signal. Consequently, these electronics were shared resources wired to
remote sensors. Over the past 20 years, digital electronics have shrunk to a
small fraction of their former size and cost, have been integrated directly
into the sensor or actuator, and have expanded in function to include
quite general processing, storage, and communication capabilities. The
1In settings in which the transducer is physically and logically distinct from the enclos-
ing system, typified by the Highway Addressable Remote Transducer (HART) for process
control and Building Automation and Control Networks (BACnet) for building automation,
readings are obtained over a standardized protocol, but their interpretation remains entirely
determined by the context, placement, and role of the device in the larger process. The use
of the information produced by the physical measurement, and hence its semantics, are
contained within the enclosing system.
OCR for page 56
56 COMPUTING RESEARCH FOR SUSTAINABILITY
configurable, self-contained nature of modern instrumentation reduces
the costs of deployment and enables broader use.
• Information-rich operation. The primary control loop of opera-
tional processes (typically represented in manufacturing as plant-sensor-
controller-actuator-plant) is usually augmented with substantial second-
ary instrumentation to permit optimization. For example, in refinery
process control, such additional instrumentation streams help to tune
controllers to increase yield or reduce harmful by-products. In semicon-
ductor manufacturing, they are employed in conjunction with small-
scale process perturbation and large-scale statistical analysis in order to
shorten the learning curve and reach a final configuration more quickly.
In environmental conditioning for buildings, multiple sensing points are
aggregated into zone controllers. Automotive instruments are fused to
present real-time mileage information to the driver.
• Cross-system integration. Measurements designed for one sys-
tem are increasingly being exploited to improve the quality or perfor-
mance of others. For example, light and motion sensors are installed to
modulate the amount of artificially supplied lighting in many “green
buildings.” But those motion detectors are then also available to serve
as occupancy indicators in sophisticated heating, ventilation, and air
conditioning (HVAC) controls. Rather than simply isolating indoor cli-
mate from external factors, modern design practice may seek to exploit
passive ventilation, heating, and cooling; to do so requires the instru-
mentation of building configuration (such as open and closed window
and door states) and of external and internal environmental proper-
ties (temperature, humidity, wind speed, etc.). All of these sources of
information may also be exploited for longitudinal analysis, to drive
recommissioning, retrofitting, and refining operations. Interval util-
ity meter readings are used not just for time-of-use pricing but also to
guide energy-efficiency measures. Traffic measurements and content-
condition instrumentation are applied to optimize logistics operations.
The factors described above have changed the role of instrumentation
and measurement from a subsidiary element of the system design process
to an integrative, largely independent process of design and provision-
ing of physical information services. For many sustainability challenges,
methodologies are needed that can start with an initial model that is based
on modest amounts of data collected during the design process; those
methodologies would then include the development of an incremental
plan for deploying sensors that progressively improves the model and
exploits the improvements to achieve the goals of the system. In many
sustainability applications, such as climate modeling and building mod-
eling, the most effective approach may involve combining mechanistic
OCR for page 57
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 57
modeling with data-driven modeling. In these applications, mechanistic
models can capture (approximately) the main behaviors of the system,
which can then be refined by data-driven modeling. Classically, models
may be developed from first principles based on the behavior laws of the
system of interest, given sufficiently complete knowledge of the design
and implementation of the system. Such approaches are reflected not just
in the instrumentation plan, but in simulation tools and analysis tech-
niques. However, for most aspects of sustainability, the system may not be
rigorously defined or carefully engineered to operate under a narrow set
of well-defined behaviors. Examples include watersheds, forests, fisher-
ies, transportation networks, power networks, and cities. New technical
opportunities for addressing the challenges presented by such systems as
well as opportunities in instrumentation and measurement are emerging,
several of which are discussed below.
Coping with Self-Defining Physical Information
Rather than simply drawing its semantics and interpretation from
its embedding in a particular system, each physical information service
could be used for a variety of purposes outside the context of a particu-
lar system and hence should have an unambiguous meaning. The most
basic part of this problem is the conversion from readings to physical
units and the associated calibration coefficients and correction function.2
The much more significant part of the problem is capturing the context of
the observation that determines its meaning.3 For example, in a building
environment, supply air, return air, chilled water supply, chilled water
return, outside air, mixing valve inputs, economizer points, zone set point,
guard band, compressor oil, and refrigerated measurement all have physi-
cal units of temperature, but these measurements all have completely
2These aspects have been examined and partially solved over the years with electronic
data sheets, such as the IEEE [Institute of Electrical and Electronics Engineers] P1451 family,
ISA [Instrumentation Systems and Automation Society] 104 Electronic Device Description
Language, or Open Geospatial Consortium Sensor Model Language (SensorML). However,
many variations exist within distinct industrial segments and scientific disciplines; the stan-
dards tend to be very complex, and adoption is far from universal.
3One example of this problem is a stream-water temperature sensor that is normally sub-
merged but under low-water conditions becomes an air-temperature sensor instead. How
should this contextual change in semantics be captured? One possibility might be a subse-
quent data-cleaning step that determines in what “mode” the sensor-context combination
was (in this case, perhaps by using a stream-flow sensor or by correlating with a nearby air-
temperature sensor). Another example is a soil-moisture sensor whose accuracy can increase
with time when more is known about the soil composition—the parameterized equations
used by the sensors can be tuned to the soil-type details.
OCR for page 58
58 COMPUTING RESEARCH FOR SUSTAINABILITY
different meanings. The same applies to the collection of measurements
across many scientific experiments. Typically, contextual factors are cap-
tured on an ad hoc basis in naming conventions for the sense points, the
presentation screens for operations consoles, or the labels in data-analysis
reports.
The straightforward application of known techniques can be employed
to collect the diverse instrumentation sources and deposit readings into a
database for a specific setting or experiment. Similarly, electronic records
can be made of the contextual information to permit an analysis of the
data. The collection, storage, and query-processing infrastructure can be
made to scale arbitrarily; processes can be run to validate data integrity
and to ensure availability; and visualization tools can be introduced to
guide various stakeholders.
To provide these capabilities in general rather than as a result of a
design and engineering process for each specific domain or setting, how-
ever, requires either significant innovation in the techniques deployed
or the development of new techniques. There are, for instance, well-
developed techniques for defining the meaning, context, and interpre-
tation of information directly affected by human actions, where these
aspects are inherently related to the generation process.4 To cope with
many large-scale sustainability challenges, similar capabilities need to be
developed for physical or non-human-generated information.
Closely related to this definitional problem is the family of problems
related to registration, lookup, classification, and taxonomy, much as for
human-generated information, as one moves from physical documents to
interconnected electronic representations. When an application or system
is to be constructed on the basis of a certain body of physical information,
how is the set of information services discovered? How are they named?
If such information is to be stored and retrieved, how should it be classi-
fied? If physical information is to be accessed through means outside such
classifications, how is it to be searched? Keyword search can potentially
apply to the metadata that capture context, type, and role, but what about
features of the data stream itself?
Today one addresses these problems by implicitly relying on the
enclosing system for which the instrumentation is collected. As physi-
cal information is applied more generally, it becomes necessary to rep-
resent the model of the enclosing system explicitly if it is to be used to
4For example, the inventory of products in a retail outlet is quite diverse, but schemas are
in place to capture the taxonomy of possible items, locations in the supply chain or in the
store, prices, suppliers, and other information. Actions of ordering, shipping, stocking, sell-
ing, and so on cause specific changes to be made in the inventory database.
OCR for page 59
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 59
give meaning to the physical instrumentation. However, general model
description languages and the like are still in their infancy.
The Design and Capacity Planning of Physical Information Services
Once the physical deployment of the instrumentation capability is
decoupled from the design and implementation of the enclosing system,
many new research questions arise. Each consumer of physical informa-
tion may require that information at different timescales and levels of
resolution. Furthermore, the necessary level of resolution can change
dynamically depending on the purpose of the measurements. In prin-
ciple, one could measure everything at the finest possible resolution, but
this is rarely practical because of limitations in power consumption, local
memory, processing capacity, and network bandwidth. What is needed for
many sustainability-related challenges is a distributed system by which
information needs can be routed to relevant sensors—for the purposes of
this discussion, comparatively high bandwidth sensors are meant—and
those sensors can then modulate their sampling rates and resolution as
necessary.5
Recent advances in compressed sensing (to help conserve bandwidth
and power) and network coding (to take advantage of network topolo-
gies for increasing throughput) add to the complexity of such a distrib-
uted system. One can imagine tools that take as input a collection of
information consumers, a set of available sensors, and an understood
network topology and produce as output a set of sensing and routing
procedures that incorporate compressed sensing and network coding.
However, this perspective assumes that the locations of the sensors and
the network topology are already known. In virtually all practical situ-
ations, determining the number, location, and capabilities of individual
sensors is an important design step. To support these design decisions,
algorithms are needed for sensor placement and sizing. These algorithms
require models of the phenomena being measured and of the information
needs of each consumer. How will such models be provided and in what
representation?
As mentioned above, system architecture has traditionally been orga-
nized as a cycle: plant-sensor-controller-actuator-plant. In this model, sen-
sor readings are centralized and aggregated to produce an estimate of the
5Consider, for example, a thermometer in a freshwater stream. For purposes of hydro-
logical analysis, it might suffice to measure only the daily maximum and minimum tem-
peratures and report them once per week. But suppose that one seeks to detect sudden
temperature changes that might indicate the dumping of industrial wastewater. Then the
thermometer may need to measure and report temperatures at 15-second intervals.
OCR for page 60
60 COMPUTING RESEARCH FOR SUSTAINABILITY
state of the plant. The controller then determines the appropriate control
decisions, which are transmitted to the actuators. However, as the “plant”
becomes a large, spatially distributed system (e.g., a city, a power grid, an
ecosystem) and the volume of data becomes overwhelming, it is no longer
feasible to integrate centrally the state estimation and decision making.
A recent International Data Corporation study6 suggests that there will
be more than 35 zettabytes of data stored in 2020. Distributed algorithms
are needed that can push as much computation and decision making
as possible out to the sensors and actuators so that a smaller amount of
data needs to be moved and stored. At the same time, these algorithms
will need to avoid losing the advantages of data integration (reduction of
uncertainty and improved forecasting and decision making).
Finally, tools are needed to support the planning and design process.
These tools need to provide the designer with feedback on such things as
the marginal benefit of additional sensing and additional network links,
the robustness of the design to future information needs, and so forth. In
summary, all aspects of capacity planning present in highly engineered
systems, such as data centers and massive Internet services, arise in the
context of the physical information service infrastructure.
Software Stacks for Physical Infrastructures
Potential solutions to the problems delineated above suggest that
sophisticated model-driven predictive control and integrated cross-system
optimization will become commonplace rather than rare. As discussed in
Chapter 1, on the electric grid today, the independent service operator
attempts to predict future demand and to schedule supply and transmis-
sion resources to meet it, with possibly coarse-grained time-of-use rates
or, in rare cases, critical peak notification to influence the demand shape.
In the future, environmental control systems for buildings may be able
to adapt to the availability of non-dispatchable renewable supplies on
the grid, using the thermal storage inherent in a building to “green” the
electricity blend and ease the demand profile. Distributed generation and
storage may become more common in such a cooperative grid. Various
analysis, forecasting, and planning algorithms may operate over the phys-
ical information and human-generated information associated with the
grid, the building, the retail facility, the manufacturing plant. It becomes
important to ask what the execution environment is for such control algo-
rithms and analytical applications. What are the convenient abstractions
6International Data Corporation, “The 2011 Digital Universe Study: Extracting Value from
Chaos” (June 2011), available at http://www.emc.com/collateral/demos/microsites/emc-
digital-universe-2011/index.htm.
OCR for page 61
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 61
of physical resources that ease the development of such algorithms, and
how is access to shared-resource protected and managed? In effect, what
is the operating system of the building, of the grid, of the plant, of the
fleet, of the watershed? Today these operating systems are rudimentary
and consist of ad hoc ensembles of mostly proprietary enterprise resource
planning systems, building management systems, databases, communica-
tion structures, operations manuals, and manual procedures. An impor-
tant challenge for computer science research is to develop the systems and
design tools that can support effective and flexible management of these
complex systems.
INFORMATION-INTENSIVE SYSTEMS
Sustainability problems raise many research questions for informa-
tion-intensive systems because of the nature of the data sources and
the sheer amount of data that will be generated.7 Computer science has
applied itself broadly to problems related to discrete forms of human-
generated information, including transaction processing, communica-
tions, design simulation, scheduling, logistics tracking and optimization,
document analysis, financial modeling, and social network structure.
Many of these processes result in vast bodies of information, not just from
explicit data entry but through implicit information collection as goods
and services move through various aspects of the supply chain or as a
result of analyses performed on such underlying data. The proliferation
of mobile computing devices adds not just new quantities of data, but
new kinds of data as well. Some data-intensive processes are extremely
high bandwidth event streams, such as clickstreams from millions of web
users. In addition, computer science is widely applied to discretized forms
of continuous processes, including computational science simulation and
modeling, multimedia, and human-computer interfaces. In both regimes,
substantial data mining, inference, and machine learning are employed
to extract specific insights from a vast body of often low-grade, partially
related information.
All of the techniques described above can and must be applied to
problems associated with sustainability. Nonetheless, several aspects of
sustainability, even in addition to the vast quantities of data that will
7Given the vast amounts of data expected to be generated in the near future, traditional ap-
proaches to managing such amounts of data will not themselves be sustainable. Bandwidth
will become a significant barrier, meaning that approaches to computation (such as moving
computational resources to the data, or computing on data in real time and then discarding
them, or other new techniques), different from simply storing the data and computing on
them when necessary, will need to become more widespread.
OCR for page 75
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 75
ing to some underlying statistical model. Unfortunately, experience with
ecological modeling and environmental policy suggests that there are
many “unknown unknowns”—phenomena that are unknown to the
modelers and decision makers and therefore not accounted for in the
models.19 One possible safeguard is robust optimization.20 Rather than
treating model parameters as known, this approach assumes instead that
the parameters lie within some uncertainty set and optimizes against
the worst-case realization within these sets. The size of these uncertainty
sets can be varied to measure the loss in the objective function that must
be sustained in order to achieve a given degree of robustness. Robust-
optimization approaches can greatly improve the ability to sustain signifi-
cant departures from conditions in the nominal model. Existing robust-
optimization methods generally assume that the decision model can be
expressed as an optimization problem with a convex structure (e.g., linear
or quadratic programs). Robust optimization is sometimes considered
overly conservative. Convex constraints over multiple uncertainty sets
can be introduced to rule out simultaneous extreme events and reduce
the over-conservatism of first-generation robust-optimization methods.21
An open theoretical question is that of determining the best ways to use
data in optimization problems. In some problems in which there are insuf-
ficient data, the question becomes one of how to properly incorporate
subjective opinion about the data and what the best way is to characterize
uncertainty. Another research challenge is to develop robust-optimization
methods that are applicable to the kinds of complex nonlinear models that
arise in sustainability applications.
Optimal Sequential Decision Making
Most sustainability challenges will not be addressed by a decision
made at a single point in time. Instead, decisions must be made iteratively
over a long time horizon since a system is not sustainable unless it can be
operated indefinitely into the future. For example, in problems involving
natural resource management, every year provides a decision-making
opportunity. In fisheries, the annual allowable catch for each species must
be determined. In forests, the location and method for tree harvesting
19D.F. Doak et al., Understanding and predicting ecological dynamics: Are major surprises
inevitable? Ecology 89(4):952-961 (2008).
20A. Ben-Tal, L. El Ghaoui, and A. Nemirovski, Robust Optimization, Princeton, N.J.: Princ-
eton University Press (2009).
21D. Bertsimas and A. Thiele, Robust and data-driven optimization: Modern decision-
making under uncertainty, INFORMS Tutorials in Operations Research: Models, Methods, and
Applications for Innovative Decision Making, pp. 1-39 (2006).
OCR for page 76
76 COMPUTING RESEARCH FOR SUSTAINABILITY
must be specified, as well as other actions such as mechanical thinning to
reduce fire risk. In energy generation and distribution, the location of new
generation facilities and transmission lines must be chosen. In managing
global climate change, the amount of required reduction in greenhouse
gas emissions each year must be determined. The state of the art for solv-
ing sequential decision problems is to formalize them as Markov deci-
sion problems and solve them by means of stochastic dynamic program-
ming. However, an exact solution through these methods is only feasible
for processes whose state space is relatively small (tens of thousands of
states). Recently, approximate dynamic programming methods have been
developed in the fields of machine learning and operations research. 22
These methods typically employ linear function approximation methods
to provide a compact representation of the quantities required for stochas-
tic dynamic programming.
An important aspect of sustainability problems is that they often
involve optimization over time and space. For example, consider the
problem of designing biological reserves to protect threatened and endan-
gered species and ecosystems. Many conflicting factors operate in this
problem. Large, contiguous reserves tend to protect many species and
preserve biodiversity. However, such reserves are also more vulnerable
to spatially autocorrelated threats such as fire, disease, invasive species,
and climate change. The optimal design may thus involve a collection of
smaller reserves that lie along environmental gradients (elevation, pre-
cipitation, etc.). The purchase or preservation of land for reserves costs
money, and so a good design should also minimize cost. Another factor
is that reserves typically cannot be designed and purchased in a single
year. Instead, money becomes available (through government budgets
and private donations) and parcels are offered for sale over a period of
many years. Finally, the scientific understanding and the effectiveness of
previous land purchase decisions can be reassessed each year, and that
should be taken into account when making decisions.
The solution of large spatiotemporal sequential decision problems
such as those described above is far beyond the state of the art. Striking
the right balance between complexity and accuracy, especially in the con-
text of complex networked systems, is critical. New research is needed to
develop methods that can capture the spatial structure of the state each
year and the spatial transitions (e.g., fire, disease) that occur. There are
sustainability problems in which all three of these factors—uncertainty,
robustness, and sequential decision making—combine. For example, in
22W. Powell, Approximate Dynamic Programming: Solving the Curses of Dimensionality (2nd
Ed.), New York, N.Y.: Wiley (2011).
OCR for page 77
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 77
reserve design, models of suitable habitat for threatened and endan-
gered species are required. These are typically constructed by means
of machine learning methods and hence are inherently uncertain. This
uncertainty needs to be captured and incorporated into the sequential
decision-making process. Finally, existing stochastic dynamic program-
ming methods are designed to maximize expected utility. These methods
need to be extended in order to apply robust-optimization methods. A
research opportunity is to integrate the training of the machine learning
models—which can itself be formulated as a robust-optimization prob-
lem—with the robust optimization of the sequential decision problem.
This integration would allow the machine learning methods to tailor their
predictive accuracy to those regions of time and space that are of greatest
importance to the optimization process and could lead to large improve-
ments in the quality of the resulting decisions.
Formulating problems in terms of sequential decision making can
sometimes make the problems more tractable. For example, Roe and
Baker23 show that structure inherent in the sensitivity of the climate sys-
tem makes it extremely difficult to reduce the uncertainties in the esti-
mates of global warming. However, by formulating the problem as a
sequential decision-making problem, Allen and Frame24 show that it is
possible to control global warming adaptively without ever precisely
determining the level of climate sensitivity.
HUMAN-CENTERED SYSTEMS
It is critical, for real-world applicability, to situate technology inno-
vation and practice within the context-specific needs of the people ben-
efiting from or otherwise affected by that technology. For example, in
the context of introducing intelligence into the electric grid in order to
increase sustainability, the essential measures and relevant information
are very different when considered from the differing perspectives of the
utility, supplier, and customer. The utility may be interested in introduc-
ing payment schedules that influence customer behavior in a manner that
reduces the need to build plants that run for only a tiny fraction of the
time (to serve just the diminishing tail of the demand curve). Avoiding
such construction does reduce overall GHG emissions, but the primary
goal is to avoid capital investment. Trimming the peak does little to reduce
overall energy use, but it reduces the use of the most costly supplies. A
consumer-centric perspective is likely to focus on overall energy savings
23G.H. Roe and M.B. Baker, Why is climate sensitivity so unpredictable? Science 318
(5850):629-632 (2007)
24M.R. Allen and D.J. Frame, Call off the quest,” Science 318(5850):582-583 (2007).
OCR for page 78
78 COMPUTING RESEARCH FOR SUSTAINABILITY
and efficiency measures, not just on critical-peak usage. Thus, greater
emphasis may be placed on visualizing usage, understanding demand,
reducing waste, curbing energy consumption and less important usage,
and (if there is dynamic variable pricing) helping to move easily resched-
uled uses (e.g., water heating) to off-peak times. A grid-centric perspec-
tive, by contrast, may focus on the matching of supply and demand, as
well as on the utilization of the key bottlenecks in the transmission and
distribution infrastructure. All of these stakeholders need to be consid-
ered, and ideally involved, to substantially increase the penetration limit
of non-dispatchable renewable supplies, because of the need to match
consumption to supply. And, all stakeholders have substantial needs
for monitoring usage data, determining causal relationships between
activities and usage, and managing those activities to optimize usage. In
addition to the needs and values of these direct stakeholders in the tech-
nology, the indirect stakeholders should also be considered—that is, those
who are affected by the technology but do not use it. In the smart grid
example, the set of indirect stakeholders is broad indeed, since everyone
is (for example) affected by climate change. The ability to understand
such needs and to guide the development of technology on that basis
constitutes a natural application of techniques developed in the area of
human-computer interaction (HCI).
More generally, a human-centered approach can and should be inte-
grated with each of the topics discussed above. Issues such as human-in-
the-loop training of machine learning systems, the interpretability of model
results, and the possible use (or abuse) of large volumes of sensed data
become particularly salient with a human-centered viewpoint. Indeed,
with the vast quantities of data to be generated and used as described
earlier, privacy becomes a first-order concern. The role of computer sci-
ence in sustainability is predicated on the ability to capture and analyze
data at a scale without precedent. The understanding and mitigating of
privacy implications constitute an area in which fundamental CS research
can play a role—in both formalizing the questions in an appropriate way
(and indeed this is research well underway) and potentially in providing
solutions that can help mitigate the loss of privacy that is, to some extent,
inherent in taking full advantage of the power of information-gathering
at a global scale with high resolution. It is essential that a human-centered
approach be integrated with more traditional security approaches: not
only should the techniques for preserving privacy be technically sound,
but they should also be accessible, understandable, and convincing to the
users of these systems.
Historically, much of the research on sustainability in HCI has focused
on individual change. Perhaps one of the best-recognized examples is eco-
OCR for page 79
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 79
feedback technology, which leverages persuasive interface techniques 25
and focuses primarily on residential settings. Reduced individual use
can socialize people to the issues at hand, and can, at scale, have a direct
if limited effect on overall energy use.26 However, population growth
alone may outstrip the gains realized by such approaches. In response,
the committee notes the importance of significantly increased attention to
social, institutional, governmental, and policy issues in addition to issues
of individual change. A challenging public policy question, for example,
is how to verify compliance with GHG emissions requirements. Reliably
validated carbon reductions, for instance, are important not just to global
progress; they would be also invaluable for guiding sustainability efforts
at a macro level.27
This report emphasizes opportunities for research, in addition to the
data and privacy challenges mentioned earlier, on human-centered sys-
tems both at the individual level and beyond (at the organizational and
societal levels). Examples of such research areas include visualization and
user-interaction design for comprehensibility, transparency, legitimation,
deliberation, and participation; devices and dashboards for individu-
als and institutions; expanding the understanding of human behaviors,
empowering people to measure, argue for, and change what is happening;
and education. Following are brief discussions of each of these.
Supporting Deliberation, Civic Engagement,
Education, and Community Action
As noted in Chapter 1, moving toward a more sustainable society will
require massive cultural, social, political, and economic changes—and
today’s technologies are deeply intertwined with many of these changes.
Technology can help to support an informed and engaged citizenry. Cur-
rently, civic engagement is uneven at best, and thoughtful public delibera-
tion about major issues is often challenging to accomplish. However, the
ease of information access, the existence of community-based knowledge
25For example, see J. Froehlich, L. Findlater, and J. Landay, The design of eco-feedback
technology, in Proceedings of the 28th International Conference on Human Factors in Computing
Systems, New York, N.Y.: Association for Computing Machinery, pp. 1998-2008 (2010).
26For a provocative essay on this issue, see P. Dourish, Print This Paper, Kill a Tree: Environ-
mental Sustainability as a Research Topic for HCI, LUCI-2009-004, Laboratory for Ubiquitous
Computing and Interaction, University of California, Irvine (2009), and a related article: P.
Dourish, HCI and environmental sustainability: The politics of design and the design of
politics, in Proceedings of the 2010 ACM Conference on Designing Interactive Systems, Aarhus,
Denmark, pp. 1-10 (2010).
27See National Research Council, Verifying Greenhouse Gas Emissions: Methods to Support
International Climate Agreements, Washington, D.C.: The National Academies Press (2010).
OCR for page 80
80 COMPUTING RESEARCH FOR SUSTAINABILITY
repositories, and the search and social networking capabilities online
are transforming the manner in which humans learn, make decisions,
and interact. These techniques can be adapted for a research program on
designing, deploying, and testing innovative ways for citizens to deliber-
ate and to engage with government and one another, particularly with
those who may hold very different views in the context of sustainability.
These deliberations should be closely coupled to data (gathered both by
professional scientists and citizen-scientists) and simulation results—affor-
dances should be provided to help ground the discussion in the scientific
evidence. Similarly, online curricula for students in kindergarten through
grade 12 and for adults can explore, for instance, ongoing scientific and
policy discussions related to sustainability; and educational initiatives can
contribute to societal changes needed to meet sustainability goals.
In addition to opportunities with respect to tools for engaged citizens
generally, there are also promising areas of research in helping scientists
provide more effective input into these broader discussions and debates
on sustainability and potential initiatives. The intellectual merit of this
research would center on the issues of how to facilitate large-scale online
deliberation about contentious issues; the broader impacts would be in
making the results of scientific inquiry more widely seen and discussed.
As an example, suppose that there was a network supporting online
deliberation among scientists concerned with sustainability for develop-
ing key points, areas of strong consensus, areas of disagreement, and
supporting evidence. Those deliberations would produce a sustainability
action agenda that could be introduced to the public by means of interest-
ing interactive environments designed to appeal to those of all ages. These
sites could feed information by means of different media outlets (both
traditional and emerging) as well as providing interactive scenarios that
people could use to answer questions and debate solutions. One highlight
of this system would be a series of consensus news stories, perhaps on
a weekly basis. These stories could be based on agenda items created by
scientists and rated by public interest.
A core component of such a public education system could be a
forum for discussing scientific data, for voicing views on which stories
to present and when, and for suggesting how to frame them (delibera-
tive forums for the science community for building consensus positions).
A key research issue here is the development of technologies that help
organize the discussion, both for long-standing participants and for
people who are interested in entering into a long-running discussion but
could use help in understanding it and in making useful contributions.
The forum should include affordances that make it easy and natural to
classify suggestions, pro and con arguments, and so on, to keep this
type of exchange from degenerating into just a free-form discussion
OCR for page 81
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 81
board. Another important kind of affordance would be hooks for giving
sources for assertions (tools to encourage grounding arguments in the
scientific data).
Another project might be a highly visible forum for exchanges between
groups with quite divergent views in a deliberative setting. Again, the
system should include affordances that make it easy and natural to clas-
sify arguments and perspectives and tools that encourage the grounding
of arguments in the scientific data.
Basic research in educational technology is also crucial to increasing
the relevance and effectiveness of tools for a culturally and economi-
cally diverse population. Arguably, better support for deliberation and
engagement will not be enough. Supporting community action is also
essential. In recent years technology has become more and more salient
as an enabler of successful social change.28 In another example, on a local
scale, citizen sensing of environmental indicators (e.g., pollutants) has
influenced the ability of individuals to advocate for change. As the cost
of sophisticated sensors comes down, one can expect to see more and
more of them employed by end users. A citizenry that engages with and
helps to track this information is important to progress on the issues at
stake, and this engagement leads to increased education and engagement
in addition to increasing the amount of information available in crucial
areas. However, this raises fundamental research problems ranging from
the creation of these sensors to our ability to use the data effectively
despite the inherent uncertainties that arise from its production.
Design for Sustainability
Techniques developed to design for manufacturing, design for mass
customization, and user-centric design can expand on the understanding
of what it means to design for sustainability. Techniques such as ENERGY
STAR ratings for appliances and Leadership in Energy and Environmental
Design (LEED) ratings for buildings have had some success in reorienting
industry providers and consumers alike toward more sustainable prac-
tices. These efforts can be substantially informed by the measurement,
information-collection, and model-development techniques described
earlier, but can also use HCI techniques for appropriation, reuse, and end-
to-end design for technology products. This research can be expanded
to shed light on process, distribution, middleware, and other aspects of
the production and distribution of products. Technological advances can
28T. Hirsch and J. Henry, TXTmob: Text messaging for protest swarms, in Extended Ab-
stracts on Human Factors in Computing Systems, New York, N.Y.: Association for Computing
Machinery, pp. 1455-1458 (2005). DOI: http://doi.acm.org/10.1145/1056808.1056940.
OCR for page 82
82 COMPUTING RESEARCH FOR SUSTAINABILITY
also contribute to the tracking, monitoring, and analysis of the source
materials, production processes, distribution, and eventual disposal of
products. This information in turn can help to inform purchase decisions,
provide better accounting, and otherwise improve the sustainability of the
consumer economy.
Human Understanding of Sensing, Modeling, and Simulation
As the availability of sophisticated sensors, information collection,
modeling, and dissemination increase, techniques need to be developed
to provide in meaningful forms rich, highly disaggregate information to
households, small groups, and organizations regarding resource usage
(e.g., for electricity or water consumption). In addition to supporting
improved decision making about energy use at the organizational and
individual levels, this information could provide civil and environmen-
tal engineers with a picture at a new level of detail about how and why
these resources are being consumed, allowing their science and practice
to advance. At the same time, this possibility raises challenging research
questions regarding appropriate amounts of information, how to deal
with the inherent uncertainties in the data, techniques for evaluating
such systems, coupling with other systems on the supply side (e.g., the
smart grid), and important value questions regarding fairness, represen-
tativeness, security, and privacy. Better data can also drive modeling and
simulation, which can help with such activities as predicting important
trends, assessing how well proposed policies would meet objectives,
and optimizing resource use. Modeling climate change is an obvious
example, but there are many others, including a simulation of the evolu-
tion of urban areas, freight transport, and natural environments such as
forests or rivers. However, to be effective and relevant to policy making
and decision making, such modeling work must include careful consid-
eration of how it integrates with deliberation and the political process.
This raises issues of design for transparency, legitimation, appropriability,
and participation.
Tools to Help Organizations and
Individuals Engage in More Sustainable Behaviors
Another area for research concerns tools that make it easier and per-
haps even enjoyable for people to engage in more sustainable behaviors.
Some of the many examples in this area are the providing of real-time pub-
lic transit arrival and route information (particularly on mobile devices),
online ride-share matching, geowikis for bicycling, Zipcar, Freecycle, and
the like. Another class of tools provides eco-feedback: targeted informa-
OCR for page 83
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 83
tion about resource consumption (perhaps in real time), integrated with
suitable visualization techniques and appropriate persuasive technol-
ogy, for example to show progress toward personal or group goals. This
area is also related to the previous opportunity regarding the use of
information from resource-usage sensing. It is important to recognize
the limits of these technologies: better transit information is great if a
good underlying transit system exists, but it is not so useful without
that. Similarly, eco-feedback regarding energy use can be helpful, but it
does not address the more fundamental, underlying energy challenges in
some situations—such as low-income households in which comparatively
expensive upgrades would be a financial hardship, or homes that contain
inefficient appliances or poor insulation. For such challenges, alternative
solutions would be needed.
Many of the techniques described here are relevant to organizations as
well. For example, a large organization might similarly provide targeted
information about resource consumption, in real time, to show progress
toward goals for different branches of the organization.
Mitigation, Adaptation, and Disaster Response
Even under optimistic climate change scenarios, weather disasters
are likely to increase in number and severity, resulting in both the need
for immediate disaster relief and likely the need to assist large numbers
of refugees (e.g., from low-lying regions).29 Also, unfortunately, human
actions are likely to continue to contribute directly to environmental
disasters such as oil spills. There are research challenges with respect to
developing plans that can be revised rapidly under conditions of great
uncertainty, making use of vast numbers of citizen observations (such as
micro-content posted from disaster areas by individuals), coordinating
supply efforts, and others. One challenge for this line of work is recog-
nizing that there are huge uncertainties about the future and thus also
in developing tools and infrastructure that are flexible, adaptable, and
appropriate.
Using Information from Resource-Usage Sensing
Recent work has opened the possibility of providing rich, highly
disaggregate information to households, small groups, and organizations
regarding resource usage. For example, immediate feedback can now be
29NationalResearch Council, Adapting to the Impacts of Climate Change, Washington, D.C.:
The National Academies Press (2010).
OCR for page 84
84 COMPUTING RESEARCH FOR SUSTAINABILITY
provided on electrical energy use at the appliance or individual lighting
circuit level. A number of possibilities arise as a result, including detailed
eco-feedback about usage and tighter coupling with smart grid technol-
ogy on the supply side. Similar feedback is possible for other resources
such as water and natural gas.
This possibility does, however, raise a number of challenging research
questions. For example, what is the appropriate amount of information to
provide to households? Clearly there is the possibility of overwhelming
them with information. How are the inherent uncertainties in the data to
be dealt with? How are such systems to be evaluated? The traditional HCI
evaluation techniques of laboratory studies and small-scale deployments
are inadequate, but massive deployments over long periods are slow and
expensive, implying that one can only try a small number of alternatives
(in tension with the need for rapid prototyping and iteration). How can
these systems be coupled with smart grid technology on the supply side?
For example, the grid could signal to the household that the system was
close to capacity and that lowering energy use for the next hour would be
very helpful (or perhaps would result in a lower bill); or, conversely, the
household could be signaled that this would be an opportunity for some
non-time-critical activity. This arrangement would be a combination of
automated actions, with the scripts under the household’s control, and
explicit actions.
Another set of issues concerns fairness and representativeness. For
example, the majority of households in the United States are low-income
and many households rent, although most work in this area focuses on
relatively affluent homeowners. Can systems and policies be designed
that do not unfairly disadvantage some households, particularly ones
that can least afford additional charges? Another set of challenges con-
cerns security and privacy. Such systems offer the potential for reducing
resource consumption and making better use of resources, but there are
clear security and privacy risks if the system is compromised. Related
to that issue are questions of responsibility and power around available
infrastructure that must be addressed. Not everyone owns a home or pays
for energy use, and the relationships between landlords, residents, laws
(incentives, disincentives, and so on), available services (green contrac-
tors), and other factors influence energy use outcomes and may bear on
the design of technology (for example, in terms of authenticating who has
access to what data).
It is difficult to get good information about the fine-grained use of
energy right now. Buildings are not generally instrumented to produce
these data, yet a true understanding of the forces driving energy use is
impossible without better data. Better information about which appliances
are in use and when they are in use can help in developing a more complete
OCR for page 85
ELEMENTS OF A COMPUTER SCIENCE RESEARCH AGENDA 85
understanding of human behavior, and perhaps in identifing interven-
tions that can have an impact on energy use. Even a modest advance such
as analysis based on the segmentation of a building’s energy use among
HVAC, lighting, and plug load could yield useful results. Although this
may seem like a pure sensing problem, the process of deploying sensors,
labeling data, and interpreting the results involves people, and computer
science researchers are at the forefront of some of the innovations in this
area.30 Despite these advances, the problem of labeling data and interpret-
ing the results is one that requires more attention.
CONCLUSION
This chapter provides examples of important technical research areas
and outlines a broad research agenda for computer science and sus-
tainability. Although there are numerous opportunities to apply well-
understood technologies and techniques to sustainability, there are also
hard problems—such as mitigating climate change—for which current
methods offer at-best partial solutions, and rapid innovation is essential
in light of the pressing nature of the challenges. The areas highlighted in
this chapter—measurement and instrumentation; information-intensive
systems; analysis, modeling, and simulation; optimization; and human-
centered systems—are counterparts to well-established research areas in
computer science. This overlap has clear positive implications. However,
finding a way to have a significant impact may require new approaches
to these problems and almost certainly new ways of conducting and
managing research. Chapter 3 explores ways of conducting and manag-
ing research so that computer science research can have an even greater
impact on sustainability challenges.
30For example, Patel and others have developed comparatively lightweight methods to
acquire reasonably fine-grained data in homes; see J. Froehlich, E. Larson, S. Gupta, G. Cohn,
M. Reynolds, and S.N. Patel, Disaggregated end-use energy sensing for the smart grid, IEEE
Pervasive Computing, Special Issue on Smart Energy Systems, January-March (2011).