Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter.
Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.
Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.
OCR for page 1
Statistical Analysis of Massive Data Streams: Proceedings of a Workshop
Committee on Applied and Theoretical Statistics
STATISTICAL ANALYSIS OF MASSIVE DATA STREAMS
National Research Council
Washington, D.C.
December 13 and 14, 2002
December 13
Welcome and Overview of Sessions
Sallie Keller-McNulty, Los Alamos National Laboratory
Chair, Committee on Applied and Theoretical Statistics
James Schatz, National Security Agency
Session 1. Atmospheric and Meteorological Data
Douglas Nychka, Session Chair, National Center for Atmospheric Research
Introduction
John Bates, National Climatic Data Center
Exploratory Climate Analysis Tools for Environmental Satellite and Weather Radar Data
Amy Braverman, Jet Propulsion Laboratory
Statistical Challenges in the Production and Analysis of Remote Sensing Earth Science Data at the Jet Propulsion Laboratory
Ralph Milliff, Colorado Research Associates
Global and Regional Surface Wind Field Inferences from Spaceborne Scatterometer Data
Report from Breakout Group
OCR for page 2
Statistical Analysis of Massive Data Streams: Proceedings of a Workshop
Session 2. High-Energy Physics
David Scott, Session Chair, Rice University
Introduction
Robert Jacobsen, Lawrence Berkeley National Laboratory
Statistical Analysis of High Energy Physics Data
Paul Padley, Rice University
Some Challenges in Experimental Particle Physics Data Streams
Miron Livny, University of Wisconsin-Madison
Data Grids (or A Distributed Computing View of High Energy Physics)
Report from Breakout Group
Luncheon Keynote Address
Daryl Pregibon, AT&T Shannon Research Laboratories
Keynote Address: Graph Mining—Discovery in Large Networks
Session 3. Integrated Data Systems
Sallie Keller-McNulty, Session Chair, Los Alamos National Laboratory
Introduction
J.Douglas Reason, Los Alamos National Laboratory
Global Situational Awareness
Kevin Vixie, Los Alamos National Laboratory
Incorporating Invariants in Mahalanobis Distance-Based Classifiers: Applications to Face Recognition
John Elder, Elder Research
Ensembles of Models: Simplicity (of Function) Through Complexity (of Form)
Report from Breakout Group
After-Dinner Address
Mark Hansen, Bell Laboratories
Announcement from the Whitney Museum of American Art Untitled Presentation
OCR for page 3
Statistical Analysis of Massive Data Streams: Proceedings of a Workshop
December 14
Session 4. Network Traffic
Wendy Martinez, Session Chair, Office of Naval Research
Introduction
William Cleveland, Bell Laboratories
FSD Models for Open-Loop Generation of Internet Packet Traffic
Johannes Gehrke, Cornell University
Processing Aggregate Queries over Continuous Data Streams
Edward Wegman, George Mason University
Visualization of Internet Packet Headers
Paul Whitney, Pacific Northwest National Laboratory
Toward the Routine Analysis of Moderate to Large-Size Data
Report from Breakout Group
Session 5. Mining Commercial Streams of Data
Leland Wilkinson, Session Chair, SPSS, Inc.
Introduction
Lee Rhodes, Hewlett-Packard Laboratories
A Stream Processor for Extracting Usage Intelligence from High-Momentum Internet Data
Pedro Domingos, University of Washington
A General Framework for Mining Massive Data Streams
Andrew Moore, Carnegie Mellon University
Kd-R-Ball-and Ad-Trees: Scalable Massive Science Data Analysis
Report from Breakout Group
Representative terms from entire chapter:
massive data