The National Academies | 500 Fifth St. N.W. | Washington, D.C. 20001
Copyright © National Academy of Sciences. All rights reserved.
Terms of Use and Privacy Statement



Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.

OCR for page 1
Statistical Analysis of Massive Data Streams: Proceedings of a Workshop Committee on Applied and Theoretical Statistics STATISTICAL ANALYSIS OF MASSIVE DATA STREAMS National Research Council Washington, D.C. December 13 and 14, 2002     December 13         Welcome and Overview of Sessions         Sallie Keller-McNulty, Los Alamos National Laboratory Chair, Committee on Applied and Theoretical Statistics         James Schatz, National Security Agency         Session 1. Atmospheric and Meteorological Data         Douglas Nychka, Session Chair, National Center for Atmospheric Research Introduction         John Bates, National Climatic Data Center Exploratory Climate Analysis Tools for Environmental Satellite and Weather Radar Data         Amy Braverman, Jet Propulsion Laboratory Statistical Challenges in the Production and Analysis of Remote Sensing Earth Science Data at the Jet Propulsion Laboratory         Ralph Milliff, Colorado Research Associates Global and Regional Surface Wind Field Inferences from Spaceborne Scatterometer Data         Report from Breakout Group    

OCR for page 1
Statistical Analysis of Massive Data Streams: Proceedings of a Workshop     Session 2. High-Energy Physics         David Scott, Session Chair, Rice University Introduction         Robert Jacobsen, Lawrence Berkeley National Laboratory Statistical Analysis of High Energy Physics Data         Paul Padley, Rice University Some Challenges in Experimental Particle Physics Data Streams         Miron Livny, University of Wisconsin-Madison Data Grids (or A Distributed Computing View of High Energy Physics)         Report from Breakout Group         Luncheon Keynote Address         Daryl Pregibon, AT&T Shannon Research Laboratories Keynote Address: Graph Mining—Discovery in Large Networks         Session 3. Integrated Data Systems         Sallie Keller-McNulty, Session Chair, Los Alamos National Laboratory Introduction         J.Douglas Reason, Los Alamos National Laboratory Global Situational Awareness         Kevin Vixie, Los Alamos National Laboratory Incorporating Invariants in Mahalanobis Distance-Based Classifiers: Applications to Face Recognition         John Elder, Elder Research Ensembles of Models: Simplicity (of Function) Through Complexity (of Form)         Report from Breakout Group         After-Dinner Address         Mark Hansen, Bell Laboratories Announcement from the Whitney Museum of American Art Untitled Presentation    

OCR for page 1
Statistical Analysis of Massive Data Streams: Proceedings of a Workshop     December 14         Session 4. Network Traffic         Wendy Martinez, Session Chair, Office of Naval Research Introduction         William Cleveland, Bell Laboratories FSD Models for Open-Loop Generation of Internet Packet Traffic         Johannes Gehrke, Cornell University Processing Aggregate Queries over Continuous Data Streams         Edward Wegman, George Mason University Visualization of Internet Packet Headers         Paul Whitney, Pacific Northwest National Laboratory Toward the Routine Analysis of Moderate to Large-Size Data         Report from Breakout Group         Session 5. Mining Commercial Streams of Data         Leland Wilkinson, Session Chair, SPSS, Inc. Introduction         Lee Rhodes, Hewlett-Packard Laboratories A Stream Processor for Extracting Usage Intelligence from High-Momentum Internet Data         Pedro Domingos, University of Washington A General Framework for Mining Massive Data Streams         Andrew Moore, Carnegie Mellon University Kd-R-Ball-and Ad-Trees: Scalable Massive Science Data Analysis         Report from Breakout Group