A
Workshop Agenda

NATIONAL ACADEMIES

WASHINGTON, D.C.


Wednesday, August 19, 2009

8:00 am

Continental breakfast available

8:30

Chair’s opening remarks

Michael Stonebraker, MIT

Data Integration Stretch Goals, Technical Needs, and Policy Issues—Views From Various Domains

9:00

Geospatial data

Keith Clarke, University of California, Santa Barbara

9:20

Life sciences

Carl Kesselman, University of Southern California

9:40

Physics

Tim Frazier, Lawrence Livermore National Laboratory

10:00

Astronomy

Alex Szalay, Johns Hopkins University



The National Academies | 500 Fifth St. N.W. | Washington, D.C. 20001
Copyright © National Academy of Sciences. All rights reserved.
Terms of Use and Privacy Statement



Below are the first 10 and last 10 pages of uncorrected machine-read text (when available) of this chapter, followed by the top 30 algorithmically extracted key phrases from the chapter as a whole.
Intended to provide our own search engines and external engines with highly rich, chapter-representative searchable text on the opening pages of each chapter. Because it is UNCORRECTED material, please consider the following text as a useful but insufficient proxy for the authoritative book pages.

Do not use for reproduction, copying, pasting, or reading; exclusively for search engines.

OCR for page 43
A Workshop Agenda NATIONAL ACADEMIES WASHINgTON, D.C. Wednesday, August 19, 2009 8:00 am Continental breakfast available 8:30 Chair’s opening remarks Michael Stonebraker, MIT Data Integration Stretch Goals, Technical Needs, and Policy Issues—Views From Various Domains 9:00 Geospatial data Keith Clarke, University of California, Santa Barbara 9:20 Life sciences Carl Kesselman, University of Southern California 9:40 Physics Tim Frazier, Lawrence Livermore National Laboratory 10:00 Astronomy Alex Szalay, Johns Hopkins University 

OCR for page 43
 STEPS TOWARD LARGE-SCALE DATA INTEGRATION IN THE SCIENCES 10:20 Earth sciences Tom Karl, National Oceanic and Atmospheric Administration 10:40 Research libraries Clifford Lynch, Coalition for Networked Information 11:00 Break 11:20 Open discussion Working Lunch: Agency Perspecties Noon 10-15 minutes apiece from sponsors and other agencies James St. Pierre, National Institute of Standards and Technology Michael Marron, National Institutes of Health Ed Seidel, National Science Foundation State of the Art in Data Integration—Structured Data 1:20 pm Data federations Laura Haas, IBM Almaden Research Center 1:40 Data type conversion and ETL technology Lee Scheffler, IBM Information Integration Solutions 2:00 Automatic conversion Michael Siegel, MIT 2:20 The SciDB approach David Maier, Portland State University 2:40 Linked open data Orri Erling, OpenLink 3:00 Microsoft approach to data conversion Phil Bernstein, Microsoft 3:20 Break

OCR for page 43
 APPENDIX A State of the Art Data Integration Solutions—Semistructured Data 3:40 Google Approach Alon Halevy, Google 4:00 Yahoo! Approach Raghu Ramakrishnan, Yahoo! 4:20 MapReduce/Hive/Pig paradigm Amr Awadallah, Cloudera Policy Perspecties 4:40 Policy perspective Michael Nelson, Georgetown University 5:00 Policy perspective Christopher Greer, National Coordination Office for Networking and InformationTechnology R&D 5:20 Lessons from a large-scale information integration ecosystem Michael Brodie, Verizon 5:40 Business perspective Josephine Cheng, IBM Almaden Research Center 6:00 Develop organization for Day Two 6:30 Working dinner Thursday, August 20, 2009 Open Brainstorming 8:30 am Reflections on Day One 9:00 Identify three areas for detailed and moderated discussion Topic Area One 9:15 Open discussion 10:00 Break

OCR for page 43
 STEPS TOWARD LARGE-SCALE DATA INTEGRATION IN THE SCIENCES Topic Area Two 10:30 Open discussion Topic Area Three 11:15 Open discussion Noon Adjourn public workshop 12:30 pm Planning committee lunch/executive session 3:00 Planning committee adjourns