Skip to main content

Frontiers in Massive Data Analysis

View Cover

Purchase Options
Purchase Options MyNAP members save 10% online. Login or Register



Data mining of massive data sets is transforming the way we think about crisis response, marketing, entertainment, cybersecurity and national intelligence. Collections of documents, images, videos, and networks are being thought of not merely as bit strings to be stored, indexed, and retrieved, but as potential sources of discovery and knowledge, requiring sophisticated analysis techniques that go far beyond classical indexing and keyword counting, aiming to find relational and semantic interpretations of the phenomena underlying the data.

Frontiers in Massive Data Analysis examines the frontier of analyzing massive amounts of data, whether in a static database or streaming through a system. Data at that scale--terabytes and petabytes--is increasingly common in science (e.g., particle physics, remote sensing, genomics), Internet commerce, business analytics, national security, communications, and elsewhere. The tools that work to infer knowledge from data at smaller scales do not necessarily work, or work well, at such massive scale. New tools, skills, and approaches are necessary, and this report identifies many of them, plus promising research directions to explore. Frontiers in Massive Data Analysis discusses pitfalls in trying to infer knowledge from massive data, and it characterizes seven major classes of computation that are common in the analysis of massive data. Overall, this report illustrates the cross-disciplinary knowledge--from computer science, statistics, machine learning, and application disciplines--that must be brought to bear to make useful inferences from massive data.


Suggested Citation

National Research Council. 2013. Frontiers in Massive Data Analysis. Washington, DC: The National Academies Press.

Import this citation to:

Publication Info

190 pages | 6 x 9
  • Paperback: 978-0-309-28778-4
  • Ebook: 978-0-309-28781-4



Scott Weidman, director of the Board on Mathematical Science and their Applications at the NRC, explains the charge and key recommendation of the report along with the challenges and opportunties the Massive Data presents.


Copyright Information

The National Academies Press and the Transportation Research Board have partnered with Copyright Clearance Center to offer a variety of options for reusing our content. You may request permission to:

  • Republish or display in another publication, presentation, or other media
  • Use in print or electronic course materials and dissertations
  • Share electronically via secure intranet or extranet
  • And more

For most Academic and Educational uses no royalties will be charged although you are required to obtain a license and comply with the license terms and conditions.

Click here to obtain permission for Frontiers in Massive Data Analysis.

Translation and Other Rights

For information on how to request permission to translate our work and for any other rights related query please click here. Customer Service

For questions about using the service, please contact:

Copyright Clearance Center
22 Rosewood Drive
Danvers, MA 01923
Tel (toll free): 855/239-3415 (select option 1)

Loading stats for Frontiers in Massive Data Analysis...