Skip to main content

Currently Skimming:


Pages 16-38

The Chapter Skim interface presents what we've algorithmically identified as the most significant single chunk of text within every page in the chapter.
Select key terms on the right to highlight them within pages of the chapter.


From page 16...
... 16 By 1996, digital storage became a more cost-effective option for storing data than paper (SB 2016)
From page 17...
... State of the Practice of Big Data 17 Big Data datasets often are characterized using five attributes, referred to as the "five Vs": volume, variety, velocity, veracity, and value. 3.1.1 Volume Volume characterizes the main aspect of a Big Data dataset.
From page 18...
... 18 Leveraging Big Data to Improve Traffic Incident Management Emergency Medical Services Information System (NEMSIS) -- consisting of 30.2 million records from 15,000 EMS agencies -- makes up about 40 gigabytes (GB)
From page 19...
... State of the Practice of Big Data 19 rapidly changing Big Data dataset that needs to be processed quickly to catch suspicious transactions and deny payments. 3.1.4 Veracity Veracity refers to the trustworthiness of the data in Big Data datasets.
From page 20...
... 20 Leveraging Big Data to Improve Traffic Incident Management corrective or augmentative action(s)
From page 21...
... State of the Practice of Big Data 21 3. The tool had to be capable of handling component failure (e.g., failure of the CPU, or memory, or of the network)
From page 22...
... 22 Leveraging Big Data to Improve Traffic Incident Management programming model. They can scale up from single servers to thousands of machines, each offering local computation and storage.
From page 23...
... State of the Practice of Big Data 23 work with random access memory (RAM) , solid-state drives, or disk drives.
From page 24...
... 24 Leveraging Big Data to Improve Traffic Incident Management to deliver high availability and performance to NoSQL databases without sacrificing the robust consistency requirements and transaction capabilities found in relational databases. NewSQL databases also support the standard relational database language, SQL, to access and modify their data.
From page 25...
... State of the Practice of Big Data 25 has already been organized into an easily manageable format that facilitates sorting, merging, aggregating, and calculating. It should be noted that a traditional data warehouse -- which is a complex analytical system composed of one or more relational databases -- is not the same as a Big Data store.
From page 26...
... 26 Leveraging Big Data to Improve Traffic Incident Management is composed of a distributed storage system where data is archived indefinitely. A server cluster processes the stored data using various Big Data frameworks and databases to allow users to explore and query the data, create visualizations and dashboards, classify data, identify patterns or trends, and create rules or predictive models.
From page 27...
... State of the Practice of Big Data 27 and use of Big Data analytics arise from the absence of theory to drive the analytics and critical judgment in interpreting the analytics. These shortcomings are of particular concern for evolving social systems.
From page 28...
... 28 Leveraging Big Data to Improve Traffic Incident Management of clusters in which each observation belongs to the cluster with the nearest mean. Examples of software programs capable of performing clustering analysis include Apache Mahout, Apache Spark, and Revolution R Enterprise.
From page 29...
... State of the Practice of Big Data 29 be information about the text (e.g., its author, title, date, edition) and/or information that has been extracted using the text mining libraries (e.g., all names or locations mentioned in the text)
From page 30...
... 30 Leveraging Big Data to Improve Traffic Incident Management a face)
From page 31...
... State of the Practice of Big Data 31 enhance brand awareness and adoption. Figure 3-8 shows an example of the results of a graph analysis called betweenness centrality.
From page 32...
... 32 Leveraging Big Data to Improve Traffic Incident Management Example Application: Graph analysis often is used in fraud detection. In 2016, the International Consortium of Investigative Journalists (ICIJ)
From page 33...
... State of the Practice of Big Data 33 One benefit Big Data holds for transportation planners is the ability to track movements of vehicles and people on a scale never before imagined. Recent advances in crowd modeling systems have led to more focus on modeling complex locations; however, accurate data collection is one of the biggest limitations that crowd specialists face today (Alvarez 2015)
From page 34...
... 34 Leveraging Big Data to Improve Traffic Incident Management 3.3.4 Public Transportation Many city administrations recognize the value of using Big Data for public transportation, particularly for improving the management of bus fleets and optimizing maintenance and operations. In Sao Paulo, Brazil, Big Data collected in real time provides a more accurate picture of how many people ride the buses, which routes are on time, how drivers respond to changing conditions, and many other factors.
From page 35...
... State of the Practice of Big Data 35 • Develop data standards, especially if transportation agencies are not collecting and managing the data themselves; • Consider approaches to reduce the volume of connected vehicle and traveler data so that it is more manageable while ensuring that all valuable data is collected; • Utilize specific technologies and techniques like crowdsourcing, cloud computing, and federated database systems that have come to characterize the state-of-the-practice in Big Data and which will facilitate transportation operators or private sector data service providers in extracting value from connected-vehicle and traveler data; • Develop connected vehicle Big Data use cases that incorporate Big Data analytics approaches and the operational strategies that could derive from the knowledge gained through those approaches; and • Further investigate the potential cost and other resource implications of adopting Big Data approaches based on the outcome of the use-case investigation. Shi and Abdel-Aty (2015)
From page 36...
... 36 Leveraging Big Data to Improve Traffic Incident Management tunnel operations, chain stations, maintenance, and GIS shape files) in a cloud-based data lake.
From page 37...
... State of the Practice of Big Data 37 • Big Data in transport will lead to improved multi-source traffic and travel data availability and processing as well as to tools that improve multi-source traffic and travel data fusion. Combining big, open, and linked data will foster innovation and economic benefits.
From page 38...
... 38 Leveraging Big Data to Improve Traffic Incident Management data, researchers are developing the TIMELI system (Traffic Incident Management Enabled by Large-data Innovations) , which will make use of emerging large-scale data analytics to reduce the number of incidents and improve incident detection.

Key Terms



This material may be derived from roughly machine-read images, and so is provided only to facilitate research.
More information on Chapter Skim is available.