Obtaining Meaningful Answers from Hidden Data of the Public Web

Austin, TX (Scicasts) — Much of the data of the World Wide Web hides like an iceberg below the surface. The so-called ‘deep web’ has been estimated to be 500 times bigger than the ‘surface web’ seen through search engines like Google.

For scientists and others, the deep web holds important computer code and its licensing agreements. Nestled further inside the deep web, one finds the ‘dark web,’ a place where images and video are used by traders in illicit drugs, weapons, and human trafficking. A new data-intensive supercomputer called Wrangler is helping researchers obtain meaningful answers from the hidden data of the public web.

The Wrangler supercomputer got its start in response to the question, can a computer be built to handle massive amounts of I/O (input and output)? The National Science Foundation (NSF) in 2013 got behind this effort and awarded the Texas Advanced Computing Center (TACC), Indiana University, and the University of Chicago $11.2 million to build a first-of-its-kind data-intensive supercomputer. Wrangler’s 600 terabytes of lightning-fast flash storage enabled the speedy reads and writes of files needed to fly past big data bottlenecks that can slow down even the fastest computers. It was built to work in tandem with



Article source: https://scicasts.com/big-data/2007-scientific-computing/12265-obtain-meaningful-answers-from-hidden-data-of-the-public-web/

Bitcoin Watch Shop