DIMACS Workshop on Big Data Integration
Month: June 2013
Date: June 20--21
Name: DIMACS Workshop on Big Data Integration
Location: DIMACS Center, CoRE Building, Rutgers University, Piscataway, New Jersey.
The Big Data era is upon us: data is being generated, collected and analyzed at an unprecedented scale, and data-driven decision making is sweeping through all aspects of technology and society. Since the value of data increases exponentially when it can be linked and fused with other data, addressing the big data integration challenge is critical to realizing the promise of Big Data - and conversely, Big Data techniques are critical to the goals of simplifying data integration.
The convergence of Big Data and data integration is emerging in many forms, largely motivated by the goals of integrating structured data on the Web or across communities. Increasingly we are seeing problems where (i) the number of data sources, even for a single domain, has grown to be in the tens of thousands, (ii) many of the data sources are very dynamic, as large volumes of newly collected data are continuously made available, (iii) the data sources are extremely heterogeneous in their structure, with considerable variety even for conceptually similar entities, and (iv) the data sources are of widely differing quality, with significant differences in the coverage, accuracy and timeliness of data provided.
Xin Luna Dong, AT&T Labs-Research,
email@example.com; Zachary Ives, University of Pennsylvania,
firstname.lastname@example.org. Presented under the auspices of the DIMACS Special Focus on Information Sharing and Dynamic Data Analysis.