Entity Linkage: Similarity Measures and Algorithms - Divesh Srivastava

 

Entity Linkage: Similarity Measures and Algorithms - Divesh Srivastava

Abstract: A central problem in data quality research is the problem of entity linkage: given two large multi-attribute data sets, identify all pairs of entities in the two sets that are approximately the same. This problem arises in data cleaning, heterogeneous data integration, flexible querying, and a variety of other data-centric applications. This tutorial provides a comprehensive and cohesive overview of the key research results on this problem, focusing on similarity measures and efficient algorithms for this problem.

Bio: Divesh Srivastava is the head of the Database Research Department at AT&T Labs-Research. He received his Ph.D. from the University of Wisconsin, Madison, and his B.Tech. from the Indian Institute of Technology, Bombay, India. His current research interests include data quality and data stream management systems.

  
NICTA is Australia's new Information Communication Technology (ICT) centre of excellence Logo: Griffith University Logo: The University of Queensland Logo: CSIRO Logo: Victoria University Logo: University of New South Wales Logo: Maquarie University Logo: University of Melbourne Logo: University of Swinburne Logo: University of Newcastle Logo: University of Sydney Logo: University of South Australia
Logo: Queensland University of Technology Logo: University of Wooloongong Logo: The University of Technology, Sydney Logo: Monash University Logo: Edith Cowan University Logo: Australian National University Logo: SAP Logo: Microsoft