Project
Data Cleaning
Poor data quality is a well-known problem in data warehouses that arises for a variety of reasons such as data entry errors and differences in data representation among data sources. For example, one source may…
Publication
Squirrel: A decentralized peer-to-peer web cache
Tool
IceCube (engine and applications)
In a distributed system, shared data is replicated. Updates cause replicas to diverge. Reconciliation repairs the divergence. IceCube is a general-purpose reconciliation engine, parameterised by the semantics of shared data and by the users’ declared…
Publication