Course Topics
 
The following is the tentative list of topics that will be covered in this class and the list of papers that will be discussed:
 
  1. What is information integration?
  2. Mediators in the architecture of future information systems, Gio Wiederhold  
    IEEE Computer, 1992, Volume: 25 , Issue: 3, page(s): 38 - 49 [
    IEEE Xplore Link]
  3. Beauty and the Beast: The Theory and Practice of Information Integration, Laura Haas, Proceedings of ICDT, 2007. [Link]
  4. From databases to dataspaces: a new abstraction for information management, Micheal Franklin, Alon Halevy, David Maier, ACM SIGMOD Record, 2005. [ACM DL Link]
  5. Introduction to logic as a database language
  6. Principles of Database and Knowledge-base systems, Vol II. Jeffrey Ullman, Computer Science Press (ISBN 0-7167-8162-X), Chapters 12, 13
  7. Logical integration (LAV, GAV, GLAV)
  8. Query Processing in the Information Manifold., A Levy, A Rajaraman, J Ordille., Proc. VLDB Conference (1996), [VLDB Link] [Link]
  9. Navigational plans for data integration., M Friedman, A Levy, T Millstein. Proc. of the 16th Nat. Conf. on Artificial Intelligence (1999), [Link]
  10. Data integration: a theoretical perspective. Maurizio Lenzerini. Proceedings of the twenty-first ACM SIGMOD Conference (2002), [ACM DL Link]
  11. Answering queries using views: A survey. Alon Halevy, VLDB Journal, 2001. [Springer Link]
  12. Logical issues related to integration
  13. Answering queries using templates with binding patterns (extended abstract)., Anancl Rajaraman Yehoshua Sagiv Jeffrey D. Ullman, Proceedings of the fourteenth ACM SIGMOD Conference (1995), [ACM DL Link]
  14. Obtaining complete answers from incomplete databases., Alon Levy., Proc. of the 22nd Int. Conf. on Very Large Data Bases (1996), [VLDB Site]
  15. Tackling inconsistencies in data integration through source preferences., G De Giacomo, D Lembo, M Lenzerini, R Rosati., Proceedings of the 2004 international workshop on Information quality in information systems (2004) [ACM DL Link]
  16. Generating mappings
  17. Collective entity resolution in relational data, I Bhattacharya, L Getoor., ACM Transactions on Knowledge Discovery from Data (TKDD) (2007),  [ACM DL Link]
  18. Swoosh: A generic approach to entity resolution., O Benjelloun, H García-Molina, Q Su, J Widom., [Link]
  19. Schema Mapping as Query Discovery., R Miller, L Haas, M Hernández., Proceedings of the 26th International Conference on Very Large Databases (2000), [ACM DL Link]
  20. Reference reconciliation in complex information spaces., X Dong, A Halevy, J Madhavan., Proceedings of the 2005 ACM SIGMOD international conference (2005), [ACM DL Link]
  21. Exploiting relationships for object consolidation., Z Chen, D Kalashnikov, S Mehrotra., Proceedings of the 2nd international workshop on Information quality in information systems (2005), [ACM DL Link]
  22. Model management and schema composition
  23. Nested mappings: schema mapping reloaded., A Fuxman, M Hernández, H Ho, R Miller, P Papotti, Proceedings of the 32nd international conference on Very Large Databases (2006), [ACM DL Link]
  24. Representing and querying data transformations.. Y Velegrakis, R Miller, J Mylopoulos,  Proceedings of the International Conference on Data Engineering (2005), [IEEE Xplore Link]
  25. Ontologies
  26. TBA
  27. Integration and ontologies
  28. A Discovery-Based Approach to Database Ontology Design., S Castano, V De Antonellis., Distributed and Parallel Databases (1999), [Springer Link]
  29. Leveraging data and structure in ontology integration., O Udrea, L Getoor, R Miller.,
  30. Proceedings of the 2007 ACM SIGMOD international conference (2007) [ACM DL Link]
  31. Imprecise answers in distributed environments: Estimation of information loss for multi-ontology based query processing, E Mena, V Kashyap, A Illarramendi, A Sheth., International Journal of Cooperative Information Systems (2000) [Link]
  32. Other Topics
  33. Start making sense: The Chatty Web approach for global semantic agreements., K Aberer, P Cudré-Mauroux, M Hauswirth., Web Semantics: Science (2003) [Link]
  34. Peerdb: A p2p-based system for distributed data sharing., W Ng, B Ooi, K Tan, A Zhou., Proceedings of the 19th International Conference on Data … (2003) [Link]
  35. Mapping data in peer-to-peer systems: semantics and algorithmic issues., A Kementsietsidis, M Arenas, R Miller., Proceedings of the 2003 ACM SIGMOD international conference (2003) [ACM DL Link]
  36. Flows and views for scalable scientific process integration., Q Li, Z Shan, P Hung, D Chiu, S Cheung., Proceedings of the 1st international conference on Scalable Information Systems (2006) [ACM DL Link]
  37. Applications: scientific applications (esp. bioinformatics), personal information systems, mash-up systems
 
 
 
CSCI 6967 - Information Integration