Proceedings of the 2005 CIDR Conference

Table of Contents

 

IR Applications and GUIs

 

Integrating DB and IR Technologies:  What is the Sound of One Hand Clapping?
Surajit Chaudhuri (MS), Raghu Ramakrishnan (University of Wisconsin), Gerhard Weikum (Max-Planck Institute of Computer Science)

    1

Haystack: A General-Purpose Information Management Tool for End Users Based on Semistructured Data
David Karger, Karun Bakshi, David Huynh (MIT), Dennis Quan (IBM), Vineet Sinha (MIT)

13

 

 

Web-based Infrastructure

 

The Architecture of PIER: an Internet-Scale Query Processor
Ryan Huebsch (UC Berkeley), Brent Chun (Intel Research Berkeley), Joseph Hellerstein (UC Berkeley and Intel Research), Boon Thau Loo (UC Berkeley), Petros Maniatis, Timothy Roscoe (Intel Research Berkeley), Scott Shenker, Ion Stoica (UC Berkeley), Aydan R. Yumerefendi (Intel Research Berkeley)

28

Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web
Kevin Chen-Chuan Chang, Bin He, Zhen Zhang (UIUC)

44

A Scalability Service for Dynamic Web Applications
Christopher Olston, Amit Manjhi, Charles Garrod, Anastassia Ailamaki, Bruce Maggs, Todd Mowry (Carnegie Mellon University)

56

 

 

War Stories

 

Lessons Learned from Managing a Petabyte
Jacek Becla, Daniel L. Wang (Stanford Linear Accelerator Center)

70

Automatic Performance Diagnosis and Tuning in Oracle
Karl Dias, Mark Ramacher, Uri Shaft, Venkateshwaran Venkataramani, Graham Wood (Oracle Corp.)

84

Efficient Regression Tests for Database Applications
Florian Haftmann (i-TV-T AG), Donald Kossmann (ETH Zurich), Alexander Kreutz (i-TV-T AG)

95

 

 

Information Integration

 

ORCHESTRA: Rapid, Collaborative Sharing of Dynamic Data
Zachary Ives, Nitin Khandelwal, Aneesh Kapur (University of Pennsylvania), Murat Cakir (Drexel University)

107

A Platform for Personal Information Management and Integration
Xin (Luna) Dong, Alon Halevy (Univ. of Washington)

119

(Almost) Hands-Off Information Integration for the Life Sciences
Ulf Leser, Felix Naumann (Humboldt-UniversitŠt zu Berlin)

131

 

 

Keynote Presentation

 

Data on the Outside Versus Data on the Inside
Pat Helland (Microsoft)

144

 

 

Scientific Applications

 

When Database Systems Meet the Grid
Maria Nieto-Santisteban (Johns Hopkins University), Jim Gray (Microsoft), Alexander Szalay (Johns Hopkins University), James Annis (Fermilab), Aniruddha Thakar, William O_Mullane (Johns Hopkins University)

154

Deriving and Managing Data Products in an Environmental Observation and Forecasting System
Laura Bright, David Maier (OGI/OHSU)

162

 

 

Distributed Systems

 

How can we support Grid Transactions? Towards Peer-to-Peer Transaction Processing
Can Turker, Klaus Haller, Christoph Schuler (ETH Zurich), Hans Schek (ETH Zurich and UMIT Innsbruck)

174

Two Can Keep A Secret: A Distributed Architecture for Secure Database Services
Gagan Aggarwal, Mayank Bawa, Prasanna Ganesan, Hector Garcia-Molina, Krishnaram Kenthapadi, Rajeev Motwani, Utkarsh Srivastava, Dilys Thomas, Ying Xu (Stanford University)

186

User Profile Management in Converged Networks (Episode II): "Share your Data, Keep your Secrets"
Arnaud Sahuguet (Bell Labs),  Bogdan Alexe (Ecole Polytechnique), Irini Fundulaki (Bell Labs),  Pierre-Yves Lalilgand, Abdullatif Shikfa, Antoine Arnail (Ecole Polytechnique)

200

 

 

Query Processing

 

Cracking the Database Store
Martin Kersten, Stefan Manegold (CWI)

213

MonetDB/X100: Hyper-Pipelining Query Execution
Peter Boncz, Marcin Zukowski, Niels Nes (CWI)

225

Adaptive Query Processing in the Looking Glass
Shivnath Babu (Stanford University), Pedro Bizarro (University of Wisconsin, Madison)

238

Buffer-pool Aware Query Optimization
Ravishankar Ramamurthy, David Dewitt (University of Wisconsin, Madison)

250

 

 

Keynote Presentation

 

Trio: A System for Integrated Management of Data, Accuracy, and Lineage
Jennifer Widom (Stanford University)

262

Stream Processing

 

The Design of the Borealis Stream Processing Engine
Daniel Abadi (MIT), Yanif Ahmad (Brown University), Magdalena Balazinska (MIT), Ugur Cetintemel (Brown University), Mitch Cherniack (Brandeis University), Jeong-Hyon Hwang (Brown University), Wolfgang Lindner (MIT), Anurag S. Maskey (Brandeis University), Alexander Rasin (Brown University), Esther Ryvkina (Brandeis University), Nesime Tatbul, Ying Xing, Stan Zdonik (Brown University)

277

Design Considerations for High Fan-In Systems: The HiFi Approach
Michael J. Franklin, Shawn R. Jeffery, Sailesh Krishnamurthy, Frederick Reiss, Shariq Rizvi, Eugene Wu, Owen Cooper, Anil Edakkunni (UC Berkeley), Wei Hong (Intel Research Berkeley)

290

Action-Oriented Query Processing for Pervasive Computing
Wenwei Xue, Qiong Luo (HKUST)

305

Using Probabilistic Models for Data Management in Acquisitional Environments
Amol Deshpande (UC Berkeley), Carlos Guestrin (Carnegie Mellon University), Sam Madden (MIT)

317

 

 

Invited Real Time Integration Demo

 

The Geek-TonesŞ:  An Experiment in Distributed, Real-time Musical Integration
Mike Carey (BEA Systems), Dean Jacobs (Salesforce.com), Len Seligman (Mitre)

 

329