The workshop will foster an informal atmosphere, where participants can
freely interact through short presentations and open discussions on their
ongoing work as well as discussion on current/future trends in data mining.
Questions can be asked freely during any time, and discussion slots have
been set aside in the morning and afternoon sessions. These slots are meant
to focus on critical evaluation of current approaches and open problems.
All speakers and participants are encouraged to think about these issues
prior to the workshop and to take active part in the ensuing discussions.
You can download the entire collection of papers in a single file
from here: http://www.cs.rpi.edu/tr/03-05.tar.gz
or by clicking on the individual papers below.
8:00-8:30 Continental Breakfast
8:30-8:35 Opening Remarks
8:35-9:10 Invited Talk Analyzing Massive Data
Streams: Past, Present, and Future: Minos
Garofalakis, Bells Labs
Abstract: Continuous data streams arise naturally, for example, in the
installations of large telecom and Internet service providers where
detailed usage information (Call-Detail-Records, SNMP/RMON packet-flow
data, etc.) from different parts of the underlying network needs to be
continuously collected and analyzed for interesting trends. Such
environments raise a critical need for effective stream-processing
algorithms that can provide (typically, approximate) answers to
data-analysis queries while utilizing only small space (to maintain
concise stream synopses) and small processing time per stream item. In
this talk, I will discuss the basic pseudo-random sketching mechanism
for building stream synopses and our ongoing work that exploits sketch
synopses to build an approximate SQL (multi) query processor. I will
also describe our recent results on extending sketching to handle more
complex forms of queries and streaming data (e.g., similarity joins
over streams of XML trees), and try to identify some challenging open
problems in the data-streaming area.
9:10-9:50 Data Streams I (Session Chair: Mohammed Zaki, RPI)
9:50-10:15 Coffee Break
10:15-11:20 DB Integration (Session Chair: Ankur Teredesai, RIT)
11:20-11:30 Short Discussion Session (Open Floor)
11:30-12:30 FCRC Plenary Talk by James Kurose on Networking
1:45-2:25 WWW Mining (Session Chair: Jean-Francois Boulicaut,
2:25-3:05 Data Streams II (Session Chair: Jean-Francois Boulicaut,
3:05-3:45 Bioinformatics (Session Chair: William Maniatty, SUNY Albany)
3:45-4:00 Coffee Break
4:00-4:50 Privacy & Security (Session Chair: William
Maniatty, SUNY Albany)
4:50-5:25 Discussion Session (Open Floor)
5:25-5:30 Closing Remarks
DMKD03 is held in conjunction with SIGMOD/PODS Conference, along with other
FCRC 2003 Conferences.
All participants must register for the workshop. The early registration
deadline for SIGMOD/PODS is 7th May 2003.
Registration details can be found at http://www.regmaster.com/fcrc2003.html
The registration fee includes the workshop proceedings, continental breakfast,
coffee breaks, and lunch.
The workshop proceedings will be distributed to all registered participants
at the workshop. In addition, online proceedings will be available at this
page. The proceedings will also be part of ACM Digital Library and SIGMOD
Anthology & DiSC.
Mohammed J. Zaki, Rensselaer Polytechnic Institute
Charu Aggarwal, IBM T.J. Watson Research Center (charu.AT.us.ibm.com)
Roberto Bayardo, IBM Almaden Research Center
Alok Choudhary, Northwestern University
Gautam Das, Microsoft Research
Venkatesh Ganti, Microsoft Research
Minos N. Garofalakis, Bell Labs
Dimitrios Gunopulos, University of California, Riverside
Jiawei Han, University of Illinois at Urbana-Champaign
Eamonn Keogh, University of Califirnia, Riverside
Nick Koudas, AT&T Research
Vipin Kumar, University of Minnesota
Bing Liu, University of Illinois at Chicago
Rosa Meo, University of Torino, Italy
Raymond Ng, University of British Columbia, Canada
Srini Parthasarathy, Ohio State University
Rajeev Rastogi, Bell Labs
Kyuseok Shim, Seoul National University
Hannu Toivonen, University of Helsinki
Philip S. Yu, IBM T.J. Watson Research Center
The original call for papers is available here.