Main

Dmcourse.Main History

Hide minor edits - Show changes to markup

December 03, 2012, at 09:02 PM by 128.113.126.13 -
Added line 211:
December 02, 2012, at 06:10 PM by 128.113.126.13 -
Added line 210:
November 29, 2012, at 05:20 PM by 128.113.126.13 -
Changed line 198 from:
to:
Changed lines 203-205 from:

[l] FPM: Sequence Mining

to:

[l] FPM: Sequence Mining [l] Attach:chap10.pdf [l] Attach:Lecture21.PDF

November 26, 2012, at 03:31 PM by 128.113.126.13 -
Added line 199:
November 25, 2012, at 10:09 PM by 128.113.126.13 -
Added line 198:
November 19, 2012, at 08:42 PM by 128.113.126.13 -
Added line 189:
November 15, 2012, at 07:13 PM by 128.113.126.13 -
Changed lines 181-182 from:

[l]

to:

[l] CLUS: Spectral & Graph Clustering [l]

November 15, 2012, at 07:12 PM by 128.113.126.13 -
Changed lines 181-182 from:

[l] CLUS: Evaluation & Assessment [l] Attach:chap18.pdf

to:
Changed lines 186-187 from:

[l] Frequent Pattern Mining (FPM): Itemset Mining

to:

[l] CLUS: Evaluation & Assessment [l] Attach:chap18.pdf

Changed lines 195-196 from:

[l] FPM: Sequence Mining

to:

[l] Frequent Pattern Mining (FPM): Itemset Mining

Changed line 199 from:

[l] FPM: Graph Mining

to:

[l] FPM: Sequence Mining

Changed line 203 from:

[l] FPM: Pattern Assessment

to:

[l] FPM: Graph Mining

November 13, 2012, at 06:38 AM by 128.113.126.13 -
Changed line 182 from:
to:
November 13, 2012, at 06:36 AM by 128.113.126.13 -
Changed line 178 from:

[l]

to:
Added line 182:
November 11, 2012, at 07:54 PM by 128.113.126.13 -
Changed lines 177-178 from:
to:
November 08, 2012, at 07:40 PM by 128.113.126.13 -
Added line 172:
November 08, 2012, at 04:10 PM by 128.113.126.13 -
Added line 25:
  • Nov 8: Assign5 has be posted. It is due on 16th Nov, before midnight.
November 07, 2012, at 07:04 PM by 128.113.126.13 -
Added lines 163-168:

[l] CLUS: EM-based [l] [l] Attach:Lecture15.PDF

[row bgcolor=aliceblue] [l]R: Nov 8

Changed lines 170-173 from:

[row bgcolor=aliceblue] [l]R: Nov 8 [l] CLUS: Subspace Clustering

to:
October 25, 2012, at 04:07 PM by 128.113.126.13 -
Added line 151:
October 24, 2012, at 10:47 PM by 128.113.126.13 -
Changed lines 149-150 from:

[l] Clustering (CLUS): Hierarchical, Partitional

to:

[l] Clustering (CLUS): Partitional [l] Attach:chap13.pdf

Changed line 162 from:

[l] CLUS: Density-based Clustering

to:

[l] CLUS: Hierarchical, Density-based Clustering

October 23, 2012, at 04:32 PM by 128.113.126.13 -
Added line 25:
  • Oct 23: Assign4 has be posted. It is due on 30th Oct, before midnight.
October 22, 2012, at 06:07 PM by 128.113.126.13 -
Added line 142:

[l]

October 22, 2012, at 06:05 PM by 128.113.126.13 -
Changed lines 141-142 from:

[l] CLASS: Classifier Evaluation

to:

[l] CLASS: Classifier Evaluation [l] Attach:Lecture13.PDF

October 18, 2012, at 02:53 PM by 128.113.126.13 -
Added line 137:
October 17, 2012, at 09:01 PM by 128.113.126.13 -
Changed line 136 from:
to:
October 15, 2012, at 04:00 PM by 128.113.126.13 -
Added line 131:
October 14, 2012, at 11:42 PM by 128.113.126.13 -
Changed line 130 from:
to:
October 14, 2012, at 11:41 PM by 128.113.126.13 -
Added line 130:
October 12, 2012, at 07:59 PM by 128.113.126.13 -
Added line 25:
  • Oct 12: Assign3 has be posted. It is due on 19th Oct, before midnight.
October 11, 2012, at 09:11 PM by 128.113.126.13 -
Changed line 117 from:
to:
Added lines 123-124:
October 09, 2012, at 07:42 PM by 128.113.126.13 -
Changed line 118 from:

[l]

to:
Changed line 122 from:

[l] CLASS: SVMs, Bayesian Classifier

to:

[l] CLASS: SVMs

Changed line 126 from:

[l] ASS: Decision Trees

to:

[l] CLASS: Bayesian Classifier, Decision Trees

October 08, 2012, at 09:40 PM by 128.113.126.13 -
Changed line 117 from:
to:
October 02, 2012, at 10:12 PM by 128.113.126.13 -
Changed lines 104-107 from:

[l] DA: Kernels, Classification (CLASS): Linear Discriminants [l] Attach:chap22.pdf [l]

to:

[l] DA: Kernels [l] [l] Attach:Lecture8.PDF

Changed lines 116-118 from:

[l] CLASS: SVMs

to:

[l] Classification (CLASS): Linear Discriminants, SVMs [l] Attach:chap22.pdf [l]

September 30, 2012, at 09:14 PM by 128.113.126.13 -
Changed lines 105-106 from:
to:
September 30, 2012, at 08:25 PM by 128.113.126.13 -
Changed lines 104-106 from:

[l] Classification (CLASS): Linear Discriminants & SVMs

to:

[l] DA: Kernels, Classification (CLASS): Linear Discriminants

Changed line 119 from:

[l] CLASS: Bayesian Classifier CL

to:

[l] CLASS: SVMs, Bayesian Classifier

September 30, 2012, at 08:23 PM by 128.113.126.13 -
Changed lines 104-105 from:

[l] Classification (CLASS): Bayesian Classifier

to:

[l] Classification (CLASS): Linear Discriminants & SVMs

Changed line 115 from:

[l] CLASS: Decision Trees

to:

[l] CLASS: SVMs

Changed line 119 from:

[l] CLASS: Linear Discriminants & SVMs

to:

[l] CLASS: Bayesian Classifier CL

Changed line 123 from:

[l] CLASS: SVMs

to:

[l] ASS: Decision Trees

September 27, 2012, at 03:18 PM by 128.113.126.13 -
Added line 100:
September 24, 2012, at 10:20 PM by 128.113.126.13 -
Added line 25:
  • Sep 24: Assign2 has be posted. It is due on 1st Oct, before midnight.
September 24, 2012, at 04:10 PM by 128.113.126.13 -
Changed lines 91-93 from:

[l] DA: Categorical Data & Kernel Methods [l] Attach:chap3.pdf, Attach:chap5.pdf

to:

[l] DA: Categorical Data & [l] Attach:chap3.pdf [l] Attach:Lecture6.PDF

Changed lines 97-98 from:

[l] DA: Dimensionality Reduction

to:

[l] DA: Kernel Methods [l] Attach:chap5.pdf

September 20, 2012, at 04:07 PM by 128.113.126.13 -
Changed lines 85-86 from:

[l] DA: Categorical Data & High Dimensional Analysis [l] Attach:chap3.pdf, Attach:chap6.pdf

to:

[l] DA: High Dimensional Analysis [l] Attach:chap6.pdf [l] Attach:Lecture5.PDF

Changed lines 91-92 from:

[l] Kernel Methods

to:

[l] DA: Categorical Data & Kernel Methods [l] Attach:chap3.pdf, Attach:chap5.pdf

September 17, 2012, at 01:40 PM by 128.113.126.13 -
Added line 81:
September 17, 2012, at 01:39 PM by 128.113.126.13 -
Changed lines 79-81 from:

[l] DA: Dimensionality Reduction & Categorical Data [l] Attach:chap3.pdf , Attach:chap7.pdf

to:

[l] DA: Dimensionality Reduction [l] Attach:chap7.pdf

Changed lines 84-85 from:

[l] DA: High Dimensional Analysis

to:

[l] DA: Categorical Data & High Dimensional Analysis [l] Attach:chap3.pdf, Attach:chap6.pdf

September 16, 2012, at 11:36 PM by 128.113.126.13 -
Changed line 80 from:
to:
September 14, 2012, at 12:55 PM by 128.113.126.13 -
Changed line 25 from:
  • Sep 14: [Dmcourse/Assign1 | Assign1]] has be posted. It is due on 21st Sep, before midnight.
to:
  • Sep 14: Assign1 has be posted. It is due on 21st Sep, before midnight.
September 14, 2012, at 12:54 PM by 128.113.126.13 -
Added line 25:
  • Sep 14: [Dmcourse/Assign1 | Assign1]] has be posted. It is due on 21st Sep, before midnight.
September 13, 2012, at 01:57 PM by 128.113.126.13 -
Changed line 65 from:

[l] DA: Numeric and Categorical Attributes

to:

[l] DA: Numeric Attributes

Changed lines 71-72 from:

[l] DA: Numeric and Categorical Attributes [l] Attach:chap3.pdf

to:

[l] DA: Numeric Attributes: Eigen-decomposition [l]

Changed lines 78-79 from:

[l] DA: Kernel Approach and Graph Analysis

to:

[l] DA: Dimensionality Reduction & Categorical Data [l] Attach:chap3.pdf

Changed line 87 from:

[l] DA: Dimensionality Reduction

to:

[l] Kernel Methods

September 13, 2012, at 01:54 PM by 128.113.126.13 -
Added line 73:
September 11, 2012, at 09:31 AM by 128.113.126.13 -
Changed line 60 from:
to:
Changed lines 66-67 from:
to:
Changed lines 71-72 from:

[l] DA: Kernel Approach

to:

[l] DA: Numeric and Categorical Attributes [l] Attach:chap3.pdf

Changed line 77 from:

[l] DA: Graph Analysis

to:

[l] DA: Kernel Approach and Graph Analysis

September 07, 2012, at 11:47 AM by 128.113.126.13 -
Added line 25:
  • Sep 7: Everyone enrolled in the course should have already signed up for the piazza account (or they should have received an email to do so). Please sign up immediately to receive class announcements and emails.
September 07, 2012, at 09:39 AM by 128.113.126.13 -
Added line 43:

[!c]Lectures

Changed line 60 from:
to:
September 05, 2012, at 07:36 PM by 128.113.126.13 -
Changed lines 58-59 from:

[l] (Attach:)chap1.pdf, (Attach:)chap2.pdf

to:
Changed line 64 from:

[l] (Attach:)chap3.pdf

to:
September 05, 2012, at 07:36 PM by 128.113.126.13 -
Changed lines 58-59 from:
to:

[l] (Attach:)chap1.pdf, (Attach:)chap2.pdf

Added line 64:

[l] (Attach:)chap3.pdf

September 04, 2012, at 07:29 AM by 128.113.126.13 -
Changed lines 105-106 from:

[l] CLASS: SVMs FPM: Pattern Assessment

to:

[l] CLASS: SVMs

Changed line 143 from:

[l] CLUS: Spectral & Graph Clustering

to:

[l] CLUS: Evaluation & Assessment

Changed lines 147-148 from:

[l] CLUS: Evaluation & Assessment

to:

[l] Frequent Pattern Mining (FPM): Itemset Mining

Changed lines 155-156 from:

[l] Frequent Pattern Mining (FPM): Itemset Mining

to:

[l] FPM: Sequence Mining

Changed line 159 from:

[l] FPM: Sequence Mining

to:

[l] FPM: Graph Mining

Changed line 163 from:

[l] FPM: Graph Mining

to:

[l] FPM: Pattern Assessment

September 04, 2012, at 07:27 AM by 128.113.126.13 -
Changed lines 62-63 from:

[l] DA: Numeric Attributes

to:

[l] DA: Numeric and Categorical Attributes

Changed lines 66-67 from:

[l] DA: Categorical Attributes

to:

[l] DA: Kernel Approach

Changed lines 71-72 from:

[l] DA: Kernel Approach

to:

[l] DA: Graph Analysis

Changed line 75 from:

[l] DA: High Dimensional Analysis

to:

[l] DA: High Dimensional Analysis

Changed lines 79-80 from:

[l] DA: Dimensionality Reduction

to:

[l] DA: Dimensionality Reduction

Changed line 83 from:

[l] Frequent Pattern Mining (FPM): Itemset Mining

to:

[l] DA: Dimensionality Reduction

Changed lines 87-88 from:

[l] FPM: Sequence Mining

to:

[l] Classification (CLASS): Bayesian Classifier

Changed line 97 from:

[l] DA: Graph Analysis

to:

[l] CLASS: Decision Trees

Changed line 101 from:

[l] DA: Graph Analysis & Mining

to:

[l] CLASS: Linear Discriminants & SVMs

Changed lines 105-106 from:

[l] FPM: Graph Mining

to:

[l] CLASS: SVMs FPM: Pattern Assessment

Changed lines 109-110 from:

[l] FPM: Pattern Assessment

to:

[l] CLASS: Classifier Evaluation

Changed lines 114-116 from:

[l] Classification (CLASS): Bayesian Classifier

to:

[l] CLASS: Classifier Evaluation

Changed line 119 from:

[l] CLASS: Decision Trees

to:

[l] Clustering (CLUS): Hierarchical, Partitional

Changed lines 131-132 from:

[l] CLASS: Linear Discriminants & SVMs

to:

[l] CLUS: Density-based Clustering

Changed line 135 from:

[l] CLASS: SVMs

to:

[l] CLUS: Subspace Clustering

Changed lines 139-140 from:

[l] CLASS: Classifier Evaluation

to:

[l] CLUS: Spectral & Graph Clustering

Changed line 143 from:

[l] Clustering (CLUS): Hierarchical, Partitional

to:

[l] CLUS: Spectral & Graph Clustering

Changed lines 147-148 from:

[l] CLUS: Density-based Clustering

to:

[l] CLUS: Evaluation & Assessment

Changed lines 155-156 from:

[l] CLUS: Spectral & Graph Clustering

to:

[l] Frequent Pattern Mining (FPM): Itemset Mining

Changed line 159 from:

[l] CLUS: Subspace Clustering

to:

[l] FPM: Sequence Mining

Changed line 163 from:

[l] CLUS: Evaluation & Assessment

to:

[l] FPM: Graph Mining

September 02, 2012, at 08:22 PM by 128.113.126.13 -
Changed line 11 from:

TA Office Hours: TBA\\

to:

TA Office Hours: W 2-4PM, Amos Eaton 119\\

September 02, 2012, at 10:45 AM by 128.113.126.13 -
Changed line 12 from:

TA Contact:

to:

TA Contact:

September 02, 2012, at 10:38 AM by 128.113.126.13 -
Changed line 12 from:

TA Contact: email:talkun@rpi.edu

to:

TA Contact:

September 02, 2012, at 10:37 AM by 128.113.126.13 -
Changed line 12 from:

TA Contact: emailto:talkun@rpi.edu

to:

TA Contact: email:talkun@rpi.edu

September 02, 2012, at 10:37 AM by 128.113.126.13 -
Changed line 12 from:
to:

TA Contact: emailto:talkun@rpi.edu

September 02, 2012, at 10:37 AM by 128.113.126.13 -
Changed line 12 from:
to:
September 02, 2012, at 10:35 AM by 128.113.126.13 -
Changed line 6 from:

Room: TBA\\

to:

Room: Greene 120\\

Changed line 10 from:

TA: TBA\\

to:

TA: Nilothpal Talukder\\

Changed line 12 from:

TA Contact: TBA

to:
August 07, 2012, at 12:31 PM by 128.113.126.13 -
Changed lines 62-63 from:
to:

[l] DA: Numeric Attributes

Changed line 66 from:

[l]DA: Numeric Attributes

to:

[l] DA: Categorical Attributes

August 07, 2012, at 12:29 PM by 128.113.126.13 -
Changed line 150 from:

[l]Thanksgiving Break

to:

[l]Thanksgiving Break

August 07, 2012, at 12:29 PM by 128.113.126.13 -
Changed line 7 from:

Instructor Office Hours: 12-1PM, MR, Lally 307\\

to:

Instructor Office Hours: MR 12-1PM, Lally 307\\

August 06, 2012, at 02:22 PM by 128.113.126.13 -
Changed lines 194-195 from:

There is no required text for the course. Notes will be posted online on the course webpage.

to:

Students will be given draft chapters from the forthcoming book

  • Data Mining and Analysis: Foundations and Algorithms, Mohammed J. Zaki and Wagner Meira, Jr, Cambridge University Press, 2013.
Changed lines 210-212 from:
  • Attendance: Students are strongly encouraged to participate in the class, and should try to attend all classes. Students are responsible for brushing up on any missed material.
  • Laptops: Absolutely no laptops will be allowed in class during lectures. The only exception is during exams, to access the class notes online and to use the calculator. Even during the exam, you may not use any other software (e.g., R, python, matlab, etc.) for the computations, and you may not "browse" for solutions (you are not likely to find anything!).
to:
  • Attendance: Students are strongly encouraged to participate in the class, and should try to attend all classes. Students are responsible for any topics and assignments for the missed classes.
  • Laptops: Absolutely no laptops will be allowed in class during lectures. The only exception is during exams, to access the class notes online and to use the calculator functions. Even during the exam, you may not use any other software (e.g., R, python, matlab, etc.) for the computations.
August 06, 2012, at 02:15 PM by 128.113.126.13 -
Added line 96:

[l] DA: Graph Analysis

Changed line 100 from:

[l] DA: Graph Analysis

to:

[l] DA: Graph Analysis & Mining

August 06, 2012, at 02:14 PM by 128.113.126.13 -
Changed lines 57-58 from:

[l] Data Mining and Analysis (DA): Introduction

to:

[l] Data Mining and Analysis (DA): Algebraic and Probabilistic Views

Changed lines 65-66 from:

[l]DA: Algebraic and Probabilistic Views

to:

[l]DA: Numeric Attributes

Deleted lines 69-72:

[l] DA: Numeric Attributes

[row bgcolor=aliceblue] [l]R: Sep 20

Added lines 71-74:

[row bgcolor=aliceblue] [l]R: Sep 20 [l] DA: High Dimensional Analysis

Deleted lines 77-80:

[l] DA: High Dimensional Analysis

[row bgcolor=aliceblue] [l]R: Sep 27

Added lines 79-82:

[row bgcolor=aliceblue] [l]R: Sep 27 [l] Frequent Pattern Mining (FPM): Itemset Mining

Changed lines 86-87 from:

[l] Frequent Pattern Mining (FPM): Itemset Mining

to:

[l] FPM: Sequence Mining

Changed line 99 from:

[l] FPM: Sequence Mining

to:

[l] DA: Graph Analysis

Changed lines 103-104 from:

[l] DA: Graph Analysis

to:

[l] FPM: Graph Mining

Changed lines 107-108 from:

[l] FPM: Graph Mining

to:

[l] FPM: Pattern Assessment

Changed lines 112-114 from:

[l] FPM: Graph Mining

to:

[l] Classification (CLASS): Bayesian Classifier

Changed line 117 from:

[l] FPM: Pattern Assessment

to:

[l] CLASS: Decision Trees

Changed lines 129-130 from:

[l] , Classification (CLASS): Linear Discriminants

to:

[l] CLASS: Linear Discriminants & SVMs

Deleted lines 136-139:

[l] CLASS: Bayesian Classifier, Decision Trees

[row bgcolor=aliceblue] [l]R: Nov 15

Added lines 138-141:

[row bgcolor=aliceblue] [l]R: Nov 15 [l] Clustering (CLUS): Hierarchical, Partitional

Changed lines 145-146 from:

[l] Clustering (CLUS): Hierarchical, Partitional

to:

[l] CLUS: Density-based Clustering

Changed lines 153-154 from:

[l] CLUS: Density-based Clustering

to:

[l] CLUS: Spectral & Graph Clustering

Changed line 157 from:

[l] CLUS: Spectral & Graph Clustering

to:

[l] CLUS: Subspace Clustering

August 06, 2012, at 02:06 PM by 128.113.126.13 -
August 06, 2012, at 02:06 PM by 128.113.126.13 -
August 06, 2012, at 02:06 PM by 128.113.126.13 -
Changed lines 57-58 from:

[l]DA: Numeric Attributes

to:

[l] Data Mining and Analysis (DA): Introduction

Changed lines 65-66 from:

[l]DA: Numeric Attributes & Eigenvectors

to:

[l]DA: Algebraic and Probabilistic Views

Changed lines 70-71 from:

[l] DA: Categorical Data

to:

[l] DA: Numeric Attributes

Changed line 74 from:

[l] DA: Graph Data

to:

[l] DA: Kernel Approach

Changed lines 78-79 from:

[l] DA: Graph Models

to:

[l] DA: High Dimensional Analysis

Changed line 82 from:

[l] DA: Kernel Methods

to:

[l] DA: Dimensionality Reduction

Changed lines 86-87 from:

[l]DA: High Dimensional Analysis

to:

[l] Frequent Pattern Mining (FPM): Itemset Mining

Changed line 99 from:

[l]DA: Dimensionality Reduction

to:

[l] FPM: Sequence Mining

Changed lines 103-104 from:

[l] Frequent Pattern Mining (FPM): Itemset Mining

to:

[l] DA: Graph Analysis

Changed lines 107-108 from:

[l] FPM: Itemset Summaries & Sequence Mining

to:

[l] FPM: Graph Mining

Changed lines 112-114 from:

[l] FPM: Sequence Mining, Graph Mining

to:

[l] FPM: Graph Mining

Changed line 117 from:

[l] FPM: Graph Mining, Classification (CLASS): Linear Discriminants

to:

[l] FPM: Pattern Assessment

Changed lines 129-130 from:

[l] CLASS: SVMs

to:

[l] , Classification (CLASS): Linear Discriminants

Changed line 133 from:

[l] CLASS: Bayesian Classifier, Decision Trees

to:

[l] CLASS: SVMs

Changed lines 137-138 from:

[l] Clustering (CLUS): Partitional

to:

[l] CLASS: Bayesian Classifier, Decision Trees

Changed line 141 from:

[l] CLUS: Hierarchical Clustering

to:

[l] CLASS: Classifier Evaluation

Changed lines 145-146 from:

[l] CLUS: Density-based Clustering,

to:

[l] Clustering (CLUS): Hierarchical, Partitional

Changed lines 153-154 from:

[l] CLUS: Subspace Clustering

to:

[l] CLUS: Density-based Clustering

Changed line 157 from:

[l] Spectral & Graph Clustering

to:

[l] CLUS: Spectral & Graph Clustering

Changed line 161 from:

[l] Evaluation & Assessment

to:

[l] CLUS: Evaluation & Assessment

August 06, 2012, at 01:52 PM by 128.113.126.13 -
Changed lines 121-122 from:

[l] CLASS: Linear Discriminants, Support Vector Machines (SVM)

to:

[l] NO CLASS

Changed line 125 from:

[l] CLASS: SVMs

to:

[l] EXAM II

Changed line 129 from:

[l]EXAM II

to:

[l] CLASS: SVMs

August 06, 2012, at 01:49 PM by 128.113.126.13 -
Added line 26:
  • Aug 6: Students in the class must sign up for the Piazza course discussion site. All discussions and Q&A will be carried out using Piazza.
August 06, 2012, at 01:32 PM by 128.113.126.13 -
Changed lines 41-42 from:

[!c]Chapters [!c]Lecture Notes

to:

[!c]Readings

August 06, 2012, at 01:31 PM by 128.113.126.13 -
August 06, 2012, at 01:31 PM by 128.113.126.13 -
Changed line 50 from:

[l] NO CLASS%

to:

[l] NO CLASS

Changed lines 58-59 from:
to:
Changed lines 62-64 from:

[l] NO CLASS NSF-RPI Workshop on Complex Data

to:
Changed lines 66-67 from:

[l] [l] lecture3.pdf

to:
Changed lines 71-73 from:
to:
Deleted lines 74-75:
Changed lines 79-81 from:

[l] [l] lecture6.pdf

to:
Deleted lines 82-83:
Changed lines 87-89 from:
to:
Deleted line 95:

[l] NO CLASS

Deleted lines 99-100:
Changed lines 104-105 from:
to:
Changed lines 108-109 from:
to:
Changed lines 113-116 from:
to:
Deleted lines 117-118:
Changed lines 122-124 from:
to:
Deleted lines 125-126:

[l] [l] lecture15.pdf

Deleted lines 133-134:
Changed lines 138-140 from:
to:
Deleted lines 141-142:
Changed lines 146-148 from:
to:
Changed lines 154-155 from:
to:
Deleted lines 157-158:
Deleted lines 161-162:
August 06, 2012, at 01:26 PM by 128.113.126.13 -
Changed lines 25-34 from:
  • Nov 70: Assignment 6 has been posted.
  • Nov 7: Assignment 5 has been posted.
  • Oct 25: updated chap8.pdf on PCA, kernel PCA and SVD.
  • Oct 24: Assignment 4 has been posted.
  • Oct 14: Assignment 3 has been posted.
  • Sep 25: Assignment 2 has been posted.
  • Sep 17: Assignment 1 has been posted.
  • Sep 14: Activate your piazza account
  • Sep 12: Book chapters, as well as lectures are posted online after each lecture. Make sure to check the course website.
  • Aug 18: Course website is up, with the tentative calendar and syllabus.
to:
  • Aug 6: Course website is up, with the syllabus and tentative calendar.
Changed lines 45-47 from:

[l]M: Aug 29 [l]CLASSES CANCELLED

to:

[l]M: Aug 27 [l]NO CLASS

Changed lines 49-52 from:

[l]R: Sep 1 [l] Data Mining Overview & Data Analysis Foundations (DA): Algebraic & Probabilistic Views [l] chap1.pdf [l] Attach:dmintro.pptx,lecture1.pdf

to:

[l]R: Aug 30 [l] NO CLASS%

Changed lines 53-54 from:

[l]M: Sep 5 [l]Labor Day Holiday

to:

[l]M: Sep 3 [l]Labor Day Holiday

Changed line 56 from:

[l]R: Sep 8

to:

[l]R: Sep 6

Changed line 62 from:

[l]M: Sep 12

to:

[l]M: Sep 10

Changed line 67 from:

[l]R: Sep 15

to:

[l]R: Sep 13

Changed line 73 from:

[l]M: Sep 19

to:

[l]M: Sep 17

Changed line 79 from:

[l]R: Sep 22

to:

[l]R: Sep 20

Changed line 85 from:

[l]M: Sep 26

to:

[l]M: Sep 24

Changed line 91 from:

[l]R: Sep 29

to:

[l]R: Sep 27

Changed line 97 from:

[l]M: Oct 3

to:

[l]M: Oct 1

Changed line 103 from:

[l]R: Oct 6

to:

[l]R: Oct 4

Changed line 109 from:

[l]Tue: Oct 11

to:

[l]Tue: Oct 9

Changed line 113 from:

[l]R: Oct 13

to:

[l]R: Oct 11

Changed line 119 from:

[l]M: Oct 17

to:

[l]M: Oct 15

Changed line 124 from:

[l]R: Oct 20

to:

[l]R: Oct 18

Changed line 130 from:

[l]M: Oct 24

to:

[l]M: Oct 22

Changed line 137 from:

[l]R: Oct 27

to:

[l]R: Oct 25

Changed line 143 from:

[l]M: Oct 31

to:

[l]M: Oct 29

Changed line 149 from:

[l]R: Nov 3

to:

[l]R: Nov 1

Changed line 155 from:

[l]M: Nov 7

to:

[l]M: Nov 5

Changed line 159 from:

[l]R: Nov 10

to:

[l]R: Nov 8

Changed line 165 from:

[l]M: Nov 14

to:

[l]M: Nov 12

Changed line 171 from:

[l]R: Nov 17

to:

[l]R: Nov 15

Changed line 177 from:

[l]M: Nov 21

to:

[l]M: Nov 19

Changed line 183 from:

[l]R: Nov 24

to:

[l]R: Nov 22

Changed line 187 from:

[l]M: Nov 28

to:

[l]M: Nov 26

Changed line 192 from:

[l]R: Dec 1

to:

[l]R: Nov 29

Changed line 198 from:

[l]M: Dec 5

to:

[l]M: Dec 3

Changed line 203 from:

[l]R: Dec 8

to:

[l]R: Dec 6

August 06, 2012, at 01:20 PM by 128.113.126.13 -
Changed line 1 from:

CSCI-4390/6390: Data Mining, Fall 2011

to:

CSCI-4390/6390: Data Mining, Fall 2012

Changed line 6 from:

Room: Carnegie 113\\

to:

Room: TBA\\

Changed lines 10-12 from:

TA: Amina Shabbeer
TA Office Hours: 4-5PM, TW, AE 304
TA Contact: shabba@rpi.edu

to:

TA: TBA
TA Office Hours: TBA
TA Contact: TBA

December 05, 2011, at 01:56 PM by 128.113.126.13 -
Changed lines 211-212 from:
to:
December 01, 2011, at 04:33 PM by 128.113.126.13 -
Added line 206:
November 28, 2011, at 04:55 PM by 128.113.126.13 -
Changed lines 199-200 from:

[l] CLUS: Subspace Clustering, Spectral & Graph Clustering [l] chap19.pdf, chap20.pdf

to:

[l] CLUS: Subspace Clustering [l] chap19.pdf [l] lecture20.pdf

Changed lines 204-205 from:

[l] Evaluation & Assessment

to:

[l] Spectral & Graph Clustering [l] chap20.pdf

Added line 210:
November 21, 2011, at 01:10 PM by 128.113.126.13 -
Changed lines 189-191 from:

[l] CLUS: Density-based Clustering, Subspace Clustering [l] chap18.pdf, chap19.pdf

to:

[l] CLUS: Density-based Clustering, [l] chap18.pdf [l] lecture19.pdf

Changed lines 199-200 from:

[l] CLUS: Spectral & Graph Clustering [l] chap20.pdf

to:

[l] CLUS: Subspace Clustering, Spectral & Graph Clustering [l] chap19.pdf, chap20.pdf

November 20, 2011, at 01:46 PM by 128.113.126.13 -
Added line 25:
November 20, 2011, at 11:47 AM by 128.113.126.13 -
Changed lines 188-190 from:

[l] CLUS: Density-based Clustering [l] chap18.pdf

to:

[l] CLUS: Density-based Clustering, Subspace Clustering [l] chap18.pdf, chap19.pdf

Deleted lines 196-199:

[l] CLUS: Subspace Clustering

[row bgcolor=aliceblue] [l]R: Dec 1

Added lines 198-201:

[l] chap20.pdf [row bgcolor=aliceblue] [l]R: Dec 1 [l] Evaluation & Assessment

Changed line 205 from:

[l] CLUS: Graph Clustering

to:

[l] Evaluation & Assessment

November 17, 2011, at 12:53 PM by 128.113.126.13 -
Added line 179:
Added line 184:
November 16, 2011, at 07:44 PM by 128.113.126.13 -
Changed line 182 from:
to:
Added line 187:
November 14, 2011, at 02:27 PM by 128.113.126.13 -
Changed line 178 from:
to:
November 10, 2011, at 05:26 PM by 128.113.126.13 -
Added line 177:
November 10, 2011, at 05:02 PM by 128.113.126.13 -
Changed line 172 from:
to:
November 09, 2011, at 09:23 PM by 128.113.126.13 -
Changed lines 170-171 from:

[l] CLASS: Bayesian Classifier

to:

[l] CLASS: Bayesian Classifier, Decision Trees [l] chap26.pdf, chap24.pdf

November 07, 2011, at 09:15 PM by 128.113.126.13 -
Added line 25:
November 03, 2011, at 08:45 PM by 128.113.126.13 -
Added line 160:

[l]

November 03, 2011, at 08:43 PM by 128.113.126.13 -
Changed lines 159-160 from:

[l] CLASS: SVMs & Decision Trees

to:

[l] CLASS: SVMs [l] lecture15.pdf

October 31, 2011, at 12:49 PM by 128.113.126.13 -
Added line 155:
October 27, 2011, at 12:37 PM by 128.113.126.13 -
Added line 148:
Deleted line 149:
October 27, 2011, at 12:35 PM by 128.113.126.13 -
Changed lines 147-148 from:

[l] FPM: Graph Mining

to:

[l] FPM: Graph Mining, Classification (CLASS): Linear Discriminants [l] lecture13.pdf [l] chap27.pdf

Changed lines 153-154 from:

[l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM) [l] chap27.pdf, chap28.pdf

to:

[l] CLASS: Linear Discriminants, Support Vector Machines (SVM) [l] chap28.pdf

October 26, 2011, at 05:43 PM by 128.113.126.13 -
Changed line 153 from:
to:
October 25, 2011, at 01:54 PM by 128.113.126.13 -
Added line 25:
  • Oct 25: updated chap8.pdf on PCA, kernel PCA and SVD.
October 24, 2011, at 05:24 PM by 128.113.126.13 -
Added line 25:
October 24, 2011, at 05:11 PM by 128.113.126.13 -
Added line 140:
October 23, 2011, at 07:56 PM by 128.113.126.13 -
Added line 139:
October 20, 2011, at 02:54 PM by 128.113.126.13 -
Added line 134:
Added lines 138-142:

[l] FPM: Sequence Mining, Graph Mining

[row bgcolor=aliceblue] [l]R: Oct 27

Deleted lines 144-148:

[row bgcolor=aliceblue] [l]R: Oct 27 [l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM)

Changed lines 148-150 from:

[l] CLASS: Decision Trees

to:

[l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM)

Changed lines 153-154 from:

[l] CLASS: Bayesian Classifier

to:

[l] CLASS: SVMs & Decision Trees

Changed lines 162-163 from:

[l] Clustering (CLUS): Partitional

to:

[l] CLASS: Bayesian Classifier

Changed line 167 from:

[l] CLUS: Partitional

to:

[l] Clustering (CLUS): Partitional

October 19, 2011, at 05:32 PM by 128.113.126.13 -
Changed line 133 from:
to:
October 19, 2011, at 05:31 PM by 128.113.126.13 -
Changed lines 132-133 from:

[l] FPM: Sequence Mining

to:

[l] FPM: Itemset Summaries & Sequence Mining [l] chap10.pdf, chap11.pdf

October 17, 2011, at 02:08 PM by 128.113.126.13 -
Changed line 129 from:

[l]

to:
October 14, 2011, at 11:13 PM by 128.113.126.13 -
Added line 25:
October 14, 2011, at 10:56 PM by 128.113.126.13 -
Changed lines 120-122 from:

[l]DA: Dimensionality Reduction, Frequent Pattern Mining (FPM): Itemset Mining [l] chap8.pdf, chap10.pdf

to:

[l]DA: Dimensionality Reduction [l] chap8.pdf [l] lecture9.pdf

Changed lines 126-127 from:

[l] FPM: Itemsets and Sequences

to:

[l] Frequent Pattern Mining (FPM): Itemset Mining [l] chap10.pdf [l]

Changed lines 131-132 from:

[l] Graph Mining

to:

[l] FPM: Sequence Mining

Changed lines 136-138 from:

[l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM)CLASS: SVMs

to:

[l] FPM: Graph Mining

Changed lines 141-142 from:

[l] CLASS: SVMs

to:

[l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM)

Deleted line 146:
October 09, 2011, at 03:07 PM by 128.113.126.13 -
Added lines 116-119:

[l] NO CLASS

[row bgcolor=aliceblue] [l]R: Oct 13

Deleted lines 122-125:

[row bgcolor=aliceblue] [l]R: Oct 13 [l] FPM: Sequence Mining

Changed lines 126-127 from:

[l] FPM:Graph Mining

to:

[l] FPM: Itemsets and Sequences

Changed lines 130-131 from:

[l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM)

to:

[l] Graph Mining

Added lines 135-139:

[l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM)CLASS: SVMs

[row bgcolor=aliceblue] [l]R: Oct 27

Deleted lines 141-145:

[row bgcolor=aliceblue] [l]R: Oct 27 [l] CLASS: Decision Trees

Added lines 145-150:

[l] CLASS: Decision Trees

[row bgcolor=aliceblue] [l]R: Nov 3

Deleted lines 151-156:

[row bgcolor=aliceblue] [l]R: Nov 3 [l] CLASS: Ensembles & Classifier Assessment

October 06, 2011, at 05:59 PM by 128.113.126.13 -
Changed line 117 from:
to:
October 06, 2011, at 05:58 PM by 128.113.126.13 -
Added line 117:
October 03, 2011, at 03:26 PM by 128.113.126.13 -
Changed lines 104-105 from:

[l]DA: High Dimensional Analysis & Dimensionality Reduction (PCA/SVD)

to:

[l]DA: High Dimensional Analysis [l] chap6.pdf [l] lecture8.pdf

Changed line 116 from:

[l]Frequent Pattern Mining (FPM): Itemset Mining

to:

[l]DA: Dimensionality Reduction, Frequent Pattern Mining (FPM): Itemset Mining

September 29, 2011, at 04:05 PM by 128.113.126.13 -
Added lines 99-100:
September 26, 2011, at 04:44 PM by 128.113.126.13 -
Changed lines 92-93 from:

[l] DA: Graph Models, Kernel Method

to:

[l] DA: Graph Models [l] [l] lecture6.pdf

Changed line 98 from:

[l] DA: High Dimensional Analysis

to:

[l] DA: Kernel Methods

Changed line 102 from:

[l]DA: Dimensionality Reduction (PCA/SVD)

to:

[l]DA: High Dimensional Analysis & Dimensionality Reduction (PCA/SVD)

September 25, 2011, at 11:40 AM by 128.113.126.13 -
Added line 25:
September 23, 2011, at 04:33 PM by 128.113.126.13 -
Changed lines 85-87 from:

[l] DA: Graph Models

to:

[l] DA: Graph Data [l] chap4.pdf [l] lecture5.pdf

Changed line 91 from:

[l] DA: Kernel Method

to:

[l] DA: Graph Models, Kernel Method

September 19, 2011, at 04:53 PM by 128.113.126.13 -
Added line 25:
Changed lines 79-81 from:

[l] DA: Graph Data

to:

[l] DA: Categorical Data [l] chap3.pdf [l] lecture4.pdf

September 15, 2011, at 12:03 PM by 128.113.126.13 -
Changed lines 72-74 from:

[l]DA: Numeric & Categorical Attributes

to:

[l]DA: Numeric Attributes & Eigenvectors [l] [l] lecture3.pdf

September 14, 2011, at 09:18 PM by 128.113.126.13 -
Added line 25:
  • Sep 14: Activate your piazza account
September 13, 2011, at 04:31 PM by 128.113.126.13 -
Changed line 52 from:
to:
Changed line 61 from:
to:
September 13, 2011, at 04:30 PM by 128.113.126.13 -
Added line 25:
  • Sep 12: Book chapters, as well as lectures are posted online after each lecture. Make sure to check the course website.
Changed line 52 from:

[l]

to:
Changed line 61 from:

[l]

to:
September 13, 2011, at 04:01 PM by 128.113.126.13 -
Changed line 11 from:

TA Office Hours: 4-5PM, TW, AE 217\\

to:

TA Office Hours: 4-5PM, TW, AE 304\\

September 11, 2011, at 09:58 AM by 128.113.126.13 -
Changed lines 59-60 from:

[l]DA: Numeric & Categorical Attributes

to:

[l]DA: Numeric Attributes [l] [l] lecture2.pdf

September 05, 2011, at 01:45 PM by 128.113.126.13 -
September 05, 2011, at 01:45 PM by 128.113.126.13 -
Changed line 52 from:
to:
September 05, 2011, at 01:44 PM by 128.113.126.13 -
Changed line 52 from:
to:
September 05, 2011, at 01:34 PM by 128.113.126.13 -
Added lines 51-52:
August 31, 2011, at 10:03 PM by 128.113.126.13 -
Changed lines 10-12 from:

TA: TBA
TA Office Hours: TBA
TA Contact: TBA

to:

TA: Amina Shabbeer
TA Office Hours: 4-5PM, TW, AE 217
TA Contact: shabba@rpi.edu

August 25, 2011, at 10:51 PM by 128.113.126.13 -
Changed lines 46-47 from:

[l]Data Mining Overview & Data Analysis Foundations (DA)

to:

[l]CLASSES CANCELLED

Changed line 50 from:

[l] DA: Algebraic & Probabilistic Views

to:

[l] Data Mining Overview & Data Analysis Foundations (DA): Algebraic & Probabilistic Views

August 16, 2011, at 04:50 PM by 128.113.126.13 -
Changed line 199 from:

You are expected to learn python on your own via web tutorials, etc.

to:

You are expected to learn python on your own via web tutorials, etc. Assignments must be submitted via email to .

August 16, 2011, at 04:22 PM by 128.113.126.13 -
Changed lines 87-88 from:

[l]EDA: Dimensionality Reduction (PCA/SVD)

to:

[l]DA: Dimensionality Reduction (PCA/SVD)

Changed line 97 from:

[l]EDA: Frequent Pattern Mining (FPM): Itemset Mining

to:

[l]Frequent Pattern Mining (FPM): Itemset Mining

August 16, 2011, at 04:15 PM by 128.113.126.13 -
Changed line 62 from:

[l] NO CLASS NSF-RPI Workshop

to:

[l] NO CLASS NSF-RPI Workshop on Complex Data

Changed lines 199-200 from:
to:

You are expected to learn python on your own via web tutorials, etc.

Changed line 226 from:

The school takes cases of academic dishonesty very seriously, resulting in an automatic "F" grade for the course. Students should familiarize themselves with the relevant portion of the Rensselaer Handbook of Student Rights and Responsibilities on this topic.

to:

The school takes cases of academic dishonesty very seriously, resulting in an automatic "F" grade for the course. Students should familiarize themselves with the relevant portion of the Rensselaer Handbook of Student Rights and Responsibilities on this topic.

August 16, 2011, at 04:11 PM by 128.113.126.13 -
Changed lines 208-211 from:

Your grade will be a combination of the following items. Note that the final distribution is subject to some change depending on the number of assignments, but exams will be at least 60%.

  • Assignments (40%): The assignments are meant to be practically oriented. You'll be asked to run some mining methods on some real datasets, or to implement some algorithms, to complement the theory. There will be roughly one assignment per week, to be submitted via the course wiki site. User accounts will be created after first day of class.
to:

Your grade will be a combination of the following items.

  • Assignments (40%): The assignments are meant to be practically oriented. You'll be asked to implement some algorithms and apply them to real datasets, to complement the theory. There will be roughly one assignment every two weeks.
Changed lines 216-218 from:
  • Attendance: Students are strongly encouraged to participate in the class, and should try to attend all classes. Students are responsible entirely responsible for brushing up on any missed material.
  • Laptops: Absolutely no laptops will be allowed in class during lectures. The only exception is during exams, to access the class notes online and to use the calculator. Even during the exam, you may not use any other software (e.g., R, python, etc) for the computations, and you may not "browse" for solutions (you are not likely to find anything!).
to:
  • Attendance: Students are strongly encouraged to participate in the class, and should try to attend all classes. Students are responsible for brushing up on any missed material.
  • Laptops: Absolutely no laptops will be allowed in class during lectures. The only exception is during exams, to access the class notes online and to use the calculator. Even during the exam, you may not use any other software (e.g., R, python, matlab, etc.) for the computations, and you may not "browse" for solutions (you are not likely to find anything!).
August 16, 2011, at 04:08 PM by 128.113.126.13 -
Changed lines 189-190 from:

Data mining is the process of automatic discovery of patterns, models, changes, associations and anomalies in massive databases. This course will provide an introduction to the main topics in data mining and knowledge discovery, including: statistical foundations, pattern mining, classification, and clustering. Emphasis will be laid on the algorithmic foundations.

to:

Data mining is the process of automatic discovery of patterns, models, changes, associations and anomalies in massive databases. This course will provide an introduction to the main topics in data mining and knowledge discovery, including: algebraic and statistical foundations, pattern mining, classification, and clustering. Emphasis will be laid on the algorithmic approach.

Changed line 198 from:

The pre-requisites for this course include data structures and algorithms and discrete mathematics. Linear algebra and probability & statistics are also essentially pre-requisites, though an attempt will be made to review the basic concepts. Assignments will require the use of the R software. Students are expected to learn R on their own. Assignments must be submitted online at the wiki site. Knowledge of pmwiki markup usage will be your responsibility.

to:

The pre-requisites for this course include data structures and algorithms and discrete mathematics. Linear algebra and probability & statistics are also essentially pre-requisites, though an attempt will be made to review the basic concepts. Assignments will require the use of the python language, with NumPy package for numeric computations.

August 16, 2011, at 03:56 PM by 128.113.126.13 -
Changed lines 110-111 from:

[l] Classification (CLASS): Linear Discriminants

to:

[l] Classification (CLASS): Linear Discriminants, Support Vector Machines (SVM)

Deleted lines 114-118:

[l] CLASS: Support Vector Machines (SVM)

[row bgcolor=aliceblue] [l]R: Oct 27

Added lines 117-121:

[row bgcolor=aliceblue] [l]R: Oct 27 [l] CLASS: Decision Trees

Deleted lines 124-129:

[l] CLASS: Decision Trees

[row bgcolor=aliceblue] [l]R: Nov 3

Added lines 127-132:

[row bgcolor=aliceblue] [l]R: Nov 3 [l] CLASS: Ensembles & Classifier Assessment

Changed lines 140-141 from:

[l] CLASS: Ensembles & Classifier Assessment

to:

[l] Clustering (CLUS): Partitional

Changed line 145 from:

[l] Clustering (CLUS): Partitional

to:

[l] CLUS: Partitional

August 16, 2011, at 03:53 PM by 128.113.126.13 -
Changed lines 57-58 from:

[l]DA: Numeric Attributes

to:

[l]DA: Numeric & Categorical Attributes

Changed line 67 from:

[l]DA: Categorical Attributes

to:

[l]DA: Numeric & Categorical Attributes

Deleted lines 78-81:

[l] DA: Graph Models

[row bgcolor=aliceblue] [l]R: Sep 29

Added lines 80-83:

[row bgcolor=aliceblue] [l]R: Sep 29 [l] DA: High Dimensional Analysis

Changed lines 87-88 from:

[l]EDA: High Dimensional Analysis

to:

[l]EDA: Dimensionality Reduction (PCA/SVD)

Changed lines 97-98 from:

[l]EDA: Dimensionality Reduction (PCA/SVD)

to:

[l]EDA: Frequent Pattern Mining (FPM): Itemset Mining

Changed lines 101-102 from:

[l]Frequent Pattern Mining (FPM): Itemset Mining

to:

[l] FPM: Sequence Mining

Changed lines 106-107 from:

[l]FPM: Sequence Mining

to:

[l] FPM:Graph Mining

Changed lines 110-111 from:

[l]FPM:Graph Mining

to:

[l] Classification (CLASS): Linear Discriminants

Changed lines 115-117 from:

[l] Classification (CLASS): Linear Discriminant Analysis (LDA)

to:

[l] CLASS: Support Vector Machines (SVM)

Changed lines 120-121 from:

[l] CLASS: Support Vector Machines

to:

[l] CLASS: SVMs

Changed lines 125-128 from:

[l] CLASS: SVMs

to:

[l] CLASS: Decision Trees

Changed lines 131-132 from:

[l]CLASS: Bayesian Classifier

to:

[l] CLASS: Bayesian Classifier

Changed line 140 from:

[l] CLASS: Decision Trees & Classifier Assessment

to:

[l] CLASS: Ensembles & Classifier Assessment

August 16, 2011, at 03:48 PM by 128.113.126.13 -
Changed lines 46-47 from:

[l]Data Mining Overview

to:

[l]Data Mining Overview & Data Analysis Foundations (DA)

Changed line 50 from:

[l]Exploratory Data Analysis (EDA): Data Matrix

to:

[l] DA: Algebraic & Probabilistic Views

Changed lines 57-58 from:

[l]EDA: Numeric Attributes

to:

[l]DA: Numeric Attributes

Changed line 67 from:

[l]Categorical Attributes

to:

[l]DA: Categorical Attributes

Changed lines 71-74 from:

[l]Categorical Attributes

to:

[l] DA: Graph Data

Changed line 75 from:

[l] EDA: Graph Data Analysis

to:

[l] DA: Graph Models

Changed lines 79-80 from:

[l] EDA: Web Centralities

to:

[l] DA: Graph Models

Changed line 83 from:

[l]EDA: Graph Models

to:

[l] DA: Kernel Method

Changed lines 97-98 from:

[l]EDA: High Dimensional Analysis & Dimensionality Reduction (PCA/SVD)

to:

[l]EDA: Dimensionality Reduction (PCA/SVD)

Changed lines 115-117 from:

[l]Classification (CLASS): Linear Discriminant Analysis (LDA)

to:

[l] Classification (CLASS): Linear Discriminant Analysis (LDA)

Changed lines 140-141 from:

[l]CLASS: Decision Trees & Classifier Assessment

to:

[l] CLASS: Decision Trees & Classifier Assessment

Changed lines 145-146 from:

[l]Clustering (CLUS): Partitional

to:

[l] Clustering (CLUS): Partitional

Changed lines 149-150 from:

[l]CLUS: Hierarchical Clustering

to:

[l] CLUS: Hierarchical Clustering

Changed lines 154-155 from:

[l]CLUS: Density-based Clustering

to:

[l] CLUS: Density-based Clustering

Changed lines 162-163 from:

[l]CLUS: Subspace Clustering

to:

[l] CLUS: Subspace Clustering

Changed line 166 from:

[l]CLUS: Spectral Clustering

to:

[l] CLUS: Spectral & Graph Clustering

Changed line 170 from:

[l]CLUS: Kernel K-means

to:

[l] CLUS: Graph Clustering

August 16, 2011, at 03:36 PM by 128.113.126.13 -
Changed line 62 from:

[l] NO CLASS NSF-RPI Workshop%

to:

[l] NO CLASS NSF-RPI Workshop

August 16, 2011, at 03:34 PM by 128.113.126.13 -
Changed line 62 from:

[l]EDA: Numeric Attributes

to:

[l] NO CLASS NSF-RPI Workshop%

August 16, 2011, at 03:31 PM by 128.113.126.13 -
Changed line 53 from:

[l]M: Sep 6

to:

[l]M: Sep 5

Changed line 56 from:

[l]R: Sep 9

to:

[l]R: Sep 8

Changed line 61 from:

[l]M: Sep 13

to:

[l]M: Sep 12

Changed line 66 from:

[l]R: Sep 16

to:

[l]R: Sep 15

Changed line 70 from:

[l]M: Sep 20

to:

[l]M: Sep 19

Changed line 76 from:

[l]R: Sep 23

to:

[l]R: Sep 22

Changed line 80 from:

[l]M: Sep 27

to:

[l]M: Sep 26

Changed line 84 from:

[l]R: Sep 30

to:

[l]R: Sep 29

Changed line 88 from:

[l]M: Oct 4

to:

[l]M: Oct 3

Changed line 92 from:

[l]R: Oct 7

to:

[l]R: Oct 6

Changed line 98 from:

[l]Tue: Oct 12

to:

[l]Tue: Oct 11

Changed line 102 from:

[l]R: Oct 14

to:

[l]R: Oct 13

Changed line 107 from:

[l]M: Oct 18

to:

[l]M: Oct 17

Changed line 111 from:

[l]R: Oct 21

to:

[l]R: Oct 20

Changed line 116 from:

[l]M: Oct 25

to:

[l]M: Oct 24

Changed line 121 from:

[l]R: Oct 28

to:

[l]R: Oct 27

Changed line 126 from:

[l]M: Nov 1

to:

[l]M: Oct 31

Changed line 132 from:

[l]R: Nov 4

to:

[l]R: Nov 3

Changed line 137 from:

[l]M: Nov 8

to:

[l]M: Nov 7

Changed line 141 from:

[l]R: Nov 11

to:

[l]R: Nov 10

Changed line 146 from:

[l]M: Nov 15

to:

[l]M: Nov 14

Changed line 150 from:

[l]R: Nov 18

to:

[l]R: Nov 17

Changed line 155 from:

[l]M: Nov 22

to:

[l]M: Nov 21

Changed line 159 from:

[l]R: Nov 25

to:

[l]R: Nov 24

Changed line 163 from:

[l]M: Nov 29

to:

[l]M: Nov 28

Changed line 167 from:

[l]R: Dec 2

to:

[l]R: Dec 1

Changed line 171 from:

[l]M: Dec 6

to:

[l]M: Dec 5

Changed line 174 from:

[l]R: Dec 9

to:

[l]R: Dec 8

August 16, 2011, at 03:29 PM by 128.113.126.13 -
Changed line 45 from:

[l]M: Aug 30

to:

[l]M: Aug 29

Changed line 49 from:

[l]R: Sep 2

to:

[l]R: Sep 1

August 16, 2011, at 03:29 PM by 128.113.126.13 -
Changed lines 47-49 from:
to:
Deleted lines 50-51:

[l] [l]Lecture2.pdf

Changed lines 58-59 from:
to:
Deleted lines 62-63:

[l] [l] Lecture4.pdf

Deleted lines 67-68:
Changed lines 72-76 from:

[l] [l] Lecture6.pdf

to:
Deleted lines 77-78:
Changed lines 82-84 from:

[l] [l]Lecture8.pdf

to:
Deleted lines 85-86:

[l] [l]Lecture9.pdf

Changed lines 90-92 from:
to:
Changed lines 100-102 from:
to:
Changed lines 104-106 from:
to:
Changed lines 109-111 from:
to:
Changed lines 113-115 from:
to:
Changed lines 118-121 from:
to:
Changed lines 123-125 from:
to:
Changed lines 128-132 from:

[l] [l] Lecture17.pdf

to:
Changed lines 134-136 from:
to:
Changed lines 143-145 from:
to:
Changed lines 148-150 from:
to:
Changed lines 152-154 from:

[l] chap18.pdf [l]

to:
Changed lines 157-159 from:

[l] chap20.pdf [l]

to:
Changed lines 165-167 from:
to:
Deleted lines 168-169:
Deleted lines 172-173:

[l] chap6.pdf, and chap17, sec 17.3 [l] Lecture23.pdf

August 16, 2011, at 03:27 PM by 128.113.126.13 -
August 16, 2011, at 03:27 PM by 128.113.126.13 -
August 16, 2011, at 03:25 PM by 128.113.126.13 -
Changed lines 25-35 from:
  • Nov 14: Assign5 posted.
  • Oct 29: Assign4 posted.
  • Oct 17: Assign3 posted.
  • Oct 5: updated chap5.pdf is online
  • Sep 26: Assign2 posted.
  • Sep 26: Check the chapter notes often for updates. Usually there is a date printed on top to indicate if there is a new version.
  • Sep 21: Pranay will hold TA hours on 22nd (wed) between 12-1:45pm; he will not hold hours on friday (23rd).
  • Sep 15: Assign1 posted. You may also want to check ou the Pmwiki guidelines and the Quick R Tutorial.
  • Sep 3: Accounts for the Assignment page were mailed out. Contact me if you did not get that.
  • Sep 1: First three chapters now posted online.
  • Aug 2: Course website is up, with the tentative calendar and syllabus.
to:
  • Aug 18: Course website is up, with the tentative calendar and syllabus.
Deleted line 35:

Lecture notes and videos from last year (Fall09) are available here.

August 16, 2011, at 03:22 PM by 128.113.126.13 -
Changed line 1 from:

CSCI-4390/6390: Data Mining, Fall 2010

to:

CSCI-4390/6390: Data Mining, Fall 2011

Changed lines 10-12 from:

TA: Pranay Anchuri
TA Office Hours: 12:00-1:50PM TF
TA Contact: AE106, x2857,

to:

TA: TBA
TA Office Hours: TBA
TA Contact: TBA

December 07, 2010, at 09:45 AM by 128.113.126.13 -
Changed lines 230-231 from:

[l]CLUS: Graph Clustering

to:

[l]CLUS: Kernel K-means [l] chap6.pdf, and chap17, sec 17.3 [l] Lecture23.pdf

December 03, 2010, at 06:01 PM by 128.113.126.13 -
Changed line 189 from:

[l]

to:
December 02, 2010, at 04:56 PM by 128.113.126.13 -
Changed lines 225-226 from:
to:
November 29, 2010, at 07:35 PM by 128.113.126.13 -
Changed line 220 from:

[l]

to:
November 15, 2010, at 02:22 PM by 128.113.126.13 -
Changed line 197 from:

[l]

to:
November 14, 2010, at 10:09 PM by 128.113.126.13 -
Changed lines 202-203 from:
to:

[l] chap18.pdf [l]

Changed lines 209-210 from:
to:

[l] chap20.pdf [l]

Changed lines 219-220 from:
to:

[l] chap21.pdf [l]

November 14, 2010, at 02:36 PM by 128.113.126.13 -
Added line 25:
November 14, 2010, at 02:01 PM by 128.113.126.13 -
Changed lines 195-196 from:
to:

[l] chap17.pdf [l]

November 11, 2010, at 03:46 PM by 128.113.126.13 -
Changed line 177 from:

[l]

to:
Changed lines 187-188 from:

[l]CLASS: Clustering (CLUS): Partitional (KMeans, EM)

to:

[l]CLASS: Decision Trees & Classifier Assessment [l] [l] Lecture19.pdf

November 04, 2010, at 01:52 PM by 128.113.126.13 -
Changed lines 176-178 from:

[l]CLASS: Decision Trees & Naive Bayes

to:

[l]CLASS: Bayesian Classifier [l] [l] Lecture18.pdf

November 01, 2010, at 01:34 PM by 128.113.126.13 -
Changed lines 168-172 from:

[l] CLASS: SVM + Decision Trees

to:

[l] CLASS: SVMs [l] [l] Lecture17.pdf

Changed lines 176-177 from:

[l]CLASS: Probabilistic Method

to:

[l]CLASS: Decision Trees & Naive Bayes

October 29, 2010, at 10:06 PM by 128.113.126.13 -
Added line 25:
October 28, 2010, at 03:44 PM by 128.113.126.13 -
Changed line 161 from:
to:
October 28, 2010, at 03:43 PM by 128.113.126.13 -
Changed lines 160-162 from:

[l] CLASS: SVM

to:

[l] CLASS: Support Vector Machines l] chap30.pdf [l]Lecture16.pdf

October 26, 2010, at 08:03 PM by 128.113.126.13 -
Changed line 154 from:
to:
October 26, 2010, at 07:55 PM by 128.113.126.13 -
Changed lines 153-156 from:

[l]Classification (CLASS): Decision Trees

to:

[l]Classification (CLASS): Linear Discriminant Analysis (LDA) l] chap29.pdf [l]Lecture15.pdf

Changed lines 160-163 from:

[l] CLASS: Probabilistic Methods

to:

[l] CLASS: SVM

Changed lines 167-171 from:

[l] CLASS: Linear Discriminant Analysis (LDA)

to:

[l] CLASS: SVM + Decision Trees

Changed lines 174-176 from:

[l]CLASS: Support Vector Machines (SVM)

to:

[l]CLASS: Probabilistic Method

Changed lines 184-186 from:

[l]CLASS: Kernel SVMs

to:

[l]CLASS: Clustering (CLUS): Partitional (KMeans, EM)

Changed line 190 from:

[l]Clustering (CLUS): Partitional (KMeans, EM)

to:

[l]Clustering (CLUS): Partitional

October 21, 2010, at 07:55 PM by 128.113.126.13 -
Changed line 148 from:
to:
October 18, 2010, at 03:38 PM by 128.113.126.13 -
Changed line 141 from:
to:
Changed line 147 from:
to:
October 18, 2010, at 03:33 PM by 128.113.126.13 -
Changed lines 135-136 from:
to:
Changed lines 141-142 from:
to:
October 17, 2010, at 10:06 PM by 128.113.126.13 -
Added line 25:
October 13, 2010, at 09:58 PM by 128.113.126.13 -
Changed line 115 from:

[l]

to:
Changed lines 127-129 from:
to:
Changed line 133 from:
to:
October 05, 2010, at 02:03 PM by 128.113.126.13 -
Added line 25:
  • Oct 5: updated chap5.pdf is online
October 05, 2010, at 02:02 PM by 128.113.126.13 -
Changed line 96 from:
to:
October 04, 2010, at 08:55 PM by 128.113.126.13 -
Added line 108:

[l]

October 04, 2010, at 08:55 PM by 128.113.126.13 -
Changed line 108 from:
to:
Changed lines 112-113 from:

[l]EDA: High Dimensional Analysis & Dimensionality Reduction (PCA/SVD)

to:

[l]EDA: High Dimensional Analysis [l] [l]Lecture10.pdf

Added lines 124-127:

[l]EDA: High Dimensional Analysis & Dimensionality Reduction (PCA/SVD)

[row bgcolor=aliceblue] [l]R: Oct 14

Changed lines 131-135 from:

[row bgcolor=aliceblue] [l]R: Oct 14 [l]FPM: Sequence Mining

to:
Added lines 135-140:

[l]FPM: Sequence Mining

[row bgcolor=aliceblue] [l]R: Oct 21

Changed lines 144-148 from:

[row bgcolor=aliceblue] [l]R: Oct 21 [l]Classification (CLASS): Decision Trees

to:
Added lines 148-153:

[l]Classification (CLASS): Decision Trees

[row bgcolor=aliceblue] [l]R: Oct 28

Changed lines 157-161 from:

[row bgcolor=aliceblue] [l]R: Oct 28 [l] CLASS: Linear Discriminant Analysis (LDA)

to:
Added lines 161-167:

[l] CLASS: Linear Discriminant Analysis (LDA)

[row bgcolor=aliceblue] [l]R: Nov 4

Deleted lines 170-174:

[row bgcolor=aliceblue] [l]R: Nov 4 [l]CLASS: Kernel SVMs

Changed lines 178-179 from:

[l]Clustering (CLUS): Partitional (KMeans, EM)

to:

[l]CLASS: Kernel SVMs

Added lines 184-188:

[l]Clustering (CLUS): Partitional (KMeans, EM)

[row bgcolor=aliceblue] [l]R: Nov 18

Changed lines 191-194 from:

[row bgcolor=aliceblue] [l]R: Nov 18 [l]CLUS: Density-based Clustering

to:
Changed lines 195-196 from:

[l]CLUS: Subspace Clustering

to:

[l]CLUS: Density-based Clustering

Added lines 204-208:

[l]CLUS: Subspace Clustering

[row bgcolor=aliceblue] [l]R: Dec 2

Deleted lines 210-212:

[row bgcolor=aliceblue] [l]R: Dec 2 [l]CLUS: Graph Clustering

Changed line 214 from:

[l]Cluster Evaluation

to:

[l]CLUS: Graph Clustering

October 02, 2010, at 12:45 AM by 128.113.126.13 -
Changed lines 107-108 from:

[l]EDA: Graph Models & High Dimensional Data

to:

[l]EDA: Graph Models

Changed line 112 from:

[l]EDA: Dimensionality Reduction (PCA/SVD)

to:

[l]EDA: High Dimensional Analysis & Dimensionality Reduction (PCA/SVD)

October 02, 2010, at 12:44 AM by 128.113.126.13 -
Changed line 96 from:
to:
September 27, 2010, at 04:29 PM by 128.113.126.13 -
Changed lines 102-103 from:
to:

[l] [l]Lecture8.pdf

September 27, 2010, at 04:27 PM by 128.113.126.13 -
Changed lines 101-102 from:

[l] EDA: Graph Models

to:

[l] EDA: Web Centralities [l] Lecture7.pdf

Changed line 106 from:

[l]EDA: High Dimensional Data

to:

[l]EDA: Graph Models & High Dimensional Data

September 26, 2010, at 06:28 PM by 128.113.126.13 -
Added line 26:
  • Sep 26: Check the chapter notes often for updates. Usually there is a date printed on top to indicate if there is a new version.
September 26, 2010, at 06:27 PM by 128.113.126.13 -
Changed line 95 from:

[l]

to:
September 26, 2010, at 05:43 PM by 128.113.126.13 -
Added line 25:
September 24, 2010, at 09:06 AM by 128.113.126.13 -
Changed lines 7-8 from:

Instructor Office Hours: 12-1PM, MR

to:

Instructor Office Hours: 12-1PM, MR, Lally 307

Changed lines 94-95 from:
to:

[l] [l] Lecture7.pdf

Changed line 99 from:

[l]: EDA: Graph Models

to:

[l] EDA: Graph Models

September 23, 2010, at 08:15 PM by 128.113.126.13 -
Changed lines 93-94 from:

[l] EDA: Graph Data Analysis Graph Models

to:

[l] EDA: Graph Data Analysis

Added lines 98-101:

[l]: EDA: Graph Models

[row bgcolor=aliceblue] [l]R: Sep 30

Deleted lines 103-106:

[row bgcolor=aliceblue] [l]R: Sep 30 [l]EDA: Dimensionality Reduction (PCA/SVD)

Added lines 107-110:

[l]EDA: Dimensionality Reduction (PCA/SVD)

[row bgcolor=aliceblue] [l]R: Oct 7

Changed lines 113-116 from:

[row bgcolor=aliceblue] [l]R: Oct 7 [l]Frequent Pattern Mining (FPM): Itemset Mining

to:
Added lines 117-121:

[l]Frequent Pattern Mining (FPM): Itemset Mining

[row bgcolor=aliceblue] [l]R: Oct 14

Changed lines 124-127 from:

[row bgcolor=aliceblue] [l]R: Oct 14 [l]FPM:Graph Mining

to:
Added lines 128-132:

[l]FPM:Graph Mining

[row bgcolor=aliceblue] [l]R: Oct 21

Changed lines 135-138 from:

[row bgcolor=aliceblue] [l]R: Oct 21 [l] CLASS: Probabilistic Methods

to:
Added lines 139-143:

[l] CLASS: Probabilistic Methods

[row bgcolor=aliceblue] [l]R: Oct 28

Changed lines 146-149 from:

[row bgcolor=aliceblue] [l]R: Oct 28 [l]CLASS: Support Vector Machines (SVM)

to:
Added lines 150-155:

[l]CLASS: Support Vector Machines (SVM)

[row bgcolor=aliceblue] [l]R: Nov 4

Deleted lines 157-160:

[row bgcolor=aliceblue] [l]R: Nov 4 [l]EXAM II

Changed line 161 from:

[l]CLASS: Graph Classification

to:

[l]EXAM II

September 21, 2010, at 05:44 PM by 128.113.126.13 -
Added line 25:
  • Sep 21: Pranay will hold TA hours on 22nd (wed) between 12-1:45pm; he will not hold hours on friday (23rd).
September 20, 2010, at 09:45 PM by 128.113.126.13 -
Added lines 84-91:

[l]Categorical Attributes [l] [l] Lecture6.pdf

[row bgcolor=aliceblue] [l]R: Sep 23

Deleted lines 93-97:

[row bgcolor=aliceblue] [l]R: Sep 23 [l]EDA: High Dimensional Data

Changed lines 97-98 from:

[l]EDA: Dimensionality Reduction (PCA)

to:

[l]EDA: High Dimensional Data

Changed line 101 from:

[l]EDA: Dimensionality Reduction (SVD)

to:

[l]EDA: Dimensionality Reduction (PCA/SVD)

September 16, 2010, at 09:57 PM by 128.113.126.13 -
Added line 80:
September 15, 2010, at 11:14 PM by 128.113.126.13 -
September 15, 2010, at 11:14 PM by 128.113.126.13 -
Changed line 25 from:
  • Sep 15: Assign1 posted. You may also want to check ou the Pmwiki guidelines and the .
to:
  • Sep 15: Assign1 posted. You may also want to check ou the Pmwiki guidelines and the Quick R Tutorial.
September 15, 2010, at 11:14 PM by 128.113.126.13 -
Added line 25:
  • Sep 15: Assign1 posted. You may also want to check ou the Pmwiki guidelines and the .
September 15, 2010, at 08:39 PM by 128.113.126.13 -
Changed line 78 from:
to:
September 13, 2010, at 07:05 PM by 128.113.126.13 -
Changed lines 70-71 from:

[l]EDA: Categorical Attributes [l] chap3.pdf

to:

[l]EDA: Numeric Attributes [l] [l] Lecture4.pdf

Changed lines 77-78 from:

[l]EDA: Graph Data Analysis Graph Models

to:

[l]Categorical Attributes

 chap3.pdf
Changed line 82 from:

[l] EDA: Graph Models

to:

[l] EDA: Graph Data Analysis Graph Models

September 10, 2010, at 08:13 PM by 128.113.126.13 -
Added line 66:
September 08, 2010, at 11:11 PM by 128.113.126.13 -
Changed lines 51-52 from:
to:
Changed line 57 from:
to:
September 08, 2010, at 11:02 PM by 128.113.126.13 -
Changed line 51 from:
to:
September 08, 2010, at 11:02 PM by 128.113.126.13 -
Changed lines 51-52 from:
to:
Changed line 57 from:
to:
Changed line 65 from:
to:
Changed line 70 from:
to:
September 08, 2010, at 10:55 PM by 128.113.126.13 -
Changed line 50 from:
to:
September 04, 2010, at 12:32 PM by 128.113.126.13 -
Changed line 57 from:
to:
September 04, 2010, at 12:29 PM by 128.113.126.13 -
Changed lines 55-56 from:

[l]Exploratory Data Analysis (EDA): Numeric Attributes [l] chap2.pdf

to:

[l]Exploratory Data Analysis (EDA): Data Matrix [l] Lec2.pdf

Changed lines 64-65 from:

[l]EDA: Categorical Attributes [l] chap3.pdf

to:

[l]EDA: Numeric Attributes [l] chap2.pdf

Changed lines 69-70 from:

[l]EDA: Graph Data Analysis

to:

[l]EDA: Categorical Attributes [l] chap3.pdf

Changed lines 75-76 from:

[l]EDA: Graph Models

to:

[l]EDA: Graph Data Analysis Graph Models

Changed lines 80-82 from:

[l]Frequent Pattern Mining (FPM): Itemset Mining

to:

[l] EDA: Graph Models

Changed lines 85-86 from:

[l]Clustering (CLUS): Partitional (KMeans, EM)

to:

[l]EDA: High Dimensional Data

Changed lines 90-91 from:

[l]Classification (CLASS): Decision Trees

to:

[l]EDA: Dimensionality Reduction (PCA)

Changed lines 94-95 from:

[l]EDA: High Dimensional Data

to:

[l]EDA: Dimensionality Reduction (SVD)

Changed lines 103-104 from:

[l]EDA: Dimensionality Reduction (PCA)

to:

[l]Frequent Pattern Mining (FPM): Itemset Mining

Changed lines 117-118 from:

[l]FPM: Pattern Significance

to:

[l]Classification (CLASS): Decision Trees

Changed line 148 from:

[l]CLASS: Classifier Evaluation

to:

[l]Clustering (CLUS): Partitional (KMeans, EM)

September 03, 2010, at 03:19 PM by 128.113.126.13 -
Added line 25:
  • Sep 3: Accounts for the Assignment page were mailed out. Contact me if you did not get that.
September 01, 2010, at 09:03 PM by 128.113.126.13 -
Added line 25:
  • Sep 1: First three chapters now posted online.
September 01, 2010, at 09:01 PM by 128.113.126.13 -
Changed lines 48-50 from:

[l] [l] ,

to:
Changed line 54 from:
to:
Changed line 62 from:
to:
September 01, 2010, at 08:56 PM by 128.113.126.13 -
Added lines 48-49:

[l] [l] ,

August 29, 2010, at 06:48 PM by 128.113.126.13 -
Added line 36:

Lecture notes and videos from last year (Fall09) are available here.

August 29, 2010, at 06:44 PM by 128.113.126.13 -
Changed lines 63-64 from:

[l]EDA: Eigenvalues Primer; Graph Data Analysis

to:

[l]EDA: Graph Data Analysis

Changed lines 67-68 from:

[l]EDA: High Dimensional Data

to:

[l]EDA: Graph Models

Deleted lines 71-74:

[l]EDA: Dimensionality Reduction (PCA)

[row bgcolor=aliceblue] [l]R: Sep 23

Added lines 74-78:

[row bgcolor=aliceblue] [l]R: Sep 23 [l]Clustering (CLUS): Partitional (KMeans, EM)

Deleted lines 81-84:

[l]Clustering (CLUS): Partitional (KMeans, EM)

[row bgcolor=aliceblue] [l]R: Sep 30

Added lines 84-87:

[row bgcolor=aliceblue] [l]R: Sep 30 [l]EDA: High Dimensional Data

Changed lines 95-96 from:

[l]FPM: Itemset Summaries

to:

[l]EDA: Dimensionality Reduction (PCA)

Changed lines 109-110 from:

[l] CLASS: Linear Discriminant Analysis (LDA)

to:

[l]FPM: Pattern Significance

Changed lines 113-114 from:

CLASS: Probabilistic Methods

to:

[l] CLASS: Probabilistic Methods

Added lines 118-121:

[l] CLASS: Linear Discriminant Analysis (LDA)

[row bgcolor=aliceblue] [l]R: Oct 28

Deleted lines 123-126:

[row bgcolor=aliceblue] [l]R: Oct 28 [l]CLASS: Kernel SVMs, Graph Classification

Changed lines 127-128 from:

[l]CLASS: Classifier Evaluation

to:

[l]CLASS: Kernel SVMs

Changed lines 136-137 from:

[l]CLUS: Hierarchical Clustering

to:

[l]CLASS: Graph Classification

Changed lines 140-141 from:

[l]CLUS: Density-based Clustering

to:

[l]CLASS: Classifier Evaluation

Changed lines 145-146 from:

[l]CLUS: Subspace Clustering

to:

[l]CLUS: Hierarchical Clustering

Changed lines 149-150 from:

[l]CLUS: Spectral Clustering

to:

[l]CLUS: Density-based Clustering

Changed lines 154-155 from:

[l]CLUS: Graph Clustering

to:

[l]CLUS: Subspace Clustering

Changed lines 162-163 from:

[l]Cluster Evaluation

to:

[l]CLUS: Spectral Clustering

Changed line 166 from:

[l]Social Network Analysis (SNA)

to:

[l]CLUS: Graph Clustering

Changed line 170 from:

[l]SNA: Graph Mining

to:

[l]Cluster Evaluation

August 29, 2010, at 06:29 PM by 128.113.126.13 -
Changed lines 58-59 from:

[l]EDA: Numeric & Categorical Attributes

to:

[l]EDA: Categorical Attributes

Changed lines 63-64 from:

[l]Frequent Pattern Mining (FPM): Itemset Mining

to:

[l]EDA: Eigenvalues Primer; Graph Data Analysis

Changed lines 67-68 from:

[l]Clustering (CLUS): Partitional (KMeans, EM)

to:

[l]EDA: High Dimensional Data

Changed lines 72-73 from:

[l]Classification (CLASS): Decision Trees

to:

[l]EDA: Dimensionality Reduction (PCA)

Changed lines 76-77 from:

[l]EDA: High Dimensional Data

to:

[l]Frequent Pattern Mining (FPM): Itemset Mining

Changed lines 81-82 from:

[l]EDA: Dimensionality Reduction: PCA

to:

[l]Clustering (CLUS): Partitional (KMeans, EM)

Changed lines 85-86 from:

[l]EDA: Dimensionality Reduction: PCA/SVD

to:

[l]Classification (CLASS): Decision Trees

Added line 91:
Changed lines 94-95 from:

[l]EDA: Linear Discriminant Analysis: LDA

to:

[l]FPM: Itemset Summaries

Deleted lines 98-101:

[l]FPM: Itemset Summaries

[row bgcolor=aliceblue] [l]R: Oct 14

Added lines 101-104:

[row bgcolor=aliceblue] [l]R: Oct 14 [l]FPM:Graph Mining

Changed lines 108-109 from:

[l]FPM:Sequence Mining, CLASS: Probabilistic

to:

[l] CLASS: Linear Discriminant Analysis (LDA)

Changed lines 112-113 from:

[l]CLASS: Support Vector Machines (SVM)

to:

CLASS: Probabilistic Methods

Changed lines 117-118 from:

[l]CLASS: SVM contd.

to:

[l]CLASS: Support Vector Machines (SVM)

Changed lines 121-122 from:

[l]CLASS: Kernel SVM, Rule-based

to:

[l]CLASS: Kernel SVMs, Graph Classification

Changed lines 134-135 from:

[l]CLUS: Hierarchical/Density-based Clustering

to:

[l]CLUS: Hierarchical Clustering

Changed lines 138-139 from:

[l]CLUS: Density-based Clustering (Kernel Density Estimation)

to:

[l]CLUS: Density-based Clustering

Changed lines 152-153 from:

[l]Kernel Methods: Kernel K-means

to:

[l]CLUS: Graph Clustering

Changed line 160 from:

[l]Kernel Methods: Kernel PCA/LDA

to:

[l]Cluster Evaluation

August 26, 2010, at 01:56 PM by 128.113.126.13 -
Changed lines 191-192 from:
  • knowledgeable about the fundamental data mining tasks like pattern mining, classification and clustering
  • able to understand the key algorithms for the main tasks
to:
  • able to describe the fundamental data mining tasks like pattern mining, classification and clustering
  • able to analyze the key algorithms for the main tasks
August 26, 2010, at 01:28 PM by 128.113.126.13 -
Changed line 21 from:

Announcements#Announcements

to:

Announcements

Changed line 33 from:

Calender#Calendar & Lecture Notes

to:

Calendar & Lecture Notes

August 26, 2010, at 01:27 PM by 128.113.126.13 -
Changed line 21 from:

Announcements

to:

Announcements#Announcements

Changed line 33 from:

Calendar & Lecture Notes

to:

Calender#Calendar & Lecture Notes

August 26, 2010, at 01:26 PM by 128.113.126.13 -
Changed line 181 from:

Syllabus

to:

Syllabus

August 26, 2010, at 01:23 PM by 128.113.126.13 -
August 26, 2010, at 01:20 PM by 128.113.126.13 -
Added line 15:

Changed lines 17-18 from:
to:

(:*toc:)

August 26, 2010, at 01:17 PM by 128.113.126.13 -
Changed line 197 from:

There is no required text for the course. Notes will be handed out in class.

to:

There is no required text for the course. Notes will be posted online on the course webpage.

August 26, 2010, at 01:10 PM by 128.113.126.13 -
Changed lines 208-211 from:
  • Exams (60%): There will be three exams covering the main topics of the course. The tentative exam schedule is posted on the class schedule table. There is no comprehensive final exam.

Attendance: Students are strongly encouraged to participate in the class, and should try to attend all classes.

to:
  • Exams (60%): There will be three exams covering the main topics of the course. The tentative exam schedule is posted on the class schedule table. There is no comprehensive final exam. All exams are open book.
Other Policies
  • Attendance: Students are strongly encouraged to participate in the class, and should try to attend all classes. Students are responsible entirely responsible for brushing up on any missed material.
  • Laptops: Absolutely no laptops will be allowed in class during lectures. The only exception is during exams, to access the class notes online and to use the calculator. Even during the exam, you may not use any other software (e.g., R, python, etc) for the computations, and you may not "browse" for solutions (you are not likely to find anything!).
  • Late Assignments: Most assignments will be due just before midnight on the due date. Students get an automatic one day extension with 20% penalty. No late assignments will be accepted after the midnight following the due date.
Changed line 219 from:

You may consult other members of the class on the homeworks, but you must submit your own work. For instance you may discuss general approaches to solving a problem, but you must implement the solution on your own (similarity detection software may be used). Anytime you borrow material from the web or elsewhere, you must acknowledge the source.

to:

You may consult other members of the class on the assignments, but you must submit your own work. For instance you may discuss general approaches to solving a problem, but you must implement the solution on your own (similarity detection software may be used). Anytime you borrow material from the web or elsewhere, you must acknowledge the source.

August 20, 2010, at 06:58 PM by 128.113.126.13 -
Changed line 12 from:

TA Contact: AE106, x2857, anchup@rpi.edu

to:

TA Contact: AE106, x2857,

August 20, 2010, at 06:57 PM by 128.113.126.13 -
Changed line 12 from:

TA Contact: AE106, x2857, anchup@rpi.edu

to:

TA Contact: AE106, x2857, anchup@rpi.edu

August 20, 2010, at 06:56 PM by 128.113.126.13 -
Changed lines 11-12 from:

TA Office Hours: TBD
TA Contact:

to:

TA Office Hours: 12:00-1:50PM TF
TA Contact: AE106, x2857, anchup@rpi.edu

August 20, 2010, at 04:30 PM by 128.113.126.13 -
Changed line 16 from:
to:
August 20, 2010, at 04:16 PM by 128.113.126.13 -
Changed line 16 from:
to:
August 20, 2010, at 04:13 PM by 128.113.126.13 -
Changed line 16 from:
to:
August 20, 2010, at 03:14 PM by 128.113.126.13 -
Added line 8:
August 20, 2010, at 03:12 PM by 128.113.126.13 -
Changed lines 5-6 from:

Class: 10-11:50AM, MR, Room: Carnegie 113\\

to:

Class Time: MR 10-11:50AM
Room: Carnegie 113\\

Changed lines 8-11 from:

TA & TA Office Hours: Pranay Anchuri, Hours TBD

to:

TA: Pranay Anchuri
TA Office Hours: TBD
TA Contact:

August 20, 2010, at 03:04 PM by 128.113.126.13 -
Changed line 5 from:

Class: 10-11:50AM, MR, Room: TBD\\

to:

Class: 10-11:50AM, MR, Room: Carnegie 113\\

Changed line 7 from:

TA & TA Office Hours: TBD

to:

TA & TA Office Hours: Pranay Anchuri, Hours TBD

August 11, 2010, at 05:11 PM by 128.113.126.13 -
Changed line 26 from:

Calendar & Lecture Notes/Videos

to:

Calendar & Lecture Notes

August 02, 2010, at 07:17 AM by 128.113.126.13 -
Changed line 156 from:

[l]EXAM III

to:

[l]Social Network Analysis (SNA)

Deleted lines 159-162:

[l]Social Network Analysis (SNA)

[row bgcolor=aliceblue] [l]R: Dec 9

Added lines 162-164:

[row bgcolor=aliceblue] [l]R: Dec 9 [l]EXAM III

August 02, 2010, at 07:14 AM by 128.113.126.13 -
Changed lines 190-192 from:

The pre-requisites for this course include data structures and algorithms and discrete mathematics. Basics of linear algebra, and probability & statistics will be very useful as well. Assignments will require the use of the R software. Students are expected to learn R on their own. Assignments must be submitted online at the wiki site. Knowledge of pmwiki markup usage will be your responsibility.

to:

The pre-requisites for this course include data structures and algorithms and discrete mathematics. Linear algebra and probability & statistics are also essentially pre-requisites, though an attempt will be made to review the basic concepts. Assignments will require the use of the R software. Students are expected to learn R on their own. Assignments must be submitted online at the wiki site. Knowledge of pmwiki markup usage will be your responsibility.

Changed line 209 from:

You may consult other members of the class on the homeworks, but you must submit your own work. Anytime you borrow material from the web or elsewhere, you must acknowledge the source.

to:

You may consult other members of the class on the homeworks, but you must submit your own work. For instance you may discuss general approaches to solving a problem, but you must implement the solution on your own (similarity detection software may be used). Anytime you borrow material from the web or elsewhere, you must acknowledge the source.

August 02, 2010, at 07:03 AM by 128.113.126.13 -
Changed line 1 from:

CSCI-4390/6390: Data Mining, Fall 2009

to:

CSCI-4390/6390: Data Mining, Fall 2010

Changed lines 5-6 from:

Class: 10-11:50AM, MR, Low 3045
Instructor Office Hours: 12-1PM, MR

to:

Class: 10-11:50AM, MR, Room: TBD
Instructor Office Hours: 12-1PM, MR
TA & TA Office Hours: TBD

Changed lines 18-35 from:
  • Dec 4: Exam III solutions have been posted.
  • Dec 2: Solutions to Assignment 6 posted on the assignment page
  • Nov 17: Assignment 6 posted.
  • Nov 12: Exam II solutions have been posted.
  • Nov 4: Solutions to Assignment 5 posted on the assignment page.
  • Oct 31: Solutions to Assignment 4 posted on the assignment page.
  • Oct 24: Assignment 5 posted.
  • Oct 13: Exam I solutions have been posted.
  • Oct 10: Assignment 4 has been posted.
  • Oct 4: Solutions for Assignment 3 posted.
  • Sep 27: Solutions for Assignments 1 and 2 have been posted on the respective pages.
  • Sep 26: Assignment 3 is now available.
  • Sep 18: Assignment 2 is now available.
  • Sep 12: I have posted the notes below. They are time-stamped so that if I update them, you can check if your copy is the latest one or not.
  • Sep 8: Assignment 1 has been posted. See the general R/pmwiki instruction at Assignments and see the specific assignment at Assign1
  • Sep 2: Passwords for the assignment submission wiki were sent out yesterday. Contact me if you did not get the email.
  • Aug 30: Slight update of the syllabus.
  • Aug 19: Course website is up, with the tentative calendar and syllabus.
to:
  • Aug 2: Course website is up, with the tentative calendar and syllabus.
Deleted line 35:

[!c]Video

Changed line 38 from:

[l]M: Aug 31

to:

[l]M: Aug 30

Changed lines 40-42 from:

[l] [l]PDF [l]

to:
Changed line 42 from:

[l]R: Sep 3

to:

[l]R: Sep 2

Changed lines 44-46 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 47 from:

[l]M: Sep 7

to:

[l]M: Sep 6

Changed line 50 from:

[l]R: Sep 10

to:

[l]R: Sep 9

Changed lines 52-54 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 55 from:

[l]M: Sep 14

to:

[l]M: Sep 13

Changed lines 57-59 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 59 from:

[l]R: Sep 17

to:

[l]R: Sep 16

Changed lines 61-63 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 64 from:

[l]M: Sep 21

to:

[l]M: Sep 20

Changed lines 66-68 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 68 from:

[l]R: Sep 24

to:

[l]R: Sep 23

Changed lines 70-72 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 73 from:

[l]M: Sep 28

to:

[l]M: Sep 27

Changed lines 75-77 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 77 from:

[l]R: Oct 1

to:

[l]R: Sep 30

Changed lines 79-81 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 82 from:

[l]M: Oct 5

to:

[l]M: Oct 4

Changed line 85 from:

[l]R: Oct 8

to:

[l]R: Oct 7

Changed lines 87-89 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 90 from:

[l]Tue: Oct 13

to:

[l]Tue: Oct 12

Changed lines 92-94 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 94 from:

[l]R: Oct 15

to:

[l]R: Oct 14

Changed lines 96-98 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 99 from:

[l]M: Oct 19

to:

[l]M: Oct 18

Changed lines 101-103 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 103 from:

[l]R: Oct 22

to:

[l]R: Oct 21

Changed lines 105-107 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 108 from:

[l]M: Oct 26

to:

[l]M: Oct 25

Changed lines 110-112 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 112 from:

[l]R: Oct 29

to:

[l]R: Oct 28

Changed lines 114-116 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 117 from:

[l]M: Nov 2

to:

[l]M: Nov 1

Changed lines 119-121 from:

[l] [l]PDF [l]Video

to:
Changed line 121 from:

[l]R: Nov 5

to:

[l]R: Nov 4

Changed line 125 from:

[l]M: Nov 9

to:

[l]M: Nov 8

Changed lines 127-129 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 129 from:

[l]R: Nov 12

to:

[l]R: Nov 11

Changed lines 131-133 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 134 from:

[l]M: Nov 16

to:

[l]M: Nov 15

Changed lines 136-138 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 138 from:

[l]R: Nov 19

to:

[l]R: Nov 18

Changed lines 140-142 from:

[l] PDF [l]PDF [l]Video

to:
Changed line 143 from:

[l]M: Nov 23

to:

[l]M: Nov 22

Changed lines 145-147 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 147 from:

[l]R: Nov 26

to:

[l]R: Nov 25

Changed line 151 from:

[l]M: Nov 30

to:

[l]M: Nov 29

Changed lines 153-155 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 155 from:

[l]R: Dec 3

to:

[l]R: Dec 2

Changed line 159 from:

[l]M: Dec 7

to:

[l]M: Dec 6

Changed lines 161-163 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 163 from:

[l]R: Dec 10

to:

[l]R: Dec 9

Changed lines 165-167 from:

[l]PDF [l]PDF [l]Video

to:
Changed line 224 from:

[l]

to:

[l]PDF

Changed line 230 from:

[l]

to:

[l]PDF

Changed lines 231-232 from:

[l] [l]

to:

[l]PDF [l]Video

Changed lines 225-226 from:

[l] [l]

to:

[l]PDF [l]Video

Added line 17:
Added line 17:
  • Dec 2: Solutions to Assignment 6 posted on the assignment page
Changed line 195 from:

[l]

to:

[l] PDF

Changed line 202 from:

[l]

to:

[l]PDF

Changed lines 212-214 from:

[l] [l] [l]

to:

[l]PDF [l]PDF [l]Video

Changed line 176 from:

[l]

to:

[l]PDF

Changed line 201 from:

[l]CLUS: Kernel K-means

to:

[l]Kernel Methods: Kernel K-means

Changed lines 203-204 from:

[l] [l]

to:

[l]PDF [l]Video

Changed line 211 from:

[l]CLASS: Kernel PCA/LDA

to:

[l]Kernel Methods: Kernel PCA/LDA

Changed line 201 from:

[l]CLUS: Cluster Validity

to:

[l]CLUS: Kernel K-means

Changed lines 196-197 from:

[l] [l]

to:

[l]PDF [l]Video

Changed line 175 from:

[l]CLUS: Hierarchical/Density-based

to:

[l]CLUS: Hierarchical/Density-based Clustering

Changed lines 181-182 from:

[l]CLUS: Density-based (Kernel Density Estimation) [l]

to:

[l]CLUS: Density-based Clustering (Kernel Density Estimation) [l]PDF

Changed line 159 from:

[l]

to:

[l]PDF

Changed line 189 from:

[l]

to:

[l]PDF

Added line 17:
Changed line 187 from:

[l]CLUS: Subspace

to:

[l]CLUS: Subspace Clustering

Changed lines 189-190 from:

[l] [l]

to:

[l]PDF [l]Video

Changed line 17 from:
  • Nov 12: [((Attach:)exam2-sol.pdf | Exam II solutions]] have been posted.
to:
Added line 17:
  • Nov 12: [((Attach:)exam2-sol.pdf | Exam II solutions]] have been posted.
Changed line 173 from:

[l]CLUS: Hierarchical

to:

[l]CLUS: Hierarchical/Density-based

Changed line 179 from:

[l]CLUS: Density-based

to:

[l]CLUS: Density-based (Kernel Density Estimation)

Changed lines 181-182 from:

[l] [l]

to:

[l]PDF [l]Video

Changed lines 175-176 from:

[l] [l]

to:

[l]PDF [l]Video

Added line 17:
  • Nov 4: Solutions to Assignment 5 posted on the assignment page.
Changed line 172 from:

[l]CLUS: Density-based

to:

[l]CLUS: Hierarchical

Changed line 178 from:

[l]CLUS: Subspace

to:

[l]CLUS: Density-based

Changed line 185 from:

[l]CLUS: Subspace contd.

to:

[l]CLUS: Subspace

Changed line 191 from:

[l]CLASS: Kernel Methods (Kernel SVM)

to:

[l]CLUS: Spectral Clustering

Changed line 198 from:

[l]CLASS: Kernel PCA/LDA

to:

[l]CLUS: Cluster Validity

Changed line 208 from:

[l]CLUS: Spectral Clustering

to:

[l]CLASS: Kernel PCA/LDA

Changed line 162 from:

[l]CLUS: Hierarchical

to:

[l]CLASS: Classifier Evaluation

Changed lines 164-165 from:

[l] [l]

to:

[l]PDF [l]Video

Added line 17:
  • Oct 31: Solutions to Assignment 4 posted on the assignment page.
Changed line 154 from:

[l]CLASS: Instance-based/Rule-based

to:

[l]CLASS: Kernel SVM, Rule-based

Changed lines 156-157 from:

[l] [l]

to:

[l]PDF [l]Video

Changed lines 149-151 from:

[l] [l] [l]

to:

[l]PDF [l]PDF [l]Video

Changed line 123 from:

[l]

to:

[l]PDF

Changed line 129 from:

[l]

to:

[l]PDF

Changed line 136 from:

[l]

to:

[l]PDF

Changed line 142 from:

[l]

to:

[l]PDF

Changed line 17 from:
to:
Added line 17:
Changed lines 142-143 from:

[l] [l]

to:

[l]PDF [l]Video

Changed lines 136-137 from:

[l] [l]

to:

[l]PDF [l]Video

Changed line 140 from:

[l]CLASS: Instance-based/Rule-based

to:

[l]CLASS: Support Vector Machines (SVM)

Changed line 147 from:

[l]CLASS: Support Vector Machines (SVM)

to:

[l]CLASS: SVM contd.

Changed line 153 from:

[l]CLASS: SVM contd.

to:

[l]CLASS: Instance-based/Rule-based

Changed line 134 from:

[l]CLASS: Instance-based/Rule-based

to:

[l]FPM:Sequence Mining, CLASS: Probabilistic

Changed line 140 from:

[l]CLASS: Probabilistic

to:

[l]CLASS: Instance-based/Rule-based

Changed line 124 from:

[l] Video

to:

[l]Video

Changed lines 129-130 from:

[l] [l]

to:

[l]PDF [l]Video

Changed lines 123-124 from:

[l] [l]

to:

[l]PDF [l] Video

Deleted lines 166-168:

[l] [l] [l]

Added lines 161-163:

[l] [l] [l]

Changed line 160 from:

[l]EXAM II

to:

[l]CLUS: Hierarchical

Changed line 163 from:

[l]CLUS: Hierarchical

to:

[l]EXAM II

Added line 17:
Changed line 104 from:

[l]

to:

[l]PDF

Changed line 114 from:

[l]

to:

[l]PDF

Added line 17:
Changed lines 114-115 from:

[l] [l]

to:

[l]PDF [l]Video

Added line 17:
  • Oct 4: Solutions for Assignment 3 posted.
Changed line 111 from:

[l]

to:

[l]EDA: Linear Discriminant Analysis: LDA

Changed line 95 from:

[l]EDA: Dimensionality Reduction (PCA/SVD)

to:

[l]EDA: Dimensionality Reduction: PCA

Changed line 101 from:

[l]EDA: Linear Discriminant Analysis (LDA)

to:

[l]EDA: Dimensionality Reduction: PCA/SVD

Changed lines 103-104 from:

[l] [l]

to:

[l]PDF [l]Video

Deleted line 110:

[l]FPM: Itemset Summaries

Added line 114:

[l]

Changed line 118 from:

[l]FPM: Sequence Mining

to:

[l]FPM: Itemset Summaries

Changed line 124 from:

[l]CLASS: Instance-based/Rule-based

to:

[l]FPM: Sequence Mining

Changed line 131 from:

[l]CLASS: Probabilistic

to:

[l]CLASS: Instance-based/Rule-based

Changed line 137 from:

[l]CLASS: Support Vector Machines (SVM)

to:

[l]CLASS: Probabilistic

Changed line 144 from:

[l]CLASS: SVM contd.

to:

[l]CLASS: Support Vector Machines (SVM)

Changed line 150 from:

[l]CLAS: Ensemble Methods

to:

[l]CLASS: SVM contd.

Changed lines 96-98 from:

[l] [l] [l]

to:

[l]PDF [l]PDF [l]Video

Added line 17:
  • Sep 27: Solutions for Assignments 1 and 2 have been posted on the respective pages.
Added line 17:
Changed line 87 from:

[l]

to:

[l]PDF

Changed line 46 from:

[l]PDF

to:

[l]PDF

Changed lines 88-89 from:

[l] [l]

to:

[l]PDF [l]Video

Changed line 46 from:

[l]PDF

to:

[l]PDF

Changed line 81 from:

[l]

to:

[l]PDF

Changed lines 82-83 from:

[l] [l]

to:

[l]PDF [l]Video

Changed line 51 from:

[l]PDF

to:

[l]PDF

Changed line 61 from:

[l]PDF

to:

[l]PDF

Changed line 68 from:

[l]

to:

[l]PDF

Changed line 74 from:

[l]

to:

[l]PDF

Added line 17:
Changed line 61 from:

PDF

to:

[l]PDF

Changed line 68 from:

PDF

to:

[l]PDF

Changed line 72 from:

[l]Clustering (CLUS): Partitional

to:

[l]Clustering (CLUS): Partitional (KMeans, EM)

Changed line 74 from:

PDF

to:

[l]PDF

Changed line 59 from:

[l]EDA: Numeric & Categorical Attributes

to:

[l]EDA: Numeric & Categorical Attributes

Changed lines 37-38 from:

[!c]Notes

to:

[!c]Chapters [!c]Lecture Notes

Added line 44:

[l]

Added line 51:

[l]PDF

Added line 61:

PDF

Added line 68:

PDF

Changed lines 74-75 from:

[l]

to:

PDF [l]Video

Added line 82:

[l]

Added line 88:

[l]

Added line 95:

[l]

Added line 101:

[l]

Added line 111:

[l]

Added line 118:

[l]

Added line 124:

[l]

Added line 131:

[l]

Added line 137:

[l]

Added line 144:

[l]

Added line 150:

[l]

Added line 160:

[l]

Added line 167:

[l]

Added line 173:

[l]

Added line 180:

[l]

Added line 186:

[l]

Added line 193:

[l]

Added line 203:

[l]

Added line 213:

[l]

Added line 217:

[l]

Changed line 64 from:

[l]

to:

[l]Video

Changed line 146 from:

[l]CLUS: Density-based

to:

[l]CLUS: Density-based

Changed line 41 from:

[l]M: Aug 31

to:

[l]M: Aug 31

Changed line 103 from:

[l]T: Oct 13 (Monday Schedule)

to:

[l]Tue: Oct 13

Changed lines 33-37 from:

(:table border=1 width=100%:) (:cellnr bgcolor=lavender:) Day: Date (:cell bgcolor=lavender:) Topic (:cell bgcolor=lavender:)Notes (:cell bgcolor=lavender:)Video

to:

[table border=1 width=100%] [row bgcolor=lavender] [!c]Day: Date [!c]Topic [!c]Notes [!c]Video

Changed lines 40-47 from:

(:cellnr:) M: Aug 31 (:cell:) Data Mining Overview (:cell:) PDF (:cell:) (:cellnr bgcolor=aliceblue:) R: Sep 3 (:cell:) Exploratory Data Analysis (EDA): Numeric Attributes (:cell:) PDF (:cell:) Video

to:

[row] [l]M: Aug 31 [l]Data Mining Overview [l]PDF [l] [row bgcolor=aliceblue] [l]R: Sep 3 [l]Exploratory Data Analysis (EDA): Numeric Attributes [l]PDF [l]Video

Changed lines 51-56 from:

(:cellnr:) M: Sep 7 (:cell:) Labor Day Holiday (:cellnr bgcolor=aliceblue:) R: Sep 10 (:cell:) EDA: Numeric & Categorical Attributes (:cell:) PDF (:cell:) Video

to:

[row] [l]M: Sep 7 [l]Labor Day Holiday [row bgcolor=aliceblue] [l]R: Sep 10 [l]EDA: Numeric & Categorical Attributes [l]PDF [l]Video

Changed lines 60-67 from:

(:cellnr:) M: Sep 14 (:cell:) Frequent Pattern Mining (FPM): Itemset Mining (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Sep 17 (:cell:) Clustering (CLUS): Partitional (:cell:) (:cell:)

to:

[row] [l]M: Sep 14 [l]Frequent Pattern Mining (FPM): Itemset Mining [l] [l] [row bgcolor=aliceblue] [l]R: Sep 17 [l]Clustering (CLUS): Partitional [l] [l]

Changed lines 71-78 from:

(:cellnr:) M: Sep 21 (:cell:) Classification (CLASS): Decision Trees (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Sep 24 (:cell:) EDA: High Dimensional Data (:cell:) (:cell:)

to:

[row] [l]M: Sep 21 [l]Classification (CLASS): Decision Trees [l] [l] [row bgcolor=aliceblue] [l]R: Sep 24 [l]EDA: High Dimensional Data [l] [l]

Changed lines 82-89 from:

(:cellnr:) M: Sep 28 (:cell:) EDA: Dimensionality Reduction (PCA/SVD) (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 1 (:cell:) EDA: Linear Discriminant Analysis (LDA) (:cell:) (:cell:)

to:

[row] [l]M: Sep 28 [l]EDA: Dimensionality Reduction (PCA/SVD) [l] [l] [row bgcolor=aliceblue] [l]R: Oct 1 [l]EDA: Linear Discriminant Analysis (LDA) [l] [l]

Changed lines 93-98 from:

(:cellnr:) M: Oct 5 (:cell:) EXAM I (:cellnr bgcolor=aliceblue:) R: Oct 8 (:cell:) FPM: Itemset Summaries (:cell:) (:cell:)

to:

[row] [l]M: Oct 5 [l]EXAM I [row bgcolor=aliceblue] [l]R: Oct 8 [l]FPM: Itemset Summaries [l] [l]

Changed lines 102-109 from:

(:cellnr:) T: Oct 13 (Monday Schedule) (:cell:) FPM: Sequence Mining (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 15 (:cell:) CLASS: Instance-based/Rule-based (:cell:) (:cell:)

to:

[row] [l]T: Oct 13 (Monday Schedule) [l]FPM: Sequence Mining [l] [l] [row bgcolor=aliceblue] [l]R: Oct 15 [l]CLASS: Instance-based/Rule-based [l] [l]

Changed lines 113-120 from:

(:cellnr:) M: Oct 19 (:cell:) CLASS: Probabilistic (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 22 (:cell:) CLASS: Support Vector Machines (SVM) (:cell:) (:cell:)

to:

[row] [l]M: Oct 19 [l]CLASS: Probabilistic [l] [l] [row bgcolor=aliceblue] [l]R: Oct 22 [l]CLASS: Support Vector Machines (SVM) [l] [l]

Changed lines 124-131 from:

(:cellnr:) M: Oct 26 (:cell:) CLASS: SVM contd. (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 29 (:cell:) CLAS: Ensemble Methods (:cell:) (:cell:)

to:

[row] [l]M: Oct 26 [l]CLASS: SVM contd. [l] [l] [row bgcolor=aliceblue] [l]R: Oct 29 [l]CLAS: Ensemble Methods [l] [l]

Changed lines 135-140 from:

(:cellnr:) M: Nov 2 (:cell:) EXAM II (:cellnr bgcolor=aliceblue:) R: Nov 5 (:cell:) CLUS: Hierarchical (:cell:) (:cell:)

to:

[row] [l]M: Nov 2 [l]EXAM II [row bgcolor=aliceblue] [l]R: Nov 5 [l]CLUS: Hierarchical [l] [l]

Changed lines 144-151 from:

(:cellnr:) M: Nov 9 (:cell:) CLUS: Density-based (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Nov 12 (:cell:) CLUS: Subspace (:cell:) (:cell:)

to:

[row] [l]M: Nov 9 [l]CLUS: Density-based [l] [l] [row bgcolor=aliceblue] [l]R: Nov 12 [l]CLUS: Subspace [l] [l]

Changed lines 155-162 from:

(:cellnr:) M: Nov 16 (:cell:) CLUS: Subspace contd. (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Nov 19 (:cell:) CLASS: Kernel Methods (Kernel SVM) (:cell:) (:cell:)

to:

[row] [l]M: Nov 16 [l]CLUS: Subspace contd. [l] [l] [row bgcolor=aliceblue] [l]R: Nov 19 [l]CLASS: Kernel Methods (Kernel SVM) [l] [l]

Changed lines 166-171 from:

(:cellnr:) M: Nov 23 (:cell:) CLASS: Kernel PCA/LDA (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Nov 26 (:cell:) Thanksgiving Break

to:

[row] [l]M: Nov 23 [l]CLASS: Kernel PCA/LDA [l] [l] [row bgcolor=aliceblue] [l]R: Nov 26 [l]Thanksgiving Break

Changed lines 175-180 from:

(:cellnr:) M: Nov 30 (:cell:) CLUS: Spectral Clustering (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Dec 3 (:cell:) EXAM III

to:

[row] [l]M: Nov 30 [l]CLUS: Spectral Clustering [l] [l] [row bgcolor=aliceblue] [l]R: Dec 3 [l]EXAM III

Changed lines 184-191 from:

(:cellnr:) M: Dec 7 (:cell:) Social Network Analysis (SNA) (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Dec 10 (:cell:) SNA: Graph Mining (:cell:) (:cell:)

to:

[row] [l]M: Dec 7 [l]Social Network Analysis (SNA) [l] [l] [row bgcolor=aliceblue] [l]R: Dec 10 [l]SNA: Graph Mining [l] [l]

Changed line 195 from:

(:tableend:)

to:

[tableend]

Changed lines 50-51 from:

(:cellnr bgcolor=aliceblue:) R: Sep 10 – EDA: Numeric & Categorical Attributes

to:

(:cellnr bgcolor=aliceblue:) R: Sep 10 (:cell:) EDA: Numeric & Categorical Attributes

Changed lines 34-35 from:

(:cellnr bgcolor=lavender:) Topic

to:

(:cellnr bgcolor=lavender:) Day: Date (:cell bgcolor=lavender:) Topic

Changed lines 39-40 from:

(:cellnr:) M: Aug 31 – Data Mining Overview

to:

(:cellnr:) M: Aug 31 (:cell:) Data Mining Overview

Changed lines 43-44 from:

(:cellnr bgcolor=aliceblue:) R: Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes

to:

(:cellnr bgcolor=aliceblue:) R: Sep 3 (:cell:) Exploratory Data Analysis (EDA): Numeric Attributes

Changed lines 48-49 from:

(:cellnr:) M: Sep 7 – Labor Day Holiday

to:

(:cellnr:) M: Sep 7 (:cell:) Labor Day Holiday

Changed lines 54-55 from:

(:cellnr:) M: Sep 14 – Frequent Pattern Mining (FPM): Itemset Mining

to:

(:cellnr:) M: Sep 14 (:cell:) Frequent Pattern Mining (FPM): Itemset Mining

Changed lines 58-59 from:

(:cellnr bgcolor=aliceblue:) R: Sep 17 – Clustering (CLUS): Partitional

to:

(:cellnr bgcolor=aliceblue:) R: Sep 17 (:cell:) Clustering (CLUS): Partitional

Changed lines 63-64 from:

(:cellnr:) M: Sep 21 – Classification (CLASS): Decision Trees

to:

(:cellnr:) M: Sep 21 (:cell:) Classification (CLASS): Decision Trees

Changed lines 67-68 from:

(:cellnr bgcolor=aliceblue:) R: Sep 24 – EDA: High Dimensional Data

to:

(:cellnr bgcolor=aliceblue:) R: Sep 24 (:cell:) EDA: High Dimensional Data

Changed lines 72-73 from:

(:cellnr:) M: Sep 28 – EDA: Dimensionality Reduction (PCA/SVD)

to:

(:cellnr:) M: Sep 28 (:cell:) EDA: Dimensionality Reduction (PCA/SVD)

Changed lines 76-77 from:

(:cellnr bgcolor=aliceblue:) R: Oct 1 – EDA: Linear Discriminant Analysis (LDA)

to:

(:cellnr bgcolor=aliceblue:) R: Oct 1 (:cell:) EDA: Linear Discriminant Analysis (LDA)

Changed lines 81-82 from:

(:cellnr:) M: Oct 5– EXAM I (:cellnr bgcolor=aliceblue:) R: Oct 8 – FPM: Itemset Summaries

to:

(:cellnr:) M: Oct 5 (:cell:) EXAM I (:cellnr bgcolor=aliceblue:) R: Oct 8 (:cell:) FPM: Itemset Summaries

Changed lines 88-89 from:

(:cellnr:) T: Oct 13 – (Monday Schedule) FPM: Sequence Mining

to:

(:cellnr:) T: Oct 13 (Monday Schedule) (:cell:) FPM: Sequence Mining

Changed lines 92-93 from:

(:cellnr bgcolor=aliceblue:) R: Oct 15 – CLASS: Instance-based/Rule-based

to:

(:cellnr bgcolor=aliceblue:) R: Oct 15 (:cell:) CLASS: Instance-based/Rule-based

Changed lines 97-98 from:

(:cellnr:) M: Oct 19 – CLASS: Probabilistic

to:

(:cellnr:) M: Oct 19 (:cell:) CLASS: Probabilistic

Changed lines 101-102 from:

(:cellnr bgcolor=aliceblue:) R: Oct 22 – CLASS: Support Vector Machines (SVM)

to:

(:cellnr bgcolor=aliceblue:) R: Oct 22 (:cell:) CLASS: Support Vector Machines (SVM)

Changed lines 106-107 from:

(:cellnr:) M: Oct 26 – CLASS: SVM contd.

to:

(:cellnr:) M: Oct 26 (:cell:) CLASS: SVM contd.

Changed lines 110-111 from:

(:cellnr bgcolor=aliceblue:) R: Oct 29 – CLAS: Ensemble Methods

to:

(:cellnr bgcolor=aliceblue:) R: Oct 29 (:cell:) CLAS: Ensemble Methods

Changed lines 115-116 from:

(:cellnr:) M: Nov 2 – EXAM II (:cellnr bgcolor=aliceblue:) R: Nov 5 – CLUS: Hierarchical

to:

(:cellnr:) M: Nov 2 (:cell:) EXAM II (:cellnr bgcolor=aliceblue:) R: Nov 5 (:cell:) CLUS: Hierarchical

Changed lines 122-123 from:

(:cellnr:) M: Nov 9 – CLUS: Density-based

to:

(:cellnr:) M: Nov 9 (:cell:) CLUS: Density-based

Changed lines 126-127 from:

(:cellnr bgcolor=aliceblue:) R: Nov 12 – CLUS: Subspace

to:

(:cellnr bgcolor=aliceblue:) R: Nov 12 (:cell:) CLUS: Subspace

Changed lines 131-132 from:

(:cellnr:) M: Nov 16 – CLUS: Subspace contd.

to:

(:cellnr:) M: Nov 16 (:cell:) CLUS: Subspace contd.

Changed lines 135-136 from:

(:cellnr bgcolor=aliceblue:) R: Nov 19 – CLASS: Kernel Methods (Kernel SVM)

to:

(:cellnr bgcolor=aliceblue:) R: Nov 19 (:cell:) CLASS: Kernel Methods (Kernel SVM)

Changed lines 140-141 from:

(:cellnr:) M: Nov 23 – CLASS: Kernel PCA/LDA

to:

(:cellnr:) M: Nov 23 (:cell:) CLASS: Kernel PCA/LDA

Changed lines 144-145 from:

(:cellnr bgcolor=aliceblue:) R: Nov 26 – Thanksgiving Break

to:

(:cellnr bgcolor=aliceblue:) R: Nov 26 (:cell:) Thanksgiving Break

Changed lines 147-148 from:

(:cellnr:) M: Nov 30 - CLUS: Spectral Clustering

to:

(:cellnr:) M: Nov 30 (:cell:) CLUS: Spectral Clustering

Changed lines 151-152 from:

(:cellnr bgcolor=aliceblue:) R: Dec 3 – EXAM III

to:

(:cellnr bgcolor=aliceblue:) R: Dec 3 (:cell:) EXAM III

Changed lines 154-155 from:

(:cellnr:) M: Dec 7 – Social Network Analysis (SNA)

to:

(:cellnr:) M: Dec 7 (:cell:) Social Network Analysis (SNA)

Changed lines 158-159 from:

(:cellnr bgcolor=aliceblue:) R: Dec 10 - SNA: Graph Mining

to:

(:cellnr bgcolor=aliceblue:) R: Dec 10 (:cell:) SNA: Graph Mining

Changed line 33 from:

(:table border=1 width=70% align=left:)

to:

(:table border=1 width=100%:)

Added lines 138-139:

\\

Changed line 33 from:

(:table border=1 width=80% align=center:)

to:

(:table border=1 width=70% align=left:)

Changed line 33 from:

(:table align=center border=1 width=80%:)

to:

(:table border=1 width=80% align=center:)

Changed line 33 from:

(:table align=center border=1 width=100%:)

to:

(:table align=center border=1 width=80%:)

Changed lines 35-36 from:

(:cell:)Notes (:cell:)Video

to:

(:cell bgcolor=lavender:)Notes (:cell bgcolor=lavender:)Video

Changed lines 33-43 from:

[table align=center border=1 width=100%]

to:

(:table align=center border=1 width=100%:) (:cellnr bgcolor=lavender:) Topic (:cell:)Notes (:cell:)Video


(:cellnr:) M: Aug 31 – Data Mining Overview (:cell:) PDF (:cell:) (:cellnr bgcolor=aliceblue:) R: Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes (:cell:) PDF (:cell:) Video

Changed lines 45-47 from:

[row bgcolor=lavender] [!c] Mondays [!c] Thursdays

to:

(:cellnr:) M: Sep 7 – Labor Day Holiday (:cellnr bgcolor=aliceblue:) R: Sep 10 – EDA: Numeric & Categorical Attributes (:cell:) PDF (:cell:) Video

Changed lines 50-52 from:

[row] [l]Aug 31 – Data Mining Overview: (PDF) [l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes(Notes(PDF))(Video)

to:

(:cellnr:) M: Sep 14 – Frequent Pattern Mining (FPM): Itemset Mining (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Sep 17 – Clustering (CLUS): Partitional (:cell:) (:cell:)

Changed lines 57-59 from:

[row] [l]Sep 7 – Labor Day Holiday [l]Sep 10 – EDA: Numeric & Categorical Attributes (Notes (PDF))(Video)

to:

(:cellnr:) M: Sep 21 – Classification (CLASS): Decision Trees (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Sep 24 – EDA: High Dimensional Data (:cell:) (:cell:)

Changed lines 64-66 from:

[row] [l]Sep 14 – Frequent Pattern Mining (FPM): Itemset Mining [l]Sep 17 – Clustering (CLUS): Partitional

to:

(:cellnr:) M: Sep 28 – EDA: Dimensionality Reduction (PCA/SVD) (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 1 – EDA: Linear Discriminant Analysis (LDA) (:cell:) (:cell:)

Changed lines 71-73 from:

[row] [l]Sep 21 – Classification (CLASS): Decision Trees [l]Sep 24 – EDA: High Dimensional Data

to:

(:cellnr:) M: Oct 5– EXAM I (:cellnr bgcolor=aliceblue:) R: Oct 8 – FPM: Itemset Summaries (:cell:) (:cell:)

Changed lines 76-78 from:

[row] [l]Sep 28 – EDA: Dimensionality Reduction (PCA/SVD) [l]Oct 1 – EDA: Linear Discriminant Analysis (LDA)

to:

(:cellnr:) T: Oct 13 – (Monday Schedule) FPM: Sequence Mining (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 15 – CLASS: Instance-based/Rule-based (:cell:) (:cell:)

Changed lines 83-85 from:

[row] [l]Oct 5– EXAM I [l]Oct 8 – FPM: Itemset Summaries

to:

(:cellnr:) M: Oct 19 – CLASS: Probabilistic (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 22 – CLASS: Support Vector Machines (SVM) (:cell:) (:cell:)

Changed lines 90-92 from:

[row] [l]Oct 13 – (Monday Schedule) FPM: Sequence Mining [l]Oct 15 – CLASS: Instance-based/Rule-based

to:

(:cellnr:) M: Oct 26 – CLASS: SVM contd. (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Oct 29 – CLAS: Ensemble Methods (:cell:) (:cell:)

Changed lines 97-99 from:

[row] [l]Oct 19 – CLASS: Probabilistic [l]Oct 22 – CLASS: Support Vector Machines (SVM)

to:

(:cellnr:) M: Nov 2 – EXAM II (:cellnr bgcolor=aliceblue:) R: Nov 5 – CLUS: Hierarchical (:cell:) (:cell:)

Changed lines 102-104 from:

[row] [l]Oct 26 – CLASS: SVM contd. [l]Oct 29 – CLAS: Ensemble Methods

to:

(:cellnr:) M: Nov 9 – CLUS: Density-based (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Nov 12 – CLUS: Subspace (:cell:) (:cell:)

Changed lines 109-111 from:

[row] [l]Nov 2 – EXAM II [l]Nov 5 – CLUS: Hierarchical

to:

(:cellnr:) M: Nov 16 – CLUS: Subspace contd. (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Nov 19 – CLASS: Kernel Methods (Kernel SVM) (:cell:) (:cell:)

Changed lines 116-118 from:

[row] [l]Nov 9 – CLUS: Density-based [l]Nov 12 – CLUS: Subspace

to:

(:cellnr:) M: Nov 23 – CLASS: Kernel PCA/LDA (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Nov 26 – Thanksgiving Break

Changed lines 121-123 from:

[row] [l]Nov 16 – CLUS: Subspace contd. [l]Nov 19 – CLASS: Kernel Methods (Kernel SVM)

to:

(:cellnr:) M: Nov 30 - CLUS: Spectral Clustering (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Dec 3 – EXAM III

Changed lines 126-128 from:

[row] [l]Nov 23 – CLASS: Kernel PCA/LDA [l]Nov 26 – Thanksgiving Break

to:

(:cellnr:) M: Dec 7 – Social Network Analysis (SNA) (:cell:) (:cell:) (:cellnr bgcolor=aliceblue:) R: Dec 10 - SNA: Graph Mining (:cell:) (:cell:)

Changed lines 133-141 from:

[row] [l]Nov 30 - CLUS: Spectral Clustering [l]Dec 3 – EXAM III


[row] [l]Dec 7 – Social Network Analysis (SNA) [l]Dec 10 - SNA: Graph Mining


[tableend]

to:

(:tableend:)

Added line 17:
  • Sep 12: I have posted the notes below. They are time-stamped so that if I update them, you can check if your copy is the latest one or not.
Changed line 41 from:

[l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes(Notes (PDF))(Video)

to:

[l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes(Notes(PDF))(Video)

Changed line 40 from:

[l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes (Notes (PDF)) (Video)

to:

[l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes(Notes (PDF))(Video)

Changed line 44 from:

[l]Sep 10 – EDA: Numeric & Categorical Attributes (Notes (PDF))(Video)

to:

[l]Sep 10 – EDA: Numeric & Categorical Attributes (Notes (PDF))(Video)

Changed line 40 from:

[l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes (Video)

to:

[l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes (Notes (PDF)) (Video)

Changed line 44 from:

[l]Sep 10 – EDA: Numeric & Categorical Attributes (Video)

to:

[l]Sep 10 – EDA: Numeric & Categorical Attributes (Notes (PDF))(Video)

Changed line 47 from:

[l]Sep 14 – Frequent Pattern Mining (FPM): Itemset Mining

to:

[l]Sep 14 – Frequent Pattern Mining (FPM): Itemset Mining

Changed lines 51-52 from:

[l]Sep 21 – Classification (CLASS): Decision Trees [l]Sep 24 – EDA: High Dimensional Data

to:

[l]Sep 21 – Classification (CLASS): Decision Trees [l]Sep 24 – EDA: High Dimensional Data

Changed lines 44-45 from:

[l]Sep 10 – EDA: Numeric & Categorical Attributes Video)

to:

[l]Sep 10 – EDA: Numeric & Categorical Attributes (Video)

Changed lines 44-45 from:

[l]Sep 10 – Frequent Pattern Mining (FPM): Itemset Mining

to:

[l]Sep 10 – EDA: Numeric & Categorical Attributes Video)

Changed lines 48-49 from:

[l]Sep 14 – Clustering (CLUS): Partitional [l]Sep 17 – Classification (CLASS): Decision Trees

to:

[l]Sep 14 – Frequent Pattern Mining (FPM): Itemset Mining [l]Sep 17 – Clustering (CLUS): Partitional

Changed lines 52-53 from:

[l]Sep 21 – EDA: High Dimensional Data [l]Sep 24 – EDA: Dimensionality Reduction (PCA/SVD)

to:

[l]Sep 21 – Classification (CLASS): Decision Trees [l]Sep 24 – EDA: High Dimensional Data

Changed line 56 from:

[l]Sep 28 – EDA: SVD contd.

to:

[l]Sep 28 – EDA: Dimensionality Reduction (PCA/SVD)

Added line 17:
  • Sep 8: Assignment 1 has been posted. See the general R/pmwiki instruction at Assignments and see the specific assignment at Assign1
Changed lines 38-39 from:

[l]Aug 31 – Data Mining Overview: PDF [l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes Video

to:

[l]Aug 31 – Data Mining Overview: (PDF) [l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes (Video)

Changed lines 27-28 from:

Calendar & Lecture Notes

to:

Calendar & Lecture Notes/Videos

Changed lines 38-39 from:

[l]Aug 31 – Data Mining Overview [l]Sep 3 – Exploratory Data Analysis (EDA): Numeric and Categorical

to:

[l]Aug 31 – Data Mining Overview: PDF [l]Sep 3 – Exploratory Data Analysis (EDA): Numeric Attributes Video

Changed line 31 from:

[table align=center border=1]

to:

[table align=center border=1 width=100%]

Changed line 5 from:

Class: 10-11:50AM, MR, Low 3045

to:

Class: 10-11:50AM, MR, Low 3045\\

Changed lines 27-28 from:

Calendar

to:

Calendar & Lecture Notes

Changed line 38 from:

[l]Aug 31 – Data Mining Overview

to:

[l]Aug 31 – Data Mining Overview

Added line 17:
  • Sep 2: Passwords for the assignment submission wiki were sent out yesterday. Contact me if you did not get the email.
Added lines 1-141:

CSCI-4390/6390: Data Mining, Fall 2009


Class: 10-11:50AM, MR, Low 3045 Instructor Office Hours: 12-1PM, MR



Announcements

(:table border=1 bgcolor=aliceblue width=100%:) (:cell:) (:div style="height: 200px; overflow: auto; text-align: justify; padding-top: 10px; padding-left:10px; padding-right:10px;" :)

  • Aug 30: Slight update of the syllabus.
  • Aug 19: Course website is up, with the tentative calendar and syllabus.

(:divend:) (:tableend:)



Calendar

A tentative sequence of topics to be covered in the classes; changes are likely as the course progresses.

[table align=center border=1]


[row bgcolor=lavender] [!c] Mondays [!c] Thursdays


[row] [l]Aug 31 – Data Mining Overview [l]Sep 3 – Exploratory Data Analysis (EDA): Numeric and Categorical


[row] [l]Sep 7 – Labor Day Holiday [l]Sep 10 – Frequent Pattern Mining (FPM): Itemset Mining


[row] [l]Sep 14 – Clustering (CLUS): Partitional [l]Sep 17 – Classification (CLASS): Decision Trees


[row] [l]Sep 21 – EDA: High Dimensional Data [l]Sep 24 – EDA: Dimensionality Reduction (PCA/SVD)


[row] [l]Sep 28 – EDA: SVD contd. [l]Oct 1 – EDA: Linear Discriminant Analysis (LDA)


[row] [l]Oct 5– EXAM I [l]Oct 8 – FPM: Itemset Summaries


[row] [l]Oct 13 – (Monday Schedule) FPM: Sequence Mining [l]Oct 15 – CLASS: Instance-based/Rule-based


[row] [l]Oct 19 – CLASS: Probabilistic [l]Oct 22 – CLASS: Support Vector Machines (SVM)


[row] [l]Oct 26 – CLASS: SVM contd. [l]Oct 29 – CLAS: Ensemble Methods


[row] [l]Nov 2 – EXAM II [l]Nov 5 – CLUS: Hierarchical


[row] [l]Nov 9 – CLUS: Density-based [l]Nov 12 – CLUS: Subspace


[row] [l]Nov 16 – CLUS: Subspace contd. [l]Nov 19 – CLASS: Kernel Methods (Kernel SVM)


[row] [l]Nov 23 – CLASS: Kernel PCA/LDA [l]Nov 26 – Thanksgiving Break


[row] [l]Nov 30 - CLUS: Spectral Clustering [l]Dec 3 – EXAM III


[row] [l]Dec 7 – Social Network Analysis (SNA) [l]Dec 10 - SNA: Graph Mining


[tableend]



Syllabus

(:table border=1 bgcolor=aliceblue width=100%:) (:cell:) (:div style="height: 400px; overflow: auto; text-align: justify; padding-top: 10px; padding-left:10px; padding-right:10px;" :)

Introduction

Data mining is the process of automatic discovery of patterns, models, changes, associations and anomalies in massive databases. This course will provide an introduction to the main topics in data mining and knowledge discovery, including: statistical foundations, pattern mining, classification, and clustering. Emphasis will be laid on the algorithmic foundations.

Learning Objectives

After taking this course students will be

  • knowledgeable about the fundamental data mining tasks like pattern mining, classification and clustering
  • able to understand the key algorithms for the main tasks
  • able to implement and apply the techniques to real world datasets
Prerequisites

The pre-requisites for this course include data structures and algorithms and discrete mathematics. Basics of linear algebra, and probability & statistics will be very useful as well. Assignments will require the use of the R software. Students are expected to learn R on their own. Assignments must be submitted online at the wiki site. Knowledge of pmwiki markup usage will be your responsibility.

Textbook

There is no required text for the course. Notes will be handed out in class.

The following text books are also good references:

  • Introduction to Data Mining, by Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Addison Wesley, 2006.
  • Data Mining: Concepts and Techniques (2nd edition), by Jiawei Han and Micheline Kamber, Morgan Kaufmann, 2006.
Grading Policy

Your grade will be a combination of the following items. Note that the final distribution is subject to some change depending on the number of assignments, but exams will be at least 60%.

  • Assignments (40%): The assignments are meant to be practically oriented. You'll be asked to run some mining methods on some real datasets, or to implement some algorithms, to complement the theory. There will be roughly one assignment per week, to be submitted via the course wiki site. User accounts will be created after first day of class.
  • Exams (60%): There will be three exams covering the main topics of the course. The tentative exam schedule is posted on the class schedule table. There is no comprehensive final exam.

Attendance: Students are strongly encouraged to participate in the class, and should try to attend all classes.

Academic Integrity

You may consult other members of the class on the homeworks, but you must submit your own work. Anytime you borrow material from the web or elsewhere, you must acknowledge the source.

The school takes cases of academic dishonesty very seriously, resulting in an automatic "F" grade for the course. Students should familiarize themselves with the relevant portion of the Rensselaer Handbook of Student Rights and Responsibilities on this topic. (:divend:) (:tableend:)