Chapter 2: | Segmenting Customer Transactions Using a Pattern-Based Clustering Approach |
This is a limited free preview of this book. Please buy full access.
and that the functions defined below for the clusters are for the general case. However, in the implementation described in Section 4, we collapse the numeric attributes into categories in order to discover frequent itemsets from data using existing algorithms such as A priori (Agrawal et al. 1995). Below we describe the objective function of the pattern-based clustering method.
2.2.2 Objective Function for Pattern-Based Clustering
Consider a collection of transactions to be clustered {T1, T2, … , Tn}. Each transaction Ti contains a subset of a list of candidate items {i1, i2, … , im}. A clustering C is a partition {C1, C2, … , Ck} of {T1, T2, … , Tn} and each Ci is a cluster. The goal of this method is to maximize the difference between clusters and the similarity of transactions within clusters. We cluster to maximize a quantity M, where M is defined as follows:
Here we only give a specific definition for the difference between two clusters. This is sufficient since hierarchical clustering techniques can be used to cluster the transactions repeatedly into two groups in such a way that the process results in clustering the transactions into an arbitrary number of clusters (which is generally desirable because the number of clusters does not have to be specified up front).
Now, we define the difference between two clusters.
Definition 1 (Difference between two clusters):
Let Sad = the number of transactions containing pattern Pa in cluster Cd.
|Cd| is the number of transactions in cluster Cd.
Let , and S (Pa, Cd) is called the support of pattern Pa in cluster Cd. It is the fraction of transactions in cluster Cd that contain pattern Pa.