TKDD: Vol 3, No 1

Volume 3, Issue 1March 2009

Volume 3, Issue 1

March 2009

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:1556-4681

EISSN:1556-472X

Tags:

Subscribe to Journal Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Issue Downloads

PDFfront matter (TOC, masthead, submission information)

Select All

Export Citations Save to Binder

research-article

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

Article No.: 1, pp 1–58https://doi.org/10.1145/1497577.1497578

As a prolific research area in data mining, subspace clustering and related problems induced a vast quantity of proposed solutions. However, many publications compare a new proposition—if at all—with one or two competitors, or even with a so-called “...

research-article

Semi-analytical method for analyzing models and model selection measures based on moment analysis

Article No.: 2, pp 1–51https://doi.org/10.1145/1497577.1497579

In this article we propose a moment-based method for studying models and model selection measures. By focusing on the probabilistic space of classifiers induced by the classification algorithm rather than on that of datasets, we obtain efficient ...

research-article

Closed patterns meet n-ary relations

Article No.: 3, pp 1–36https://doi.org/10.1145/1497577.1497580

Set pattern discovery from binary relations has been extensively studied during the last decade. In particular, many complete and efficient algorithms for frequent closed set mining are now available. Generalizing such a task to n-ary relations (n ≥ 2) ...

research-article

DOLPHIN: An efficient algorithm for mining distance-based outliers in very large datasets

Article No.: 4, pp 1–57https://doi.org/10.1145/1497577.1497581

In this work a novel distance-based outlier detection algorithm, named DOLPHIN, working on disk-resident datasets and whose I/O cost corresponds to the cost of sequentially reading the input dataset file twice, is presented.

It is both theoretically and ...

research-article

Bellwether analysis: Searching for cost-effective query-defined predictors in large databases

Article No.: 5, pp 1–49https://doi.org/10.1145/1497577.1497582

How to mine massive datasets is a challenging problem with great potential value. Motivated by this challenge, much effort has concentrated on developing scalable versions of machine learning algorithms. However, the cost of mining large datasets is not ...

ACM Transactions on Knowledge Discovery from Data

Sections

Issue Downloads

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

Semi-analytical method for analyzing models and model selection measures based on moment analysis

Closed patterns meet n-ary relations

DOLPHIN: An efficient algorithm for mining distance-based outliers in very large datasets

Bellwether analysis: Searching for cost-effective query-defined predictors in large databases

Sections

Issue Downloads

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

Semi-analytical method for analyzing models and model selection measures based on moment analysis

Closed patterns meet n-ary relations

DOLPHIN: An efficient algorithm for mining distance-based outliers in very large datasets

Bellwether analysis: Searching for cost-effective query-defined predictors in large databases

Save to Binder

Subjects

Comments