CONTENTS

  1. Distance metric definition
  2. Ideal generalization threshold
  3. DISTORTION: Equivalence class wise quality metric, Attribute wise quality metric, Overall quality metric
  4. HEIGHT-COST-DISTANCE scheme

PRE-PREQUISITES

The metric scheme takes into account two types of attributes - numerical and categorical. Numerical attributes have a hierarchy based on their absolute values. Categorical attributes have a hierarchy that can be represented using a tree structure.

  1. Distance Metric Definition

with normalisation:

Numerical -

Untitled

N represents an Attribute, i and j represent different values of the attribute

Categorical -

Untitled

without normalisation:

Numerical -

Untitled