Conference Proceedings
PACRIM Congress 2008
Conference Proceedings
PACRIM Congress 2008
Data Driven Resource Discovery Using Self-Organising Maps - An Introduction
The self-organising map (SOM) is an exploratory data mining technique that is both non-traditional and underutilised. Methodologies based on SOM tend to be data-driven and unsupervised, which makes them ideal to assist in the integrated analysis and interpretation of complex and disparate mineral exploration' data sets._x000D_
While traditional statistical multivariate approaches have difficulty with relationships that are non-linear and data distributions that are not normal, SOM-based data mining procedures are useful in these circumstances. In a SOM analysis each sample is treated as a vector in a data space defined by the variables; and measures of vector similarity, such as the dot-product or Euclidean distance, are used to order or segment a data set into naturally occurring populations._x000D_
These groupings are positioned as nodes, or groups of nodes, on a 2D rectilinear representation of the multi-dimensional data space', which is the self organized map'. While it is common not to include a sample's locational information in the actual SOM analysis, such information can be used to display the spatial location of samples coded by their SOM node or cluster. If one finds there are coherent spatial patterns belonging to samples from particular nodes, or groups of nodes, then there is strong evidence that the technique is determining patterns and relationships within the data that have natural significance. Variations of the SOM technique may be used to perform broad categories of operations, including: 1. outlier identification - for samples belonging to various nodes or grouping of nodes; 2. function fitting, prediction, estimation or imputation - for calibrating relationships between variables and determining replacement values for missing, null or censored values; 3. clustering, pattern recognition or noise reduction - for determining the natural patterns within a data set; and 4. classification - for classifying samples into the clusters determined in three._x000D_
AnEXTENDED ABSTRACTis available for download. A full-length paper was notprepared for this presentation.
While traditional statistical multivariate approaches have difficulty with relationships that are non-linear and data distributions that are not normal, SOM-based data mining procedures are useful in these circumstances. In a SOM analysis each sample is treated as a vector in a data space defined by the variables; and measures of vector similarity, such as the dot-product or Euclidean distance, are used to order or segment a data set into naturally occurring populations._x000D_
These groupings are positioned as nodes, or groups of nodes, on a 2D rectilinear representation of the multi-dimensional data space', which is the self organized map'. While it is common not to include a sample's locational information in the actual SOM analysis, such information can be used to display the spatial location of samples coded by their SOM node or cluster. If one finds there are coherent spatial patterns belonging to samples from particular nodes, or groups of nodes, then there is strong evidence that the technique is determining patterns and relationships within the data that have natural significance. Variations of the SOM technique may be used to perform broad categories of operations, including: 1. outlier identification - for samples belonging to various nodes or grouping of nodes; 2. function fitting, prediction, estimation or imputation - for calibrating relationships between variables and determining replacement values for missing, null or censored values; 3. clustering, pattern recognition or noise reduction - for determining the natural patterns within a data set; and 4. classification - for classifying samples into the clusters determined in three._x000D_
AnEXTENDED ABSTRACTis available for download. A full-length paper was notprepared for this presentation.
Contributor(s):
S J Fraser, J H Hodgkinson, B L Dickson
-
Data Driven Resource Discovery Using Self-Organising Maps - An IntroductionPDFThis product is exclusive to Digital library subscription
-
Data Driven Resource Discovery Using Self-Organising Maps - An IntroductionPDFNormal price $22.00Member price from $0.00
Fees above are GST inclusive
PD Hours
Approved activity
- Published: 2008
- PDF Size: 0.419 Mb.
- Unique ID: P200811032