Changes between Initial Version and Version 1 of similarity_AcronymMatch


Ignore:
Timestamp:
06/12/06 12:15:59 (18 years ago)
Author:
endres
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • similarity_AcronymMatch

    v1 v1  
     1[[TracNav]] 
     2 
     3= SimilarityMeasure: AcronymMatcher = 
     4 
     5Developer: [mailto:endres@dfki.uni-kl.de Björn Endres] 
     6 
     7== Description == 
     8This module uses the entities' labels to calculate the likelihood that one is meant to be an acronym of the other. The algorithm is pretty smart and recognises extensions as in ''W3C'' or basic leet as in ''2l8''. It is meant to be a supplement for more general similarity measures, improving them by the ability to detect acronyms. This measure is symmetric, since the shorter of the two labels is always checked for being an acronym of the longer one. 
     9A set of parameters allows for tuning the measure to different scenarios. 
     10 
     11== Characteristics == 
     12In order to demonstrate the abilities of this SimilarityMeasure, here some examples: 
     13|| '''Frame A name''' || '''Frame B name''' || '''Measure value''' || 
     14|| Graduate Management in Admission Test (Educational Testing Service) || GMAT || 1.00 || 
     15|| International Semantic Web Conference 2005 || ISWC05 || 1.00 || 
     16|| The World Wide Web Consortium || W3C || 1.00 || 
     17|| ventricular fibrillation || v-fib || 1.00 || 
     18|| Bundesrepublik Deutschland || BRD || 0.92 || 
     19|| Roll on the floor, laughing! || rofl || 0.89 || 
     20|| || || || 
     21|| '''false positive examples:''' || || || 
     22|| Bundesrepublik Deutschland || brb || 0.63 || 
     23|| Graduate Management in Admission Test (Educational Testing Service) || GNU || 0.49 || 
     24|| ventricular fibrillation || BAT || 0.35 || 
     25 
     26The examples suggest, that a threshold of appromiately 0.9 should be applied in order to get reliable results. The values can, however, always be used as an additional evidence. 
     27 
     28 
     29== Evaluation/Performance == 
     30TODO 
     31 
     32== Specification == 
     33=== Intitialisation === 
     34The SimilarityMeasure main class is 
     35 {{{de.dfki.km.phaselib.impl.similarities.acronymMatch.AcronymMatcher}}} 
     36 
     37Initialisation is straight forward:  
     38 {{{new AcronymMatcher()}}} 
     39 
     40=== Parameters === 
     41none 
     42 
     43=== Dependencies === 
     44none 
     45 
     46== License Issues == 
     47TODO