Changes between Version 2 and Version 3 of generator_PhaseTab


Ignore:
Timestamp:
09/25/06 18:00:00 (18 years ago)
Author:
endres
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • generator_PhaseTab

    v2 v3  
    1010The basic principle of the algorithm is the iterative confirmation of the best ranked relation proposals. This approach makes use of a feature of some [wiki:SimilarityMeasure SimilarityMeasures], especially the SimilarityFlooding, which increase their precision when given higher confidences on correct relations. 
    1111 
    12 In order to archieve a robust and fair ranking, a BordaCount alogrithm is used to identify the {{{n}}} best rated realtion of all SimilarityMeasures involved.  
     12In order to archieve a robust and fair ranking, a [wiki:generator_BordaCount BordaCount] alogrithm is used to identify the {{{n}}} best rated realtion of all SimilarityMeasures involved.  
    1313 
    1414An iteration cycle now consists of three phases: 
    15  1) Generate the SimilarityMatrix for all [wiki:SimilarityMeasure SimilarityMeasures] involved, using the current alignment as an input; the current alignment being empty in the first iteration. If the SimilarityMeasure is independent of such input, e.g. with a frame name based comparison measure, the matrix should not be recalculated. This is, however, up to the measure to decide. 
    16  2) Copy all 1.0 rated realtions, i.e. "confirmed ones", into a new and empty alignment. 
    17  3) Rank the remaining relations and identify the {{{n}}} best. These are also confirmed by setting their confidence to 1.0. 
    18  4) Finally append all other relations with the confidences given by the BordaCount algorithm. 
    1915 
     16 1. Generate the SimilarityMatrix for all [wiki:SimilarityMeasure SimilarityMeasures] involved, using the current alignment as an input; the current alignment being empty in the first iteration. If the SimilarityMeasure is independent of such input, e.g. with a frame name based comparison measure, the matrix should not be recalculated. This is, however, up to the measure to decide. 
    2017 
     18 2. Copy all 1.0 rated realtions, i.e. "confirmed ones", into a new and empty alignment. 
     19 
     20 3. Rank the remaining relations and identify the {{{n}}} best. These are also confirmed by setting their confidence to 1.0. 
     21 
     22 4. Finally add all remaining relations with the confidences calculated by the [wiki:generator_BordaCount BordaCount] algorithm. It is up to the used Alignment implementation, to filter these masses of relations. Typically, a MatchingAlignment would be used. 
     23 
     24 5. Reiterate 1)-4) until a maximal number is reached or all relations are confirmed. 
    2125 
    2226 
    2327== Characteristics == 
     28This algorithm estimates, that the best scoring relations can be confied in. According to our experiences, this is the case in many scenarios. It is, however, highly vulnerable to errors, since they become confirmed and add a high amount of noise to the algorithms confidences. Therefore, it is crucial to keep an eye on at least two parameters: the number {{{n}}} of relations confirmed per iteration, and the number of iterations. We have not yet been able to define a general criteria to determine when the algorithm trails into cascades of false confirmation. 
     29 
     30The alignment generated by this algorithm lacks, due to its radical approach, reasonable relation confidences. This should not be a problem for end users, but is not beneficial when it comes to using it as a subroutine. 
     31 
    2432 
    2533== Evaluation/Performance ==