Changes between Version 2 and Version 3 of generator_PhaseTab
- Timestamp:
- 09/25/06 18:00:00 (18 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
generator_PhaseTab
v2 v3 10 10 The basic principle of the algorithm is the iterative confirmation of the best ranked relation proposals. This approach makes use of a feature of some [wiki:SimilarityMeasure SimilarityMeasures], especially the SimilarityFlooding, which increase their precision when given higher confidences on correct relations. 11 11 12 In order to archieve a robust and fair ranking, a BordaCountalogrithm is used to identify the {{{n}}} best rated realtion of all SimilarityMeasures involved.12 In order to archieve a robust and fair ranking, a [wiki:generator_BordaCount BordaCount] alogrithm is used to identify the {{{n}}} best rated realtion of all SimilarityMeasures involved. 13 13 14 14 An iteration cycle now consists of three phases: 15 1) Generate the SimilarityMatrix for all [wiki:SimilarityMeasure SimilarityMeasures] involved, using the current alignment as an input; the current alignment being empty in the first iteration. If the SimilarityMeasure is independent of such input, e.g. with a frame name based comparison measure, the matrix should not be recalculated. This is, however, up to the measure to decide.16 2) Copy all 1.0 rated realtions, i.e. "confirmed ones", into a new and empty alignment.17 3) Rank the remaining relations and identify the {{{n}}} best. These are also confirmed by setting their confidence to 1.0.18 4) Finally append all other relations with the confidences given by the BordaCount algorithm.19 15 16 1. Generate the SimilarityMatrix for all [wiki:SimilarityMeasure SimilarityMeasures] involved, using the current alignment as an input; the current alignment being empty in the first iteration. If the SimilarityMeasure is independent of such input, e.g. with a frame name based comparison measure, the matrix should not be recalculated. This is, however, up to the measure to decide. 20 17 18 2. Copy all 1.0 rated realtions, i.e. "confirmed ones", into a new and empty alignment. 19 20 3. Rank the remaining relations and identify the {{{n}}} best. These are also confirmed by setting their confidence to 1.0. 21 22 4. Finally add all remaining relations with the confidences calculated by the [wiki:generator_BordaCount BordaCount] algorithm. It is up to the used Alignment implementation, to filter these masses of relations. Typically, a MatchingAlignment would be used. 23 24 5. Reiterate 1)-4) until a maximal number is reached or all relations are confirmed. 21 25 22 26 23 27 == Characteristics == 28 This algorithm estimates, that the best scoring relations can be confied in. According to our experiences, this is the case in many scenarios. It is, however, highly vulnerable to errors, since they become confirmed and add a high amount of noise to the algorithms confidences. Therefore, it is crucial to keep an eye on at least two parameters: the number {{{n}}} of relations confirmed per iteration, and the number of iterations. We have not yet been able to define a general criteria to determine when the algorithm trails into cascades of false confirmation. 29 30 The alignment generated by this algorithm lacks, due to its radical approach, reasonable relation confidences. This should not be a problem for end users, but is not beneficial when it comes to using it as a subroutine. 31 24 32 25 33 == Evaluation/Performance ==