I'd start to address this problem by coming up with a better scoring function for the data as it's likely that a high percentage of the records won't have a job title. What's are business requirements for choosing the contact to output? Perhaps you can use a combination of attributes for scoring purposes. If you really need to make an arbitrary selection between two very similar records then you could consider adding a unique sequence number to differentiate between two tied records.
Alternatively, if more complex selection logic is required then it is possible to write custom selection functions but I'd try to avoid this route if possible.
Thanks for your reply Nick,
An arbitrary selection between the highest scoring records is what we want. We have a unique ID on each record in the incoming file.
I tried adding the value from this ID as the second input for the merge but it didn't seem to secondary use this ID when there was a tie.
How do i configure the merge processor to choose a secondary value from the highest if there is a tie?
The use of a secondary value would require a custom selection function.
How about combining the score with your id so that the id forms the decimal part of the number so it will always result in a winner.
ID = 123
Score = 100.123
ID = 456
Score = 100.456
100.456 > 100.123 so record 2 wins!
You'll probably need to do this by concatenating strings and then converting to a numeric attribute at the end.
Thanks for this Nick, it's greatly appreciated. This works well