I'm working on a problem that involves ranking based on pairwise comparisons (it's for a scientific problem, not actually for games). My comparisons return a numerical score (in practice roughly normally distributed around zero), not a binary outcome (win/lose or draw) like in Elo. For an A vs B comparison score>0 means A wins and score<0 means B wins, but the magnitude of the score is related to how "strong" the win was. Scores in repeat comparisons are likely to be roughly normally distributed around the average of previous scores, so a score with large magnitude also gives high probability that future scores would have the same sign.
Is there an extension to Elo or an Elo-like model which can use numerical scores?