I wanted to conduct a Wilxocon signed rank test but stumbeld upon two questions that I am unable to solve on my own. I tested 2 types of interfaces for a software with the same ten people. I want to compare if there is a difference between using these two interfaces. Each person answerd the same ten questions for both interfaces. The questions were on a range from one to five. ( 1 - strongly disagree, 5 - strongly agree)
My first questions would be if my data is appropriate for a Wilcoxon signed rank test? My questions were on a likert scale, therefore they are ordinal data. I have read that ordinal data is not suffcient for a Wilcoxon signed rank test but I have also read the opposite. I am kinda confused now and need some confirmation for my use case.
The last question is something I couldn't find in my literature. Since I wanted to compare these two interfaces, my idea was that I sum up all the answers for a question and and then take the difference/calculate the rank based on the sum. For example question one got 10 values for Interface1 and 10 for interface2. Total equals for Interface1: 30 points and Interface2: 35 points. Then I take the difference a calculate the rank of all questions based on their sum. Is this possible to do, or should I rather take the difference/rank the answers for each question individually?