Newest 'evaluation' Questions

0 votes

0 answers

20 views

why does the f1 score declines from 1.0 to 0.0?

In the evaluation of a multi-class YOLOv8 model, the F1 confidence curve, why the F1 got dropped by 1.0 to 0.0 also it starts with 1.0? Why does it have to happen when there is an increase in the ...

NAKULAN T

1

asked Jul 17 at 13:19

1 vote

1 answer

31 views

Why do the sensitivity (recall) values differ between classification_report and precision_recall_fscore_support in a loop?

I am working with a synthetic dataset generated using make_classification from sklearn.datasets with 5 classes. I have trained a RandomForestClassifier on this data and am evaluating its performance ...

Atharva Rasane

23

asked Jun 29 at 14:57

0 votes

0 answers

18 views

problems calculating precision and recall

I need to calculate precision and recall to evaluate my model performance,so I am using this code that perform inference,annotate the images with the resulted class and calculates the precision and ...

anya

11

asked Jun 22 at 10:58

0 votes

0 answers

168 views

Mean Reciprocal Rank (MRR) understanding for predicting top k elements

I have the following code from a paper, who have implemented MRR for recommending top-k elements using some Machine Learning model. def MRR(test_y, pred_y, k=5): predict = pd.DataFrame([]) ...

amp1590

103

asked May 11 at 21:21

0 votes

0 answers

45 views

Unexpected search results: How does Gmail search interpret advanced queries, and how I can use that information to achieve precision?

Consider the following filter Keep this filter (also provided in query form) in mind. We'll come back to it later. from: { domain1 domain2 email1 email2 } subject: { +"exact string 1" +"...

Musixauce3000

549

asked May 10 at 18:25

-1 votes

1 answer

46 views

How to explain null-coalescing expression precedence evaluation with some operators?

The following code works fine. What is the logic behind the addition + being evaluated after the null-coalescing ??? How's that possible? Where is the doc explaining that? int? tNullable = 2; ...

Eric Ouellet

11.5k

asked May 8 at 15:38

1 vote

1 answer

25 views

Why does the approxes variable of a custom eval_metric with catboost for binary classification contain negative values?

In order to create a personal evaluation function with catboost for binary classification, I used the example mentioned here: How to create custom eval metric for catboost? However, I have negative ...

user23571732

11

asked Apr 29 at 15:57

0 votes

0 answers

18 views

how to append validation accuracy for each epoch in the code below?

def train(): seed_val = 42 criterion = CosineSimilarityLoss() criterion = criterion.to(device) random.seed(seed_val) torch.manual_seed(seed_val) We'll store a number of quantities such as training and ...

davood asgharzadeh

1

asked Apr 22 at 8:16

1 vote

1 answer

62 views

Ensuring Equivalence in Python Functions: Understanding Implementation Impacts

In defining function equivalence, several factors come into play: Producing equivalent results Sharing the same (non-)termination behavior Mutating (non-local) memory similarly Maintaining identical ...

Adrian

153

asked Apr 19 at 15:23

3 votes

3 answers

145 views

is "Side effects of a function are sequenced before its evaluation" specified by the C++ standard?

I didn't find relevant terms in Order of evaluation. So is the behavior of function g undefined in the code below? int x; int f() { return x++; } void g() { x = f(); } I compiled the code on ...

nalemy

63

asked Apr 17 at 10:17

0 votes

0 answers

103 views

model loss value for each epoch in sentence transformers framework

I'm trying to fine-tune a pre-trained language model using sentence transformers. The model I'm using is based on Bert. the method I'm using is fine-tunning via a siamese network so in order to do ...

davood asgharzadeh

1

asked Apr 15 at 13:43

0 votes

0 answers

18 views

Evaluation in recommendation system

I want to calculate the precision, recall, and f1-score values of the recommendation system that I built. I plan to conduct an evaluation by asking users directly whether the recommendation items ...

Vina

1

asked Apr 9 at 16:59

0 votes

0 answers

53 views

How to SpanQuery the evaluations in arize phoenix

I'm making a RAG application and I use arize phoenix for my logs. I can make evaluations but it seems like I can't make a query that gets the evaluations result in a dataframe. Does anyone have a ...

Dispasim

1

asked Apr 9 at 12:45

0 votes

0 answers

21 views

Testing and Evaluating Potentially Complex System of Interconnected Software Services

How to test and evaluate the interconnected software services as a whole with scientific citations? I expect to minimize risk by knowing certainty, increasing predictability, and considering more ...

muazhari

137

asked Apr 5 at 2:05

0 votes

0 answers

40 views

How can I make an effective Evaluation function for a Draughts/Checkers game with Minimax + alpha-beta pruning?

Making a Checkers game for an academic project and struggling to produce effective evaluation methods to push certain scenarios. My game's logic appears to work fine and everything functions in terms ...

arefawan

1

asked Mar 20 at 23:37

Collectives™ on Stack Overflow

Questions tagged [evaluation]

why does the f1 score declines from 1.0 to 0.0?

Why do the sensitivity (recall) values differ between classification_report and precision_recall_fscore_support in a loop?

problems calculating precision and recall

Mean Reciprocal Rank (MRR) understanding for predicting top k elements

Unexpected search results: How does Gmail search interpret advanced queries, and how I can use that information to achieve precision?

How to explain null-coalescing expression precedence evaluation with some operators?

Why does the approxes variable of a custom eval_metric with catboost for binary classification contain negative values?

how to append validation accuracy for each epoch in the code below?

Ensuring Equivalence in Python Functions: Understanding Implementation Impacts

is "Side effects of a function are sequenced before its evaluation" specified by the C++ standard?

model loss value for each epoch in sentence transformers framework

Evaluation in recommendation system

How to SpanQuery the evaluations in arize phoenix

Testing and Evaluating Potentially Complex System of Interconnected Software Services

How can I make an effective Evaluation function for a Draughts/Checkers game with Minimax + alpha-beta pruning?

Hot Network Questions

Collectives™ on Stack Overflow

Questions tagged [evaluation]

Related Tags