Why does byte + byte = int?

Question

Looking at this C# code:

byte x = 1;
byte y = 2;
byte z = x + y; // ERROR: Cannot implicitly convert type 'int' to 'byte'

The result of any math performed on byte (or short) types is implicitly cast back to an integer. The solution is to explicitly cast the result back to a byte:

byte z = (byte)(x + y); // this works

What I am wondering is why? Is it architectural? Philosophical?

We have:

int + int = int
long + long = long
float + float = float
double + double = double

So why not:

byte + byte = byte
short + short = short?

A bit of background: I am performing a long list of calculations on "small numbers" (i.e. < 8) and storing the intermediate results in a large array. Using a byte array (instead of an int array) is faster (because of cache hits). But the extensive byte-casts spread through the code make it that much more unreadable.

The various musings below are a reasonable approximation of the design considerations. More generally: I don't think of bytes as "numbers"; I think of them as patterns of bits that could be interpreted as numbers, or characters, or colors or whatever. If you're going to be doing math on them and treating them as numbers, then it makes sense to move the result into a data type that is more commonly interpreted as a number. — Eric Lippert, Commented Jun 2, 2009 at 20:57
@Eric: That makes a lot of sense for byte, but probably not as much sense for short/ushort. — Jon Skeet, Commented Jun 3, 2009 at 12:24
@Eric: byte1 | byte2 is not at all treating them as numbers. This is treating them precisely as patterns of bits. I understand your point of view, but it just so happens that every single time I did any arithmetic on bytes in C#, I was actually treating them as bits, not numbers, and this behaviour is always in the way. — Roman Starkov, Commented Dec 28, 2009 at 11:46
Possible duplicate of Integer summing blues, short += short problem — GSerg, Commented Apr 14, 2016 at 20:17

azheglov · Accepted Answer · 2009-06-02 20:17:51Z

247

The third line of your code snippet:

byte z = x + y;

actually means

byte z = (int) x + (int) y;

So, there is no + operation on bytes, bytes are first cast to integers and the result of addition of two integers is a (32-bit) integer.

answered Jun 2, 2009 at 20:17

azheglov

5,5031 gold badge23 silver badges30 bronze badges

I have tried code below but it still not working. byte z = (byte)x + (byte)y;
– Anonymous
Commented Jun 4, 2009 at 5:51
11

that is because there is no + operation for bytes (see above). Try byte z = (byte)( (int) x + (int) y)
– azheglov
Commented Jun 5, 2009 at 18:51
The OP asks why it needs to cast. It doesn't cast for int/long/uint..., but why just for byte? and also short/ushort. You add two short and end up with an int type? Then why not two int becomes long? This sounds more like an inconsistent design decision.
– joe
Commented Apr 18, 2021 at 2:09

Add a comment |

Salman Arshad · Accepted Answer · 2012-06-23 06:31:28Z

186

In terms of "why it happens at all" it's because there aren't any operators defined by C# for arithmetic with byte, sbyte, short or ushort, just as others have said. This answer is about why those operators aren't defined.

I believe it's basically for the sake of performance. Processors have native operations to do arithmetic with 32 bits very quickly. Doing the conversion back from the result to a byte automatically could be done, but would result in performance penalties in the case where you don't actually want that behaviour.

I think this is mentioned in one of the annotated C# standards. Looking...

EDIT: Annoyingly, I've now looked through the annotated ECMA C# 2 spec, the annotated MS C# 3 spec and the annotation CLI spec, and none of them mention this as far as I can see. I'm sure I've seen the reason given above, but I'm blowed if I know where. Apologies, reference fans :(

edited Jun 23, 2012 at 6:31

Salman Arshad

270k83 gold badges436 silver badges528 bronze badges

answered Jun 2, 2009 at 20:08

Jon Skeet

1.5m881 gold badges9.2k silver badges9.3k bronze badges

4

I would argue that for symmetry, 32 + 240 becoming 16 is just as logical as int.MaxValue + 1 being int.MinValue (modulo Eric's comment about byte not really being a number so much as a collection of bits). It's nice that we have the concept of a checked context in C#...
– Jon Skeet
Commented Jun 3, 2009 at 14:56
2

Some people don't seem to like this "boring" answer, because it's too practical. They want something more conceptual. To me, this practical answer seems so much more plausible: when you design a spec, you also need to take into practical considerations. An int is designed to be added using a CPU, a byte is designed to store data. When you do an addition, you use a data type optimized for addition.
– charles
Commented Jul 13, 2010 at 4:59
1

@Will: I have hard copies - it's well worth getting: amazon.com/dp/0321741765
– Jon Skeet
Commented Apr 5, 2011 at 19:53
1

@JonSkeet the only place I have seen mentioning similar to what you said is "C# in a nutshell": 8 and 16 bit integrals lack their own arithmetic operators and compiler implicitly converted them into larger type (int32) as required.
– dragonfly02
Commented May 23, 2015 at 10:10
2

@KevinMuhuri: Yes, because += is a compound assignment operator, which has an implicit cast in it.
– Jon Skeet
Commented Dec 11, 2020 at 6:31

| Show 7 more comments

TylerH · Accepted Answer · 2023-10-11 14:46:27Z

70

From the article Why do operations on "byte" result in "int"? on Raymond Chen's blog, The Old New Thing:

Suppose we lived in a fantasy world where operations on 'byte' resulted in 'byte'.
byte b = 32;
byte c = 240;
int i = b + c; // what is i?
In this fantasy world, the value of i would be 16! Why? Because the two operands to the + operator are both bytes, so the sum "b+c" is computed as a byte, which results in 16 due to integer overflow. (And, as I noted earlier, integer overflow is the new security attack vector.)

Raymond is defending, essentially, the approach C and C++ took originally. In the comments, he defends the fact that C# takes the same approach, on the grounds of language backward compatibility.

edited Oct 11, 2023 at 14:46

TylerH

21.1k72 gold badges78 silver badges105 bronze badges

answered Jun 2, 2009 at 20:04

Michael Petrotta

60.5k27 gold badges149 silver badges181 bronze badges

47

With integers if we add them and it overflows it doesn't automatically cast it as a different datatype though so why do it with byte?
– Ryan
Commented Jun 2, 2009 at 20:06
2

With ints it does overflow. Try adding int.MaxValue + 1 you get -2147483648 instead of 2147483648.
– David Basarab
Commented Jun 2, 2009 at 20:13
9

@Longhorn213: Yep, that's what Ryan's saying: int math can overflow, but int math doesn't return longs.
– Michael Petrotta
Commented Jun 2, 2009 at 20:15
30

Exactly. If this is meant to be a security measure, it's a very poorly implemented one ;)
– Jon Skeet
Commented Jun 2, 2009 at 20:15
1

I'm not sure if Raymond (for once) should be considered the authority on this. Read the comments on the blog entry. His defense of this in C# is basically "because C++ does it that way" blogs.msdn.com/oldnewthing/archive/2004/03/10/87247.aspx#87811.
– Aardvark
Commented Jun 2, 2009 at 20:22

| Show 6 more comments

Alun Harford · Accepted Answer · 2009-06-02 23:21:36Z

C#

ECMA-334 states that addition is only defined as legal on int+int, uint+uint, long+long and ulong+ulong (ECMA-334 14.7.4). As such, these are the candidate operations to be considered with respect to 14.4.2. Because there are implicit casts from byte to int, uint, long and ulong, all the addition function members are applicable function members under 14.4.2.1. We have to find the best implicit cast by the rules in 14.4.2.3:

Casting(C1) to int(T1) is better than casting(C2) to uint(T2) or ulong(T2) because:

If T1 is int and T2 is uint, or ulong, C1 is the better conversion.

Casting(C1) to int(T1) is better than casting(C2) to long(T2) because there is an implicit cast from int to long:

If an implicit conversion from T1 to T2 exists, and no implicit conversion from T2 to T1 exists, C1 is the better conversion.

Hence the int+int function is used, which returns an int.

Which is all a very long way to say that it's buried very deep in the C# specification.

CLI

The CLI operates only on 6 types (int32, native int, int64, F, O, and &). (ECMA-335 partition 3 section 1.5)

Byte (int8) is not one of those types, and is automatically coerced to an int32 before the addition. (ECMA-335 partition 3 section 1.6)

That the ECMA only specifies those particular operations would not prevent a language from implementing other rules. VB.NET will helpfully allow byte3 = byte1 And byte2 without a cast, but unhelpfully will throw a runtime exception if int1 = byte1 + byte2 yields a value over 255. I don't know if any languages would allow byte3 = byte1+byte2 and throw an exception when that exceeds 255, but not throw an exception if int1 = byte1+byte2 yields a value in the range 256-510. — supercat, Commented Jul 6, 2014 at 20:28

Christopher · Accepted Answer · 2009-06-02 21:30:24Z

29

The answers indicating some inefficiency adding bytes and truncating the result back to a byte are incorrect. x86 processors have instructions specifically designed for integer operation on 8-bit quantities.

In fact, for x86/64 processors, performing 32-bit or 16-bit operations are less efficient than 64-bit or 8-bit operations due to the operand prefix byte that has to be decoded. On 32-bit machines, performing 16-bit operations entail the same penalty, but there are still dedicated opcodes for 8-bit operations.

Many RISC architectures have similar native word/byte efficient instructions. Those that don't generally have a store-and-convert-to-signed-value-of-some-bit-length.

In other words, this decision must have been based on perception of what the byte type is for, not due to underlying inefficiencies of hardware.

answered Jun 2, 2009 at 21:30

Christopher

8,9942 gold badges33 silver badges42 bronze badges

+1; if only this perception wasn't wrong every single time I have ever shifted and OR'd two bytes in C#...
– Roman Starkov
Commented Dec 28, 2009 at 11:43
There shouldn't be any performance cost for truncating the result. In x86 assembly it is just the difference between copying one byte of out the register or four bytes out of the register.
– Jonathan Allen
Commented Aug 8, 2010 at 4:55
1

@JonathanAllen Exactly. The only difference is, ironically enough, when performing a widening conversion. The current design incurs a performance penalty to execute the widening instruction (either signed extend or unsigned extend.)
– reirab
Commented Jun 26, 2015 at 20:55
"perception of what the byte type is for" -- That may explain this behavior for byte (and char), but not for short which semantically is clearly a number.
– smls
Commented Apr 19, 2018 at 6:36
I still don't understand. Why would they limit the potential use of bytes? I have a program which I want to optimize by using bytes instead of ints to save memory, and addition is such a natural thing to want to use. I don't care about the overflow aspect.
– Dan W
Commented Feb 23, 2022 at 7:24

Add a comment |

BFree · Accepted Answer · 2009-06-02 20:06:14Z

13

I remember once reading something from Jon Skeet (can't find it now, I'll keep looking) about how byte doesn't actually overload the + operator. In fact, when adding two bytes like in your sample, each byte is actually being implicitly converted to an int. The result of that is obviously an int. Now as to WHY this was designed this way, I'll wait for Jon Skeet himself to post :)

EDIT: Found it! Great info about this very topic here.

answered Jun 2, 2009 at 20:06

BFree

103k21 gold badges160 silver badges204 bronze badges

Add a comment |

SSpoke · Accepted Answer · 2014-04-24 05:49:43Z

8

This is because of overflow and carries.

If you add two 8 bit numbers, they might overflow into the 9th bit.

Example:

  1111 1111
+ 0000 0001
-----------
1 0000 0000

I don't know for sure, but I assume that ints, longs, anddoubles are given more space because they are pretty large as it is. Also, they are multiples of 4, which are more efficient for computers to handle, due to the width of the internal data bus being 4 bytes or 32 bits (64 bits is getting more prevalent now) wide. Byte and short are a little more inefficient, but they can save space.

edited Apr 24, 2014 at 5:49

SSpoke

5,79610 gold badges76 silver badges129 bronze badges

answered Jun 2, 2009 at 20:03

foobarfuzzbizz

58.1k55 gold badges144 silver badges197 bronze badges

25

But the larger data types dont follow the same behavior.
– Inisheer
Commented Jun 2, 2009 at 20:04
14

Issues of overflow are an aside. If you were to take your logic and apply it to the language, then all data types would return a larger data type after addition arithmetic, which is most definitely NOT the case. int + int = int, long + long = long. I think the question is in regards to the inconsistency.
– Joseph
Commented Jun 2, 2009 at 20:05
That was my first thought but then why doesn't int+int = long? So I'm not buying the "possible overflow" arguement... yet <grin>.
– Robert Cartaino
Commented Jun 2, 2009 at 20:07
12

Oh, and about the "possible overflow" argeument, why not byte + byte = short?
– Robert Cartaino
Commented Jun 2, 2009 at 20:08
A) Why does it work the way it works given the rules of C#? See my answer below. B) Why was it designed the way it is? Probably just usability considerations, based on subjective judgements on the way most people tend to use ints and bytes.
– mqp
Commented Jun 2, 2009 at 20:12

| Show 2 more comments

TylerH · Accepted Answer · 2023-10-11 14:23:36Z

5

From the C# language spec 1.6.7.5 7.2.6.2 Binary numeric promotions it converts both operands to int if it can't fit it into several other categories. My guess is they didn't overload the + operator to take byte as a parameter but want it to act somewhat normally so they just use the int data type.

C# language Spec

edited Oct 11, 2023 at 14:23

TylerH

21.1k72 gold badges78 silver badges105 bronze badges

answered Jun 2, 2009 at 20:13

Ryan

4,6428 gold badges38 silver badges43 bronze badges

Add a comment |

mqp · Accepted Answer · 2009-06-02 20:05:52Z

4

My suspicion is that C# is actually calling the operator+ defined on int (which returns an int unless you are in a checked block), and implicitly casting both of your bytes/shorts to ints. That's why the behavior appears inconsistent.

answered Jun 2, 2009 at 20:05

mqp

71.5k14 gold badges96 silver badges123 bronze badges

3

It pushs both bytes on the stack, then it calls the "add" command. In IL, add "eats" the two values and replaces them with an int.
– Jonathan Allen
Commented Aug 8, 2010 at 5:21

Add a comment |

PeterAllenWebb · Accepted Answer · 2009-06-02 20:06:51Z

This was probably a practical decision on the part of the language designers. After all, an int is an Int32, a 32-bit signed integer. Whenever you do an integer operation on a type smaller than int, it's going to be converted to a 32 bit signed int by most any 32 bit CPU anyway. That, combined with the likelihood of overflowing small integers, probably sealed the deal. It saves you from the chore of continuously checking for over/under-flow, and when the final result of an expression on bytes would be in range, despite the fact that at some intermediate stage it would be out of range, you get a correct result.

Another thought: The over/under-flow on these types would have to be simulated, since it wouldn't occur naturally on the most likely target CPUs. Why bother?

Community · Accepted Answer · 2017-05-23 11:47:24Z

This is for the most part my answer that pertains to this topic, submitted first to a similar question here.

All operations with integral numbers smaller than Int32 are rounded up to 32 bits before calculation by default. The reason why the result is Int32 is simply to leave it as it is after calculation. If you check the MSIL arithmetic opcodes, the only integral numeric type they operate with are Int32 and Int64. It's "by design".

If you desire the result back in Int16 format, it is irrelevant if you perform the cast in code, or the compiler (hypotetically) emits the conversion "under the hood".

For example, to do Int16 arithmetic:

short a = 2, b = 3;

short c = (short) (a + b);

The two numbers would expand to 32 bits, get added, then truncated back to 16 bits, which is how MS intended it to be.

The advantage of using short (or byte) is primarily storage in cases where you have massive amounts of data (graphical data, streaming, etc.)

Jim C · Accepted Answer · 2009-06-03 12:03:46Z

1

Addition is not defined for bytes. So they are cast to int for the addition. This true for most math operations and bytes. (note this is how it used to be in older languages, I am assuming that it hold true today).

edited Jun 3, 2009 at 12:03

answered Jun 2, 2009 at 20:06

Jim C

4,98122 silver badges25 bronze badges

Add a comment |

puipuix · Accepted Answer · 2018-11-01 11:37:19Z

I've test performance between byte and int.
With int values :

class Program
{
    private int a,b,c,d,e,f;

    public Program()
    {
        a = 1;
        b = 2;
        c = (a + b);
        d = (a - b);
        e = (b / a);
        f = (c * b);
    }

    static void Main(string[] args)
    {
        int max = 10000000;
        DateTime start = DateTime.Now;
        Program[] tab = new Program[max];

        for (int i = 0; i < max; i++)
        {
            tab[i] = new Program();
        }
        DateTime stop = DateTime.Now;

        Debug.WriteLine(stop.Subtract(start).TotalSeconds);
    }
}

With byte values :

class Program
{
    private byte a,b,c,d,e,f;

    public Program()
    {
        a = 1;
        b = 2;
        c = (byte)(a + b);
        d = (byte)(a - b);
        e = (byte)(b / a);
        f = (byte)(c * b);
    }

    static void Main(string[] args)
    {
        int max = 10000000;
        DateTime start = DateTime.Now;
        Program[] tab = new Program[max];

        for (int i = 0; i < max; i++)
        {
            tab[i] = new Program();
        }
        DateTime stop = DateTime.Now;

        Debug.WriteLine(stop.Subtract(start).TotalSeconds);
    }
}

Here the result:
byte : 3.57s 157mo, 3.71s 171mo, 3.74s 168mo with CPU ~= 30%
int : 4.05s 298mo, 3.92s 278mo, 4.28 294mo with CPU ~= 27%
Conclusion :
byte use more the CPU but it cost les memory and it's faster (maybe because there are less byte to alloc)

There are many things wrong with this benchmark, please use BenchmarkDotNet instead, especially for tiny measurements like this — Olivier Giniaux, Commented Jul 19, 2023 at 11:27

fortran · Accepted Answer · 2009-06-02 20:05:02Z

0

I think it's a design decission about which operation was more common... If byte+byte = byte maybe much more people will be bothered by having to cast to int when an int is required as result.

answered Jun 2, 2009 at 20:05

fortran

75.4k26 gold badges139 silver badges177 bronze badges

2

I for once am bothered the other way :) I always seem to need the byte result, so I always have to cast.
– Roman Starkov
Commented Dec 28, 2009 at 11:53
Except you don't have to cast to int. The cast is implicit. Only the other way is explicit.
– Niki
Commented Mar 15, 2010 at 8:31
1

@nikie I think you didn't understand my answer. If adding two bytes would produce a byte, in order to prevent overflows someone would have to cast the operands (not the result) to int prior the addition.
– fortran
Commented Mar 15, 2010 at 11:04

Add a comment |

serhio · Accepted Answer · 2010-02-01 10:44:26Z

From .NET Framework code:

// bytes
private static object AddByte(byte Left, byte Right)
{
    short num = (short) (Left + Right);
    if (num > 0xff)
    {
        return num;
    }
    return (byte) num;
}

// shorts (int16)
private static object AddInt16(short Left, short Right)
{
    int num = Left + Right;
    if ((num <= 0x7fff) && (num >= -32768))
    {
        return (short) num;
    }
    return num;
}

Simplify with .NET 3.5 and above:

public static class Extensions 
{
    public static byte Add(this byte a, byte b)
    {
        return (byte)(a + b);
    }
}

now you can do:

byte a = 1, b = 2, c; c = a.Add(b);

JRoughan · Accepted Answer · 2013-04-08 11:22:33Z

In addition to all the other great comments, I thought I would add one little tidbit. A lot of comments have wondered why int, long, and pretty much any other numeric type doesn't also follow this rule...return a "bigger" type in response to arithmatic.

A lot of answers have had to do with performance (well, 32bits is faster than 8bits). In reality, an 8bit number is still a 32bit number to a 32bit CPU....even if you add two bytes, the chunk of data the cpu operates on is going to be 32bits regardless...so adding ints is not going to be any "faster" than adding two bytes...its all the same to the cpu. NOW, adding two ints WILL be faster than adding two longs on a 32bit processor, because adding two longs requires more microops since you're working with numbers wider than the processors word.

I think the fundamental reason for causing byte arithmetic to result in ints is pretty clear and straight forward: 8bits just doesn't go very far! :D With 8 bits, you have an unsigned range of 0-255. That's not a whole lot of room to work with...the likelyhood that you are going to run into a bytes limitations is VERY high when using them in arithmetic. However, the chance that you're going to run out of bits when working with ints, or longs, or doubles, etc. is significantly lower...low enough that we very rarely encounter the need for more.

Automatic conversion from byte to int is logical because the scale of a byte is so small. Automatic conversion from int to long, float to double, etc. is not logical because those numbers have significant scale.

This still doesn't explain why byte - byte returns int, or why they don't cast to short... — KthProg, Commented Oct 17, 2017 at 20:35
Why would you want addition to return a different type than subtraction? If byte + byte returns int, because 255+anything is greater than a byte can hold, it doesn't make sense to have any byte minus any other byte return anything other than an int from a return type consistency standpoint. — jrista, Commented Oct 24, 2017 at 20:48
I wouldn't, it just shows that the above reason is probably not right. If it had to do with "fitting" into the result, then byte subtraction would return a byte, and byte addition would return a short (byte + byte will always fit into a short). If it was about consistency like you say, then short would still suffice for both operations rather than int. Clearly there is a mixture of reasons, not all of them necessarily well thought-out. Or, the performance reason given below may be more accurate. — KthProg, Commented Oct 25, 2017 at 19:25

Collectives™ on Stack Overflow

Why does byte + byte = int?

16 Answers 16

C#

CLI

Not the answer you're looking for? Browse other questions tagged
c#
type-conversion
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

16 Answers 16

C#

CLI

Not the answer you're looking for? Browse other questions tagged c#type-conversion or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
c#
type-conversion
or ask your own question.