Skip to main content

Questions tagged [floating-point]

Mathematical questions concerning floating point numbers, a finite approximation of the real numbers used in computing.

0 votes
2 answers
10k views

What is the result of 3-digit chopping for 0.000234?

I am trying to understand if the 0."000" part counted as digit or not. If 0."000" is not "digit", then the result should 0.000234. If yes, then the result should be 0.00 Which one is correct? Here'...
4 votes
2 answers
110 views

pow and its relative error

Investigating the floating-point implementation of the $\operatorname{pow}(x,b)=x^b$ with $x,b\in\Bbb R$ in some library implementations, I found that some pow ...
1 vote
2 answers
964 views

How to calculate the inverse of sum of a Kronecker product and a diagonal matrix

I want to calculate the inverse of a matrix of the form $S = (A\otimes B+C)$, where $A$ and $B$ are symetric and invertible, $C$ is a diagonal matrix with positive elements. Basically if the ...
2 votes
2 answers
2k views

How do 24 significant bits give from 6 to 9 significant decimal digits?

was reading IEEE 754 single-precision binary floating-point format: binary32 when I ran into The IEEE 754 standard specifies a binary32 as having: Sign bit: 1 bit Exponent width: 8 bits ...
4 votes
2 answers
14k views

The upper and lower limits of IEEE-754 standard

So there's something I just can't understand about ieee-754. The specific questions are: Which range of numbers can be represented by IEEE-754 standard using base 2 in single (double) precision? ...
0 votes
3 answers
2k views

How to convert 601.0 to IEEE-754 Single Precision

I am trying to understand how to convert from decimal to IEEE-754 Single Precision binary representation. I make up a random number which happen to be 601.00 I tried my best to figure it out and ...
0 votes
1 answer
493 views

Floating point arithmetic ( IEEE-754 standard ) commutative law (*,+)

How can I prove that: $ fl(a \ op \ b) = fl(b \ op \ a), \: op = +,*.$. I have been reading and searching the big majority say that its true. like here. However, I can not find a mathematical proof ...
0 votes
1 answer
2k views

Find point in 3D plane

I have four points in a 3D space, example: $$(0,0,1),\ (1,0,1),\ (1,0,2)\ \mbox{and}\ (0,0,2).$$ Then I have a 2D position on that square plane: $$x = 0.5,\ y = 0.5.$$ I need to find out the 3D ...
16 votes
5 answers
12k views

Solving a quadratic equation with precision when using floating point variables

I know how to solve a basic quadratic equation with the formula $$t_{1,2}=\dfrac{-b\pm\sqrt{b^2-4ac}}{2a}$$ but I learned that if $b \approx \sqrt{b^2-4ac}$ floating point precision may give ...
1 vote
2 answers
64 views

How to transform this expression to a numerically stable form?

I have this function $$f(x, t)=\frac{\left(1+x\right)^{1-t}-1}{1-t}$$ Where $x \ge 0$ and $t \ge 0$. I want to use it in neural network, and thus need it to be differentiable. While it has a ...
6 votes
0 answers
143 views

Algebraic Structures involving 𝙽𝚊𝙽 (absorbing element).

IEEE 754 floating point numbers contain the concept of 𝙽𝚊𝙽 (not a number), which "dominates" arithmetical operations ($+,-,⋅,÷$ will return ...
2 votes
2 answers
10k views

Floating point number,Mantissa,Exponent

In this computer, numbers are stored in $12$-bits. We will also assume that for a floating point (real) number, $6$ bits of these bits are reserved for the mantissa (or significand) with $2^{k-1}-1$ ...
3 votes
0 answers
53 views

Solve $10^{10^z} = 10^{10^x}+10^{10^y}$ for $z$ with floating point accuracy

In the following equation $$10^{10^z} = 10^{10^x}+10^{10^y}$$ I want to find an algorithm that computes $z$ in a floating point accurate manner given any values of $x$ and $y$ (e.g. $x=y=2000$). The ...
1 vote
0 answers
49 views

Proof that $\epsilon_{mach} \leq \frac{1}{2} b^{1-n}$

I have a question about the proof of the following statement: For each set of machine numbers $F(b, n, E_{min}, E_{max})$ with $E_{min} < E_{max}$ the following inequality holds: $\epsilon_{mach} \...
1 vote
1 answer
75 views

Find original inputs $x$ and $^y$ for a given product, possible or not? [closed]

$387,381,625,547,900,583,936$ is the product of this calculation $21\cdot2^{64}$. If I only have the product and the multiplier $2$ (without the exponent) would it be possible to find the other inputs ...

15 30 50 per page
1
2 3 4 5
32