Recently Active 'floating-point' Questions - Mathematics Stack Exchange

0 votes

2 answers

10k views

What is the result of 3-digit chopping for 0.000234?

I am trying to understand if the 0."000" part counted as digit or not. If 0."000" is not "digit", then the result should 0.000234. If yes, then the result should be 0.00 Which one is correct? Here'...

Nothing special

2,262

modified Jul 11 at 3:36

4 votes

2 answers

110 views

pow and its relative error

Investigating the floating-point implementation of the $\operatorname{pow}(x,b)=x^b$ with $x,b\in\Bbb R$ in some library implementations, I found that some pow ...

Penguino

1,249

answered Jun 25 at 22:56

1 vote

2 answers

964 views

How to calculate the inverse of sum of a Kronecker product and a diagonal matrix

I want to calculate the inverse of a matrix of the form $S = (A\otimes B+C)$, where $A$ and $B$ are symetric and invertible, $C$ is a diagonal matrix with positive elements. Basically if the ...

CommunityBot

1

modified May 21 at 2:02

2 votes

2 answers

2k views

How do 24 significant bits give from 6 to 9 significant decimal digits?

was reading IEEE 754 single-precision binary floating-point format: binary32 when I ran into The IEEE 754 standard specifies a binary32 as having: Sign bit: 1 bit Exponent width: 8 bits ...

user1018684

1

modified May 15 at 8:16

4 votes

2 answers

14k views

The upper and lower limits of IEEE-754 standard

So there's something I just can't understand about ieee-754. The specific questions are: Which range of numbers can be represented by IEEE-754 standard using base 2 in single (double) precision? ...

user1018684

1

modified May 13 at 17:19

0 votes

3 answers

2k views

How to convert 601.0 to IEEE-754 Single Precision

I am trying to understand how to convert from decimal to IEEE-754 Single Precision binary representation. I make up a random number which happen to be 601.00 I tried my best to figure it out and ...

user1018684

1

modified May 12 at 7:40

0 votes

1 answer

493 views

Floating point arithmetic ( IEEE-754 standard ) commutative law (*,+)

How can I prove that: $ fl(a \ op \ b) = fl(b \ op \ a), \: op = +,*.$. I have been reading and searching the big majority say that its true. like here. However, I can not find a mathematical proof ...

user1018684

1

modified May 11 at 12:27

0 votes

1 answer

2k views

Find point in 3D plane

I have four points in a 3D space, example: $$(0,0,1),\ (1,0,1),\ (1,0,2)\ \mbox{and}\ (0,0,2).$$ Then I have a 2D position on that square plane: $$x = 0.5,\ y = 0.5.$$ I need to find out the 3D ...

CommunityBot

1

modified Apr 24 at 6:02

16 votes

5 answers

12k views

Solving a quadratic equation with precision when using floating point variables

I know how to solve a basic quadratic equation with the formula $$t_{1,2}=\dfrac{-b\pm\sqrt{b^2-4ac}}{2a}$$ but I learned that if $b \approx \sqrt{b^2-4ac}$ floating point precision may give ...

Marco Lo Monaco

11

modified Apr 10 at 1:04

1 vote

2 answers

64 views

How to transform this expression to a numerically stable form?

I have this function $$f(x, t)=\frac{\left(1+x\right)^{1-t}-1}{1-t}$$ Where $x \ge 0$ and $t \ge 0$. I want to use it in neural network, and thus need it to be differentiable. While it has a ...

Claude Leibovici

268k

answered Apr 2 at 13:51

6 votes

0 answers

143 views

Algebraic Structures involving 𝙽𝚊𝙽 (absorbing element).

IEEE 754 floating point numbers contain the concept of 𝙽𝚊𝙽 (not a number), which "dominates" arithmetical operations ($+,-,⋅,÷$ will return ...

Hyperplane

11.8k

modified Mar 25 at 8:54

2 votes

2 answers

10k views

Floating point number,Mantissa,Exponent

In this computer, numbers are stored in $12$-bits. We will also assume that for a floating point (real) number, $6$ bits of these bits are reserved for the mantissa (or significand) with $2^{k-1}-1$ ...

CommunityBot

1

modified Mar 12 at 11:01

3 votes

0 answers

53 views

Solve $10^{10^z} = 10^{10^x}+10^{10^y}$ for $z$ with floating point accuracy

In the following equation $$10^{10^z} = 10^{10^x}+10^{10^y}$$ I want to find an algorithm that computes $z$ in a floating point accurate manner given any values of $x$ and $y$ (e.g. $x=y=2000$). The ...

Gerben Beintema

39

modified Mar 10 at 19:29

1 vote

0 answers

49 views

Proof that $\epsilon_{mach} \leq \frac{1}{2} b^{1-n}$

I have a question about the proof of the following statement: For each set of machine numbers $F(b, n, E_{min}, E_{max})$ with $E_{min} < E_{max}$ the following inequality holds: $\epsilon_{mach} \...

Scott Hootman-Ng

33

modified Feb 8 at 0:01

1 vote

1 answer

75 views

Find original inputs $x$ and $^y$ for a given product, possible or not? [closed]

$387,381,625,547,900,583,936$ is the product of this calculation $21\cdot2^{64}$. If I only have the product and the multiplier $2$ (without the exponent) would it be possible to find the other inputs ...

Thirdy Yabata

628

modified Feb 7 at 14:49

Stack Exchange Network

Questions tagged [floating-point]

What is the result of 3-digit chopping for 0.000234?

pow and its relative error

How to calculate the inverse of sum of a Kronecker product and a diagonal matrix

How do 24 significant bits give from 6 to 9 significant decimal digits?

The upper and lower limits of IEEE-754 standard

How to convert 601.0 to IEEE-754 Single Precision

Floating point arithmetic ( IEEE-754 standard ) commutative law (*,+)

Find point in 3D plane

Solving a quadratic equation with precision when using floating point variables

How to transform this expression to a numerically stable form?

Algebraic Structures involving 𝙽𝚊𝙽 (absorbing element).

Floating point number,Mantissa,Exponent

Solve $10^{10^z} = 10^{10^x}+10^{10^y}$ for $z$ with floating point accuracy

Proof that $\epsilon_{mach} \leq \frac{1}{2} b^{1-n}$

Find original inputs $x$ and $^y$ for a given product, possible or not? [closed]

Hot Network Questions

Questions tagged [floating-point]

Related Tags