Is the theory of dual numbers strong enough to develop real analysis, and does it resemble Newton's historical method for doing calculus?

Question

I've been interested in non-standard analysis recently. I was reading up on it and noticed the following interesting comment on the Wikipedia page about hyperreal numbers, right after giving an example of a nonstandard differentiation:

The use of the standard part in the definition of the derivative is a rigorous alternative to the traditional practice of neglecting the square of an infinitesimal quantity... the typical method from Newton through the 19th century would have been simply to discard the $dx^2$ term.

I've never heard anything like this before, and really find it fascinating that Newton's method was to define the relation $dx^2 = 0$. If we actually formalize the above structure by taking $\mathbb{R}$ and adjoining an element $dx^2 = 0$ to it, we get the "dual numbers," isomorphic to the quotient ring $\mathbb{R}[x]/x^2$. I'd seen some things about how this algebra plays into automated differentiation algorithms for some computer software systems, but I've never heard anything about Newton directly working in this algebra. So I have a few questions:

Does anyone have more historical information on the way that Newton performed differentiation, and its relation to the dual numbers?
Does anyone know how effectively real analysis can be formalized with the dual numbers? Does the resulting system play nice enough to develop all of the important modern results?
If we start with $\mathbb{C}[x]/x^2$ instead, can we likewise develop complex analysis?

Since this idea is so simple, I'm very curious how powerful it is. I'm also curious if it has any major drawbacks too, since I'm not sure why anyone would mess with the foundational baggage involved in defining the hyperreals if this simple 2-dimensional real algebra could really do the trick.

The "major drawback" as you put it is that an ordinary function on the reals does not extend to the dual numbers. — Mikhail Katz, Dec 19 '14 at 9:02

Ittay Weiss · Accepted Answer · 2013-03-26 09:02:37Z

up vote 10 down vote accepted

The biggest draw back (and it's a big one) is that the ring of dual numbers is not a field. It has plenty of zero divisors. So, Newton, or any of the mathematicians of the early days of calculus, certainly did not work directly in the ring of dual numbers. They of course did not consider the ring to exist (as rings did not exist at all yet), but from their writing it is clear they envisaged a field of real numbers with, somehow, some notions of infinitesimals. Their work is of course very vague, but correct. Much more on that can be found in math history books. Many interesting discussions can be found in the recent book "Adventures in Formalism", also related to the early days of calculus and how things developed.

Some (rather unsatisfactory) portions of analysis can be developed in the ring of dual numbers, but it does not go too far. The idea, as you say, is very simple, perhaps too simple. One immediately gets into trouble when trying to define the derivative as the quotient of the infinitesimal $f(x+h)-f(x)$ divided by $h$, where $h$ is infinitesimal. The difficulty is that the non-zero infinitesimals in the ring of dual numbers are not invertible. So, it's the end of the party. (As you say though, some aspects of the party remain with automatic differentiation). In some sense, the dual numbers form a first order approximation to actual infinitesimals: The square of an infinitesimal is of an order of magnitude smaller than the infinitesimal you started with, but in the ring of dual numbers, the square of an 'infinitesimal' is precisely $0$. So, in a nonstandard model of the reals you have whole layers of infinitesimals. In the dual numbers there is only one layer, nothing in it is invertible, and they all square to $0$.

The book Models for smooth infinitesimal analysis explores many different models for analysis with infinitesimals. None of them is particularly simple.

answered Mar 26 '13 at 9:02

Ittay Weiss

58k589167

8

On the other hand, for a polynomial $f$ we have that $f(x+\varepsilon)-f(x)$ equals $f'(x) \cdot \varepsilon$, so that $f'(x)=\frac{f(x+\varepsilon)-f(x)}{\varepsilon}$ in $k[\varepsilon]/\varepsilon^2 [x]$. This also illustrates that the ring of dual numbers is useful in algebraic geometry. – Martin Brandenburg Mar 26 '13 at 9:13

2

Thanks for the detailed response and the reference, which I'll definitely check out. A question: is there any utility in adjoining an element $\omega = \frac{1}{\epsilon}$ to the ring, which then has the property that $\omega^2 = \infty$ (in the extended real line sense)? This still wouldn't give you a field, but it would at least make the various dual numbers invertible, and having derivatives take values in $\mathbb{R} \cup {\infty}$ doesn't seem that far out there. – Mike Battaglia Mar 26 '13 at 9:19

1

Quick afterthought since I can't edit: That should say $\mathbb{R} \cup \{\infty, -\infty\}$ above. So you'd get numbers of the form $a + b\epsilon + c\omega$, where $\epsilon^2 = 0$ and $\omega^2 = \infty$ and $-\omega^2 = -\infty$. Not a field, but still seems possibly useful... – Mike Battaglia Mar 26 '13 at 9:25

1

@MikeBattaglia, you will still have the problem of there not being any layers of infinitesimals. The dual number is a rather crude model for adjoining infinitesimals. As Martin explains in his answer the comment, the dual numbers do find applications in algebraic geometry. But for analysis, it does not look promising. – Ittay Weiss Mar 26 '13 at 9:32

1

The main problem with the dual numbers is not that they are not a field, but that an ordinary function does not extend to the dual numbers, and therefore this approach is not helpful in calculus and analysis. – Mikhail Katz Dec 19 '14 at 9:01

| show 10 more comments

Martin Brandenburg · Answer 2 · 2013-03-26 08:55:01Z

up vote 7 down vote

No for 1. and 3., this ring is not really useful in analysis. But it is quite important for analytical considerations in algebraic geometry, the main reason being that the scheme $\mathrm{Spec}(k[\varepsilon]/\varepsilon^2)$ classifies tangent vectors. This makes it possible to define the tangent space of arbitrary functors $F : \mathsf{CRing} \to \mathsf{Set}$ at some $x \in F(k)$, namely as the fiber of $F(k[\varepsilon]/\varepsilon^2) \to F(k)$ at $x$. There is no manifold which represents tangent vectors for manifolds, so this is the main difference.

answered Mar 26 '13 at 8:55

Martin Brandenburg

98.4k12123274

This seems like a great response but unfortunately I'm not able to understand it yet! I've just started Hartshorne now but I'm still working on sheaves and haven't gotten to schemes yet. I'm going to have to come back to this in a few weeks and see if it all makes sense then.. – Mike Battaglia Mar 26 '13 at 9:23

Better look up the notion of a derivation and try to prove that homomorphisms of rings $A \to k[\varepsilon]/\varepsilon^2$ correspond 1:1 to pairs consisting of a homomorphism of rings $A \to k$ and a $k$-derivation $A \to k$. This is the observation on which everything else rests. – Martin Brandenburg Mar 26 '13 at 9:51

@MikeBattaglia There is a related explanation of first order infinitesmial symmetries in one of Qiaochu Yuan's old blogposts that may or may not help you develop a picture of the above idea. – rschwieb Mar 26 '13 at 10:01

Thanks for the references, everyone - will check those out. – Mike Battaglia Mar 27 '13 at 21:20

add a comment |

Mikhail Katz · Answer 3 · 2013-04-16 11:49:33Z

up vote 4 down vote

The answers by Ittay Weiss and Martin Brandenburg are helpful. I would like to point out a more direct shorcoming of the dual numbers as far as analysis (and even calculus) is concerned is that it is not clear how to extend a generic real function to the dual numbers, even say a $C^\infty$ smooth function. Thus, if one wishes to form a ratio of infinitesimals involved in the definition of the derivative, it is not clear what should appear in the numerator. Over the hyperreals, one has a systematic way of extending every real function to the wider hyperreal domain, and the transfer principle (which is arguably a formalisation of the Leibnizian Law of Continuity) ensures that such an extension is meaningful.

For this reason, the answer to the original question would be: No, dual numbers are insufficient to capture "Newton's historical method for doing calculus". The hyperreals provide a framework where the procedures of 17th century infinitesimal calculus can be successfully formalized.

edited Apr 16 '13 at 11:49

answered Apr 16 '13 at 11:04

Mikhail Katz

23.7k12072

I never saw this before, but since I'm seeing it now: I mentioned above, in my comments to Ittay, an algebraic structure where $\omega = 1/\epsilon$ is defined, with the property $\omega^2 = \infty$ and $-\omega^2 = -\infty$, taken from the extended real line $\mathbb{R} \cup \{\infty, -\infty\}$. Thus, every number in this system is of the form $a+b\epsilon+c\omega$, other than the special numbers $\infty$ and $-\infty$. This makes all sorts of things invertible which weren't invertible before. Would this still pose a problem in extending generic real functions to the dual numbers? – Mike Battaglia Aug 16 '13 at 7:01

3

@Mike: One immediate problem is that the condition $\epsilon^2=0$ would not allow us to carry out the usual algebraic simplifications on the relation $(\epsilon\omega)^2=1$. Thus one of the ordinary rules of algebra has to break down, which is inconvenient in a number system. In the hyperreal framework all ordinary rules of algebra continue to hold over the extended domain. – Mikhail Katz Aug 16 '13 at 7:49

Aha, great point. I never saw that before. Thanks. – Mike Battaglia Aug 16 '13 at 8:02

add a comment |

Michael · Answer 4 · 2013-11-29 17:49:16Z

up vote 3 down vote

I think no one mentioned Synthetic differential geometry, there you have ~~nonzero~~ quantities with $dx^2=0$. For a very readable introduction I suggest:

Bell, A primer of infinitesimal Analysis

edited Nov 29 '13 at 17:49

answered Nov 28 '13 at 10:29

Michael

602313

2

Note that these quantities aren't actually nonzero. What one can say about them is that one cannot prove that they are zero; nor can one prove that they are nonzero. This is of course only possible when the background logic is intuitionistic. – Mikhail Katz Nov 28 '13 at 19:14

Right. But at least one can prove that the set of all nilsquare elements does not reduce to the set $\{0\}$ – Michael Nov 28 '13 at 21:44

1

How does that go? Is this really a "set" in the traditional sense? I recall that one needs a more sophisticated topos theory setting to make this work. Does one get a nonempty set of which one cannot exhibit any element? – Mikhail Katz Nov 29 '13 at 8:22

1

@user72694 have a look at page 5 of these notes: home.sandiego.edu/~shulman/papers/sdg-pizza-seminar.pdf – Michael Nov 29 '13 at 17:39

| show 1 more comment

user48672 · Answer 5 · 2014-12-18 16:50:02Z

up vote 2 down vote

In the dual numbers for any differentiable function holds $ f(x+\epsilon) = f(x) + \epsilon f^\prime (x)$. This is enough to handle computationally 1st derivatives. Of course it is not enough for the conventional definition of second derivative. So you can consider the duals as a computational model. Of course, on of the drawbacks is that the purely imaginary numbers are not invertible.

answered Dec 18 '14 at 16:50

user48672

515210

This is a dubious claim. How do you define $f(x+\epsilon)$ for dual numbers if $f$ is an ordinary function? – Mikhail Katz Dec 19 '14 at 8:59

Well you can extend the function to the duals. On the real axis of the dual plane it will retain the same values as the function prototype acting on the reals. Automatic differentiation algorithms rely exactly on this extension. – user48672 Dec 20 '14 at 0:28

What does "retaining the same values" mean exactly? Does it mean that $f(x+\epsilon)=f(x)$? – Mikhail Katz Dec 20 '14 at 19:48

It means $f ( x + \epsilon . 0) = f (x)$. Think about $\epsilon $ as an imaginary unit similar to $i$. – user48672 Dec 21 '14 at 0:18

You can define it that way but this definition is useless for defining the derivative of $f$, because according to your definition the derivative $\frac{f(x+dx)}{dx}$ will be always zero, with $dx$ the nilpotent infinitesimal. – Mikhail Katz Dec 21 '14 at 9:35

| show 6 more comments

Hurkyl · Answer 6 · 2016-09-22 13:00:22Z

Yes... and no....

On the one hand, the dual numbers $\mathbb{R}[\epsilon] / (\epsilon^2)$ are a topological ring, and the projection $\mathbb{R}[\epsilon] / (\epsilon^2) \to \mathbb{R}$ is a vector bundle over the real line. In fact, it is isomorphic (as a vector bundle) to the tangent bundle $T\mathbb{R}$ and to the cotangent bundle $T^*\mathbb{R}$ in a rather suggestive way.

On the other hand, aside from being a neat way to differentiate polynomials, I'm not sure it actually does anything for you. e.g. while it's interesting to organize derivative information such as $\log(x+\epsilon y) = \log(x) + \epsilon \frac{y}{x}$, I don't know if it actually does anything to help you derive such formula.

On the cotangent side, matters are worse — I'm not aware of anything the dual numbers could do for you that the exterior algebra doesn't do as well or better.

j4n bur53 · Answer 7 · 2017-01-29 03:53:14Z

Duals Numbers, attributed to Eduard Study, are already practically used, for example here:
https://github.com/JuliaDiff/DualNumbers.jl

From my point of view division seems not so much a problem, since it fails when ordinary division would also fail. Here is a set of arithmetic operations defined for duals, I am writing (x,y) instead of x+εy:

-(x,y) = (-x,-y)

(x,y)+(z,t) = (x+z,y+t)

(x,y)*(z,t) = (x*z,x*t+y*z)

(x,y)/(z,t) = (x/z, (y*z-x*t)/z^2)

I guess the claim that duals cannot be used to define derivative, stems from a confusion with Jerome Keislers standard part. He writes translated to dual equations the following, and division is exactly the problem:

f((x,h)) - f((x,0))
------------------- = (f'(x), e) /* doesn't work */
      (0,h)

But if we use the hypothesis:

f((x,y)) = (f(x), f'(x)*y)

We then find the following by using this hypothesis and the aforementioned arithmetic operations:

f((x,h)) - f((x,0))
------------------- = (0, f'(x))  /* works */
      (h,0)

And if this isn't convincing enough, we can also use the hypothesis to show, that duals reflect the chain rule:

f(g( (x,1) )) = f( (g(x), g'(x)) )

              = ( f(g(x)), f'(g(x))*g'(x) )

Hyperduals are an extension of duals where second or higher order derivatives can be also calculated.

But currently I rather would wish for duals that can compute f(x+) and f(x-) for me, i.e. left and right derivative. Currently experimenting.

J. weisz · Answer 8 · 2015-05-26 01:31:00Z

This seems to be generating a lot of confusion.

It is simply that f(x+he)=f(x)+he df/dx where ee=0 ; algebraically and precisley for Taylor expandable differentiable functions.(ie analytic functions)

You can go through the table of all the elementary functions and write out the answer using this algebraic definition. Also proove all properties of derivatives. It is no better nor worse than the Lagrange definition of derivative, of whatever is sitting on the second term of the expansion.

Probably good for educational purposes, no torture using limit concept, the derivative is simply there. Of course analyists may misunderstand the algebra part. It is a ring, not a field. The he part is an ideal of the ring.

asked	4 years, 3 months ago
viewed	3,258 times
active	5 months ago

current community

your communities

more stack exchange communities

Is the theory of dual numbers strong enough to develop real analysis, and does it resemble Newton's historical method for doing calculus?

8 Answers 8

Your Answer

Not the answer you're looking for? Browse other questions tagged calculus real-analysis abstract-algebra math-history nonstandard-analysis or ask your own question.

Visit Chat

Linked

Hot Network Questions

current community

your communities

more stack exchange communities

Is the theory of dual numbers strong enough to develop real analysis, and does it resemble Newton's historical method for doing calculus?

8 Answers 8

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged calculus real-analysis abstract-algebra math-history nonstandard-analysis or ask your own question.

Visit Chat

Linked

Related

Hot Network Questions