Slashdot is powered by your submissions, so send in your scoop

 



Forgot your password?
typodupeerror
×
Math Education

Old-School Slashdotter Discovers and Solves Longstanding Flaw In Basic Calculus (mindmatters.ai) 222

Longtime Slashdot reader johnnyb (Jonathan Bartlett) shares the findings of a new study he, along with co-author Asatur Zh. Khurshudyan, published this week in the journal DCDIS-A: Recently a longstanding flaw in elementary calculus was found and corrected. The "second derivative" has a notation that has confused many students. It turns out that part of the confusion is because the notation is wrong. Note -- I am the subject of the article. Mind Matters provides the technical details: "[T]he second derivative of y with respect to x has traditionally had the notation 'd2 y/dx 2.' While this notation is expressed as a fraction, the problem is that it doesn't actually work as a fraction. The problem is well-known but it has been generally assumed that there is no way to express the second derivative in fraction form. It has been thought that differentials (the fundamental 'dy' and 'dx' that calculus works with) were not actual values and therefore they aren't actually in ratio with each other. Because of these underlying assumptions, the fact that you could not treat the second derivative as a fraction was not thought to be an anomaly. However, it turns out that, with minor modifications to the notation, the terms of the second derivative (and higher derivatives) can indeed be manipulated as an algebraic fraction. The revised notation for the second derivative is '(d 2 y/dx 2) - (dy/dx)(d 2 x/dx 2).'"

The report adds that while mathematicians haven't been getting wrong answers, "correcting the notation enables mathematicians to work with fewer special-case formulas and also to develop a more intuitive understanding of the nature of differentials."
This discussion has been archived. No new comments can be posted.

Old-School Slashdotter Discovers and Solves Longstanding Flaw In Basic Calculus

Comments Filter:
  • I appreciate the new form is technically more accurate but the expansion is pretty large compared to the original form... I wonder if the extra length doesn't wash out the understandability gains you get out of the original form.

    • I think the understandability of the new form would be better than the old, because the old way ways "you can treat the derivative as a fraction, except not really (you can't do foo and bar)", which was confusing. Presumably the new form will simplify to the old representation in many cases (which is why the old way was used) and people will be able to understand this and understand when the old way would be sufficient.

      That said, I haven't taken ordinary differential equations in eons and I'm not gonna
      • by johnnyb ( 4816 ) <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @06:59PM (#58418302) Homepage

        This is my thought as well. Interestingly, I developed this while writing a book (Calculus from the Ground Up [amazon.com]) to use for my homeschool co-op calculus classes. I was trying to find a good way to explain the notation, and I literally had 20 calculus books that I read through trying to find a good explanation for the standard notation in any of them. None of them even attempted an explanation, just "this is the way it is, but don't treat it as a fraction." So, I tried to deduce the notation myself. That's when I realized that it was not just limited, it was actually wrong. So I wrote the paper and finished the book (it's Appendix B in the book).

        • Your result is awesome. Well done.
        • Now I don't feel bad that the notation never made any sense to me.
        • Awesome! I'll have to recommend your book to my homeschool co-op. I was especially impressed that you noted that the problem of the original notation derived from a philosophical cause. Too many people do not realize that philosophy plays into science and mathematics, even in how we conceptualize objective facts and concepts.
      • Re: (Score:3, Interesting)

        by Obfuscant ( 592200 )

        because the old way ways "you can treat the derivative as a fraction,

        Except the second derivative notion isn't a fraction. It's a way of writing "the second derivative of Y with respect to X" in a short form. Not all '/' create "fractions". Unless, of course, you want to argue that I'm putting a lot of "< divided by quote>" fractions in my /. postings.

        The error is not in the notation, it's in the inability to overload the / operator when dealing with more complex and abstract mathematical concepts. It's like not being able to differentiate between "e as a variable"

        • by johnnyb ( 4816 ) <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @09:42PM (#58418980) Homepage

          Except that, in the first derivative, it *is* used as a fraction. Otherwise you couldn't reformulate your equation for integration (i.e., you have to multiply both sides by dx, which is treating it as a fraction). So, to say that in one case, it is a fraction, but this next case it isn't, but still written as a fraction, even though it *could* be written as a fraction, but we just decided not to, seems strange, at least to me.

          • (i.e., you have to multiply both sides by dx,

            I cannot remember EVER having to multiply "both sides" of anything by "dx" to do an integration. Maybe "new math" forces this.

            • by Anonymous Coward

              Yes, you do, to integrate dy/dx = x, you would multiply both sides by dx, cancelling out the denominator in dy/dx to form, dy = x dx. Throw both sides under an integral sign and go! You just memorized the rules with no understanding of why it worked...

              • No... if I wanted to solve that DE, I would integrate both sides with respect to $x$. On the right, the integral is easy enough to compute. On the left, it comes down to an application of the fundamental theorem of calculus. The Leibniz notation is convenient in this case, since it lets you treat the differential as a number, but the usual exposition relies on actual theorems which justify this kind of manipulation. It should also be noted that Abraham Robinson went over this in the 60s...

                • I've seen many explanations of "integration by substitution" that involve treating du/dx as a fraction where u is substituted for an inner function - It's described as applying the differentiation chain rule in reverse.. It seems to be the standard way of teaching it.
        • by jbengt ( 874751 )
          "d^2/dx^2" is just a shorthand way of writing "d(dy/dx)/dx". Just do the calculations in the order of the parentheses. I fail to see the problem with that. Am I missing something?
    • For every complex problem there is an answer that is clear, simple, and wrong.

      That this basic calculus equation was wrong is my new excuse for why I suck at calculus.

      • by Anonymous Coward

        The way I studied it - it was never a notation. The second order derivative was just a derivative of the derivative, there was no expression for it. Though that was in the former eastern block.

    • .....given that the original form is too verbose, it is no surprise that longer algebraic constructions are also too verbose

      The problem with both these forms is that while this may be how the theory-side thinks of it, its not how the applied side thinks of it. Trying to get from a relation stated in the original notation, to an applied calculation via algebra, leads nowhere. The second "notation" is clearly a minefield for any applied guy (*) even if algebraic manipulations are now valid.

      (*) most of the
    • I appreciate the new form is technically more accurate

      It's only technically more accurate if you read the standard form as a fraction. If you actually read the standard form as intended - a notation indicating the second derivative of y with respect to x - the standard notational form is just as accurate as is the Newtonian notation of dots to denote derivatives with respect to time.

    • by Tomahawk ( 1343 )

      From the abstract on the paper:

      "This leads to an overall simplification in working with calculus for both students and practitioners, as it allows items which are written as fractions to be treated as fractions. It prevents students from making mistakes, since their natural inclination is to treat differentials as fractions.Additionally, there are several little-known but extremely help"

      and

      "Since many in the engineering disciplines are not formally trained mathematicians, this also can prevent professionals

      • by Tomahawk ( 1343 )

        Damnit... I wish comments could be edited on this...

        "...Additionally, there are several little-known but extremely helpful formulas which are straightforwardly deducible from this new notation."

    • you can still use notations like f''(x) for brevity.
  • by JoshuaZ ( 1134087 ) on Wednesday April 10, 2019 @06:06PM (#58418080) Homepage
    There's no "flaw" in calculus. They've proposed a notation which if one used it would allow a broader range of formal manipulations to be valid. This is interesting but it isn't groundbreaking.
    • Well I could argue there was a component missing (d2y/dx2 was there but not -(dy/dx)(d2x/dx2)), hence it was a fundamental error that was causing d2y/dx2 to be a notation instead of a mathematical object, while looking like a mathematical object.

      If it was called something like (dx/dy)" maybe I would have agreed this is only a notation. But d2y/dx2 is weird enough that it pretends to say something .. which happens to be wrong.

  • Congrats (Score:5, Interesting)

    by yodleboy ( 982200 ) on Wednesday April 10, 2019 @06:25PM (#58418154)

    Figured I'd better say congratulations before the inevitable flood of people shitting on your contribution to math gets up to speed.

    • Re:Congrats (Score:5, Interesting)

      by johnnyb ( 4816 ) <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @06:46PM (#58418232) Homepage

      Thanks! I appreciate it. Given that this was my first peer-reviewed mathematics paper, I had no idea how long the process was. I submitted the paper over a *year* ago. The necessary changes were minor. But the actual time it took to go through the process was excruciating. I'm happy to finally be on the other side :)

      • by Anonymous Coward

        You bastard, now every student will need to buy a new textbook next year! ;)

      • My first paper in chemistry was submitted to the Journal of the American Chemical Society.
        The review took over a year. This was before the internet and the editor dropped the ball. At that time,
        it was bad form to ask the editor about the status of your submission as the process was already slow and
        you did not want to pester the editor and get it treated more slowly.
        Email has changed the whole process, but some journals are still slow.

        Congratulations on the acceptance of your paper.

      • Can you give some examples of special cases that are avoided by your notation?

        • by johnnyb ( 4816 )

          Anywhere where you have a second derivative where the variable with which you are taking the derivative with respect to is dependent on another variable. You would previously have to use Faa di Bruno's formula to properly take care of this situation. Now you can just do algebraic manipulations.

      • I submitted the paper over a *year* ago. The necessary changes were minor. But the actual time it took to go through the process was excruciating.

        Apart from a few exception, yeah submitting papers takes aaageeess. Basically (and this has nothing to do with you or the quality of your work), no one wants t oread your paper. I mean people accpet they need to read and review papers as part of the academic papers, but no one WANTS to do it.

        Half of it is they want to be doing their own research. The other half i

        • by johnnyb ( 4816 )

          I recently had another paper which sat for 4 MONTHS in the editors inbox, before he decided he just wasn't interested.

          What needs to happen is to have a small change in policy like this:

          1) You can submit to multiple journals at once
          2) A journal makes an offer to send it for review
          3) Accepting an offer @2 requires that you remove your submission from other journals

          Then the procedure goes on as before. This will prevent editors from wasting everyone's time.

          What's super-super frustrating is that I had a *diffe

      • This is indeed very cool. I just skimmed the paper (so far), but I fundamentally like the idea that not treating dx/dy as a fraction is just a historical artifact.

        Bought your book.

  • by sexconker ( 1179573 ) on Wednesday April 10, 2019 @06:36PM (#58418198)

    The d, dx, dy, etc. are not things to be generally operated on.
    Writing the second derivative as d^2 / dx^2, or worse, d^2y / dx^2 is doubly absurd. (I'm using the ^ to denote supersripting, not exponentiation.)

    d represents the instantaneous rate of change (which itself is a flawed concept - a rate of change cannot be instantaneous as a rate depends on the passage of time), dx represents that instantaneous rate of change of x. d/dx represents the instantaneous rate of some value, possibly some value dependent on x, with respect to the instantaneous rate of change of x. dy/dx, dv/dt, etc. are all the same deal. That rate of change of some variables with respect to other variables.

    What is that instantaneous rate of change? The slope of a line (plane, or whatever if you've got more free variables) tangent to your function at a given point, presuming such a thing exists.

    How do you determine that tangent line? You take the target point and some point h past it ((f(x) vs f(x+h)) (or before it!) and determine what the line does when you consider h approaching 0. You make sure you can define that shit from both ends and both ends agree. If that works out, have a limit, you've got a derivative, and baby, you've got the fundamental theorem of calculus goin'.

    Whoever tried to slap that shit together as a fraction or take shortcuts and try to manipulate those symbols in a way that looks sort of like algebraic manipulation is a clown. Trying to fix that is going to be an uphill battle, but using more of the busted notation isn't really the solution.

    • Whoever tried to slap that shit together as a fraction or take shortcuts and try to manipulate those symbols in a way that looks sort of like algebraic manipulation is a clown. Trying to fix that is going to be an uphill battle, but using more of the busted notation isn't really the solution.

      100% agree.

      The thing is, superior notations are right in front of us. Programmers use a variety of them every day.

    • Re:Ugh (Score:5, Insightful)

      by BKX ( 5066 ) on Wednesday April 10, 2019 @08:00PM (#58418594) Journal

      dy/dx doesn't represent instantaneous rate of change. That would be nonsense. The d in dx and dy means "small difference that will eventually go to zero". This is why dy/dx is a fraction. It represents the limit of a small change in y divided by small change in x, as the changes go to zero. This is why we teach students about the limit definition of the derivative as being what the derivative really is. As far dy and dx being tricks of notation, they're really not. They really are small changes. There's no instantaneous rate of change. dy and dx are always finite real numbers. They never actually become zero. dy/dx is the ratio that is approached as they get smaller and smaller.

      As far as this guy's new version of the second derivative, I call bullshit. I seriously doubt that this is correct. And the notation d^2y/dx^2 actually makes sense when you think about. It's really just d(dy/dx)/dx, that is, a small change in dy/dx divided by a small change in x, where dy/dx is a small change in y divided by a small change in x. Writing it in the other way is just a good way of doing it. If you draw out what this means graphically, is becomes clear that it's really a small change between two consecutive small changes in y divided by two small changes in x, that is d(dy)/dx^2, hence d^2y/dx^2.

      This guy's new version, on the other hand, doesn't make sense at all. I mean, how do you get that from taking the derivative of the first derivative. Let's take a pretty standard function: x=1/2*t^2+2*t+12. x'=t+2; x''=1, whereas his version would be x''=1-t, which doesn't make any sense, unless he has completely redefined everything. I mean, d^2y/dx^2 would have to be something like 2t+5 and d^2x/dx^2 would have to be something like 2, and then we get x''=2t+5-(t+2)*2. I didn't read the paper so I don't know what it would actually be, but there's no doubt that x''=1, so if his method is to make any sense at all it would have to give the same results in the end. I just don't see how it could.

    • embrace the suck, apply powers to other math operators, eg: 1 +^2 3 = 7

  • "d/dx" is an operator on functions that has about a half dozen tidier, less-confusing alternate notations, while "dy/dx" is a limit of a ratio of nonzero numbers that is misleadingly written as a fraction because people in the 18th century weren't as bothered about the whole 'dividing by zero' thing.

    The fact that algebraically treating dy/dx as a fraction works in any situation at all is a minor stroke of luck that honestly should be concealed, since thinking that way already hurts the progress of a huge nu

  • Call differentiation "quark" instead. The new form for d2y/dx2 could be called a double quark or "fred" for short or f for really short. For really rigorous treatment call it f(x). In the UKoGBnNI it shall be known as noddy on Tuesdays unless the year is 2022.

    • by tepples ( 727027 )

      In the UKoGBnNI it shall be known as noddy on Tuesdays unless the year is 2022.

      In your notation, what's Big Ears?

      (And what's Mr. Wobbly Man?)

  • by epine ( 68316 ) on Wednesday April 10, 2019 @06:56PM (#58418286)

    It took me a few minutes to get to the nub of the matter.

    If you're mentally reading the notation d^2 y / dx^2 as the second derivative of y divided by dx squared, you're doing it wrong.

    Because what this notion really intends to mean is d(d(y)/dx)/dx, which as the paper points out is a different order of operation.

    A more compact notation less misleading than the traditional d^2 y / dx^2 might be (d/dx)^2 dy, which expands via two repeated function applications to d(d(y)/dx)/dx, with the underlying operations now in the right order.

    Calculus was never my best thing, so I might be all wet, but it seems to make sense.

    I never liked the dx/dy notation much, regarding it more as a cryptic code than anything conceptually helpful (when its not cryptic, it's not helpful, because that's the common case you already know).

    With the right lambda notation (riffing on what I proposed above) the fundamental operator nature of d() could be correctly expressed, even if you don't want into these algebraic manipulations, which mostly strike me as far too detailed and tedious.

  • by ArchieBunker ( 132337 ) on Wednesday April 10, 2019 @06:58PM (#58418294)

    god that's settled. Now we can figure out that P=NP problem that nobody can give a coherent answer on why its even a thing.

    • Now we can figure out that P=NP problem that nobody can give a coherent answer on why its even a thing.

      I think that problem is you tbh, if you don't understand it.

  • by Tablizer ( 95088 ) on Wednesday April 10, 2019 @07:00PM (#58418310) Journal

    I have a "math issue" that has stumped most of my professors and online math forums. Linear regression typically uses the "least squares" algorithm. However, the power of 2 seems arbitrary to me, and possibly over-emphasizes outliers.

    One professor at first said that the power of 2 makes the "best fit" in an objective sense, but later admitted that he doesn't really know, and couldn't find an answer before the end of the semester.

    While it is true that the power of 2 may simplify the computation process*, that doesn't necessarily means it produces a better result in terms of line or curve fitting. Now that we have computers to do the number crunching, perhaps it's time to embrace arbitrary or different powers (superscripts).

    (Disclaimer, I'm not a math expert.)

    * In other words, power-of-2 produces the simplest known algorithm. But my question revolves around best data fit, not computational resources nor algorithm or formula brevity. Note that when using other powers, one may have to add an absolute value function because power-of-2 automatically provides the equivalent. I actually did a simulation that tested different powers; "blurring" known datasets and seeing which power best matched the original. I couldn't find any significant difference, but probably didn't try enough samples. I tested with fractional powers also, such as 1.5, 2.5, etc.

    • by Mendenhall ( 32321 ) on Wednesday April 10, 2019 @07:23PM (#58418428)

      It's not arbitrary. There's actually a good reason for minimizing (y-yobs)^2, assuming that your observations have a Gaussian distribution. The resulting estimators provide a maximum likelihood estimator of the parameters of the distribution, if and only if it really was Gaussian. Thus, of course, if it isn't Gaussian (outliers of various sorts, et.c), the x^2 may not be the best bet. There is an entire field of 'robust estimators' of quantities, which are more resistant to outliers than least squares. There are also cases in which the underlying distribution is pathologically different from Gaussian; it could be Lorentzian (Cauchy), in which case it is so completely unlike a Gaussian, it doesn't even have a defined standard deviation (it is infinite). There are weighted methods which can fix this too.

      So, in short, least squares is the right answer (in the sense that it yields results which provable have the maximum likelihood describing the data at hand) if you have a perfect Gaussian variate; otherwise, it may well not be.

      • by bungo ( 50628 )

        Good answer.

        I'd go a step further and say that the purpose of linear regression is to see if there is a relationship in the data, and not to provide an actual answer to what the relationship exactly is.

        In the real world, data relationships are rarely linear and distributions are often not known or are approximated. A linear regression will give you an idea on what is going on. The real relationship maybe too complex to ever know.

        So, using least squares, well, it's probably good enough or at least a good sta

    • by sfcat ( 872532 )

      I have a "math issue" that has stumped most of my professors and online math forums. Linear regression typically uses the "least squares" algorithm. However, the power of 2 seems arbitrary to me, and possibly over-emphasizes outliers.

      One professor at first said that the power of 2 makes the "best fit" in an objective sense, but later admitted that he doesn't really know, and couldn't find an answer before the end of the semester.

      While it is true that the power of 2 may simplify the computation process*, that doesn't necessarily means it produces a better result in terms of line or curve fitting. Now that we have computers to do the number crunching, perhaps it's time to embrace arbitrary or different powers (superscripts).

      (Disclaimer, I'm not a math expert.)

      * In other words, power-of-2 produces the simplest known algorithm. But my question revolves around best data fit, not computational resources nor algorithm or formula brevity. Note that when using other powers, one may have to add an absolute value function because power-of-2 automatically provides the equivalent. I actually did a simulation that tested different powers; "blurring" known datasets and seeing which power best matched the original. I couldn't find any significant difference, but probably didn't try enough samples. I tested with fractional powers also, such as 1.5, 2.5, etc.

      Those other exponents probably also work. That term is just to help estimate the slope but any exponent > 1 would likely work there. Its just that its impractical for a variety of reasons including the fact that linear regression is just too simple a model for anything but the simplest use cases. Other techniques which aren't built upon linear regression are used instead so nobody studies this. You very well might be right about outliers for some use cases but it doesn't matter as other techniques ar

    • However, the power of 2 seems arbitrary to me, and possibly over-emphasizes outliers.

      It's not arbitrary, and yes it does emphasise outliers.

      This is going to be hard to explain given slashdot's formatting ability but here goes...

      For least squares, the assumption is that the noise is Gaussian, where every datapoint has the same (unknown) noise variance. If you have some linear function of parameters f(a), what you're doing is finding the a that maximises the probability of observing the data.

      Assume you have d

    • by Ihlosi ( 895663 )
      However, the power of 2 seems arbitrary to me,

      It is not arbitrary!. Using the power of two allows a simple, possible even trivial analytical solution of the problem (Matlab and similar have it built-in and can do it in a single line).

      Of course you could use other norms to minimize the regression error - l1 norm, linf norm or any other norm in between, or even any other norm you can come up with. But in these cases, you end up with optimization problems that do not have analytical solutions and require

    • by gotan ( 60103 )

      But what is the "best data fit"?
      That depends on the kind of data, the kind of errors one expects and the properties the fit should have.

      Linear regression yields a result with some well known properties, e.g. the resulting linear function passes through the center of gravity. Maybe that's a desirable property. In other cases the y_i could be the result of a measuring process with a gaussian error distribution (where larger errors become more unlikely). Due to the central limit theorem that is often the case,

  • by porky_pig_jr ( 129948 ) on Wednesday April 10, 2019 @07:13PM (#58418386)

    Leibniz' notation is normally treated as a "suggestive kind", never to be understood literally. The origin of notation d^2/dx^2 goes from applying d/dx to d/dx, but d/dx only means "a derivative w.r.t. x" and nothing else. Sometime taking this notation literally and doing manipulations as if it were the regular fractions work (and that's b.t.w. is attributed to the early discoveries of many differentiation and integration rules), but it doesn't work most of the time. Any decent book on Calculus should point out that fact. Working with fractions helps to discover some rules, yes, but it's never rigorous, it's more like discovering something in a heuristic way, but then you still need a rigorous proof and that involves going back to basic definition of limits, not arguing in terms of "infinitesimals" (yes, I'm aware of Robinson's "non-standard calculus", but IMHO it's not a mainstream approach. Cheers.

  • but Lagrange's notation > Leibniz's notation anyhow.

    If you want to do wizardry by manipulating the notation itself, then by all means use '(d 2 y/dx 2) - (dy/dx)(d 2 x/dx 2)'.

    Kudos to johnnyb

  • by fahrbot-bot ( 874524 ) on Wednesday April 10, 2019 @07:20PM (#58418422)

    Old-School Slashdotter Discovers and Solves Longstanding Flaw In Basic Calculus

    Can occasionally be heard yelling at younger mathematicians: "Get off my lambda"

    • by jrumney ( 197329 )
      I'd always dismissed old folks groanings about how easy the kids have it compared to their day. I went all the way through K12 and university with a fairly heavy calculus component to my degree, without ever encountering the second derivative of y with respect to x, and I'm not exactly young. But this guy considers this to be "elementary" calculus, so his old elementary school must have been hard core.
      • by johnnyb ( 4816 )

        You never did a second derivative test to determine whether you are at a local minima or maxima?

        Most intro calculus books at least show the notation for the second derivative. However, it is true that they rarely take it far enough to hit any problems with the notation.

        I actually figured this out while trying to find a good way to explain the notation to my students, which is a homeschool co-op class (I have a range of 9-12 graders - the 9th grader is an exception, but she is ridiculously smart). I read t

        • by primebase ( 9535 )
          I'm just impressed that there's anyone still left around here with a lower user# than mine! Congratulations on your innovation and publication!!
  • Very cool! I think the paper will help me understand more deeply problems with the notation I've fought with many times!
    However, I'm a bit disappointed that the notion of partial vs. full derivative wasn't raised, which I think is very relevant to the question...

    • by johnnyb ( 4816 ) <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @10:24PM (#58419136) Homepage

      I've actually got a second paper on partial derivatives just about ready to go. It was originally part of this paper, but it got a little long, and I wanted to rethink and clarify a few concepts. Anyway, partial differentials have the same notational problem *plus* one more. The problem is that there are several partial differentials which all go by the same name. Once you name them properly (i.e., give them each a distinct name) the problems go away.

      • by kackle ( 910159 )
        Well, congratulations, and thank you for putting forth the extra effort to help (future) mankind.
  • Comment removed based on user account deletion
  • In the form of a frog meme?
  • by ledow ( 319597 ) on Thursday April 11, 2019 @05:05AM (#58419862) Homepage

    Often in maths, a mere change of notation, analogous equation in another field, or just looking at things in a slightly different way will open up whole new areas of maths.

    Fermat's Last Theorem took forever to prove and the proof relies on translating the problem to a completely unrelated area of maths, solving it there, and then translating the results back.

    And if you do things like use polar coordinates, etc. some areas of maths burst open with good sense and nice equations.

    Something as simple as a notation change can work wonders. But this is just for convenience of amateurs who don't understand what a derivative actually is and does. It's like saying "Don't use the word multiplication for vectors, because it's not the same as for scalars". We know. Anyone handling it knows. Anyone dumb enough to confuse the notations is going to find out very quickly that nothing works. Sure, it might help if you've literally never done those kinds of equations before, but likely then you'll not be making any ground-breaking mathematical discoveries any time soon.

    Things don't tend to survive hundreds of years for no reason, especially when they are one pen-stroke away from being changed, and have themselves gone through several notational iterations in their time.

    I got through a degree in maths without thinking "Well, this notation is stupid", including three years of advanced calculus.

    If you don't understand the notation, that's the very least of your worries as regards actually doing any calculus.

  • Will revolutionize calculus teaching

Save energy: Drive a smaller shell.

Working...