Slashdot is powered by your submissions, so send in your scoop

Old-School Slashdotter Discovers and Solves Longstanding Flaw In Basic Calculus (mindmatters.ai) 222

Posted by BeauHD on Wednesday April 10, 2019 @05:58PM from the unnecessary-confusion dept.

Longtime Slashdot reader johnnyb (Jonathan Bartlett) shares the findings of a new study he, along with co-author Asatur Zh. Khurshudyan, published this week in the journal DCDIS-A: Recently a longstanding flaw in elementary calculus was found and corrected. The "second derivative" has a notation that has confused many students. It turns out that part of the confusion is because the notation is wrong. Note -- I am the subject of the article. Mind Matters provides the technical details: "[T]he second derivative of y with respect to x has traditionally had the notation 'd2 y/dx 2.' While this notation is expressed as a fraction, the problem is that it doesn't actually work as a fraction. The problem is well-known but it has been generally assumed that there is no way to express the second derivative in fraction form. It has been thought that differentials (the fundamental 'dy' and 'dx' that calculus works with) were not actual values and therefore they aren't actually in ratio with each other. Because of these underlying assumptions, the fact that you could not treat the second derivative as a fraction was not thought to be an anomaly. However, it turns out that, with minor modifications to the notation, the terms of the second derivative (and higher derivatives) can indeed be manipulated as an algebraic fraction. The revised notation for the second derivative is '(d 2 y/dx 2) - (dy/dx)(d 2 x/dx 2).'"

The report adds that while mathematicians haven't been getting wrong answers, "correcting the notation enables mathematicians to work with fewer special-case formulas and also to develop a more intuitive understanding of the nature of differentials."

This discussion has been archived. No new comments can be posted.

Old-School Slashdotter Discovers and Solves Longstanding Flaw In Basic Calculus

Load All Comments

Search 222 Comments Log In/Create an Account

Comments Filter:

Seems quite a lot larger... (Score:2, Insightful)

by SuperKendall ( 25149 ) writes:

I appreciate the new form is technically more accurate but the expansion is pretty large compared to the original form... I wonder if the extra length doesn't wash out the understandability gains you get out of the original form.
- Re: (Score:2)
  
  by Shane_Optima ( 4414539 ) writes:
  
  I think the understandability of the new form would be better than the old, because the old way ways "you can treat the derivative as a fraction, except not really (you can't do foo and bar)", which was confusing. Presumably the new form will simplify to the old representation in many cases (which is why the old way was used) and people will be able to understand this and understand when the old way would be sufficient.
  
  That said, I haven't taken ordinary differential equations in eons and I'm not gonna
  - Re:Seems quite a lot larger... (Score:5, Informative)
    
    by johnnyb ( 4816 ) writes: <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @06:59PM (#58418302) Homepage
    
    This is my thought as well. Interestingly, I developed this while writing a book (Calculus from the Ground Up [amazon.com]) to use for my homeschool co-op calculus classes. I was trying to find a good way to explain the notation, and I literally had 20 calculus books that I read through trying to find a good explanation for the standard notation in any of them. None of them even attempted an explanation, just "this is the way it is, but don't treat it as a fraction." So, I tried to deduce the notation myself. That's when I realized that it was not just limited, it was actually wrong. So I wrote the paper and finished the book (it's Appendix B in the book).
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by itamblyn ( 867415 ) writes:
      
      Your result is awesome. Well done.
    - Re: (Score:2)
      
      by phantomfive ( 622387 ) writes:
      
      Now I don't feel bad that the notation never made any sense to me.
    - Re: (Score:2)
      
      by azcoyote ( 1101073 ) writes:
      
      Awesome! I'll have to recommend your book to my homeschool co-op. I was especially impressed that you noted that the problem of the original notation derived from a philosophical cause. Too many people do not realize that philosophy plays into science and mathematics, even in how we conceptualize objective facts and concepts.
    - - Re: (Score:3)
        
        by johnnyb ( 4816 ) writes:
        
        The problem with e-book math books is trying to make it look right on a small screen. If you just want a PDF of it, send me an email and I'll send you one, especially if you consider telling other people how great it is. Unfortunately, you can't just tell Amazon to take your PDF and make it an e-book :(
    - - Re: (Score:2)
        
        by ron_ivi ( 607351 ) writes:
        
        like trying to write better poetry by using more perfect spelling and grammar
        Let's run with your analogy.
        The current situation - with the misleading notation - is as if all poetry in history was written using future-tense.
        The new recommended notation is like adding the ability to write the correct tenses in poetry.
        Sure, you can still use the rough approximation old notation (and many teachers will); just as you can still write poetry that only uses future tense.
        This guy empowered us to now use the more correct notation that implies the meaning we intend.
  - Re: (Score:3, Interesting)
    
    by Obfuscant ( 592200 ) writes:
    
    because the old way ways "you can treat the derivative as a fraction,
    
    Except the second derivative notion isn't a fraction. It's a way of writing "the second derivative of Y with respect to X" in a short form. Not all '/' create "fractions". Unless, of course, you want to argue that I'm putting a lot of "< divided by quote>" fractions in my /. postings.
    The error is not in the notation, it's in the inability to overload the / operator when dealing with more complex and abstract mathematical concepts. It's like not being able to differentiate between "e as a variable"
    - Re:Seems quite a lot larger... (Score:4, Interesting)
      
      by johnnyb ( 4816 ) writes: <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @09:42PM (#58418980) Homepage
      
      Except that, in the first derivative, it *is* used as a fraction. Otherwise you couldn't reformulate your equation for integration (i.e., you have to multiply both sides by dx, which is treating it as a fraction). So, to say that in one case, it is a fraction, but this next case it isn't, but still written as a fraction, even though it *could* be written as a fraction, but we just decided not to, seems strange, at least to me.
      
      Parent Share
      twitter facebook
      - Re: (Score:1)
        
        by Obfuscant ( 592200 ) writes:
        
        (i.e., you have to multiply both sides by dx,
        I cannot remember EVER having to multiply "both sides" of anything by "dx" to do an integration. Maybe "new math" forces this.
        
        Re: Seems quite a lot larger... (Score:2, Insightful)
        
        by Anonymous Coward writes:
        
        Yes, you do, to integrate dy/dx = x, you would multiply both sides by dx, cancelling out the denominator in dy/dx to form, dy = x dx. Throw both sides under an integral sign and go! You just memorized the rules with no understanding of why it worked...
        
        Re: (Score:2)
        
        by the phantom ( 107624 ) writes:
        
        No... if I wanted to solve that DE, I would integrate both sides with respect to $x$. On the right, the integral is easy enough to compute. On the left, it comes down to an application of the fundamental theorem of calculus. The Leibniz notation is convenient in this case, since it lets you treat the differential as a number, but the usual exposition relies on actual theorems which justify this kind of manipulation. It should also be noted that Abraham Robinson went over this in the 60s...
        
        Re: (Score:2)
        
        by bugs2squash ( 1132591 ) writes:
        
        I've seen many explanations of "integration by substitution" that involve treating du/dx as a fraction where u is substituted for an inner function - It's described as applying the differentiation chain rule in reverse.. It seems to be the standard way of teaching it.
    - Re: (Score:2)
      
      by jbengt ( 874751 ) writes:
      
      "d^2/dx^2" is just a shorthand way of writing "d(dy/dx)/dx". Just do the calculations in the order of the parentheses. I fail to see the problem with that. Am I missing something?
- Re: (Score:2)
  
  by Spazmania ( 174582 ) writes:
  
  For every complex problem there is an answer that is clear, simple, and wrong.
  That this basic calculus equation was wrong is my new excuse for why I suck at calculus.
  - Re: Seems quite a lot larger... (Score:1)
    
    by Anonymous Coward writes:
    
    The way I studied it - it was never a notation. The second order derivative was just a derivative of the derivative, there was no expression for it. Though that was in the former eastern block.
- Re: (Score:2)
  
  by Rockoon ( 1252108 ) writes:
  
  .....given that the original form is too verbose, it is no surprise that longer algebraic constructions are also too verbose
  
  The problem with both these forms is that while this may be how the theory-side thinks of it, its not how the applied side thinks of it. Trying to get from a relation stated in the original notation, to an applied calculation via algebra, leads nowhere. The second "notation" is clearly a minefield for any applied guy (*) even if algebraic manipulations are now valid.
  
  (*) most of the
- Standard form just as accurate (Score:3)
  
  by Roger W Moore ( 538166 ) writes:
  
  I appreciate the new form is technically more accurate
  It's only technically more accurate if you read the standard form as a fraction. If you actually read the standard form as intended - a notation indicating the second derivative of y with respect to x - the standard notational form is just as accurate as is the Newtonian notation of dots to denote derivatives with respect to time.
- Re: (Score:2)
  
  by Tomahawk ( 1343 ) writes:
  
  From the abstract on the paper:
  "This leads to an overall simplification in working with calculus for both students and practitioners, as it allows items which are written as fractions to be treated as fractions. It prevents students from making mistakes, since their natural inclination is to treat differentials as fractions.Additionally, there are several little-known but extremely help"
  and
  "Since many in the engineering disciplines are not formally trained mathematicians, this also can prevent professionals
  - Re: (Score:2)
    
    by Tomahawk ( 1343 ) writes:
    
    Damnit... I wish comments could be edited on this...
    "...Additionally, there are several little-known but extremely helpful formulas which are straightforwardly deducible from this new notation."
- Re: (Score:2)
  
  by bugs2squash ( 1132591 ) writes:
  
  you can still use notations like f''(x) for brevity.
- - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
Summary's accuracy seems questionable (Score:5, Informative)

by JoshuaZ ( 1134087 ) writes: on Wednesday April 10, 2019 @06:06PM (#58418080) Homepage

There's no "flaw" in calculus. They've proposed a notation which if one used it would allow a broader range of formal manipulations to be valid. This is interesting but it isn't groundbreaking.

Share
twitter facebook
- Re: (Score:2)
  
  by Guybrush_T ( 980074 ) writes:
  
  Well I could argue there was a component missing (d2y/dx2 was there but not -(dy/dx)(d2x/dx2)), hence it was a fundamental error that was causing d2y/dx2 to be a notation instead of a mathematical object, while looking like a mathematical object.
  If it was called something like (dx/dy)" maybe I would have agreed this is only a notation. But d2y/dx2 is weird enough that it pretends to say something .. which happens to be wrong.
  - - Re: (Score:2)
      
      by jbengt ( 874751 ) writes:
      
      By the way, d2x/dx2 = 0, so there is nothing "missing".
      Considering that, if actually defined, dx/dx should be 1, I would have thought that d^2x/dx^2 = d(dx/dx)/dx = d(1)/dx, which would be the limit of 1/dx as dx goes to zero, which would be undefined or infinity.
      - Re: (Score:2)
        
        by johnnyb ( 4816 ) writes:
        
        Not quite. d(1) *is* zero. The differential of a constant is zero, basically by definition. If e is an infinitesimal, 0/e is still zero. However, d^2x/dx^2 != d(dx/dx)/dx. d(dx/dx)/dx, using the new notation, is "d^2x/dx^2 - (dx/dx)(d^2x/dx^2)", which is obviously zero by inspection.
- - Re: (Score:2)
    
    by jgtg32a ( 1173373 ) writes:
    
    Sounds like it is well known and he came up with a patch that covers all/more of the special cases.
  - Re: (Score:3)
    
    by sexconker ( 1179573 ) writes:
    
    It's well known to most students who took a calculus course and then took the next level calculus course.
    Suddenly your d and dx start doing weird shit, and they can't do other shit you've been told to do with them, then you find yourself questioning WTF the d or the x actually mean, and then you wonder why you never thought d/dx was just 1/x, and you realize your teacher / professor can't explain it either.
    If you don't grok the fundamental theorem of calculus, you'll never grok d/dx and the bullshit that ha
  - Re:Summary's accuracy seems questionable (Score:5, Interesting)
    
    by johnnyb ( 4816 ) writes: <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @06:55PM (#58418284) Homepage
    
    It's a bit of both. Some of the facts of the matter were known, but it was assumed that this was just "the way it was". That is, no one considered it an open problem. For instance, we view the inability to divide by zero just a fact of mathematics, not a flaw. Likewise, this was not known to be a flaw, it was just assumed that this was the way things worked.
    If you need to point to a definitive flaw, it was in our understanding of how it was supposed to work - the relationship between our understanding and the notation. Once *that* flaw was discovered, the actual notation just spilled right out. That is, the flaw was that people were *not* treating dy/dx *sufficiently* as a fraction, due to 19th century preferences against infinitesimals. Once you realize that dy/dx really is a fraction, and has to be treated accordingly, everything automatically works.
    It's almost humorous because there was no real advanced work to do. Literally everything needed is available in intro calculus. The problem was (a) the mathematics community had a habit of *not* treating dy/dx as a fraction, and (b) new students who didn't know better were simply taught *what* to do, not *why* to do it, and continued to repeat the mistake for over a century.
    
    Parent Share
    twitter facebook
Congrats (Score:5, Interesting)

by yodleboy ( 982200 ) writes: on Wednesday April 10, 2019 @06:25PM (#58418154)

Figured I'd better say congratulations before the inevitable flood of people shitting on your contribution to math gets up to speed.

Share
twitter facebook
- Re:Congrats (Score:5, Interesting)
  
  by johnnyb ( 4816 ) writes: <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @06:46PM (#58418232) Homepage
  
  Thanks! I appreciate it. Given that this was my first peer-reviewed mathematics paper, I had no idea how long the process was. I submitted the paper over a *year* ago. The necessary changes were minor. But the actual time it took to go through the process was excruciating. I'm happy to finally be on the other side :)
  
  Parent Share
  twitter facebook
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    You bastard, now every student will need to buy a new textbook next year! ;)
  - Re: (Score:1)
    
    by Dr. Bombay ( 126603 ) writes:
    
    My first paper in chemistry was submitted to the Journal of the American Chemical Society.
    The review took over a year. This was before the internet and the editor dropped the ball. At that time,
    it was bad form to ask the editor about the status of your submission as the process was already slow and
    you did not want to pester the editor and get it treated more slowly.
    Email has changed the whole process, but some journals are still slow.
    Congratulations on the acceptance of your paper.
  - Re: (Score:2)
    
    by Actually, I do RTFA ( 1058596 ) writes:
    
    Can you give some examples of special cases that are avoided by your notation?
    - Re: (Score:2)
      
      by johnnyb ( 4816 ) writes:
      
      Anywhere where you have a second derivative where the variable with which you are taking the derivative with respect to is dependent on another variable. You would previously have to use Faa di Bruno's formula to properly take care of this situation. Now you can just do algebraic manipulations.
  - Re: (Score:2)
    
    by serviscope_minor ( 664417 ) writes:
    
    I submitted the paper over a *year* ago. The necessary changes were minor. But the actual time it took to go through the process was excruciating.
    Apart from a few exception, yeah submitting papers takes aaageeess. Basically (and this has nothing to do with you or the quality of your work), no one wants t oread your paper. I mean people accpet they need to read and review papers as part of the academic papers, but no one WANTS to do it.
    Half of it is they want to be doing their own research. The other half i
    - Re: (Score:2)
      
      by johnnyb ( 4816 ) writes:
      
      I recently had another paper which sat for 4 MONTHS in the editors inbox, before he decided he just wasn't interested.
      What needs to happen is to have a small change in policy like this:
      1) You can submit to multiple journals at once
      2) A journal makes an offer to send it for review
      3) Accepting an offer @2 requires that you remove your submission from other journals
      Then the procedure goes on as before. This will prevent editors from wasting everyone's time.
      What's super-super frustrating is that I had a *diffe
  - Re: (Score:2)
    
    by Ami Ganguli ( 921 ) writes:
    
    This is indeed very cool. I just skimmed the paper (so far), but I fundamentally like the idea that not treating dx/dy as a fraction is just a historical artifact.
    Bought your book.
  - - Re: (Score:3)
      
      by johnnyb ( 4816 ) writes:
      
      My coauthor has been doing this to good effect. His book "Controllability of Dynamic Systems: The Green's Function Approach" utilizes it. My role in mathematics is primarily in teaching high schoolers, so I don't spend a lot of time with differential equations. That's also the reason I *have* a co-author. I needed someone to tell me I wasn't crazy :)
Ugh (Score:3)

by sexconker ( 1179573 ) writes: on Wednesday April 10, 2019 @06:36PM (#58418198)

The d, dx, dy, etc. are not things to be generally operated on.
Writing the second derivative as d^2 / dx^2, or worse, d^2y / dx^2 is doubly absurd. (I'm using the ^ to denote supersripting, not exponentiation.)
d represents the instantaneous rate of change (which itself is a flawed concept - a rate of change cannot be instantaneous as a rate depends on the passage of time), dx represents that instantaneous rate of change of x. d/dx represents the instantaneous rate of some value, possibly some value dependent on x, with respect to the instantaneous rate of change of x. dy/dx, dv/dt, etc. are all the same deal. That rate of change of some variables with respect to other variables.
What is that instantaneous rate of change? The slope of a line (plane, or whatever if you've got more free variables) tangent to your function at a given point, presuming such a thing exists.
How do you determine that tangent line? You take the target point and some point h past it ((f(x) vs f(x+h)) (or before it!) and determine what the line does when you consider h approaching 0. You make sure you can define that shit from both ends and both ends agree. If that works out, have a limit, you've got a derivative, and baby, you've got the fundamental theorem of calculus goin'.
Whoever tried to slap that shit together as a fraction or take shortcuts and try to manipulate those symbols in a way that looks sort of like algebraic manipulation is a clown. Trying to fix that is going to be an uphill battle, but using more of the busted notation isn't really the solution.

Share
twitter facebook
- Re: (Score:2)
  
  by Rockoon ( 1252108 ) writes:
  
  Whoever tried to slap that shit together as a fraction or take shortcuts and try to manipulate those symbols in a way that looks sort of like algebraic manipulation is a clown. Trying to fix that is going to be an uphill battle, but using more of the busted notation isn't really the solution.
  100% agree.
  
  The thing is, superior notations are right in front of us. Programmers use a variety of them every day.
- Re:Ugh (Score:5, Insightful)
  
  by BKX ( 5066 ) writes: on Wednesday April 10, 2019 @08:00PM (#58418594) Journal
  
  dy/dx doesn't represent instantaneous rate of change. That would be nonsense. The d in dx and dy means "small difference that will eventually go to zero". This is why dy/dx is a fraction. It represents the limit of a small change in y divided by small change in x, as the changes go to zero. This is why we teach students about the limit definition of the derivative as being what the derivative really is. As far dy and dx being tricks of notation, they're really not. They really are small changes. There's no instantaneous rate of change. dy and dx are always finite real numbers. They never actually become zero. dy/dx is the ratio that is approached as they get smaller and smaller.
  As far as this guy's new version of the second derivative, I call bullshit. I seriously doubt that this is correct. And the notation d^2y/dx^2 actually makes sense when you think about. It's really just d(dy/dx)/dx, that is, a small change in dy/dx divided by a small change in x, where dy/dx is a small change in y divided by a small change in x. Writing it in the other way is just a good way of doing it. If you draw out what this means graphically, is becomes clear that it's really a small change between two consecutive small changes in y divided by two small changes in x, that is d(dy)/dx^2, hence d^2y/dx^2.
  This guy's new version, on the other hand, doesn't make sense at all. I mean, how do you get that from taking the derivative of the first derivative. Let's take a pretty standard function: x=1/2*t^2+2*t+12. x'=t+2; x''=1, whereas his version would be x''=1-t, which doesn't make any sense, unless he has completely redefined everything. I mean, d^2y/dx^2 would have to be something like 2t+5 and d^2x/dx^2 would have to be something like 2, and then we get x''=2t+5-(t+2)*2. I didn't read the paper so I don't know what it would actually be, but there's no doubt that x''=1, so if his method is to make any sense at all it would have to give the same results in the end. I just don't see how it could.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by bugs2squash ( 1132591 ) writes:
  
  embrace the suck, apply powers to other math operators, eg: 1 +^2 3 = 7
Except the notation isn't "dy/dx", it's "d/dx" (Score:1)

by Anonymous Coward writes:

"d/dx" is an operator on functions that has about a half dozen tidier, less-confusing alternate notations, while "dy/dx" is a limit of a ratio of nonzero numbers that is misleadingly written as a fraction because people in the 18th century weren't as bothered about the whole 'dividing by zero' thing.
The fact that algebraically treating dy/dx as a fraction works in any situation at all is a minor stroke of luck that honestly should be concealed, since thinking that way already hurts the progress of a huge nu
(d 2 y/dx 2) – (dy/dx)(d 2 x/dx 2) (Score:2)

by JSG ( 82708 ) writes:

Call differentiation "quark" instead. The new form for d2y/dx2 could be called a double quark or "fred" for short or f for really short. For really rigorous treatment call it f(x). In the UKoGBnNI it shall be known as noddy on Tuesdays unless the year is 2022.
- Re: (Score:2)
  
  by tepples ( 727027 ) writes:
  
  In the UKoGBnNI it shall be known as noddy on Tuesdays unless the year is 2022.
  In your notation, what's Big Ears?
  (And what's Mr. Wobbly Man?)
this is actually useful (Score:4, Informative)

by epine ( 68316 ) writes: on Wednesday April 10, 2019 @06:56PM (#58418286)

It took me a few minutes to get to the nub of the matter.
If you're mentally reading the notation d^2 y / dx^2 as the second derivative of y divided by dx squared, you're doing it wrong.
Because what this notion really intends to mean is d(d(y)/dx)/dx, which as the paper points out is a different order of operation.
A more compact notation less misleading than the traditional d^2 y / dx^2 might be (d/dx)^2 dy, which expands via two repeated function applications to d(d(y)/dx)/dx, with the underlying operations now in the right order.
Calculus was never my best thing, so I might be all wet, but it seems to make sense.
I never liked the dx/dy notation much, regarding it more as a cryptic code than anything conceptually helpful (when its not cryptic, it's not helpful, because that's the common case you already know).
With the right lambda notation (riffing on what I proposed above) the fundamental operator nature of d() could be correctly expressed, even if you don't want into these algebraic manipulations, which mostly strike me as far too detailed and tedious.

Share
twitter facebook
- - Re: (Score:2)
    
    by Cederic ( 9623 ) writes:
    
    You twisted deviant.
    I'm so glad your kind never gained a foothold in modern computing.
Well thank (Score:3)

by ArchieBunker ( 132337 ) writes: on Wednesday April 10, 2019 @06:58PM (#58418294)

god that's settled. Now we can figure out that P=NP problem that nobody can give a coherent answer on why its even a thing.

Share
twitter facebook
- Re: (Score:2)
  
  by phantomfive ( 622387 ) writes:
  
  Now we can figure out that P=NP problem that nobody can give a coherent answer on why its even a thing.
  I think that problem is you tbh, if you don't understand it.
Linear regression stumper (Score:4, Interesting)

by Tablizer ( 95088 ) writes: on Wednesday April 10, 2019 @07:00PM (#58418310) Journal

I have a "math issue" that has stumped most of my professors and online math forums. Linear regression typically uses the "least squares" algorithm. However, the power of 2 seems arbitrary to me, and possibly over-emphasizes outliers.
One professor at first said that the power of 2 makes the "best fit" in an objective sense, but later admitted that he doesn't really know, and couldn't find an answer before the end of the semester.
While it is true that the power of 2 may simplify the computation process*, that doesn't necessarily means it produces a better result in terms of line or curve fitting. Now that we have computers to do the number crunching, perhaps it's time to embrace arbitrary or different powers (superscripts).
(Disclaimer, I'm not a math expert.)
* In other words, power-of-2 produces the simplest known algorithm. But my question revolves around best data fit, not computational resources nor algorithm or formula brevity. Note that when using other powers, one may have to add an absolute value function because power-of-2 automatically provides the equivalent. I actually did a simulation that tested different powers; "blurring" known datasets and seeing which power best matched the original. I couldn't find any significant difference, but probably didn't try enough samples. I tested with fractional powers also, such as 1.5, 2.5, etc.

Share
twitter facebook
- Re:Linear regression stumper (Score:5, Informative)
  
  by Mendenhall ( 32321 ) writes: on Wednesday April 10, 2019 @07:23PM (#58418428)
  
  It's not arbitrary. There's actually a good reason for minimizing (y-yobs)^2, assuming that your observations have a Gaussian distribution. The resulting estimators provide a maximum likelihood estimator of the parameters of the distribution, if and only if it really was Gaussian. Thus, of course, if it isn't Gaussian (outliers of various sorts, et.c), the x^2 may not be the best bet. There is an entire field of 'robust estimators' of quantities, which are more resistant to outliers than least squares. There are also cases in which the underlying distribution is pathologically different from Gaussian; it could be Lorentzian (Cauchy), in which case it is so completely unlike a Gaussian, it doesn't even have a defined standard deviation (it is infinite). There are weighted methods which can fix this too.
  So, in short, least squares is the right answer (in the sense that it yields results which provable have the maximum likelihood describing the data at hand) if you have a perfect Gaussian variate; otherwise, it may well not be.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by bungo ( 50628 ) writes:
    
    Good answer.
    I'd go a step further and say that the purpose of linear regression is to see if there is a relationship in the data, and not to provide an actual answer to what the relationship exactly is.
    In the real world, data relationships are rarely linear and distributions are often not known or are approximated. A linear regression will give you an idea on what is going on. The real relationship maybe too complex to ever know.
    So, using least squares, well, it's probably good enough or at least a good sta
  - - Re: (Score:2)
      
      by jbengt ( 874751 ) writes:
      
      Using 1.1 as an exponent doesn't play well with errors that have a negative sign.
- Re: (Score:2)
  
  by sfcat ( 872532 ) writes:
  
  I have a "math issue" that has stumped most of my professors and online math forums. Linear regression typically uses the "least squares" algorithm. However, the power of 2 seems arbitrary to me, and possibly over-emphasizes outliers.
  One professor at first said that the power of 2 makes the "best fit" in an objective sense, but later admitted that he doesn't really know, and couldn't find an answer before the end of the semester.
  While it is true that the power of 2 may simplify the computation process*, that doesn't necessarily means it produces a better result in terms of line or curve fitting. Now that we have computers to do the number crunching, perhaps it's time to embrace arbitrary or different powers (superscripts).
  (Disclaimer, I'm not a math expert.)
  * In other words, power-of-2 produces the simplest known algorithm. But my question revolves around best data fit, not computational resources nor algorithm or formula brevity. Note that when using other powers, one may have to add an absolute value function because power-of-2 automatically provides the equivalent. I actually did a simulation that tested different powers; "blurring" known datasets and seeing which power best matched the original. I couldn't find any significant difference, but probably didn't try enough samples. I tested with fractional powers also, such as 1.5, 2.5, etc.
  Those other exponents probably also work. That term is just to help estimate the slope but any exponent > 1 would likely work there. Its just that its impractical for a variety of reasons including the fact that linear regression is just too simple a model for anything but the simplest use cases. Other techniques which aren't built upon linear regression are used instead so nobody studies this. You very well might be right about outliers for some use cases but it doesn't matter as other techniques ar
- Re: (Score:2)
  
  by serviscope_minor ( 664417 ) writes:
  
  However, the power of 2 seems arbitrary to me, and possibly over-emphasizes outliers.
  It's not arbitrary, and yes it does emphasise outliers.
  This is going to be hard to explain given slashdot's formatting ability but here goes...
  For least squares, the assumption is that the noise is Gaussian, where every datapoint has the same (unknown) noise variance. If you have some linear function of parameters f(a), what you're doing is finding the a that maximises the probability of observing the data.
  Assume you have d
- Re: (Score:2)
  
  by Ihlosi ( 895663 ) writes:
  
  However, the power of 2 seems arbitrary to me,
  It is not arbitrary!. Using the power of two allows a simple, possible even trivial analytical solution of the problem (Matlab and similar have it built-in and can do it in a single line).
  Of course you could use other norms to minimize the regression error - l1 norm, linf norm or any other norm in between, or even any other norm you can come up with. But in these cases, you end up with optimization problems that do not have analytical solutions and require
- Re: (Score:2)
  
  by gotan ( 60103 ) writes:
  
  But what is the "best data fit"?
  That depends on the kind of data, the kind of errors one expects and the properties the fit should have.
  Linear regression yields a result with some well known properties, e.g. the resulting linear function passes through the center of gravity. Maybe that's a desirable property. In other cases the y_i could be the result of a measuring process with a gaussian error distribution (where larger errors become more unlikely). Due to the central limit theorem that is often the case,
- - Re: (Score:1)
    
    by Tablizer ( 95088 ) writes:
    
    Chairman of the FCC? Heck no, don't give him any more power.
- - Re: (Score:2)
    
    by Tablizer ( 95088 ) writes:
    
    I'm not sure how exactly you performed a 1.5 minimization
    Brute force: shifting the line around incrementally and computationally until an approximate minimum was achieved. Yes, I know it's only an approximation, but it should have been good enough to detect any clear pattern if one existed.
    I did assume rough boundaries/limits to avoid problems such as multiple candidates and division by zero. Thus, I wasn't testing every possible slope or line, just those "in the ballpark". (Perhaps "least squares" is handy
It's a complete nonsense. (Score:3)

by porky_pig_jr ( 129948 ) writes: on Wednesday April 10, 2019 @07:13PM (#58418386)

Leibniz' notation is normally treated as a "suggestive kind", never to be understood literally. The origin of notation d^2/dx^2 goes from applying d/dx to d/dx, but d/dx only means "a derivative w.r.t. x" and nothing else. Sometime taking this notation literally and doing manipulations as if it were the regular fractions work (and that's b.t.w. is attributed to the early discoveries of many differentiation and integration rules), but it doesn't work most of the time. Any decent book on Calculus should point out that fact. Working with fractions helps to discover some rules, yes, but it's never rigorous, it's more like discovering something in a heuristic way, but then you still need a rigorous proof and that involves going back to basic definition of limits, not arguing in terms of "infinitesimals" (yes, I'm aware of Robinson's "non-standard calculus", but IMHO it's not a mainstream approach. Cheers.

Share
twitter facebook
Delta square x over delta y square IS retarded... (Score:1)

by XArtur0 ( 5079833 ) writes:

but Lagrange's notation > Leibniz's notation anyhow.
If you want to do wizardry by manipulating the notation itself, then by all means use '(d 2 y/dx 2) - (dy/dx)(d 2 x/dx 2)'.
Kudos to johnnyb
Old School (Score:3)

by fahrbot-bot ( 874524 ) writes: on Wednesday April 10, 2019 @07:20PM (#58418422)

Old-School Slashdotter Discovers and Solves Longstanding Flaw In Basic Calculus
Can occasionally be heard yelling at younger mathematicians: "Get off my lambda"

Share
twitter facebook
- Re: (Score:2)
  
  by jrumney ( 197329 ) writes:
  
  I'd always dismissed old folks groanings about how easy the kids have it compared to their day. I went all the way through K12 and university with a fairly heavy calculus component to my degree, without ever encountering the second derivative of y with respect to x, and I'm not exactly young. But this guy considers this to be "elementary" calculus, so his old elementary school must have been hard core.
  - Re: (Score:3)
    
    by johnnyb ( 4816 ) writes:
    
    You never did a second derivative test to determine whether you are at a local minima or maxima?
    Most intro calculus books at least show the notation for the second derivative. However, it is true that they rarely take it far enough to hit any problems with the notation.
    I actually figured this out while trying to find a good way to explain the notation to my students, which is a homeschool co-op class (I have a range of 9-12 graders - the 9th grader is an exception, but she is ridiculously smart). I read t
    - Re: (Score:1)
      
      by primebase ( 9535 ) writes:
      
      I'm just impressed that there's anyone still left around here with a lower user# than mine! Congratulations on your innovation and publication!!
    - - Re: (Score:2)
        
        by johnnyb ( 4816 ) writes:
        
        If you read my paper, I actually suggest this as a shortened form of my own. This notation is Arbogast's, and is woefully underused. I show how to interconvert between Arbogast notation and my own in the paper.
What about partial derivatives? (Score:2)

by leehwtsohg ( 618675 ) writes:

Very cool! I think the paper will help me understand more deeply problems with the notation I've fought with many times!
However, I'm a bit disappointed that the notion of partial vs. full derivative wasn't raised, which I think is very relevant to the question...
- Re:What about partial derivatives? (Score:5, Interesting)
  
  by johnnyb ( 4816 ) writes: <jonathan@bartlettpublishing.com> on Wednesday April 10, 2019 @10:24PM (#58419136) Homepage
  
  I've actually got a second paper on partial derivatives just about ready to go. It was originally part of this paper, but it got a little long, and I wanted to rethink and clarify a few concepts. Anyway, partial differentials have the same notational problem *plus* one more. The problem is that there are several partial differentials which all go by the same name. Once you name them properly (i.e., give them each a distinct name) the problems go away.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by kackle ( 910159 ) writes:
    
    Well, congratulations, and thank you for putting forth the extra effort to help (future) mankind.
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
Could I get that explained... (Score:2)

by HotNeedleOfInquiry ( 598897 ) writes:

In the form of a frog meme?
Sigh. (Score:3)

by ledow ( 319597 ) writes: on Thursday April 11, 2019 @05:05AM (#58419862) Homepage

Often in maths, a mere change of notation, analogous equation in another field, or just looking at things in a slightly different way will open up whole new areas of maths.
Fermat's Last Theorem took forever to prove and the proof relies on translating the problem to a completely unrelated area of maths, solving it there, and then translating the results back.
And if you do things like use polar coordinates, etc. some areas of maths burst open with good sense and nice equations.
Something as simple as a notation change can work wonders. But this is just for convenience of amateurs who don't understand what a derivative actually is and does. It's like saying "Don't use the word multiplication for vectors, because it's not the same as for scalars". We know. Anyone handling it knows. Anyone dumb enough to confuse the notations is going to find out very quickly that nothing works. Sure, it might help if you've literally never done those kinds of equations before, but likely then you'll not be making any ground-breaking mathematical discoveries any time soon.
Things don't tend to survive hundreds of years for no reason, especially when they are one pen-stroke away from being changed, and have themselves gone through several notational iterations in their time.
I got through a degree in maths without thinking "Well, this notation is stupid", including three years of advanced calculus.
If you don't understand the notation, that's the very least of your worries as regards actually doing any calculus.

Share
twitter facebook
this is absolute AWESOME (Score:2)

by Andre Dias ( 3819801 ) writes:

Will revolutionize calculus teaching
- Re:And in a sane curriculum (Score:5, Informative)
  
  by TeknoHog ( 164938 ) writes: on Wednesday April 10, 2019 @06:37PM (#58418200) Homepage Journal
  
  The messed up notation by Newton is not used and instead the much saner stuff from Riemann is used. Newton was smart, but a hack and a crank. And he tried to suppress Riemann notation. Mathematics would probably have done better without Newton.
  Surely you mean Leibniz (1646-1716), not Riemann (1826-1866).
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by gweihir ( 88907 ) writes:
    
    I mean the Riemann Integral. But you are correct, differentiation came first.
  - - Re: (Score:2)
      
      by gweihir ( 88907 ) writes:
      
      He could not have. He was dead at the time all information became available. But you seem to be stupid, so you are probably incapable of understanding causality.
- Re: (Score:2)
  
  by kamapuaa ( 555446 ) writes:
  
  Thanks for your opinion on the history of Calculus, person who is not even aware of the most basic of facts and provides no real rationale.
- Newton and _Leibnitz_ both useful (Score:5, Informative)
  
  by Roger W Moore ( 538166 ) writes: on Wednesday April 10, 2019 @07:20PM (#58418418) Journal
  
  The messed up notation by Newton is not used and instead the much saner stuff from Riemann is used.
  The advantage of the Newtonian notation is that it is a lot faster and easier for, unsurprisingly, basic Newtonian mechanics where you only really differentiate with respect to time. This is why it is used extensively in this area of physics. Leibnitz's (not Riemann's!) notation is a lot more versatile which is not surprising: Leibnitz was a mathematician who was interested in the abstract concept whereas Newton was a physicist who only developed calculus so he could describe mechanics and so did not really need a broader, more flexible notation.
  
  It is actually quite a common that fundamental physics can find itself ahead of maths. For example String theory today is really a joint venture between maths and physics since they are having to develop the maths needed to describe the physical models they work on.
  
  Finally, Newton was neither a hack or a crank but he was a somewhat evil genius. He could be quite nasty and viscous, sometimes in extremely petty ways. For example he discredited Leibnitz and he fell out with Robert Hooke and had all contemporary portraits of him destroyed which so angered a modern artist that she spent the time an effort painting multiple portraits of Hooke from contemporary descriptions so that, today, there are more portraits of Hooke than Newton!
  
  Parent Share
  twitter facebook
  - Re: (Score:1)
    
    by gweihir ( 88907 ) writes:
    
    Finally, Newton was neither a hack
    As a mathematician, he was most definitely a hack. Or he just did not care, which is about as bad. As a physicist he was probably somewhat better.
    - Re: (Score:2)
      
      by Roger W Moore ( 538166 ) writes:
      
      As a mathematician, he was most definitely a hack. Or he just did not care...
      He was not a mathematician: he was a physicist. He only developed the mathematics needed to describe his physics so why should he care about maths beyond that when that wasn't what he was interested in? Complaining he was a poor mathematician would be like claiming you are a poor journalist for starting a sentence with 'or'.
      - Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        Did you miss what the story was about?
        
        Re: (Score:2)
        
        by Roger W Moore ( 538166 ) writes:
        
        It was about mathematical notation and Newton's mathematical notation for calculus is still used by physicists today because it is so convenient. This does rather suggest that his notation was aimed at physics and that he was a physicist. The only reason you have a problem with him is that you are trying to make him out to be something he was not.
      - Re: (Score:2)
        
        by Tyler Durden ( 136036 ) writes:
        
        Newton was most definitely a mathematician. He held the Lucasian Chair of Mathematics at Cambridge for a number of years. He also developed mathematics outside of what he needed to describe his physics, such as finding the infinite series necessary to expand (a+b)^n where n is negative, or a fraction.
        Also consider this quote about Newton from Leibniz himself: "Taking mathematics from the beginning of the world to the time when Newton lived, what he had done was much the better half."
    - - Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        Would make perfect sense. What a twat.
  - Re: (Score:2)
    
    by DCFusor ( 1763438 ) writes:
    
    Yup, viscous as molasses on a cold day.
- Re: (Score:3)
  
  by sfcat ( 872532 ) writes:
  
  The messed up notation by Newton is not used and instead the much saner stuff from Riemann is used. Newton was smart, but a hack and a crank. And he tried to suppress Riemann notation. Mathematics would probably have done better without Newton.
  Riemann lived 2 centuries after Newton. And your conclusions aren't correct, they aren't even wrong!!!
- - Re: (Score:1)
    
    by gweihir ( 88907 ) writes:
    
    Not that much. Privileged and had time. The adoration some people have for him is not founded on facts.
    - Re: (Score:2)
      
      by Lanthanide ( 4982283 ) writes:
      
      Unfortunately we can't go back through history and give every single human who ever lived privilege and time to see what they're capable of.
      So we can only assess those who have made prominent contributions. There are many others who had privilege and time and still didn't contribute what Newton did, so it's still fair to call him a genius.
      - Re: (Score:1)
        
        by gweihir ( 88907 ) writes:
        
        It is not. His contributions are verifiable not that good in quality (see the the story, for example) and there was a lot of low-hanging fruit around. For calculus, he did only the simple, standard-space version (and did it badly), while Riemann did a far more general and superior version basically at the same time. Calling Riemann a genius may or may not be justified, but Newton does not make it into that group.
        
        Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        There is a strong indicator: Leibnitz did the same thing independently at the same time. If you factor in that the scientific community was pretty small back then, that means the prerequisites were all there, the question had been asked and it just took somebody to put it together. That makes the results a "good" scientific result, but not a "genius level" one. And I am not talking about his contributions to physics, I am talking about his contributions to calculus, see the original story.
        As to Riemann, I c
        
        Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        And alternatively, I really know what I am talking about and regard ACs as basically scum. However, I cannot see how you come to "pretentious", unless you are an authoritarian follower that things people that have a name may not be criticized. Is that it? Newton was so great, nobody is allowed to criticize him? That would be a pretty bad stance. As for "self-centered", were to you see any evidence for that?
        So I got confused on a name in the history of mathematics. But this is /. and I did not look it up to
    - Re: And in a sane curriculum (Score:1)
      
      by Anonymous Coward writes:
      
      Exactly! If gweiher here had more time on his hands, he would have written supercalculus by now. But no, his solemn duty to try to top dead people on Slashdot takes up his otherwise suuuuper valuable time.
- - Re: (Score:2)
    
    by gweihir ( 88907 ) writes:
    
    It does not work for almost all cases. (Mathematical "almost all".) And it does not warn you of it. It is basically not mathematics, but clever shifting around of symbols with pitfalls which works purely by accident. Not a good thing.
- Re: (Score:2)
  
  by sexconker ( 1179573 ) writes:
  
  Nah. He's right. The current notation is bullshit. It looks like standard algebra and people try to manipulate the symbols as such.
  I mean, you could shorten d/dx to 1/x, right?
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    Nah. He's right. The current notation is bullshit. It looks like standard algebra and people try to manipulate the symbols as such.
    I mean, you could shorten d/dx to 1/x, right?
    The current notation is mathematically correct. It express that fact that the second derivative is the incremental limit of the first derivative.
    The new notation for d^2/dx^2 loses the corect mathematical meaning and gains nothing of value.
    d^2/dx^2 (f) (x) = lim (h->0) (d/dx (f)(x+h) - d/dx (f) (x))/h
    - Re: Uh (Score:2)
      
      by Kohlrabi82 ( 1672654 ) writes:
      
      Finally the first commenter who knows what dy/dx really is. It is the limit of an expression containing a fraction, so not a "real" fraction in algebraic sense.
  - Re: (Score:2)
    
    by Obfuscant ( 592200 ) writes:
    
    The current notation is bullshit. It looks like standard algebra and people try to manipulate the symbols as such.
    And that's why they call it "learning calculus" instead of "coming up with calculus all on your own". Not understanding how something works or what the terms mean can lead to horrible results.
- Re: I have ... (Score:1)
  
  by Anonymous Coward writes:
  
  That would make you a jerk.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Seems quite a lot larger... (Score:2, Insightful)

Re: (Score:2)

Re:Seems quite a lot larger... (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3, Interesting)

Re:Seems quite a lot larger... (Score:4, Interesting)

Re: (Score:1)

Re: Seems quite a lot larger... (Score:2, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: Seems quite a lot larger... (Score:1)

Re: (Score:2)

Standard form just as accurate (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Summary's accuracy seems questionable (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re:Summary's accuracy seems questionable (Score:5, Interesting)

Congrats (Score:5, Interesting)

Re:Congrats (Score:5, Interesting)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Ugh (Score:3)

Re: (Score:2)

Re:Ugh (Score:5, Insightful)

Re: (Score:2)

Except the notation isn't "dy/dx", it's "d/dx" (Score:1)

(d 2 y/dx 2) – (dy/dx)(d 2 x/dx 2) (Score:2)

Re: (Score:2)

this is actually useful (Score:4, Informative)

Re: (Score:2)

Well thank (Score:3)

Re: (Score:2)

Linear regression stumper (Score:4, Interesting)

Re:Linear regression stumper (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

It's a complete nonsense. (Score:3)

Delta square x over delta y square IS retarded... (Score:1)

Old School (Score:3)

Re: (Score:2)

Re: (Score:3)

Re: (Score:1)

Re: (Score:2)

What about partial derivatives? (Score:2)

Re:What about partial derivatives? (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)

Could I get that explained... (Score:2)

Sigh. (Score:3)

this is absolute AWESOME (Score:2)

Re:And in a sane curriculum (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Newton and _Leibnitz_ both useful (Score:5, Informative)