The interesting thing here isn’t “spreadsheet, but backwards” so much as “spreadsheet as a constraint system”. Classic spreadsheets are basically DAGs: data flows one way and a lot of UX assumptions (and people’s intuition) rely on that. As soon as you allow arbitrary cells to be solved for, you’re in “which variables are free?” land, and most of the confusion in this thread is really about degrees of freedom, not about the math.<p>One way to make this less surprising might be to flip the default: treat all cells as fixed unless explicitly marked as solver variables, and give a lightweight visualization of “these are the cells that will move if you edit this one.” That keeps the power of a general constraint solver while preserving the mental model spreadsheet users already have, and it opens the door to more serious use cases (financial models, physics, scheduling) without feeling like spooky action at a distance.
That's great feedback, thanks! I agree with you, but I don't want to flip the default because this is an experiment I made for fun, and the whole point is to lean in to the chaos a little bit. In a serious product the UX would definitely need a lot more work though.
Graphically, I really like the way autodesk makes sketches in fusion 360 blue until they are fully constrained, and then they are black. My intuition here is that you could color code “degrees of freedom” and “locked” states so that it was more intuitive.
The first example on the main page has a formula with two variables being updated from changing one value. The immediate question I have is if I change the output, where does the extra degree of freedom come from on the inputs? Does one stay locked in place? Unclear.<p>I am a huge fan of the concept though. It's been bugging me for years that my spreadsheet doesn't allow editing text fields after filtering and sorting them down to the subset I want. I have to go all the way back to the mess of unsorted input rows to actually edit them.
<p><pre><code> Does one stay locked in place? Unclear.
</code></pre>
If you set C1=A1+B1 then, when you set a value for C1, A1 and B1 are each half of that value, even if they started off unbalanced.
It would make more sense to preserve the ratio if possible.
Yeah, this concept is interesting but the fact that the simplest test case gives what's fundamentally a surprising result is very annoying.<p>It also doesn't help that in the example, the expected outcome of 53.3333/46.6667 isn't even considered.
You can do this with bidicalc already! You just have to model the problem correctly. If you expect the ratio to remain constant, what you actually want is a problem with a single free variable: the scale.<p><pre><code> A1 = 1.0 // the scale, your variable
A2 = 6 * A1 // intermediate values
A3 = 8 * A1
A4 = A2 + A3 // the sum
</code></pre>
Now update A4 (or any other cell!) and the scale (A1, the only variable) will update as you expect.
To get that, you could pass the ratio explicitly. C = 5A + 7B
What is inputs a,b,c,d,and e are polynomial coefficients? I am hoping to get a fields medal plz respond.
I think it would be good if you could lock one of them.
100% this. When I reached the end of that page I felt pranked because the obvious question was never answered. How are these cases resolved? Is it possible to fix some inputs and only update others? What if I sometimes want to change input A, and other times I want to update input B? All this should be explained as early as possible.
You can do it and it is explained, actually. Use # as a prefix to indicate a constant, e.g.: #50 will be a constant and not a variable.<p>In the future I'd like to support more user input constraints, in particular domain constraints for variables. So you could tell the solver that this cell must remain in some interval, and it would respect that interval instead of assigning any real value.
I have wanted one general application of this idea in a spreadsheet. Specifically, I track some of my running, including speed (pace), distance, and time. Under different circumstances, I have exactly two of the three available and I want the third to be computed, but it varies which. I have found it fairly difficult to implement this kind of data entry in Google Spreadsheets and Excel, even know conceptually it's a very simple constraint "a*b=c" where I know some two variables.<p>As a more substantive comment: You may find the thesis "Propagation networks : a flexible and expressive substrate for computation" by Alexey Radul interesting. <a href="https://dspace.mit.edu/handle/1721.1/54635" rel="nofollow">https://dspace.mit.edu/handle/1721.1/54635</a>
You could create a table with 3 columns: distance, time, pace.
Set the display format for time and pace to "Duration".<p>Enter these formulas:<p><pre><code> distance = time / pace
time = distance * pace
pace = time / distance
</code></pre>
Drag fill everything down. At this point you get reference errors, but once you enter any two values (thereby overwriting the formulas in those cells), you get your result.
You just need two spreadsheet tabs. One for the "raw" input and one with a formula that either takes the input if it exists or falls back to the calculated version
Came here to see if anyone mentioned propagators. That thesis is excellent. I second the recommendation.
Can you enter an RSA key and have it produce two prime numbers?
A random tool like this would be the most entertaining possible way for something like that to be unleashed on the world
My brother once suggested that there are probably bits of code/algorithms that would be world changing if they were released in academic journals, but instead were written by some unknowing programmer in an afternoon for their job coding embedded systems for refrigerators.<p>This particular example may be unlikely, but it's a very fun idea.
An anonymous 4chan user once solved a 25 year old maths problem, to answer a question about the watch order of an anime. <a href="https://www.scientificamerican.com/article/the-surprisingly-difficult-mathematical-proof-that-anime-fans-helped-solve/" rel="nofollow">https://www.scientificamerican.com/article/the-surprisingly-...</a>
Iirc, Heisenberg reinvented Matrix calculations to solve a problem in quantum physics. Not being a mathematician, he wasn't aware of the concept. Born recognized what Heisenberg had done and introduced him to his own reinvention.
Lots of people working in different fields end up reinventing things that have been known to math for centuries, often in clunky roundabout ways. I imagine some of them figure out things not known to math, but it's far more likely to go the other way.
Folks shouldn’t be afraid to “rediscover” stuff.<p>Primarily because the learnings you make are the same as the original “discoverer”. Without those learnings, you might not be able to arrive at your true destination.
> Lots of people working in different fields end up reinventing things that have been known to math for centuries<p>I remember reading, about a year or two ago, about a medical doctor that published a paper rediscovering calculus (I just looked it up, it happened in 1994, there’s been many articles and videos about it)
It's not clear from the Wikipedia article linked below whether she was rediscovering part of calculus or knowingly rebranding it. Do you know more details?
lmao: <a href="https://en.wikipedia.org/wiki/Tai%27s_model" rel="nofollow">https://en.wikipedia.org/wiki/Tai%27s_model</a><p>This is such a great story
it's a fact of geographical and social independence.. so far there's no way to know what everybody did or is doing (well there's twitter but it's configured on noise rather than signal)
A lot of the time engineers are focussed with solving a problem, to build a working machine/program, while academics just want to publish.<p>This is also true with patents.
Jokes aside, let's say someone does figure out how to break RSA over a weekend project. The evil options are easy to come up with, but what is the actually responsible, ethical, thing to do? Never tell anyone?
Contact a known and trusted security researcher who can verify to the world that you did what you said you did, so everyone else can have as much time as possible to figure out exactly how fucked they are. Doing nothing isn’t an option; once someone figures something like that out, it signifies that conditions were ripe for the discovery to be made, and it’s only a matter of time before it’s discovered again independently.
Pretend you had developed a quantum computing advancement and push people to post quantum encryption
Migrating to post quantum encryption is important, but it's also important to not be herded into a "solution" that can be/has been easily compromised.<p><a href="https://blog.cr.yp.to/20251004-weakened.html" rel="nofollow">https://blog.cr.yp.to/20251004-weakened.html</a>
In Prolog you can write rules (similar to functions in other languages) so that they work "both ways". Let's say you have this rule that defines how pace ("runner's speed") relates to distance and time:<p><pre><code> :- use_module(library(clpr)).
pace(Km, Minutes, Pace) :-
{ Minutes = Km * Pace }.
</code></pre>
Even though the rule only specifies how Minutes are calculated, Prolog can now also calculate the other values.<p>You can query it, giving unknowns an uppercase `Name`, and it will give you the possible values for it:<p><pre><code> pace(5, 24.5, Pace)
pace(40, Min, 5)
pace(Km, 24.5, 5)
pace(Km, Time, 5)
</code></pre>
You can try it here: <a href="https://swish.swi-prolog.org/" rel="nofollow">https://swish.swi-prolog.org/</a><p>So if you had a rule that defines RSA key calculation this way, you could enter a key and get all valid solutions for the primes. But of course complex calculations still take a long time. I assume it's similar to a brute force attack in that way (Prolog has clever strategies to explore the solution space though).<p>Disclaimer: I'm not an expert in Prolog or cryptography, so this might not be 100% accurate.
Or enter a public key + some encrypted data to get the private key
In Excel you have goal seek for this functionality. I believe it does some form of numerical solving of the equation system.<p>Good for every situation when you need to solve equations!<p>In the context of using spreadsheets I think about solving simple financial or maybe construction/mechanical design problems where you don’t want to solve it manually or program it and a spreadsheet is a quick and useful interface.
This is very different in practice, because it is pervasive rather than something you have to set up for particular cases.<p>If this was usual it would help a lot with people's tendency to hard code the correct answer rather than fix formulae. Just that aspect of it would be a huge improvement. People do this all the time with simple financial problem, for example.<p>A lot of what people use spreadsheets for is not all that simple. Again, especially with financial applications. People manage huge amounts of money using horribly complex models implemented in Excel.
It does not. It perturbates the variables and uses a hill-climbing algorithm.
Somebody did this back in the DOS era. The program was sometimes called "the crooked accountant's spreadsheet", because you could start with the outputs you wanted and get the input numbers adjusted to fit.<p>Anyone remember?
Cool!<p>Constraint propagation from SICP is a great reference here:<p><a href="https://sicp.sourceacademy.org/chapters/3.3.5.html" rel="nofollow">https://sicp.sourceacademy.org/chapters/3.3.5.html</a>
“Formulas that update backwards” is the main idea behind neural networks such as LLMs: the computation network produces a value, the error in this value is computed, and then the error quantity is pushed backward through the network; this relies on the differentiability of the function computed at each node in the network.
"Formulas that update backwards" isn't really the main idea behind neural networks. It's an efficient way of computing gradients, but there are other ways. For example forward propagation would compute a jacobian-matrix product of input wrt output with an identity matrix. Backpropagation is similar to bidi-calc to the same extent as it is similar to many other algorithms which traverse some graph backward.<p>I think you should be able to use bidi-calc to train a neural net, altough I haven't tried. You'd define a neural net, and then change it's random output to what you want it to output. However as I understand it, it won't find a good solution. It might find a least squares solution to the last layer, then you'd want previous layer to output something that reduces error of the last layer, but bidi-calc will no longer consider last layer at all.
All those words and you forget to provide people the breadcrumbs to learn more for themselves.<p>The term of interest is "backpropagation".
This is pretty cool.<p>This functionality is called ‘break back’ in a lot of enterprise modelling software. See [IBM TM1](<a href="https://www.ibm.com/docs/en/cognos-planning/10.2.1?topic=breakback-default-rules" rel="nofollow">https://www.ibm.com/docs/en/cognos-planning/10.2.1?topic=bre...</a>) and [Anaplan](<a href="https://help.anaplan.com/breakback-1b7aa87d-aa13-49f6-8f7d-d893fb8bccbe" rel="nofollow">https://help.anaplan.com/breakback-1b7aa87d-aa13-49f6-8f7d-d...</a>). They generally work in terms of scaling the inputs to match the new output though, which is a bit more basic than this approach.
Ah, two way data binding. If you've used any frameworks before React (and a couple earlier ones with the same philosophy) you'll understand how it becomes a spaghetti mess over time.
This is great.<p>Since you've asked about bugs, I tried pushing the limits and found the following:<p><pre><code> A1: 100 B1: =100-A1
A2: =A1*(100-A1)
A3: =A1*B1
</code></pre>
A2 can be successfully set to anything reasonable (up to 2500)<p>However, setting A3 to exactly 100 doesn't work, even though setting it to 101 or 99 (or even 100.000001) does work.<p>Another limit:<p><pre><code> A1: 100 B1: 100
A2: =A1+B1
A3: =A1*B1
A4: =abs(A2-100) + abs(A3-100)
</code></pre>
Setting A4 to zero (or anything below 80) doesn't work. This doesn't improve if the constants in the A4 formula are moved a short distance away from 100.<p>In case you can't tell from that last example, I think being able to fix the intended values of multiple outputs simultaneously would be interesting. If you were to give more details about the solver's internals, I'd be keen to hear them.
I think the concept is solid. I’ve only had a few minutes of playing with it, but I have the opinion is that from a UX perspective constants are more common than variables. So perhaps a cell containing a constant should not have a #, but a variable should.
This is really cool! It's like Excel's goal seek but can also handle the case of arbitrary input cells. Goal seeek can only handle one input and one output cell.<p>But how do you handle the case where multiple variables can be changed? If multiple input cells is the key difference from Goal seek, i think some more rigor should be placed into the algorithm here<p>e.g. setting A1 + B1 and wanting the result to be 5. Currently it bumps both A1 and B1 equally. What's the thought process behind this?
Wow! See the classic <a href="https://en.wikipedia.org/wiki/TK_Solver" rel="nofollow">https://en.wikipedia.org/wiki/TK_Solver</a>
A bidirectional formula is also known as an integrity constraint in databases (plus some triggers for restoring the constraint upon updates)!
The idea is very interesting. As a default strategy you could preserve the ratio of inputs by scaling them to match the scaling of the output, instead of making them equal (for addition). Similarly, for multiplication, you could preserve the ratio of inputs as well, by scaling them by nth root of the scaling factor of the output.
The examples are great and these bidirectional calculators are something that people would love to have in traditional spreadsheets.<p>So much so that Credit Suisse, which basically was running everything on heavily modded Excel, created a full language whose outputs were Excel spreadsheets capable of doing that. That thing called “paradise” was a total monstrosity but showed how much people wanted this.<p>That said, you really need a way to set which cells are fixed and which cells are allowed to move if you want to move past basic examples.<p>Most times you know what you want to do. like => if the user modifies that cell, find a solution for those specific ones.<p>If you can enter that info, then you have a lot more constrains for your solver and will avoid a lot of edge cases where everything goes to 0, and you can check that the calculation entered is indeed reversible or not, or if it could have multiple solutions, and so on.
> and these bidirectional calculators are something that people would love to have in traditional spreadsheets<p>People want them in general programming languages too. I can't count the number of times I had to implement the same function multiple times, but backwards in various ways.
Constants are supported, use # as a prefix, e.g.; #50.<p>I'd like to add more constraints in the future like a domain constraint for variables.
Could you build an inverse kinematics solver with this? (I recently watched a youtube video of someone iteratively working out the solutions for a robotic arm, by alternating modifying the inputs and the results)
That's an interesting example I hadn't thought of. Probably? I'll need to try it. Thank you for the suggestion!<p>I think one issue will be that trig functions are kinda weird because they are non-injective. So they work but they are awkward (try solving cos(A1) = 0.5). Inverse kinematics is so well studied, you're probably better off using a dedicated algorithm.
Closed form solutions might be more efficient (in time, energy) and perhaps more numerically stable.
Phenomenal! This is a solid prototype, clean execution. I've had exactly this idea, too: I've already given the spreadsheet the relationships between these values, why can't it just work backwards when values change? That premise hides a ton of complexity, though, I'm sure. Lots of scary matrices.
Sympy can (often) solve under constrained systems in terms of the free variables. The problem I run into is discrete constraints that make solving less closed form and more combinatorial search. When textbook amplifier formulas significantly diverge from physical reality I model the errors as linear correction factors and use gradient descent to correct it in a few experiments, but I’m curious if there is software that has solved this problem.
Sounds a bit like Prolog with a Spreadsheet UI?
you might like <a href="https://omrelli.ug/g9/" rel="nofollow">https://omrelli.ug/g9/</a> which is a similar concept but for graphics
Reminds me of functional logical programming languages like Verse. when you specify the output and ask for the inputs, you get all possible inputs.
Hm? I don't get it.<p>What's the point of calculating backwards non-invertible operations such as addition? Isn't the result just arbitrary?
I made this mostly as a fun challenge :)<p>You are right that there is some arbitrariness involved when picking a solution, however it's a bit more subtle than that.<p>Let's say our problem has N free variables.<p>Step 1 is finding the subset of R^N that is the solution to the root finding problem. If this subset is a point, we are done (return that point). Note that if there is no solution at all bidicalc should correctly report it.<p>Step 2 is if the solution subset is not a point. Then there is multiple (maybe even an infinity of) solutions, and picking one is indeed arbitrary.
They said this:<p>> Even a normal spreadsheet is fairly complex beast. But the novel thing about bidicalc is the backwards solver. Mathematically, updating a spreadsheet "backward" is a (potentially underdetermined) root finding problem, because we are trying to find a vector of unknowns such that , where F is the function computed by the cells formulas, and G is the objective value entered in the cell. Note that F is not necessarily a single formula, but the result of composing an upstream graph of cells into a single function.<p>> The actual root-finding solver is a custom algorithm that I made. It a general purpose algorithm that will find one root of any continuous-almost-everywhere function for which a complete syntactic expression is known. It uses a mix of continuous constraint propagation on interval union arithmetic , directional Newton's method and dichotomic search. It is of course limited by floating point precision and available computation time.<p>But that really doesn't answer your question. I see no reason why the solver wouldn't decide every time it had a two-variable summation that ADD(X+Y) doesn't reverse to X=-90 and Y=100.
It is for fun. Commercial products do not support this because functions are generally not invertible.
I would expect the opposite.<p>Commercial products are run by product managers: they do whatever the business needs that day, and if it doesn't work for most inputs, "that's fine, our users will only ever need addition". Fun open source projects, run by the same programmer who does the implementation, obsess over finding the generic solution to inverting a function and end up with a version that isn't useful for anyone's specific case.
You could add hints as a feature to this. E.g. interest = rate × principle<p>The user hints principle is preferred fixed so they can see what rate is needed for a givem amount of interest.<p>Hints could have a precedence order (then prefer to fix earlier terms on an operation on a tie breaker.)
Very cool!<p>I'd love to see a version where cells are "torn off" and named as they were in Lotus Improv and one had a "formula pane" where one could see all the formulae for a spreadsheet.<p>Would it be possible to create this in Python so that it could be a part of pyspread?
LOL! Gemini suggested to implement this to me literally yesterday: bidirectional computations.
The example was that given a temperature in Celsius and Fahrenheit, modifying either of them should update their counterpart.
In angular that would be two linked signals for instance, but even that is a bit fringe. Gemini was going for something even more elaborated.<p>I told Gemini that spreadsheets were actually not doing that and that I had ways to implement that behavior without the complexity.<p>Just writing that to show the rabbit hole people are going to fall into if they let their llms go brrr. ;D<p>In any case, the problem is interesting.
The point was to include bi-directionality inside a graph of computations so that we didn't get bogged down by cycles.
The benefit being that it would handle float precision issues.<p>My more manual solution expect that floats precision issues are handled explicitly. I think that this level of explicitness is needed anyway for proper floating point error mitigation.
A 2d sketcher with constraints is kind of similar. For example the equation<p>A = B + C<p>Where A, B, C are the lengths of 3 parallel lines. Within the sketcher you can drag the length of any one of those lines and the other two will adjust to keep the constraints.
Thanks for sharing. It’s an alluring idea. I think you’d find the concept of statistical identifiability interesting.
Could this easily represent a Kalman filter and other typically complex control problems?<p>Makes me imagine plotting a inverted pendulum and other real time systems. Could a cell variable be set to Time?
Super cool! Well done. Now take it down and never let Microsoft get their hands on the code, or the entire economic system will go down in flames.
Excellent (sorry accidental pun)<p>This is a nice exploration.
interesting. like Excel Solver? or OpenSolver, Gurobi, other optimizers? or different objective?
Never used any of those, so I don't know! I'd be curious to read a comparison from anyone who knows about them.<p>I think what's pretty unique about the bidicalc solver that I made is that it does not depend on the previous input values to update backwards. It's truly solving the root finding problem. The advantage is that there are never any "stuck in a local optimum" problems with the solver. So you can solve difficult problems like polynomials, etc.
What do you use it for?
What a neat idea!
set A1 = 3
set B1 = 4<p>set C1 = A1 + B1 = 7<p>now change C1 = 14
expected A1 = 6
expected B1 = 8<p>what it did
A1 = 7
B1 = 7<p>great
amazing
[dead]
[dead]