Getting It Right vs Getting It Done

With all the hype around machine learning, I occasionally get asked if it could be used to make predictions for particle colliders, like the LHC.

Physicists do use machine learning these days, to be clear. There are tricks and heuristics, ways to quickly classify different particle collisions and speed up computation. But if you’re imagining something that replaces particle physics calculations entirely, or even replace the LHC itself, then you’re misunderstanding what particle physics calculations are for.

Why do physicists try to predict the results of particle collisions? Why not just observe what happens?

Physicists make predictions not in order to know what will happen in advance, but to compare those predictions to experimental results. If the predictions match the experiments, that supports existing theories like the Standard Model. If they don’t, then a new theory might be needed.

Those predictions certainly don’t need to be made by humans: most of the calculations are done by computers anyway. And they don’t need to be perfectly accurate: in particle physics, every calculation is an approximation. But the approximations used in particle physics are controlled approximations. Physicists keep track of what assumptions they make, and how they might go wrong. That’s not something you can typically do in machine learning, where you might train a neural network with millions of parameters. The whole point is to be able to check experiments against a known theory, and we can’t do that if we don’t know whether our calculation actually respects the theory.

That difference, between caring about the result and caring about how you got there, is a useful guide. If you want to predict how a protein folds in order to understand what it does in a cell, then you will find AlphaFold useful. If you want to confirm your theory of how protein folding happens, it will be less useful.

Some industries just want the final result, and can benefit from machine learning. If you want to know what your customers will buy, or which suppliers are cheating you, or whether your warehouse is moldy, then machine learning can be really helpful.

Other industries are trying, like particle physicists, to confirm that a theory is true. If you’re running a clinical trial, you want to be crystal clear about how the trial data turn into statistics. You, and the regulators, care about how you got there, not just about what answer you got. The same can be true for banks: if laws tell you you aren’t allowed to discriminate against certain kinds of customers for loans, you need to use a method where you know what traits you’re actually discriminating against.

So will physicists use machine learning? Yes, and more of it over time. But will they use it to replace normal calculations, or replace the LHC? No, that would be missing the point.

3 thoughts on “Getting It Right vs Getting It Done

  1. subhydrogen

    Hm… Several colleagues of mine applied to the artificial intellect asking about inertons (my inventions) and the neutrino whose theory I just recently constructed. The AI produced so stupid things, that we were ashamed of the AI and its developers. Conclusion: The AI cannot create something very new at all.

    Like

    Reply
  2. JollyJoker

    I’d say it’s a matter of defining problems where AI can be useful. Generating formulae that fit a set of criteria could be useful and should be easy to train on, if the answers can be validated automatically.

    So, not checking if theories are true, but coming up with candidate theories.

    Like

    Reply
    1. 4gravitons Post author

      Yeah, “hard to find, easy to verify” is indeed the sweet spot where people are trying to use this stuff. It’s what the people looking for Calabi-Yaus use it for, and it’s what a few people are trying to do in the amplitudes field, using it to guess functions which can be checked by other methods.

      Literally coming up with candidate theories is probably not useful (aside from on the Calabi-Yau side, where that is kind of what they’re doing). If you can parametrize a space of theories then you can usually search it the “normal way”: it’s not so multi-dimensional that you need machine learning, you can usually just use classical statistics or optimization methods to solve for the optimal parameter values given your evidence.

      Liked by 1 person

      Reply

Leave a comment! If it's your first time, it will go into moderation.