As the saying goes, it is better not to see laws or sausages being made. You’d prefer to see the clean package on the outside than the mess behind the scenes.
The same is true of science. A good paper tells a nice, clean story: a logical argument from beginning to end, with no extra baggage to slow it down. That story isn’t a lie: for any decent paper in theoretical physics, the conclusions will follow from the premises. Most of the time, though, it isn’t how the physicist actually did it.
The way we actually make discoveries is messy. It involves looking for inspiration in all the wrong places: pieces of old computer code and old problems, trying to reproduce this or that calculation with this or that method. In the end, once we find something interesting enough, we can reconstruct a clearer, cleaner, story, something actually fit to publish. We hide the original mess partly for career reasons (easier to get hired if you tell a clean, heroic story), partly to be understood (a paper that embraced the mess of discovery would be a mess to read), and partly just due to that deep human instinct to not let others see us that way.
The trouble is, some of that “mess” is useful, even essential. And because it’s never published or put into textbooks, the only way to learn it is word of mouth.
A lot of these messy tricks involve numerics. Many theoretical physics papers derive things analytically, writing out equations in symbols. It’s easy to make a mistake in that kind of calculation, either writing something wrong on paper or as a bug in computer code. To correct mistakes, many things are checked numerically: we plug in numbers to make sure everything still works. Sometimes this means using an approximation, trying to make sure two things cancel to some large enough number of decimal places. Sometimes instead it’s exact: we plug in prime numbers, and can much more easily see if two things are equal, or if something is rational or contains a square root. Sometimes numerics aren’t just used to check something, but to find a solution: exploring many options in an easier numerical calculation, finding one that works, and doing it again analytically.
“Ansatze” are also common: our fancy word for an educated guess. These we sometimesadmit, when they’re at the core of a new scientific idea. But the more minor examples go un-mentioned. If a paper shows a nice clean formula and proves it’s correct, but doesn’t explain how the authors got it…probably, they used an ansatz. This trick can go hand-in-hand with numerics as well: make a guess, check it matches the right numbers, then try to see why it’s true.
The messy tricks can also involve the code itself. In my field we often use “computer algebra” systems, programs to do our calculations for us. These systems are programming languages in their own right, and we need to write computer code for them. That code gets passed around informally, but almost never standardized. Mathematical concepts that come up again and again can be implemented very differently by different people, some much more efficiently than others.
I don’t think it’s unreasonable that we leave “the mess” out of our papers. They would certainly be hard to understand otherwise! But it’s a shame we don’t publish our dirty tricks somewhere, even in special “dirty tricks” papers. Students often start out assuming everything is done the clean way, and start doubting themselves when they notice it’s much too slow to make progress. Learning the tricks is a big part of learning to be a physicist. We should find a better way to teach them.
In past years, I’ve compared science to a gift: the ideal gift for the puzzle-fan, one that keeps giving new puzzles. I think people might not appreciate the scale of that gift, though.
Maybe you’ve heard the old joke that studying for a PhD means learning more and more about less and less until you know absolutely everything about nothing at all. This joke is overstating things: even when you’ve specialized down to nothing at all, you still won’t know everything.
If you read the history of science, it might feel like there are only a few important things going on at a time. You notice the simultaneous discoveries, like calculus from Newton and Liebniz and natural selection from Darwin and Wallace. You can get the impression that everyone was working on a few things, the things that would make it into the textbooks. In fact, though, there was always a lot to research, always many interesting things going on at once. As a scientist, you can’t escape this. Even if you focus on your own little area, on a few topics you care about, even in a small field, there will always be more going on than you can keep up with.
This is especially clear around the holiday season. As everyone tries to get results out before leaving on vacation, there is a tidal wave of new content. I have five papers open on my laptop right now (after closing four or so), and some recorded talks I keep meaning to watch. Two of the papers are the kind of simultaneous discovery I mentioned: two differentgroups noticing that what might seem like an obvious fact – that in classical physics, unlike in quantum, one can have zero uncertainty – has unexpected implications for our kind of calculations. (A third group got there too, but hasn’t published yet.) It’s a link I would never have expected, and with three groups coming at it independently you’d think it would be the only thing to pay attention to: but even in the same sub-sub-sub-field, there are other things going on that are just as cool! It’s wild, and it’s not some special quirk of my area: that’s science, for all us scientists. No matter how much you expect it to give you, you’ll get more, lifetimes and lifetimes worth. That’s a Newtonmas gift to satisfy anyone.
I remember, a while back, visiting a friend in his office. He had just became a professor, and was still setting things up. I noticed a list on the chalkboard, taking up almost one whole side. Taking a closer look, I realized that list was a list of projects. To my young postdoc eyes, the list was amazing: how could one person be working on so many things?
There’s an idiom in English, “too many irons in the fire”. You can imagine a blacksmith forging many things at once, each piece of iron taking focus from the others. Too many, and a piece might break, or otherwise fail.
In theoretical physics, a typical PhD publishes three papers before they graduate. That usually means one project at a time, maybe two. For someone used to one or two irons in the fire, so many at a time seems an impossible feat.
Scientists grow over their careers, though, and in more than one way. What seems impossible can eventually be business as usual.
First, as your skill grows, you become more efficient. A lot of scientific work is a kind of debugging: making mistakes, and figuring out how to fix them. The more experience you have, the more you know what kinds of mistakes you might make, and the better you will be at avoiding them. (Never perfect, of course: scientists always have to debug something.)
Second, your collaborations grow. The more people you work with, the more you can share these projects, each person contributing their own piece. With time, you start supervising as well: Masters students, PhD students, postdocs. Each one adds to the number of irons you can manage in your fire. While for bad supervisors this just means having their name on lots of projects, the good supervisors will be genuinely contributing to each one. That’s yet another kind of growth: as you get further along, you get a better idea of what works and what doesn’t, so even in a quick meeting you can solve meaningful problems.
Third, you grow your system. The ideas you explore early on blossom into full-fledged methods, tricks which you can pull out again and again when you need them. The tricks combine, forming new, bigger tricks, and eventually a long list of projects becomes second nature, a natural thing your system is able to do.
As you grow as a scientist, you become more than just one researcher, one debugger at a laptop or pipetter at a lab bench. You become a research program, one that manifests across many people and laptops and labs. As your expertise grows, you become a kind of living exchange of ideas, concepts flowing through you when needed, building your own scientific world.
It’s time for my yearly Halloween post. My regular readers know what to expect: a horror trope and a physics topic, linked by a tortured analogy. And this year, the pun is definitely intended.
Horror movies have a fascination with serial killers. Over the years, they’ve explored every possible concept: from gritty realism to the supernatural, crude weapons to sophisticated traps, motivations straightforward to mysterious, and even killers who are puppets.
One common theme of all fictional serial killers is power. Serial killers are scary because they have almost all the power in a situation, turned to alien and unpredictable goals. The protagonists of a horror film are the underdogs, never knowing whether the killer will pull out some new ability or plan that makes everything they try irrelevant. Even if they get the opportunity to negotiate, the power imbalance means that they can’t count on getting what they need: anything the killer agrees will be twisted to serve their own ends.
Academics tell their own kind of horror stories. Earlier this month, the historian Brett Deveraux had a blog post about graduate school, describing what students go through to get a PhD. As he admits, parts of his story only apply to the humanities. STEM departments have more money, and pay their students a bit better. It’s not a lot better (I was making around $20,000 a year at Stony Brook), but it’s enough that I’ve never heard of a student taking out a loan to make ends meet. (At most, people took on tutoring jobs for a bit of extra cash.) We don’t need to learn new languages, and our degrees take a bit less time: six or seven years for an experimental physicist, and often five for a theoretical physicist. Finally, the work can be a lot less lonely, especially for those who work in a lab.
Still, there is a core in common, and that core once again is power. Universities have power, of course: and when you’re not a paying customer but an employee with your career on the line, that power can be quite scary. But the person with the most power over a PhD student is their advisor. Deveraux talks compellingly about the difference that power can make: how an advisor who is cruel, or indifferent, or just clueless, can make or break not just your career but your psychological well-being. The lucky students, like Deveraux and me, find supportive mentors who help us survive and move forward. The unlucky students leave with scars, even if those scars aren’t jigsaw-shaped.
Neither Deveraux or I have experience with PhD programs in Europe, which are quite different in structure from those in the US. But the power imbalance is still there, and still deadly, and so despite the different structure, I’ve seen students here break down, scarred in the same way.
Deveraux frames his post as advice for those who want to go to grad school, and his first piece of advice is “Have you tried wanting something else?” I try to echo that when I advise students. I don’t always succeed: there’s something exciting about a young person interested in the same topics we’re interested in, willing to try to make a life of it. But it is important to know what you’re getting into, and to know there’s a big world out there of other options. If, after all that, you decide to stick through it, just remember: power matters. If you give someone power over you, try to be as sure as you can that it won’t turn into a horror story.
I should say mostly in person. It’s a hybrid conference, with some speakers and attendees joining on Zoom. Some couldn’t make it because of travel restrictions, or just wanted to be cautious about COVID. But seemingly just as many had other reasons, like teaching schedules or just long distances, that kept them from coming in person. We’re all wondering if this will become a long-term trend, where the flexibility of hybrid conferences lets people attend no matter their constraints.
The hybrid format worked better than expected, but there were still a few kinks. The audio was particularly tricky, it seemed like each day the organizers needed a new microphone setup to take questions. It’s always a little harder to understand someone on Zoom, especially when you’re sitting in an auditorium rather than focused on your own screen. Still, technological experience should make this work better in future.
Content-wise, the conference began with a “mini-school” of pedagogical talks on particle physics, string theory, and mathematics. I found the mathematical talks by Erik Panzer particularly nice, it’s a topic I still feel quite weak on and he laid everything out in a very clear way. It seemed like a nice touch to include a “school” element in the conference, though I worry it ate too much into the time.
The rest of the content skewed more mathematical, and more string-theoretic, than these conferences have in the past. The mathematical content ranged from intriguing (including an interesting window into what it takes to get high-quality numerics) to intimidatingly obscure (large commutative diagrams, category theory on the first slide). String theory was arguably under-covered in prior years, but it felt over-covered this year. With the particle physics talks focusing on either general properties with perhaps some connections to elliptics, or to N=4 super Yang-Mills, it felt like we were missing the more “practical” talks from past conferences, where someone was computing something concrete in QCD and told us what they needed. Next year is in Mainz, so maybe those talks will reappear.
In science, every project is different. Sometimes, my collaborators and I have a clear enough goal, and a clear enough way to get there. There are always surprises along the way, of course, but nonetheless we keep a certain amount of structure. That can mean dividing tasks (“you find the basis, I’ll find the constraints”), or it can mean everyone doing the same work in parallel, like a group of students helping each other with homework.
Recently, I’ve experienced a different kind of collaboration. The goals are less clear, and the methods are more…playful.
A big task improves with collaboration: you can divide it up. A delicate task improves with collaboration: you can check each other’s work. An unclear task also improves with collaboration: you can explore more ground.
Picture a bunch of children playing in a sandbox. The children start out sitting by themselves, each digging in the sand. Some are building castles, others dig moats, or search for buried treasure, or dinosaur bones. As the children play, their games link up: the moat protects the castle, the knights leave for treasure, the dinosaur awakens and attacks. The stories feed back on one another, and the game grows.
The project I’m working on now is a bit like that sandbox. Each of us has our own ideas about what we’d like to build, and each experiments with them. We see what works and what doesn’t, which castles hold and which fall over. We keep an eye on what each other are doing, and adjust: if that castle is close to done, maybe a moat would improve the view. Piece by piece, the unclear task becomes clearer. Our individual goals draw us in different directions, but what we discover in the end brings us back together, richer for our distant discoveries.
Working this way requires a lot of communication! In the past, I was mystified when I saw other physicists spend hours talking at a blackboard. I thought that must be a waste of time: surely they’d get more done if they sat at their desks and worked things out, rather than having to talk through every step. Now I realize they were likely part of a different kind of collaboration: not dividing tasks or working in parallel on a clear calculation, but exploring different approaches. In these collaborations, those long chats are a kind of calibration: by explaining what you’re trying to do, you see whether it makes sense to your collaborators. You can drop the parts that don’t make sense and build in some of your collaborators’ ideas. In the end you begin to converge, to something that everyone can endorse. Your sandcastles meet up, your stories become one story. When everything looks good, you’re ready to call over your mom (or in this case, the arXiv) and show it off.
Now that I’ve rested up after this year’s Amplitudes, I’ll give a few of my impressions.
Overall, I think the conference went pretty well. People seemed amused by the digital Niels Bohr, even if he looked a bit like a puppet (Lance compared him to Yoda in his final speech, which was…apt). We used Gather.town, originally just for the poster session and a “virtual reception”, but later we also encouraged people to meet up in it during breaks. That in particular was a big hit: I think people really liked the ability to just move around and chat in impromptu groups, and while nobody seemed to use the “virtual bar”, the “virtual beach” had a lively crowd. Time zones were inevitably rough, but I think we ended up with a good compromise where everyone could still see a meaningful chunk of the conference.
A few things didn’t work as well. For those planning conferences, I would strongly suggest not making a brand new gmail account to send out conference announcements: for a lot of people the emails went straight to spam. Zulip was a bust: I’m not sure if people found it more confusing than last year’s Slack or didn’t notice it due to the spam issue, but almost no-one posted in it. YouTube was complicated: the stream went down a few times and I could never figure out exactly why, it may have just been internet issues here at the Niels Bohr Institute (we did have a power outage one night and had to scramble to get internet access back the next morning). As far as I could tell YouTube wouldn’t let me re-open the previous stream so each time I had to post a new link, which probably was frustrating for those following along there.
That said, this was less of a problem than it might have been, because attendance/”viewership” as a whole was lower than expected. Zoomplitudes last year had massive numbers of people join in both on Zoom and via YouTube. We had a lot fewer: out of over 500 registered participants, we had fewer than 200 on Zoom at any one time, and at most 30 or so on YouTube. Confusion around the conference email might have played a role here, but I suspect part of the difference is simple fatigue: after over a year of this pandemic, online conferences no longer feel like an exciting new experience.
The actual content of the conference ranged pretty widely. Some people reviewed earlier work, others presented recent papers or even work-in-progress. As in recent years, a meaningful chunk of the conference focused on applications of amplitudes techniques to gravitational wave physics. This included a talk by Thibault Damour, who has by now mostly made his peace with the field after his early doubts were sorted out. He still suspected that the mismatch of scales (weak coupling on the one hand, classical scattering on the other) would cause problems in future, but after his work withLaporta and Mastrolia even he had to acknowledge that amplitudes techniques were useful.
In the past I would have put the double-copy and gravitational wave researchers under the same heading, but this year they were quite distinct. While a few of the gravitational wave talks mentioned the double-copy, most of those who brought it up were doing something quite a bit more abstract than gravitational wave physics. Indeed, several people were pushing the boundaries of what it means to double-copy. There were modified KLT kernels, different versions of color-kinematics duality, and explorations of what kinds of massive particles can and (arguably more interestingly) cannot be compatible with a double-copy framework. The sheer range of different generalizations had me briefly wondering whether the double-copy could be “too flexible to be meaningful”, whether the right definitions would let you double-copy anything out of anything. I was reassured by the points where each talk argued that certain things didn’t work: it suggests that wherever this mysterious structure comes from, its powers are limited enough to make it meaningful.
A fair number of talks dealt with what has always been our main application, collider physics. There the context shifted, but the message stayed consistent: for a “clean” enough process two or three-loop calculations can make a big difference, taking a prediction that would be completely off from experiment and bringing it into line. These are more useful the more that can be varied about the calculation: functions are more useful than numbers, for example. I was gratified to hear confirmation that a particular kind of process, where two massless particles like quarks become three massive particles like W or Z bosons, is one of these “clean enough” examples: it means someone will need to compute my “tardigrade” diagram eventually.
If collider physics is our main application, N=4 super Yang-Mills has always been our main toy model. Jaroslav Trnka gave us the details behind Nima’s exciting talk from last year, and Nima had a whole new exciting talk this year with promised connections to category theory (connections he didn’t quite reach after speaking for two and a half hours). Anastasia Volovich presented two distinct methods for predicting square-root symbol letters, while my colleague Chi Zhang showed some exciting progress with the elliptic double-box, realizing the several-year dream of representing it in a useful basis of integrals and showcasing several interesting properties. Anne Spiering came over from the integrability side to show us just how special the “planar” version of the theory really is: by increasing the number of colors of gluons, she showed that one could smoothly go between an “integrability-esque” spectrum and a “chaotic” spectrum. Finally, Lance Dixon mentioned his progress with form-factors in his talk at the end of the conference, showing off some statistics of coefficients of different functions and speculating that machine learning might be able to predict them.
On the more mathematical side, Francis Brown showed us a new way to get numbers out of graphs, one distinct but related to our usual interpretation in terms of Feynman diagrams. I’m still unsure what it will be used for, but the fact that it maps every graph to something finite probably has some interesting implications. Albrecht Klemm and Claude Duhr talked about two sides of the same story, their recent work on integrals involving Calabi-Yau manifolds. They focused on a particular nice set of integrals, and time will tell whether the methods work more broadly, but there are some exciting suggestions that at least parts will.
There’s been a resurgence of the old dream of the S-matrix community, constraining amplitudes via “general constraints” alone, and several talks dealt with those ideas. Sebastian Mizera went the other direction, and tried to test one of those “general constraints”, seeing under which circumstances he could prove that you can swap a particle going in with an antiparticle going out. Others went out to infinity, trying to understand amplitudes from the perspective of the so-called “celestial sphere” where they appear to be governed by conformal field theories of some sort. A few talks dealt with amplitudes in string theory itself: Yvonne Geyer built them out of field-theory amplitudes, while Ashoke Sen explained how to include D-instantons in them.
We also had three “special talks” in the evenings. I’ve mentioned Nima’s already. Zvi Bern gave a retrospective talk that I somewhat cheesily describe as “good for the soul”: a look to the early days of the field that reminded us of why we are who we are. Lance Dixon closed the conference with a light-hearted summary and a look to the future. That future includes next year’s Amplitudes, which after a hasty discussion during this year’s conference has now localized to Prague. Let’s hope it’s in person!
I’m busy this week with Amplitudes 2021. Being behind the “organizer’s desk” for one of these conferences is an entirely different experience. There’s a lot to keep track of, keeping the Zoom going smoothly, the website up to date, and the YouTube stream running. Luckily we have good help, a team of students handling a lot of the more finicky details. I think we’ve been putting on a good conference, but there are definitely lessons I’ve learned for the next time I host something.
The content has been interesting too of course, and despite being busy I’ve still gotten to watch the talks. I’ll say more about this after the conference, there have been quite a few interesting developments in the past year.
In my line of work, I spend a lot of time explaining physics. I write posts here of course, and give the occasional public lecture. I also explain physics when I supervise Master’s students, and in a broader sense whenever I chat with my collaborators or write papers. I’ll explain physics even more when I start teaching. But of all the ways to explain physics, there’s one that has always been my favorite: the one-on-one conversation.
Talking science one-on-one is validating in a uniquely satisfying way. You get instant feedback, questions when you’re unclear and comprehension when you’re close. There’s a kind of puzzle to it, discovering what you need to fill in the gaps in one particular person’s understanding. As a kid, I’d chase this feeling with imaginary conversations: I’d plot out a chat with Democritus or Newton, trying to explain physics or evolution or democracy. It was a game, seeing how I could ground our modern understanding in concepts someone from history already knew.
I’ll never get a chance in real life to explain physics to a Democritus or a Newton, to bridge a gap quite that large. But, as I’ve discovered over the years, everyone has bits and pieces they don’t yet understand. Even focused on the most popular topics, like black holes or elementary particles, everyone has gaps in what they’ve managed to pick up. I do too! So any conversation can be its own kind of adventure, discovering what that one person knows, what they don’t, and how to connect the two.
Of course, there’s fun in writing and public speaking too (not to mention, of course, research). Still, I sometimes wonder if there’s a career out there in just the part I like best: just one conversation after another, delving deep into one person’s understanding, making real progress, then moving on to the next. It wouldn’t be efficient by any means, but it sure sounds fun.
The scientific method, as we usually learn it, starts with a hypothesis. The scientist begins with a guess, and asks a question with a clear answer: true, or false? That guess lets them design an experiment, observe the consequences, and improve our knowledge of the world.
But where did the scientist get the hypothesis in the first place? Often, through some form of exploratory research.
Exploratory research is research done, not to answer a precise question, but to find interesting questions to ask. Each field has their own approach to exploration. A psychologist might start with interviews, asking broad questions to find narrower questions for a future survey. An ecologist might film an animal, looking for changes in its behavior. A chemist might measure many properties of a new material, seeing if any stand out. Each approach is like digging for treasure, not sure of exactly what you will find.
Mathematicians and theoretical physicists don’t do experiments, but we still need hypotheses. We need an idea of what we plan to prove, or what kind of theory we want to build: like other scientists, we want to ask a question with a clear, true/false answer. And to find those questions, we still do exploratory research.
What does exploratory research look like, in the theoretical world? Often, it begins with examples and calculations. We can start with a known method, or a guess at a new one, a recipe for doing some specific kind of calculation. Recipe in hand, we proceed to do the same kind of calculation for a few different examples, covering different sorts of situation. Along the way, we notice patterns: maybe the same steps happen over and over, or the result always has some feature.
We can then ask, do those same steps always happen? Does the result really always have that feature? We have our guess, our hypothesis, and our attempt to prove it is much like an experiment. If we find a proof, our hypothesis was true. On the other hand, we might not be able to find a proof. Instead, exploring, we might find a counterexample – one where the steps don’t occur, the feature doesn’t show up. That’s one way to learn that our hypothesis was false.
This kind of exploration is essential to discovery. As scientists, we all have to eventually ask clear yes/no questions, to submit our beliefs to clear tests. But we can’t start with those questions. We have to dig around first, to observe the world without a clear plan, to get to a point where we have a good question to ask.