Shared posts

09 Jun 17:39

MiniBooNE

by John Baez

Big news! An experiment called MiniBooNE at Fermilab in Chicago has found more evidence that neutrinos are not acting as the Standard Model says they should:

• The MiniBooNE Collaboration, Observation of a significant excess of electron-like events in the MiniBooNE short-baseline neutrino experiment.

In brief, the experiment creates a beam of muon neutrinos (or antineutrinos—they can do either one). Then they check, with a detector 541 meters away, to see if any of these particles have turned into electron neutrinos (or antineutrinos). They’ve been doing this since 2002, and they’ve found a small tendency for this to happen.

This seems to confirm findings of the Liquid Scintillator Neutrino Detector or ‘LSND’ at Los Alamos, which did a similar experiment in the 1990s. People in the MiniBooNE collaboration claim that if you take both experiments into account, the results have a statistical significance of 6.1 σ.

This means that if the Standard Model is correct and there’s no experimental error or other mistake, the chance of seeing what these experiments saw is about 1 in 1,000,000,000.

There are 3 known kinds of neutrinos: electron, muon and tau neutrinos. Neutrinos of any kind are already known to turn into those of other kinds: these are called neutrino oscillations, and they were first discovered in the 1960’s, when it was found that 1/3 as many electron neutrinos were coming from the Sun as expected.

At the time this was a big surprise, because people thought neutrinos were massless, moved at the speed of light, and thus didn’t experience the passage of time. Back then, the Standard Model looked like this:

The neutrinos stood out as weird in two ways: we thought they were massless, and we thought they only come in a left-handed form—meaning roughly that they spin clockwise around the axis they’re moving along.

People did a bunch of experiments and wound up changing the Standard Model. Now we know neutrinos have nonzero mass. Their masses, and also neutrino oscillations, are described using a 3×3 matrix called the lepton mixing matrix. This is not a wacky idea: in fact, quarks are described using a similar 3×3 matrix called the quark mixing matrix. So, the current-day Standard Model is more symmetrical than the earlier version: leptons are more like quarks.

There is, however, still a big difference! We haven’t seen right-handed neutrinos.

MiniBooNE and LSND are seeing muon neutrinos turn into electron neutrinos much faster than the Standard Model theory of neutrino oscillations predicts. There seems to be no way to adjust the parameters of the lepton mixing matrix to fit the data from all the other experiments people have done, and also the MiniBooNE–LSND data. If this is really true, we need a new theory of physics.

And this is where things get interesting.

The most conservative change to the Standard Model would be to add three right-handed neutrinos to go along with the three left-handed ones. This would not be an ugly ad hoc trick: it would make the Standard Model more symmetrical, by making leptons even more like quarks.

If we do this in the most beautiful way—making leptons as similar to quarks as we can get away with, given their obvious differences—the three new right-handed neutrinos will be ‘sterile’. This means that they will interact only with the Higgs boson and gravity: not electromagnetism, the weak force or the strong force. This is great, because it would mean there’s a darned good reason we haven’t seen them yet!

Neutrinos are already very hard to detect, since they don’t interact with electromagnetism or the strong force. They only interact with the Higgs boson (that’s what creates their mass, and oscillations), gravity (because they have energy), and the weak force (which is how we create and detect them). A ‘sterile’ neutrino—one that also didn’t interact with the weak force—would be truly elusive!

In practice, the main way to detect sterile neutrinos would be via oscillations. We could create an ordinary neutrino, and it might turn into a sterile neutrino, and then back into an ordinary neutrino. This would create new kinds of oscillations.

And indeed, MiniBooNE and LSND seem to be seeing new oscillations, much more rapid than those predicted by the Standard Model and our usual best estimate of the lepton mixing matrix.

So, people are getting excited! We may have found sterile neutrinos.

There’s a lot more to say. For example, the SO(10) grand unified theory predicts right-handed neutrinos in a very beautiful way, so I’m curious about what the new data implies about that. There are also questions about whether a sterile neutrino could explain dark matter… or what limits astronomical observations place on the properties of sterile neutrinos. One should also wonder about the possibility of experimental error!

I would enjoy questions that probe deeper into this subject, since they might force me to study and learn more. Right now I have to go to Joshua Tree! But I’ll come back and answer your questions tomorrow morning.





08 Jun 22:04

Applied Category Theory: Resource Theories

by john
MathML-enabled post (click for more details).

My course on applied category theory is continuing! After a two-week break where the students did exercises, I went back to lecturing about Fong and Spivak’s book Seven Sketches. The second chapter is about ‘resource theories’.

MathML-enabled post (click for more details).

Resource theories help us answer questions like this:

  1. Given what I have, is it possible to get what I want?
  2. Given what I have, how much will it cost to get what I want?
  3. Given what I have, how long will it take to get what I want?
  4. Given what I have, what is the set of ways to get what I want?

Resource theories in their modern form were arguably born in these papers:

We are lucky to have Tobias in our course, helping the discussions along! He’s already posted some articles on resource theory on the Azimuth blog:

In the course, we had fun bouncing between the relatively abstract world of monoidal preorders and their very concrete real-world applications to chemistry, scheduling, manufacturing and other topics. Here are the lectures:

08 Jun 21:53

Money for nothing: the truth about universal basic income

by Carrie Arnold

Money for nothing: the truth about universal basic income

Money for nothing: the truth about universal basic income, Published online: 30 May 2018; doi:10.1038/d41586-018-05259-x

Several projects are testing the idea of doling out funds that people can use however they want.
29 May 22:28

Unless You Don't Program Them To Do That

by noreply@blogger.com (Atrios)
I've long said safety isn't the real issue with self-driving cars, in that if they work they'll be safe enough, and that programming them not to hit things has to be the bare minimum easiest thing to do. Even this isn't *that* easy as there is a bit of a problem at high speeds. They don't actually see that far ahead at the moment. Still. "If see object, brake or turn." Not hard.

Unless, of course, you don't tell them to do that.

Uber’s vehicle used Volvo software to detect external objects. Six seconds before striking Herzberg, the system detected her but didn’t identify her as a person. The car was traveling at 43 mph.

The system determined 1.3 seconds before the crash that emergency braking would be needed to avert a collision. But the vehicle did not respond, striking Herzberg at 39 mph.


And why was that? Oh.

According to Uber, emergency braking maneuvers are not enabled while the vehicle is under computer control, to reduce the potential for erratic vehicle behavior. The vehicle operator is relied on to intervene and take action. The system is not designed to alert the operator.

There's a lot of chatter about where exactly the civil liability is going to fall for these things. What about the criminal liability?
23 May 17:56

Effective thermodynamics for a marginal observer

by tomate

Suppose you receive an email from someone who claims “here is the project of a machine that runs forever and ever and produces energy for free!”. Obviously he must be a crackpot. But he may be well-intentioned. You opt for not being rude, roll your sleeves, and put your hands into the dirt, holding the Second Law as lodestar.

Keep in mind that there are two fundamental sources of error: either he is not considering certain input currents (“hey, what about that tiny hidden cable entering your machine from the electrical power line?!”, “uh, ah, that’s just to power the “ON” LED”, “mmmhh, you sure?”), or else he is not measuring the energy input correctly (“hey, why are you using a Geiger counter to measure input voltages?!”, “well, sir, I ran out of voltmeters…”).

In other words, the observer might only have partial information about the setup, either in quantity or quality, because he has been marginalized by society (most crackpots believe they are misunderstood geniuses). Therefore we will call such observer “marginal”, which incidentally is also the word that mathematicians use when they focus on the probability of a subset of stochastic variables… In fact, our modern understanding of thermodynamics as embodied in statistical mechanics and stochastic processes is founded (and funded) on ignorance: we never really have “complete” information.
If we actually had, all energy would look alike, it would not come in “more refined” and “less refined” forms, there would not be a differentials of order/disorder (using Paul Valery’s beautiful words), and that would end thermodynamic reasoning, the energy problem, and generous research grants altogether.

Even worse, within this statistical approach we might be missing chunks of information because some parts of the system are invisible to us. But then, what warrants that  we are doing things right, and he (our correspondent) is the crackpot? Couldn’t it be the other way around? Here I would like to present some recent ideas I’ve been working on together with some collaborators on how to deal with incomplete information about the sources of dissipation of a thermodynamic system. I will do this in a quite theoretical manner, but somehow I will mimic the guidelines suggested above for debunking crackpots. My three buzzwords will be: marginal, effective, and operational.

“COMPLETE” THERMODYNAMICS: AN OUT-OF-THE-BOX VIEW

The laws of thermodynamics that I address are:

  • The good ol’ Second Law (2nd)
  • The Fluctuation-Dissipation Relation (FDR), and the Reciprocal Relation (RR) close to equilibrium
  • The more recent Fluctuation Relation (FR)1 and its corollary the Integral FR (IFR), that have been discussed on this blog in a remarkable post by Matteo Smerlak.

The list above is all in the “area of the second law”. How about the other laws? Well, thermodynamics has for long been a phenomenological science, a patchwork.  So-called Stochastic Thermodynamics is trying to put some order in it by systematically grounding thermodynamic claims in (mostly Markov) stochastic processes. But it’s not an easy task, because the different laws of thermodynamics live in somewhat different conceptual planes. And it’s not even clear if they are theorems, prescriptions, habits (a bit like in jurisprudence…2). Within Stochastic Thermodynamics, the Zeroth Law is so easy nobody cares to formulate it (I do, so stay tuned…). The Third Law: no idea, let me know. As regards the First Law (or, better, “laws”, as many as there are conserved quantities across the system/environment interface…), we will assume that all related symmetries have been exploited from the offset to boil down the description to a minimum.

1

This minimum is as follows. We identify a system that is well separated from its environment. The system evolves in time, the environment is so large that its state does not evolve within the timescales of the system3. When tracing out the environment from the description, an uncertainty falls upon the system’s evolution. We assume the system’s dynamics to be described by a stochastic Markovian process.

How exactly the system evolves and what is the relationship between system and environment will be described in more detail below. Here let us take an “out of the box” view. We resolve the environment into several reservoirs labeled by index \alpha. Each of these reservoirs is “at equilibrium” on its own (whatever that means… 4). Now, the idea is that each reservoir tries to impose “its own equilibrium” on the system, and that their competition leads to a flow of currents across the system/environment interface. Each time an amount of the reservoir’s resource crosses the interface, a “thermodynamic cost” has to be to be paid or gained (be it a chemical potential difference for a molecule to go through a membrane, or a temperature gradient for photons to be emitted/absorbed, etc.).

The fundamental quantities of stochastic thermo-dynamic modeling thus are:

  • On the “-dynamic” side: the time-integrated currents \Phi^t_\alpha, independent among themselves5. Currents are stochastic variables distributed with joint probability density

P(\{\Phi_\alpha\}_\alpha)

  • On the “thermo-” side: The so-called thermodynamic forces or “affinities”6 \mathcal{A}_\alpha  (collectively denoted \mathcal{A}). These are tunable parameters that characterize reservoir-to-reservoir gradients, and they are not stochastic. For convenience, we conventionally take them all positive.

Dissipation is quantified by the entropy production:

\sum \mathcal{A}_\alpha \Phi^t_\alpha

We are finally in the position to state the main results. Be warned that in the following expressions the exact treatment of time and its scaling would require a lot of specifications, but keep in mind that all these relations hold true in the long-time limit, and that all cumulants scale linearly with time.

  • FR: The probability of observing positive currents is exponentially favoured with respect to negative currents according to

P(\{\Phi_\alpha\}_\alpha) / P(\{-\Phi_\alpha\}_\alpha) = \exp \sum \mathcal{A}_\alpha \Phi^t_\alpha

Comment: This is not trivial, it follows from the explicit expression of the path-integral, see below.

  • IFR: The exponential of minus the entropy production is unity

\big\langle  \exp - \sum \mathcal{A}_\alpha \Phi^t_\alpha  \big\rangle_{\mathcal{A}} =1

Homework: Derive this relation from the FR in one line.

  • 2nd Law: The average entropy production is not negative

\sum \mathcal{A}_\alpha \left\langle \Phi^t_\alpha \right\rangle_{\mathcal{A}} \geq 0

Homework: Derive this relation using Jensen’s inequality.

  • Equilibrium: Average currents vanish if and only if affinities vanish:

\left\langle \Phi^t_\alpha \right\rangle_{\mathcal{A}} \equiv 0, \forall \alpha \iff  \mathcal{A}_\alpha \equiv 0, \forall \alpha

Homework: Derive this relation taking the first derivative w.r.t.  {\mathcal{A}_\alpha} of the IFR. Notice that also the average depends on the affinities.

  • S-FDR: At equilibrium, it is impossible to tell whether a current is due to a spontaneous fluctuation (quantified by its variance) or to an external perturbation (quantified by the response of its mean). In a symmetrized (S-) version:

\left.  \frac{\partial}{\partial \mathcal{A}_\alpha}\left\langle \Phi^t_{\alpha'} \right\rangle \right|_{0} + \left.  \frac{\partial}{\partial \mathcal{A}_{\alpha'}}\left\langle \Phi^t_{\alpha} \right\rangle \right|_{0} = \left. \left\langle \Phi^t_{\alpha} \Phi^t_{\alpha'} \right\rangle \right|_{0}

Homework: Derive this relation taking the mixed second derivatives w.r.t.  {\mathcal{A}_\alpha} of the IFR.

  • RR: The reciprocal response of two different currents to a perturbation of the reciprocal affinities close to equilibrium is symmetrical:

\left.  \frac{\partial}{\partial \mathcal{A}_\alpha}\left\langle \Phi^t_{\alpha'} \right\rangle \right|_{0} - \left.  \frac{\partial}{\partial \mathcal{A}_{\alpha'}}\left\langle \Phi^t_{\alpha} \right\rangle \right|_{0} = 0

Homework: Derive this relation taking the mixed second derivatives w.r.t.  {\mathcal{A}_\alpha} of the FR.

Notice the implication scheme: FR => IFR => 2nd, IFR => S-FDR, FR => RR.

“MARGINAL” THERMODYNAMICS (STILL OUT-OF-THE-BOX)

Now we assume that we can only measure a marginal subset of currents \{\Phi_\mu^t\}_\mu \subset \{\Phi_\alpha^t\}_\alpha (index \mu always has a smaller range than \alpha), distributed with joint marginal probability

P(\{\Phi_\mu\}_\mu) = \int \prod_{\alpha \neq \mu} d\Phi_\alpha \, P(\{\Phi_\alpha\}_\alpha)

2

Notice that a state where these marginal currents vanish might not be an equilibrium, because other currents might still be whirling around. We call this a stalling state.

\mathrm{stalling:} \qquad \langle \Phi_\mu \rangle \equiv 0,  \quad \forall \mu

My central question is: can we associate to these currents some effective affinity \mathcal{Q}_\mu in such a way that at least some of the results above still hold true? And, are all definitions involved just a fancy mathematical construct, or are them operational?

First the bad news: In general the FR is violated for all choices of effective affinities:

P(\{\Phi_\mu\}_\mu) / P(\{-\Phi_\mu\}_\mu) \neq \exp \sum \mathcal{Q}_\mu \Phi^t_\mu

This is not surprising and nobody would expect that. How about the IFR?

  • Marginal IFR: There are effective affinities such that

\left\langle \exp - \sum \mathcal{Q}_\mu \Phi^t_\mu \right\rangle_{\mathcal{A}} =1

Mmmhh. Yeah. Take a closer look this expression: can you see why there actually exists an infinite choice of “effective affinities” that would make that average cross 1? Which on the other hand is just a number, so who even cares? So this can’t be the point.

Fact is the IFR per se is hardly of any practical interest, as are all “asbolutes” in physics. What matters is “relatives”: in our case, response. But then we need to specify how the effective affinities depend on the “real” affinities. And here steps in a crucial technicality, whose precise argumentation is a pain. Basing on reasonable assumptions7, we demonstrate that the IFR holds for the following choice of effective affinities:

\mathcal{Q}_\mu = \mathcal{A}_\mu - \mathcal{A}^{\mathrm{stalling}}_\mu,

where \mathcal{A}^{\mathrm{stalling}} is the set of values of the affinities that make marginal currents stall. Notice that this latter formula gives an operational definition of the effective affinities that could in principle be reproduced in laboratory (just go out there and tune the tunable until everything stalls, and measure the difference). Obvsiously:

  • Stalling : Marginal currents vanish  if and only if effective affinities vanish:

\left\langle \Phi^t_\mu \right\rangle_{\mathcal{A}} \equiv 0, \forall \mu \iff \mathcal{A}_\mu \equiv 0, \forall \mu

Now, according to the inference scheme illustrated above, we can also prove that:

  •  Effective 2nd Law: The average marginal entropy production is not negative

\sum \mathcal{Q}_\mu \left\langle \Phi^t_\mu \right\rangle_{\mathcal{A}} \geq 0

  • S-FDR at stalling:

\left. \frac{\partial}{\partial \mathcal{A}_\mu}\left\langle \Phi^t_{\mu'} \right\rangle \right|_{\mathcal{A}^{\mathrm{stalling}}} + \left. \frac{\partial}{\partial \mathcal{A}_{\mu'}}\left\langle \Phi^t_{\mu} \right\rangle \right|_{\mathcal{A}^{\mathrm{stalling}}} = \left. \left\langle \Phi^t_{\mu} \Phi^t_{\mu'} \right\rangle \right|_{\mathcal{A}^{\mathrm{stalling}}}

Notice instead that the RR is gone at stalling. This is a clear-cut prediction of the theory that can be experimented with basically the same apparatuses with which response theory has been experimented so far (not that I actually know what these apparatuses are…): at stalling states, differing from equilibrium states, the S-FDR still holds, but the RR does not.

INTO THE BOX

You definitely got enough of it at this point, and you can give up here. Please
exit through the gift shop.

If you’re stubborn, let me tell you what’s inside the box. The system’s dynamics is modeled as a continuous-time, discrete configuration-space Markov “jump” process. The state space can be described by a graph G=(I, E) where I is the set of configurations, E is the set of possible transitions or “edges”, and there exists some incidence relation between edges and couples of configurations. The process is determined by the rates w_{i \gets j} of jumping from one configuration to another.

We choose these processes because they allow some nice network analysis and because the path integral is well defined! A single realization of such a process is a trajectory

\omega^t = (i_0,\tau_0) \to (i_1,\tau_1) \to \ldots \to (i_N,\tau_N)

A “Markovian jumper” waits at some configuration i_n for some time \tau_n with an exponentially decaying probability w_{i_n} \exp - w_{i_n} \tau_n with exit rate w_i = \sum_k w_{k \gets i}, then instantaneously jumps to a new configuration i_{n+1} with transition probability w_{i_{n+1} \gets {i_n}}/w_{i_n}. The overall probability density of a single trajectory is given by

P(\omega^t) = \delta \left(t - \sum_n \tau_n \right) e^{- w_{i_N}\tau_{i_N}} \prod_{n=0}^{N-1} w_{j_n \gets i_n} e^{- w_{i_n} \tau_{i_n}}

One can in principle obtain the p.d.f. of any observable defined along the trajectory by taking the marginal of this measure (though in most cases this is technically impossible). Where does this expression come from? For a formal derivation, see the very beautiful review paper by Weber and Frey, but be aware that this is what one would intuitively come up with if he had to simulate with the Gillespie algorithm.

The dynamics of the Markov process can also be described by the probability of being at some configuration i at time t, which evolves with the master equation

\dot{p}_i(t) = \sum_j \left[ w_{ij} p_j(t) - w_{ji} p_i(t) \right].

We call such probability the system’s state, and we assume that the system relaxes to a uniquely defined steady state p = \mathrm{lim}_{t \to \infty} p(t).

A time-integrated current along a single trajectory is a linear combination of the net number of jumps \#^t between configurations in the network:

\Phi^t_\alpha = \sum_{ij} C^{ij}_\alpha \left[ \#^t(i \gets j) - \#^t(j\gets i) \right]

The idea here is that one or several transitions within the system occur because of the “absorption” or the “emission” of some environmental degrees of freedom, each with different intensity. However, for the moment let us simplify the picture and require that only one transition contributes to a current, that is that there exist i_\alpha,j_\alpha such that

C^{ij}_\alpha = \delta^i_{i_\alpha} \delta^j_{j_\alpha}.

Now, what does it mean for such a set of currents to be “complete”? Here we get inspiration from Kirchhoff’s Current Law in electrical circuits: the continuity of the trajectory at each configuration of the network implies that after a sufficiently long time, cycle or loop or mesh currents completely describe the steady state. There is a standard procedure to identify a set of cycle currents: take a spanning tree T of the network; then the currents flowing along the edges E\setminus T left out from the spanning tree form a complete set.

The last ingredient you need to know are the affinities. They can be constructed as follows. Consider the Markov process on the network where the observable edges are removed G' = (I,T). Calculate the steady state of its associated master equation (p^{\mathrm{eq}}_i)_i, which is necessarily an equilibrium (since there cannot be cycle currents in a tree…). Then the affinities are given by

\mathcal{A}_\alpha = \log  w_{i_\alpha j_\alpha} p^{\mathrm{eq}}_{j_\alpha} / w_{j_\alpha i_\alpha} p^{\mathrm{eq}}_{i_\alpha}.

Now you have all that is needed to formulate the complete theory and prove the FR.

Homework: (Difficult!) With the above definitions, prove the FR.

How about the marginal theory? To define the effective affinities, take the set E_{\mathrm{mar}} = \{i_\mu j_\mu, \forall \mu\} of edges where there run observable currents. Notice that now its complement obtained by removing the observable edges, that we call the hidden edge set E_{\mathrm{hid}} = E \setminus E_{\mathrm{mar}}, is not in general a spanning tree: there might be cycles that are not accounted for by our observations. However, we can still consider the Markov process on the hidden space, and calculate its stalling steady state p^{\mathrm{st}}_i, and ta-taaa: The effective affinities are given by

\mathcal{Q}_\mu = \log w_{i_\mu j_\mu} p^{\mathrm{st}}_{j_\mu} / w_{j_\mu i_\mu} p^{\mathrm{st}}_{i_\mu}.

Proving the marginal IFR is far more complicated than the complete FR. In fact, very often in my field we will not work with the current’ probability density itself,  but we prefer to take its bidirectional Laplace transform and work with the currents’ cumulant generating function. There things take a quite different and more elegant look.

Many other questions and possibilities open up now. The most important one left open is: Can we generalize the theory the (physically relevant) case where the current is supported on several edges? For example, for a current defined like \Phi^t = 5 \Phi^t_{12} + 7 \Phi^t_{34}? Well, it depends: the theory holds provided that the stalling state is not “internally alive”, meaning that if the observable current vanishes on average, then also should \Phi^t_{12} and \Phi^t_{34} separately. This turns out to be a physically meaningful but quite strict condition.

IS ALL OF THERMODYNAMICS “EFFECTIVE”?

Let me conclude with some more of those philosophical considerations that sadly I have to leave out of papers…

Stochastic thermodynamics strongly depends on the identification of physical and information-theoretic entropies — something that I did not openly talk about, but that lurks behind the whole construction. Throughout my short experience as researcher I have been pursuing a program of “relativization” of thermodynamics, by making the role of the observer more and more evident and movable. Inspired by Einstein’s gedankenexperimenten, I also tried to make the theory operational. This program may raise eyebrows here and there: Many thermodynamicians embrace a naïve materialistic world-view whereby what only matters are “real” physical quantities like temperature, pressure, and all the rest of the information-theoretic discourse is at best mathematical speculation or a fascinating analog with no fundamental bearings.  According to some, information as a physical concept lingers alarmingly close to certain extreme postmodern claims in the social sciences that “reality” does not exist unless observed, a position deemed dangerous at times when the authoritativeness of science is threatened by all sorts of anti-scientific waves.

I think, on the contrary, that making concepts relative and effective and by summoning the observer explicitly is a laic and prudent position that serves as an antidote to radical subjectivity. The other way around, clinging to the objectivity of a preferred observer — which is implied in any materialistic interpretation of thermodynamics, e.g. by assuming that the most fundamental degrees of freedom are the positions and velocities of gas’s molecules — is the dangerous position, expecially when the role of such preferred observer is passed around from the scientist to the technician and eventually to the technocrat, who would be induced to believe there are simple technological fixes to complex social problems

How do we reconcile observer-dependency and the laws of physics? The object and the subject? On the one hand, much like the position of an object depends on the reference frame, so much so entropy and entropy production do depend on the observer and the particular apparatus that he controls or experiment he is involved with. On the other hand, much like motion is ultimately independent of position and it is agreed upon by all observers that share compatible measurement protocols, so much so the laws of thermodynamics are independent of that particular observer’s quantification of entropy and entropy production (e.g., the effective Second Law holds independently of how much the marginal observer knows of the system, if he operates according to our phenomenological protocol…). This is the case even in the every-day thermodynamics as practiced by energetic engineers et al., where there are lots of choices to gauge upon, and there is no other external warrant that the amount of dissipation being quantified is the “true” one (whatever that means…) — there can only be trust in one’s own good practices and methodology.

So in this sense, I like to think that all observers are marginal, that this effective theory  serves as a dictionary by which different observers practice and communicate thermodynamics, and that we should not revere the laws of thermodynamics as “true
idols,  but rather as tools of good scientific practice.

REFERENCES

  • M. Polettini and M. Esposito,  Effective fluctuation and response theory, arXiv:1803.03552

In this work we give the complete theory and numerous references to work of other people that was along the same lines. We employ a “spiral” approach to the presentation of the results, inspired by the pedagogical principle of Albert Baez.

  • M. Polettini and M. Esposito,  Effective thermodynamics for a marginal observer, Phys. Rev. Lett. 119, 240601 (2017), arXiv:1703.05715

This is a shorter version of the story.

  • B. Altaner, MP, and M. Esposito, Fluctuation-Dissipation Relations Far from Equilibrium, Phys. Rev. Lett. 117, 180601 (2016), arXiv:1604.0883

Early version of the story, containing the FDR results but not the full-fledged FR.

  • G. Bisker, M. Polettini, T. R. Gingrich and J. M. Horowitz, Hierarchical bounds on entropy production inferred from partial information, J. Stat. Mech. 093210 (2017), arXiv:1708.06769

Some extras.

  • M. F. Weber and E. Frey, Master equations and the theory of stochastic path integrals, Rep. Progr. Phys. 80, 046601 (2017).

Great reference if one wishes to learn about path integrals for master equation systems.

1 There are as many so-called “Fluctuation Theorems” as there are authors working on them, so I decided not to call them by any name. Furthermore, notice I prefer to distinguish between a relation (a formula) and a theorem (a line of reasoning). I lingered more on this here.

2

“Just so you know, nobody knows what energy is”. Richard Feynman.

I cannot help but mention here the beautiful book by Shapin and Schaffer Leviathan and the air-pump about the Boyle vs. Hobbes diatribe about what constitutes a  “matter of fact,” and Bruno Latour’s interpretation of it in We have never been modern. Latour argues that “modernity” is a process of separation of the human and natural spheres, and within each of these spheres a process of purification of the unit facts of knowledge and the unit facts of politics, of the object and the subject. At the same time we live in a world where these two spheres are never truly separated, a world of “hybrids” that are at the same time necessary “for all practical purposes” and unconceivable according to the myths that sustain the narration of science, of the State, and even of religion. In fact, despite these myths, we cannot conceive a scientific fact out of the contextual “network” where this fact is produced and replicated, and neither we can conceive society out of the material needs that shape it: so in  this sense “we have never been modern”, we are not quite different from all those societies that we take pleasure of studying with the tools of anthropology. Within the scientific community Latour is widely despised; probably he is also misread. While it is really difficult to see how his analysis applies to, say, high-energy physics, I find that thermodynamics and its ties to the industrial revolution perfectly embodies this tension between the natural and the artificial, the matter of fact and the matter of concern. Such great thinkers as Einstein and Ehrenfest thought of the Second Law as the only physical law that would never be replaced, and I believe this is revelatory. A second thought on the Second Law, a systematic and precise definition of all its terms and circumstances, reveals that the only formulations that make sense are those phenomenological statements such as Kelvin-Planck’s or similar, which require a lot of contingent definitions regarding the operation of the engine, while fetished and universal statements are nonsensical (such as that masterwork of confusion that is “the entropy of the Universe cannot decrease”). In this respect, it is neither a purely natural law — as the moderns argue, nor a purely social construct — as the postmodern argue. One simply has to renounce to operate this separation. While I do not have a definite answer on this problem, I like to think of the Second Law as a practice, a consistency check of the thermodynamic discourse.

3 This assumption really belongs to a time, the XIXth century, when resources were virtually infinite on planet Earth…

4 As we will see shortly, we define equilibrium as that state where there are no currents at the interface between the system and the environment, so what is the environment’s own definition of equilibrium?!

5 This because we already exploited First Law.

6 This nomenclature comes from alchemy, via chemistry (think of Goethe’s The elective affinities…), it propagated in the XXth century via De Donder and Prigogine, and eventually it is still present in language in Luxembourg because in some way we come from the “late Brussels school”.

7 Basically, we ask that the tunable parameters are environmental properties, such as temperatures, chemical potentials, etc. and not internal properties, such as the energy landscape or the activation barriers between configurations.

21 May 17:43

Baby's hand mummified by copper coin

by Minnesotastan
The remains are currently on display at Hungary’s Móra Ferenc Museum.

From inspecting the tiny skeleton, Dr. Balázs determined the deceased was either a stillbirth or premature baby that died shortly after birth. The researchers concluded the child was 11 to 13 inches and weighed only one or two pounds...

The team concluded that before the child was placed in the pot and buried, someone put the copper coin into its hand. Many cultures in antiquity have buried their dead with coins as a way to pay a mythical ferryman to take their souls into the afterlife.

In this case, the copper’s antimicrobial properties protected the child’s hand from decay. Along with the conditions inside the vessel, it helped mummify the baby’s grasp. The team thinks this child’s burial may be one of the first reported cases in the scientific literature of copper-driven mummification. 
The rest of the story is at The New York Times.
18 May 14:12

The Pentagon Can’t Account for $21 Trillion (That’s Not a Typo)

by Donnal Walter

By Lee Camp, May 14, 2018, TruthDig.

Then-Secretary of Defense Robert Gates during a 2008 visit to Kosovo with U.S. Army troops on foot patrol in the town of Gnjilane. (The U.S. Army / CC BY 2.0)

Twenty-one trillion dollars.

The Pentagon’s own numbers show that it can’t account for $21 trillion. Yes, I mean trillion with a “T.” And this could change everything.

But I’ll get back to that in a moment.

There are certain things the human mind is not meant to do. Our complex brains cannot view the world in infrared, cannot spell words backward during orgasm and cannot really grasp numbers over a few thousand. A few thousand, we can feel and conceptualize. We’ve all been in stadiums with several thousand people. We have an idea of what that looks like (and how sticky the floor gets).

But when we get into the millions, we lose it. It becomes a fog of nonsense. Visualizing it feels like trying to hug a memory. We may know what $1 million can buy (and we may want that thing), but you probably don’t know how tall a stack of a million $1 bills is. You probably don’t know how long it takes a minimum-wage employee to make $1 million.

That’s why trying to understand—truly understand—that the Pentagon spent 21 trillion unaccounted-for dollars between 1998 and 2015 washes over us like your mother telling you that your third cousin you met twice is getting divorced. It seems vaguely upsetting, but you forget about it 15 seconds later because … what else is there to do?

Twenty-one trillion.

But let’s get back to the beginning. A couple of years ago, Mark Skidmore, an economics professor, heard Catherine Austin Fitts, former assistant secretary in the Department of Housing and Urban Development, say that the Department of Defense Office of Inspector General had found $6.5 trillion worth of unaccounted-for spending in 2015. Skidmore, being an economics professor, thought something like, “She means $6.5 billion. Not trillion. Because trillion would mean the Pentagon couldn’t account for more money than the gross domestic product of the whole United Kingdom. But still, $6.5 billion of unaccounted-for money is a crazy amount.”

So he went and looked at the inspector general’s report, and he found something interesting: It was trillion! It was fucking $6.5 trillion in 2015 of unaccounted-for spending! And I’m sorry for the cursing, but the word “trillion” is legally obligated to be prefaced with “fucking.” It is indeed way more than the U.K.’s GDP.

Skidmore did a little more digging. As Forbes reported in December 2017, “[He] and Catherine Austin Fitts … conducted a search of government websites and found similar reports dating back to 1998. While the documents are incomplete, original government sources indicate $21 trillion in unsupported adjustments have been reported for the Department of Defense and the Department of Housing and Urban Development for the years 1998-2015.”

Let’s stop and take a second to conceive how much $21 trillion is (which you can’t because our brains short-circuit, but we’ll try anyway).

1. The amount of money supposedly in the stock market is $30 trillion.

2. The GDP of the United States is $18.6 trillion.

3. Picture a stack of money. Now imagine that that stack of dollars is all $1,000 bills. Each bill says “$1,000” on it. How high do you imagine that stack of dollars would be if it were $1 trillion. It would be 63 miles high.

4. Imagine you make $40,000 a year. How long would it take you to make $1 trillion? Well, don’t sign up for this task, because it would take you 25 million years (which sounds like a long time, but I hear that the last 10 million really fly by because you already know your way around the office, where the coffee machine is, etc.).

The human brain is not meant to think about a trillion dollars.

And it’s definitely not meant to think about the $21 trillion our Department of Defense can’t account for. These numbers sound bananas. They sound like something Alex Jones found tattooed on his backside by extraterrestrials.

But the 21 trillion number comes from the Department of Defense Office of Inspector General—the OIG. Although, as Forbes pointed out, “after Mark Skidmore began inquiring about OIG-reported unsubstantiated adjustments, the OIG’s webpage, which documented, albeit in a highly incomplete manner, these unsupported “accounting adjustments,” was mysteriously taken down.”

Luckily, people had already grabbed copies of the report, which—for now—you can view here.

Here’s something else important from that Forbes article—which is one of the only mainstream media articles you can find on the largest theft in American history:

Given that the entire Army budget in fiscal year 2015 was $120 billion, unsupported adjustments were 54 times the level of spending authorized by Congress.

That’s right. The expenses with no explanation were 54 times the actual budget allotted by Congress. Well, it’s good to see Congress is doing 1/54th of its job of overseeing military spending (that’s actually more than I thought Congress was doing). This would seem to mean that 98 percent of every dollar spent by the Army in 2015 was unconstitutional.

So, pray tell, what did the OIG say caused all this unaccounted-for spending that makes Jeff Bezos’ net worth look like that of a guy jingling a tin can on the street corner?

“[The July 2016 inspector general] report indicates that unsupported adjustments are the result of the Defense Department’s ‘failure to correct system deficiencies.’

They blame trillions of dollars of mysterious spending on a “failure to correct system deficiencies”? That’s like me saying I had sex with 100,000 wild hairless aardvarks because I wasn’t looking where I was walking.

Twenty-one trillion.

Say it slowly to yourself.

At the end of the day, there are no justifiable explanations for this amount of unaccounted-for, unconstitutional spending. Right now, the Pentagon is being audited for the first time ever, and it’s taking 2,400 auditors to do it. I’m not holding my breath that they’ll actually be allowed to get to the bottom of this.

But if the American people truly understood this number, it would change both the country and the world. It means that the dollar is sprinting down a path toward worthless. If the Pentagon is hiding spending that dwarfs the amount of tax dollars coming in to the federal government, then it’s clear the government is printing however much it wants and thinking there are no consequences. Once these trillions are considered, our fiat currency has even less meaning than it already does, and it’s only a matter of time before inflation runs wild.

It also means that any time our government says it “doesn’t have money” for a project, it’s laughable. It can clearly “create” as much as it wants for bombing and death. This would explain how Donald Trump’s military can drop well over 100 bombs a day that cost well north of $1 million each.

So why can’t our government also “create” endless money for health care, education, the homeless, veterans benefits and the elderly, to make all parking free and to pay the Rolling Stones to play stoop-front shows in my neighborhood? (I’m sure the Rolling Stones are expensive, but surely a trillion dollars could cover a couple of songs.)

Obviously, our government could do those things, but it chooses not to. Earlier this month, Louisiana sent eviction notices to 30,000 elderly people on Medicaid to kick them out of their nursing homes. Yes, a country that can vomit trillions of dollars down a black hole marked “Military” can’t find the money to take care of our poor elderly. It’s a repulsive joke.

Twenty-one trillion.

Former Secretary of Defense Robert Gates spoke about how no one knows where the money is flying in the Pentagon. In a barely reported speech in 2011, he said, “My staff and I learned that it was nearly impossible to get accurate information and answers to questions such as, ‘How much money did you spend?’ and ‘How many people do you have?’

They can’t even find out how many people work for a specific department?

Note for anyone looking for a job: Just show up at the Pentagon and tell them you work there. It doesn’t seem like they’d have much luck proving you don’t.

For more on this story, check out David DeGraw’s excellent reporting at ChangeMaker.media, because the mainstream corporate media are mouthpieces for the weapons industry. They are friends with benefits of the military-industrial complex. I have seen basically nothing from the mainstream corporate media concerning this mysterious $21 trillion. I missed the time when CNN’s Wolf Blitzer said that the money we dump into war and death—either the accounted-for money or the secretive trillions—could end world hunger and poverty many times over. There’s no reason anybody needs to be starving or hungry or unsheltered on this planet, but our government seems hellbent on proving that it stands for nothing but profiting off death and misery. And our media desperately want to show they stand for nothing but propping up our morally bankrupt empire.

When the media aren’t actively promoting war, they’re filling the airwaves with shit, so the entire country can’t even hear itself think. Our whole mindscape is filled to the brim with nonsense and vacant celebrity idiocy. Then, while no one is looking, the largest theft humankind has ever seen is going on behind our backs—covered up under the guise of “national security.”

Twenty-one trillion.

Don’t forget.

If you think this column is important, please share it. And check out Lee Camp’s weekly TV show, “Redacted Tonight.”

Truthdig has launched a reader-funded project—its first ever—to document the Poor People’s Campaign. Please help us by making a donation.

The post The Pentagon Can’t Account for $21 Trillion (That’s Not a Typo) appeared first on World Beyond War . . ..

15 May 20:07

What is computational neuroscience? (XXX) Is the brain a computer?

by romain

It is sometimes stated as an obvious fact that the brain carries out computations. Computational neuroscientists sometimes see themselves as looking for the algorithms of the brain. Is it true that the brain implements algorithms? My point here is not to answer this question, but rather to show that the answer is not self-evident, and that it can only be true (if at all) at a fairly abstract level.

One line of argumentation is that models of the brain that we find in computational neuroscience (neural network models) are algorithmic in nature, since we simulate them on computers. And wouldn’t it be a sort of vitalistic claim that neural networks cannot be (in principle) simulated on computer?

There is an important confusion in this argument. At a low level, neural networks are modelled biophysically as dynamical systems, in which the temporality corresponds to the actual temporality of the real world (as opposed to the discrete temporality of algorithms). Mathematically, those are typically differential equations, possibly hybrid systems (i.e. coupled by timed pulses), in which time is a continuous variable. Those models can of course be simulated on computer using discretization schemes. For example, we choose a time step and compute the state of the network at time t+dt, from the state at time t. This algorithm, however, implements a simulation of the model; it is not the model that implements the algorithm. The discretization is nowhere to be found in the model. The model itself, being a continuous time dynamical system, is not algorithmic in nature. It is not described as a discrete sequence of operations; it is only the simulation of the model that is algorithmic, and different algorithms can simulate the same model.

If we put this confusion aside, then the claim that neural networks implement algorithms becomes not that obvious. It means that trajectories of the dynamical system can be mapped to the discrete flow of an algorithm. This requires: 1) to identify states with representations of some variables (for example stimulus properties, symbols); 2) to identify trajectories from one state to another as specific operations. In addition to that, for the algorithmic view to be of any use, there should be a sequence of operations, not just one operation (ie, describing the output as a function of the input is not an algorithmic description).

A key difficulty in this identification is temporality: the state of the dynamical system changes continuously, so how can this be mapped to discrete operations? A typical approach is neuroscience is to consider not states but properties of trajectories. For example, one would consider the average firing rate in a population of neurons in a given time window, and the rate of another population in another time window. The relation between these two rates in the context of an experiment would define an operation. As stated above, a sequence of such relations should be identified in order to qualify as an algorithm. But this mapping seems only possible within a feedforward flow; coupling poses a greater challenge for an algorithmic description. No known nervous system, however, has a feedforward connectome.

I am not claiming here that the function of the brain (or mind) cannot possibly be described algorithmically. Probably some of it can be. My point is rather that a dynamical system is not generically algorithmic. A control system, for example, is typically not algorithmic (see the detailed example of Tim van Gelder, What might cognition be if not computation?). Thus a neural dynamical system can only be seen as an algorithm at a fairly abstract level, which can probably address only a restricted subset of its function. It could be that control, which also attaches function to dynamical systems, is a more adequate metaphor of brain function than computation. Is the brain a computer? Given the rather narrow application of the algorithmic view, the reasonable answer should be: quite clearly not (maybe part of cognition could be seen as computation, but not brain function generally).

15 May 01:22

Protein synthesis in brain tissue is much higher than previously thought.

by mdbownds@wisc.edu (Deric Bownds)
Smeets et al. use stable isotope methodology during temporal lobe resection surgery to demonstrate protein synthesis rates exceeding 3% per day, suggesting that brain tissue plasticity is far greater than previously assumed.
All tissues undergo continuous reconditioning via the complex orchestration of changes in tissue protein synthesis and breakdown rates. Skeletal muscle tissue has been well studied in this regard, and has been shown to turnover at a rate of 1–2% per day in vivo in humans. Few data are available on protein synthesis rates of other tissues. Because of obvious limitations with regard to brain tissue sampling no study has ever measured brain protein synthesis rates in vivo in humans. Here, we applied stable isotope methodology to directly assess protein synthesis rates in neocortex and hippocampus tissue of six patients undergoing temporal lobectomy for drug-resistant temporal lobe epilepsy (Clinical trial registration: NTR5147). Protein synthesis rates of neocortex and hippocampus tissue averaged 0.17 ± 0.01 and 0.13 ± 0.01%/h, respectively. Brain tissue protein synthesis rates were 3–4-fold higher than skeletal muscle tissue protein synthesis rates (0.05 ± 0.01%/h; P < 0.001). In conclusion, the protein turnover rate of the human brain is much higher than previously assumed.
09 May 05:41

“We continuously increased the number of animals until statistical significance was reached to support our conclusions” . . . I think this is not so bad, actually!

by Andrew

Jordan Anaya pointed me to this post, in which Casper Albers shared this snippet from a recently-published paper from an article in Nature Communications:

The subsequent twitter discussion is all about “false discovery rate” and statistical significance, which I think completely misses the point.

The problems

Before I get to why I think the quoted statement is not so bad, let me review various things that these researchers seem to be doing wrong:

1. “Until statistical significance was reached”: This is a mistake. Statistical significance does not make sense as an inferential or decision rule.

2. “To support our conclusions”: This is a mistake. The point of an experiment should be to learn, not to support a conclusion. Or, to put it another way, if they want support for their conclusion, that’s fine, but that has nothing to do with statistical significance.

3. “Based on [a preliminary data set] we predicted that about 20 unites are sufficient to statistically support our conclusions”: This is a mistake. The purpose of a pilot study is to demonstrate the feasibility of an experiment, not to estimate the treatment effect.

OK, so, yes, based on the evidence of the above snippet, I think this paper has serious problems.

Sequential data collection is ok

That all said, I don’t have a problem, in principle, with the general strategy of continuing data collection until the data look good.

I’ve thought a lot about this one. Let me try to explain here.

First, the Bayesian argument, discussed for example in chapter 8 of BDA3 (chapter 7 in earlier editions). As long as your model includes the factors that predict data inclusion are also included in the model, you should be ok. In this case, the relevant variable is time: If there’s any possibility of time trends in your underlying process, you want to allow for that in your model. A sequential design can yield a dataset that is less robust to model assumptions, and a sequential design changes how you’ll do model checking (see chapter 6 of BDA), but from a Bayesian standpoint, you can handle these issues. Gathering data until they look good is not, from a Bayesian perspective, a “questionable research practice.”

Next, the frequentist argument, which can be summarized as, “What sorts of things might happen (more formally, what is the probability distribution of your results) if you as a researcher follow a sequential data collection rule?

Here’s what will happen. If you collect data until you attain statistical significance, then you will attain statistical significance, unless you have to give up first because you run out of time or resources. But . . . so what? Statistical significance by itself doesn’t tell you anything at all. For one thing, your result might be statistically significant in the unexpected direction, so it won’t actually confirm your scientific hypothesis. For another thing, we already know the null hypothesis of zero effect and zero systematic error is false, so we know that with enough data you’ll find significance.

Now, suppose you run your experiment a really long time and you end up with an estimated effect size of 0.002 with a standard error of 0.001 (on some scale in which an effect of 0.1 is reasonably large). Then (a) you’d have to say whatever you’ve discovered is trivial, (b) it could easily be explained by some sort of measurement bias that’s crept into the experiment, and (c) in any case, if it’s 0.002 on this group of people, it could well be -0.001 or -0.003 on another group. So in that case you’ve learned nothing useful, except that the effect almost certainly isn’t large—and that thing you’ve learned has nothing to do with the statistical significance you’ve obtained.

Or, suppose you run an experiment a short time (which seems to be what happened here) and get an estimate of 0.4 with a standard error of 0.2. Big news, right! No. Enter the statistical significance filter and type M errors (see for example section 2.1 here). That’s a concern. But, again, it has nothing to do with sequential data collection. The problem would still be there with a fixed sample size (as we’ve seen in zillions of published papers).

Summary

Based on the snippet we’ve seen, there are lots of reasons to be skeptical of the paper under discussion. But I think the criticism based on sequential data collection misses the point. Yes, sequential data collection gives the researchers one more forking path. But I think the proposal to correct for this with some sort of type 1 or false discovery adjustment rule is essentially impossible and would be pointless even if it could be done, as such corrections are all about the uninteresting null hypothesis of zero effect and zero systematic error. Better to just report and analyze the data and go from there—and recognize that, in a world of noise, you need some combination of good theory and good measurement. Statistical significance isn’t gonna save your ass, no matter how it’s computed.

P.S. Clicking through, I found this amusing article by Casper Albers, “Valid Reasons not to participate in open science practices.” As they say on the internet: Read the whole thing.

P.P.S. Next open slot is 6 Nov but I thought I’d post this right away since the discussion is happening online right now.

The post “We continuously increased the number of animals until statistical significance was reached to support our conclusions” . . . I think this is not so bad, actually! appeared first on Statistical Modeling, Causal Inference, and Social Science.

08 May 17:25

So Basically Murder

by noreply@blogger.com (Atrios)
I've long said I don't think "safety" is really the concern about self-driving cars in that if they work in a useful way they'll be safe, but that view didn't address the "actually they don't work but they're on the streets anyway" issue.


Uber has concluded the likely reason why one of its self-driving cars fatally struck a pedestrian earlier this year, according to tech outlet The Information. The car’s software recognized the victim, Elaine Herzberg, standing in the middle of the road, but decided it didn’t need to react right away, the outlet reported, citing two unnamed people briefed on the matter.

This isn't some trolley problem wank, this is just what happens when your concept doesn't work. You have to dial down the safety provisions because otherwise your dumb car is going to be bad. I mean bad in the sense of not being very useful. Killing people is, also, too, bad.
07 May 20:05

Herman-Kluk propagator is free from zero-point energy leakage. (arXiv:1805.01686v1 [physics.chem-ph])

by Max Buchholz, Erika Fallacara, Fabian Gottwald, Michele Ceotto, Frank Grossmann, Sergei D. Ivanov

Semiclassical techniques constitute a promising route to approximate quantum dynamics based on classical trajectories starting from a quantum-mechanically correct distribution. One of their main drawbacks is the so-called zero-point energy (ZPE) leakage, that is artificial redistribution of energy from the modes with high frequency and thus high ZPE to that with low frequency and ZPE due to classical equipartition. Here, we show that an elaborate semiclassical formalism based on the Herman-Kluk propagator is free from the ZPE leakage despite utilizing purely classical propagation. This finding opens the road to correct dynamical simulations of systems with a multitude of degrees of freedom that cannot be treated fully quantum-mechanically due to the exponential increase of the numerical effort.

04 May 22:45

A quick rule of thumb is that when someone seems to be acting like a jerk, an economist will defend the behavior as being the essence of morality, but when someone seems to be doing something nice, an economist will raise the bar and argue that he’s not being nice at all.

by Andrew

Like Pee Wee Herman, act like a jerk
And get on the dance floor let your body work

I wanted to follow up on a remark from a few years ago about the two modes of pop-economics reasoning:

You take some fact (or stylized fact) about the world, and then you either (1) use people-are-rational-and-who-are-we-to-judge-others reasoning to explain why some weird-looking behavior is in fact rational, or (2) use technocratic reasoning to argue that some seemingly reasonable behavior is, in fact, inefficient.

The context, as reported by Felix Salmon, was a Chicago restaurant whose owner, Grant Achatz, was selling tickets “at a fixed price and are then free to be resold at an enormous markup on the secondary market.” Economists Justin Wolfers and Betsey Stevenson objected. They wanted Achatz to increase his prices. By keeping prices low, he was, apparently, violating the principles of democracy: “‘It’s democratic in theory, but not in practice,’ said Wolfers . . . Bloomberg’s Mark Whitehouse concludes that Next should ‘consider selling tickets to the highest bidder and giving the extra money to charity.'”

I summarized as follows:

In this case, Wolfers and Whitehouse are going through some contortions to argue (2). In a different mood, however, they might go for (1). I don’t fully understand the rules for when people go with argument 1 and when they go with 2, but a quick rule of thumb is that when someone seems to be acting like a jerk, an economist will defend the behavior as being the essence of morality, but when someone seems to be doing something nice, an economist will raise the bar and argue that he’s not being nice at all.

I’m guessing that if Grant Achatz were to implement the very same pricing policy but talk about how he’s doing it solely out of greed, that a bunch of economists would show up and explain how this was actually the most moral and democratic option.

In comments, Alex wrote:

(1) and (2) are typically distinguished in economics textbooks as examples of positive and normative reasoning, respectively. The former aims at describing the observed behavior in terms of a specific model (e.g. rationality), seemingly without any attempt at subjective judgement. The latter takes the former as given and applies a subjective social welfare function to the outcomes in order to judge, whether the result could be improved upon with, say, different institutional arrangement or a policy intervention.

To which I replied:

Yup, and the usual rule seems to be to use positive reasoning when someone seems to be acting like a jerk, and normative reasoning when someone seems to be doing something nice. This seems odd to me. Why assume that, just because someone is acting like a jerk, that he is acting so efficiently that his decisions can’t be improved, only understood? And why assume that, just because someone seems to be doing something nice, that “unintended consequences” etc. ensure he’s not doing a good job of it. To me, this is contrarianism run wild. I’m not saying that Wolfers is a knee-jerk contrarian; rather I’m guessing that he’s following default behaviors without thinking much about it.

This is an awkward topic to write about. I’m not saying I think economists are mean people; they just seem to have a default mode of thought which is a little perverse.

In the traditional view of Freudian psychiatrists, which no behavior can be taken at face value, and it takes a Freudian analyst to decode the true meaning. Similarly, in the world of pop economics, or neoclassical economics, any behavior that might seem good, or generous (for example, not maxing out your prices at a popular restaurant) is seen to be damaging of the public good—“unintended consequences” and all that—, while any behavior that might seem mean, or selfish, is actually for the greater good.

Let’s unpack this in five directions, from the perspective of the philosophy of science, the sociology of scientific professions, politics, the logic of rhetoric, and the logic of statistics.

From the standpoint of the philosophy of science, pop economics or neoclassical economics is, like Freudian theory, unfalsifiable. Any behavior can be explained as rational (motivating economists’ mode 1 above) or as being open to improvement (motivating economists’ mode 2 of reasoning). Economists can play two roles: (1) to reassure people that the current practices are just fine and to use economic theory to explain the hidden benefits arising from seemingly irrational or unkind decisions; or (2) to improve people’s lives through rational and cold but effective reasoning (the famous “thinking like an economist”). For flexible Freudians, just about any behavior can be explained by just about any childhood trauma; and for modern economists, just about any behavior can be interpreted as a rational adaptation—or not. In either case, specific applications of the method can be falsified—after all, Freudians and neoclassical economists alike are free to make empirically testable predictions—but the larger edifice is unfalsifiable, as any erroneous prediction can simply be explained as an inappropriate application of the theory.

From a sociological perspective, the flexibility of pop-economics reasoning, like the flexibility of Freudian theory, can be seen as a plus, in that it implies a need for trained specialists, priests who can know which childhood trauma to use as an explanation, or who can decide whether to use economics’s explanation 1 or 2. Again, recall economists’ claims that they think in a different, more piercing, way than other scholars, an attitude that is reminiscent of old-school Freudians’ claim to look squarely at the cold truths of human nature that others can’t handle.

The political angle is more challenging. Neoclassical economics is sometimes labeled as conservative, in that explanation 1 (the everything-is-really-ok story) can be used to justify existing social and economic structures; on the other hand, such arguments can also be used to justify existing structures with support on the left. And, for that matter, economist Justin Wolfers, quoted above, is I believe a political liberal in the U.S. context. So it’s hard for me to put this discussion on the left or the right; maybe best just to say that pop-econ reasoning is flexible enough to go in either political direction, or even both at once.

When it comes to analyzing the logic of economic reasoning, I keep thinking about Albert Hirschman’s book, The Rhetoric of Reaction. I feel that the ability to bounce back and forth between arguments 1 and 2 is part of what gives pop economics, or microeconomics more generally, some of its liveliness and power. If you only apply argument 1—explaining away all of human behavior, however ridiculous, as rational and desirable, then you’re kinda talking yourself out of a job: as an economist, you become a mere explainer, not a problem solver. On the other hand, if you only apply argument 2—studying how to approach optimal behavior in situation after situation—then you become a mere technician. By having the flexibility of which argument to use in any given setting, you can be unpredictable. Unpredictability is a source of power and can also make you more interesting.

Finally, I can give a statistical rationale for the rule of thumb given in the title of this post. It’s Bayesian reasoning; that is, partial pooling. If you look at the population distribution of all the things that people do, some of these actions have positive effects, some have negative effects, and most effects are small. So if you receive a noisy signal that someone did something positive, the appropriate response is to partially pool toward zero and to think of reasons why this apparently good deed was, on net, not so wonderful at all. Conversely, when you hear about something that sounds bad, you can partially pool toward zero from the other direction.

Just look at the crowd. Say, “I meant to do that.”

The post A quick rule of thumb is that when someone seems to be acting like a jerk, an economist will defend the behavior as being the essence of morality, but when someone seems to be doing something nice, an economist will raise the bar and argue that he’s not being nice at all. appeared first on Statistical Modeling, Causal Inference, and Social Science.

02 May 19:42

On Unlimited Sampling

by Igor


Ayush Bhandari just let me know about the interesting approach of Unlimited Sampling in an email exchange:

...In practice, ADCs clip or saturate whenever the amplitude of signal x exceeds ADC threshold L. Typical solution is to de-clip the signal for which purpose various methods have been proposed.

Based on a new ADC hardware which allows for sampling using the principle 
y = mod(x,L)

where x is bandlimited and L is the ADC threshold, we show that Nyquist rate about \pi e (~10) times faster guarantees recovery of x from y. For this purpose we outline a new, stable recovery procedure.

Paper and slides are here.

There is also the PhysOrg coverage. Thanks Ayush ! Here is the paper:


Shannon's sampling theorem provides a link between the continuous and the discrete realms stating that bandlimited signals are uniquely determined by its values on a discrete set. This theorem is realized in practice using so called analog--to--digital converters (ADCs). Unlike Shannon's sampling theorem, the ADCs are limited in dynamic range. Whenever a signal exceeds some preset threshold, the ADC saturates, resulting in aliasing due to clipping. The goal of this work is to analyze an alternative approach that does not suffer from these problems. Our work is based on recent developments in ADC design, which allow for ADCs that reset rather than to saturate, thus producing modulo samples. An open problem that remains is: Given such modulo samples of a bandlimited function as well as the dynamic range of the ADC, how can the original signal be recovered and what are the sufficient conditions that guarantee perfect recovery? In this work, we prove such sufficiency conditions and complement them with a stable recovery algorithm. Our results are not limited to certain amplitude ranges, in fact even the same circuit architecture allows for the recovery of arbitrary large amplitudes as long as some estimate of the signal norm is available when recovering. Numerical experiments that corroborate our theory indeed show that it is possible to perfectly recover function that takes values that are orders of magnitude higher than the ADC's threshold.

h/t Laurent.
30 Apr 21:48

Classification of Phase Transitions by Microcanonical Inflection-Point Analysis

by Kai Qi and Michael Bachmann

Author(s): Kai Qi and Michael Bachmann

A statistical analysis method allows for the unambiguous identification of a phase transition via inflection points of the system’s microcanonical entropy and its derivatives.


[Phys. Rev. Lett. 120, 180601] Published Mon Apr 30, 2018

30 Apr 19:12

Get Your War On

by noreply@blogger.com (Atrios)
We all lived through that hell decade of the aughts. We weirdly managed to have some fun! When the history of this era is written, it probably will not include the most important artistic contributions of that time, because that's the way the world works.

David Rees captured that moment - in the moment - in a way that nobody else did.

While we are on rerun Sunday, take a quick look.



I don't know everything about David - we almost met a couple of times - but I remember him fundraising and donating to landmine removal in Afghanistan. There is an obvious point about where the money is.
30 Apr 13:51

Periodic spiking by a pair of ionic channels. (arXiv:1804.05786v1 [physics.bio-ph])

by Laureano Ramírez-Piscina, José M. Sancho

Neuronal cells present periodic trains of localized voltage spikes involving a large amount of different ionic channels. A relevant question is whether this is a cooperative effect or it could also be an intrinsic property of individual channels. Here we use a Langevin formulation for the stochastic dynamics of a pair of Na and K ionic channels. These two channels are simple gated pore models where a minimum set of degrees of freedom follow standard statistical physics. The whole system is totally autonomous without any external energy input, except for the chemical energy of the different ionic concentrations across the membrane. As a result it is shown that a unique pair of different ionic channels can sustain membrane potential periodic spikes. The spikes are due to the interaction between the membrane potential, the ionic flows and the dynamics of the internal parts (gates) of each channel structures. The spike involves a series of dynamical steps being the more relevant one the leak of Na ions. Missing spike events are caused by the altered functioning of specific model parts. The time dependent spike structure is comparable with experimental data.

27 Apr 22:36

Review of Bryan Caplan’s The Case Against Education

by Scott

If ever a book existed that I’d judge harshly by its cover—and for which nothing inside could possibly make me reverse my harsh judgment—Bryan Caplan’s The Case Against Education would seem like it.  The title is not a gimmick; the book’s argument is exactly what it says on the tin.  Caplan—an economist at George Mason University, home of perhaps the most notoriously libertarian economics department on the planet—holds that most of the benefit of education to students (he estimates around 80%, but certainly more than half) is about signalling the students’ preexisting abilities, rather than teaching or improving the students in any way.  He includes the entire educational spectrum in his indictment, from elementary school all the way through college and graduate programs.  He does have a soft spot for education that can be shown empirically to improve worker productivity, such as technical and vocational training and apprenticeships.  In other words, precisely the kind of education that many readers of this blog may have spent their lives trying to avoid.

I’ve spent almost my whole conscious existence in academia, as a student and postdoc and then as a computer science professor.  CS is spared the full wrath that Caplan unleashes on majors like English and history: it does, after all, impart some undeniable real-world skills.  Alas, I’m not one of the CS professors who teaches anything obviously useful, like how to code or manage a project.  When I teach undergrads headed for industry, my only role is to help them understand concepts that they probably won’t need in their day jobs, such as which problems are impossible or intractable for today’s computers; among those, which might be efficiently solved by quantum computers decades in the future; and which parts of our understanding of all this can be mathematically proven.

Granted, my teaching evaluations have been [clears throat] consistently excellent.  And the courses I teach aren’t major requirements, so the students come—presumably?—because they actually want to know the stuff.  And my former students who went into industry have emailed me, or cornered me, to tell me how much my courses helped them with their careers.  OK, but how?  Often, it’s something about my class having helped them land their dream job, by impressing the recruiters with their depth of theoretical understanding.  As we’ll see, this is an “application” that would make Caplan smile knowingly.

If Caplan were to get his way, the world I love would be decimated.  Indeed, Caplan muses toward the end of the book that the world he loves would be decimated too: in a world where educational investment no longer exceeded what was economically rational, he might no longer get to sit around with other economics professors discussing what he finds interesting.  But he consoles himself with the thought that decisionmakers won’t listen to him anyway, so it won’t happen.

It’s tempting to reply to Caplan: “now now, your pessimism about anybody heeding your message seems unwarranted.  Have anti-intellectual zealots not just taken control of the United States, with an explicit platform of sticking it to the educated elites, and restoring the primacy of lower-education jobs like coal mining, no matter the long-term costs to the economy or the planet?  So cheer up, they might listen to you!”

Indeed, given the current stakes, one might simply say: Caplan has set himself against the values that are the incredibly fragile preconditions for all academic debate—even, ironically, debate about the value of academia, like the one we’re now having.  So if we want such debate to continue, then we have no choice but to treat Caplan as an enemy, and frame the discussion around how best to frustrate his goals.

In response to an excerpt of Caplan’s book in The Atlantic, my friend Sean Carroll tweeted:

It makes me deeply sad that a tenured university professor could write something like this about higher education.  There is more to learning than the labor market.

Why should anyone with my basic values, or Sean’s, give Caplan’s thesis any further consideration?  As far as I can tell, there are only two reasons: (1) common sense, and (2) the data.

In his book, Caplan presents dozens of tables and graphs, but he also repeatedly asks his readers to consult their own memories—exploiting the fact that we all have firsthand experience of school.  He asks: if education is about improving students’ “human capital,” then why are students so thrilled when class gets cancelled for a snowstorm?  Why aren’t students upset to be cheated out of some of the career-enhancing training that they, or their parents, are paying so much for?  Why, more generally, do most students do everything in their power—in some cases, outright cheating—to minimize the work they put in for the grade they receive?  Is there any product besides higher education, Caplan asks, that people pay hundreds of thousands of dollars for, and then try to consume as little of as they can get away with?  Also, why don’t more students save hundreds of thousands of dollars by simply showing up at a university and sitting in on classes without paying—something that universities make zero effort to stop?  (Many professors would be flattered, and would even add you to the course email list, entertain your questions, and give you access to the assignments—though they wouldn’t grade your assignments.)

And: if the value of education comes from what it teaches you, how do we explain the fact that students forget almost everything so soon after the final exam, as attested by both experience and the data?  Why are employers satisfied with a years-ago degree; why don’t they test applicants to see how much understanding they’ve retained?

Or if education isn’t about any of the specific facts being imparted, but about “learning how to learn” or “learning how to think creatively”—then how is it that studies find academic coursework has so little effect on students’ general learning and reasoning abilities either?  That, when there is an improvement in reasoning ability, it’s tightly concentrated on the subject matter of the course, and even then it quickly fades away after the course is over?

More broadly, if the value of mass education derives from making people more educated, how do we explain the fact that high-school and college graduates, most of them, remain so abysmally ignorant?  After 12-16 years in something called “school,” large percentages of Americans still don’t know that the earth orbits the sun; believe that heavier objects fall faster than lighter ones and that only genetically modified organisms contain genes; and can’t locate the US or China on a map.  Are we really to believe, asks Caplan, that these apparent dunces have nevertheless become “deeper thinkers” by virtue of their schooling, in some holistic, impossible-to-measure way?  Or that they would’ve been even more ignorant without school?  But how much more ignorant can you be?  They could be illiterate, yes: Caplan grants the utility of teaching reading, writing, and arithmetic.  But how much beyond the three R’s (if those) do typical students retain, let alone use?

Caplan also poses the usual questions: if you’re not a scientist, engineer, or academic (or even if you are), how much of your undergraduate education do you use in your day job?  How well did the course content match what, in retrospect, you feel someone starting your job really needs to know?  Could your professors do your job?  If not, then how were they able to teach you to do it better?

Caplan acknowledges the existence of inspiring teachers who transform their students’ lives, in ways that need not be reflected in their paychecks: he mentions Robin Williams’ character in The Dead Poets’ Society.  But he asks: how many such teachers did you have?  If the Robin Williamses are vastly outnumbered by the drudges, then wouldn’t it make more sense for students to stream the former directly into their homes via the Internet—as they can now do for free?

OK, but if school teaches so little, then how do we explain the fact that, at least for those students who are actually able to complete good degrees, research confirms that (on average) having gone to school really does pay, exactly as advertised?  Employers do pay more for a college graduate—yes, even an English or art history major—than for a dropout.  More generally, starting salary rises monotonically with level of education completed.  Employers aren’t known for a self-sacrificing eagerness to overpay.  Are they systematically mistaken about the value of school?

Synthesizing decades of work by other economists, Caplan defends the view that the main economic function of school is to give students a way to signal their preexisting qualities, ones that correlate with being competent workers in a modern economy.  I.e., that school is tacitly a huge system for winnowing and certifying young people, which also fulfills various subsidiary functions, like keeping said young people off the street, socializing them, maybe occasionally even teaching them something.  Caplan holds that, judged as a certification system, school actually works—well enough to justify graduates’ higher starting salaries, without needing to postulate any altruistic conspiracy on the part of employers.

For Caplan, a smoking gun for the signaling theory is the huge salary premium of an actual degree, compared to the relatively tiny premium for each additional year of schooling other than the degree year—even when we hold everything else constant, like the students’ academic performance.  In Caplan’s view, this “sheepskin effect” even lets us quantitatively estimate how much of the salary premium on education reflects actual student learning, as opposed to the students signaling their suitability to be hired in a socially approved way (namely, with a diploma or “sheepskin”).

Caplan knows that the signaling story raises an immediate problem: namely, if employers just want the most capable workers, then knowing everything above, why don’t they eagerly recruit teenagers who score highly on the SAT or IQ tests?  (Or why don’t they make job offers to high-school seniors with Harvard acceptance letters, skipping the part where the seniors have to actually go to Harvard?)

Some people think the answer is that employers fear getting sued: in the 1971 Griggs vs. Duke Power case, the US Supreme Court placed restrictions on the use of intelligence tests in hiring, because of disparate impact on minorities.  Caplan, however, rejects this explanation, pointing out that it would be child’s-play for employers to design interview processes that functioned as proxy IQ tests, were that what the employers wanted.

Caplan’s theory is instead that employers don’t value only intelligence.  Instead, they care about the conjunction of intelligence with two other traits: conscientiousness and conformity.  They want smart workers who will also show up on time, reliably turn in the work they’re supposed to, and jump through whatever hoops authorities put in front of them.  The main purpose of school, over and above certifying intelligence, is to serve as a hugely costly and time-consuming—and therefore reliable—signal that the graduates are indeed conscientious conformists.  The sheer game-theoretic wastefulness of the whole enterprise rivals the peacock’s tail or the bowerbird’s ornate bower.

But if true, this raises yet another question.  In the signaling story, graduating students (and their parents) are happy that the students’ degrees land them good jobs.  Employers are happy that the education system supplies them with valuable workers, pre-screened for intelligence, conscientiousness, and conformity.  Even professors are happy that they get paid to do research and teach about topics that interest them, however irrelevant those topics might be to the workplace.  So if so many people are happy, who cares if, from an economic standpoint, it’s all a big signaling charade, with very little learning taking place?

For Caplan, the problem is this: because we’ve all labored under the mistaken theory that education imparts vital skills for a modern economy, there are trillions of dollars of government funding for every level of education—and that, in turn, removes the only obstacle to a credentialing arms race.  The equilbrium keeps moving over the decades, with more and more years of mostly-pointless schooling required to prove the same level of conscientiousness and conformity as before.  Jobs that used to require only a high-school diploma now require a bachelors; jobs that used to require only a bachelors now require a masters, and so on—despite the fact that the jobs themselves don’t seem to have changed appreciably.

For Caplan, a thoroughgoing libertarian, the solution is as obvious as it is radical: abolish government funding for education.  (Yes, he explicitly advocates a complete “separation of school and state.”)  Or if some state role in education must be retained, then let it concentrate on the three R’s and on practical job skills.  But what should teenagers do, if we’re no longer urging them to finish high school?  Apparently worried that he hasn’t yet outraged liberals enough, Caplan helpfully suggests that we relax the laws around child labor.  After all, he says, if we’ve decided anyway that teenagers who aren’t academically inclined should suffer through years of drudgery, then instead of warming a classroom seat, why shouldn’t they apprentice themselves to a carpenter or a roofer?  That way they could contribute to the economy, and gain the independence from their parents that most of them covet, and learn skills that they’d be much more likely to remember and use than the dissection of owl pellets.  Even if working a real job involved drudgery, at least it wouldn’t be as pointless as the drudgery of school.

Given his conclusions, and the way he arrives at them, Caplan realizes that he’ll come across to many as a cartoon stereotype of a narrow-minded economist, who “knows the price of everything but the value of nothing.”  So he includes some final chapters in which, setting aside the charts and graphs, he explains how he really feels about education.  This is the context for what I found to be the most striking passages in the book:

I am an economist and a cynic, but I’m not a typical cynical economist.  I’m a cynical idealist.  I embrace the ideal of transformative education.  I believe wholeheardedly in the life of the mind.  What I’m cynical about is people … I don’t hate education.  Rather I love education too much to accept our Orwellian substitute.  What’s Orwellian about the status quo?  Most fundamentally, the idea of compulsory enlightenment … Many idealists object that the Internet provides enlightenment only for those who seek it.  They’re right, but petulant to ask for more.  Enlightenment is a state of mind, not a skill—and state of mind, unlike skill, is easily faked.  When schools require enlightenment, students predictably respond by feigning interest in ideas and culture, giving educators a false sense of accomplishment. (p. 259-261)

OK, but if one embraces the ideal, then rather than dynamiting the education system, why not work to improve it?  According to Caplan, the answer is that we don’t know whether it’s even possible to build a mass education system that actually works (by his lights).  He says that, if we discover that we’re wasting trillions of dollars on some sector, the first order of business is simply to stop the waste.  Only later should we entertain arguments about whether we should restart the spending in some new, better way, and we shouldn’t presuppose that the sector in question will win out over others.


Above, I took pains to set out Caplan’s argument as faithfully as I could, before trying to pass judgment on it.  At some point in a review, though, the hour of judgment arrives.

I think Caplan gets many things right—even unpopular things that are difficult for academics to admit.  It’s true that a large fraction of what passes for education doesn’t deserve the name—even if, as a practical matter, it’s far from obvious how to cut that fraction without also destroying what’s precious and irreplaceable.  He’s right that there’s no sense in badgering weak students to go to college if those students are just going to struggle and drop out and then be saddled with debt.  He’s right that we should support vocational education and other non-traditional options to serve the needs of all students.  Nor am I scandalized by the thought of teenagers apprenticing themselves to craftspeople, learning skills that they’ll actually value while gaining independence and starting to contribute to society.  This, it seems to me, is a system that worked for most of human history, and it would have to fail pretty badly in order to do worse than, let’s say, the average American high school.  And in the wake of the disastrous political upheavals of the last few years, I guess the entire world now knows that, when people complain that the economy isn’t working well enough for non-college-graduates, we “technocratic elites” had better have a better answer ready than “well then go to college, like we did.”

Yes, probably the state has a compelling interest in trying to make sure nearly everyone is literate, and probably most 8-year-olds have no clue what’s best for themselves.  But at least from adolescence onward, I think that enormous deference ought to be given to students’ choices.  The idea that “free will” (in the practical rather than metaphysical sense) descends on us like a halo on our 18th birthdays, having been absent beforehand, is an obvious fiction.  And we all know it’s fiction—but it strikes me as often a destructive fiction, when law and tradition force us to pretend that we believe it.

Some of Caplan’s ideas dovetail with the thoughts I’ve had myself since childhood on how to make the school experience less horrible—though I never framed my own thoughts as “against education.”  Make middle and high schools more like universities, with freedom of movement and a wide range of offerings for students to choose from.  Abolish hall passes and detentions for lateness: just like in college, the teacher is offering a resource to students, not imprisoning them in a dungeon.  Don’t segregate by age; just offer a course or activity, and let kids of any age who are interested show up.  And let kids learn at their own pace.  Don’t force them to learn things they aren’t ready for: let them love Shakespeare because they came to him out of interest, rather than loathing him because he was forced down their throats.  Never, ever try to prevent kids from learning material they are ready for: instead of telling an 11-year-old teaching herself calculus to go back to long division until she’s the right age (does that happen? ask how I know…), say: “OK hotshot, so you can differentiate a few functions, but can you handle these here books on linear algebra and group theory, like Terry Tao could have when he was your age?”

Caplan mentions preschool as the one part of the educational system that strikes him as least broken.  Not because it has any long-term effects on kids’ mental development (it might not), just because the tots enjoy it at the time.  They get introduced to a wide range of fun activities.  They’re given ample free time, whether for playing with friends or for building or drawing by themselves.  They’re usually happy to be dropped off.  And we could add: no one normally minds if parents drop their kids off late, or pick them up early, or take them out for a few days.  The preschool is just a resource for the kids’ benefit, not a never-ending conformity test.  As a father who’s now seen his daughter in three preschools, this matches my experience.

Having said all this, I’m not sure I want to live in the world of Caplan’s “complete separation of school and state.”  And I’m not using “I’m not sure” only as a euphemism for “I don’t.”  Caplan is proposing a radical change that would take civilization into uncharted territory: as he himself notes, there’s not a single advanced country on earth that’s done what he advocates.  The trend has everywhere been in the opposite direction, to invest more in education as countries get richer and more technology-based.  Where there have been massive cutbacks to education, the causes have usually been things like famine or war.

So I have the same skepticism of Caplan’s project that I’d have (ironically) of Bolshevism or any other revolutionary project.  I say to him: don’t just persuade me, show me.  Show me a case where this has worked.  In the social world, unlike the mathematical world, I put little stock in long chains of reasoning unchecked by experience.

Caplan explicitly invites his readers to test his assertions against their own lives.  When I do so, I come back with a mixed verdict.  Before college, as you may have gathered, I find much to be said for Caplan’s thesis that the majority of school is makework, the main purposes of which are to keep the students out of trouble and on the premises, and to certify their conscientiousness and conformity.  There are inspiring teachers here and there, but they’re usually swimming against the tide.  I still feel lucky that I was able to finagle my way out by age 15, and enter Clarkson University and then Cornell with only a G.E.D.

In undergrad, on the other hand, and later in grad school at Berkeley, my experience was nothing like what Caplan describes.  The professors were actual experts: people who I looked up to or even idolized.  I wanted to learn what they wanted to teach.  (And if that ever wasn’t the case, I could switch to a different class, excepting some major requirements.)  But was it useful?

As I look back, many of my math and CS classes were grueling bootcamps on how to prove theorems, how to design algorithms, how to code.  Most of the learning took place not in the classroom but alone, in my dorm, as I struggled with the assignments—having signed up for the most advanced classes that would allow me in, and thereby left myself no escape except to prove to the professor that I belonged there.  In principle, perhaps, I could have learned the material on my own, but in reality I wouldn’t have.  I don’t still use all of the specific tools I acquired, though I do still use a great many of them, from the Gram-Schmidt procedure to Gaussian integrals to finding my way around a finite group or field.  Even if I didn’t use any of the tools, though, this gauntlet is what upgraded me from another math-competition punk to someone who could actually write research papers with long proofs.  For better or worse, it made me what I am.

Just as useful as the math and CS courses were the writing seminars—places where I had to write, and where my every word got critiqued by the professor and my fellow students, so I had to do a passable job.  Again: intensive forced practice in what I now do every day.  And the fact that it was forced was now fine, because, like some leather-bound masochist, I’d asked to be forced.

On hearing my story, Caplan would be unfazed.  Of course college is immensely useful, he’d say … for those who go on to become professors, like me or him.  He “merely” questions the value of higher education for almost everyone else.

OK, but if professors are at least good at producing more people like themselves, able to teach and do research, isn’t that something, a base we can build on that isn’t all about signaling?  And more pointedly: if this system is how the basic research enterprise perpetuates itself, then shouldn’t we be really damned careful with it, lest we slaughter the golden goose?

Except that Caplan is skeptical of the entire enterprise of basic research.  He writes:

Researchers who specifically test whether education accelerates progress have little to show for their efforts.  One could reply that, given all the flaws of long-run macroeconomic data, we should ignore academic research in favor of common sense.  But what does common sense really say? … True, ivory tower self-indulgence occasionally revolutionizes an industry.  Yet common sense insists the best way to discover useful ideas is to search for useful ideas—not to search for whatever fascinates you and pray it turns out to be useful (p. 175).

I don’t know if common sense insists that, but if it does, then I feel on firm ground to say that common sense is woefully inadequate.  It’s easy to look at most basic research, and say: this will probably never be useful for anything.  But then if you survey the inventions that did change the world over the past century—the transistor, the laser, the Web, Google—you find that almost none would have happened without what Caplan calls “ivory tower self-indulgence.”  What didn’t come directly from universities came from entities (Bell Labs, DARPA, CERN) that wouldn’t have been thinkable without universities, and that themselves were largely freed from short-term market pressures by governments, like universities are.

Caplan’s skepticism of basic research reminded me of a comment in Nick Bostrom’s book Superintelligence:

A colleague of mine likes to point out that a Fields Medal (the highest honor in mathematics) indicates two things about the recipient: that he was capable of accomplishing something important, and that he didn’t.  Though harsh, the remark hints at a truth. (p. 314)

I work in theoretical computer science: a field that doesn’t itself win Fields Medals (at least not yet), but that has occasions to use parts of math that have won Fields Medals.  Of course, the stuff we use cutting-edge math for might itself be dismissed as “ivory tower self-indulgence.”  Except then the cryptographers building the successors to Bitcoin, or the big-data or machine-learning people, turn out to want the stuff we were talking about at conferences 15 years ago—and we discover to our surprise that, just as the mathematicians gave us a higher platform to stand on, so we seem to have built a higher platform for the practitioners.  The long road from Hilbert to Gödel to Turing and von Neumann to Eckert and Mauchly to Gates and Jobs is still open for traffic today.

Yes, there’s plenty of math that strikes even me as boutique scholasticism: a way to signal the brilliance of the people doing it, by solving problems that require years just to understand their statements, and whose “motivations” are about 5,000 steps removed from anything Caplan or Bostrom would recognize as motivation.  But where I part ways is that there’s also math that looked to me like boutique scholasticism, until Greg Kuperberg or Ketan Mulmuley or someone else finally managed to explain it to me, and I said: “ah, so that’s why Mumford or Connes or Witten cared so much about this.  It seems … almost like an ordinary applied engineering question, albeit one from the year 2130 or something, being impatiently studied by people a few moves ahead of everyone else in humanity’s chess game against reality.  It will be pretty sweet once the rest of the world catches up to this.”


I have a more prosaic worry about Caplan’s program.  If the world he advocates were actually brought into being, I suspect the people responsible wouldn’t be nerdy economics professors like himself, who have principled objections to “forced enlightenment” and to signalling charades, yet still maintain warm fuzzies for the ideals of learning.  Rather, the “reformers” would be more on the model of, say, Steve Bannon or Scott Pruitt or Alex Jones: people who’d gleefully take a torch to the universities, fortresses of the despised intellectual elite, not in the conviction that this wouldn’t plunge humanity back into the Dark Ages, but in the hope that it would.

When the US Congress was debating whether to cancel the Superconducting Supercollider, a few condensed-matter physicists famously testified against the project.  They thought that $10-$20 billion for a single experiment was excessive, and that they could provide way more societal value with that kind of money were it reallocated to them.  We all know what happened: the SSC was cancelled, and of the money that was freed up, 0%—absolutely none of it—went to any of the other research favored by the SSC’s opponents.

If Caplan were to get his way, I fear that the story would be similar.  Caplan talks about all the other priorities—from feeding the world’s poor to curing diseases to fixing crumbling infrastructure—that could be funded using the trillions currently wasted on runaway credential signaling.  But in any future I can plausibly imagine where the government actually axes education, the savings go to things like enriching the leaders’ cronies and launching vanity wars.

My preferences for American politics have two tiers.  In the first tier, I simply want the Democrats to vanquish the Republicans, in every office from president down to dogcatcher, in order to prevent further spiraling into nihilistic quasi-fascism, and to restore the baseline non-horribleness that we know is possible for rich liberal democracies.  Then, in the second tier, I want the libertarians and rationalists and nerdy economists and Slate Star Codex readers to be able to experiment—that’s a key word here—with whether they can use futarchy and prediction markets and pricing-in-lieu-of-regulation and other nifty ideas to improve dramatically over the baseline liberal order.  I don’t expect that I’ll ever get what I want; I’ll be extremely lucky even to get the first half of it.  But I find that my desires regarding Caplan’s program fit into the same mold.  First and foremost, save education from those who’d destroy it because they hate the life of the mind.  Then and only then, let people experiment with taking a surgical scalpel to education, removing from it the tumor of forced enlightenment, because they love the life of the mind.

27 Apr 21:34

Administrators at CUNY and Duke Aren’t Going to Do Anything About Students Who Disrupted Events

by Robby Soave

DukeIn the wake of shutdown attempts led by student-activists, administrators at Duke University and the City University of New York have finally made clear what their battle plan is for deterring such behavior in the future: do nothing.

Activists at Duke recently hijacked an alumni event and shouted down their own president, Vincent Price. The students were then shocked and outraged to learn that the administration was considering punishing them—even the mere suggestion of discipline was triggering and would exacerbate their "pre-existing mental health conditions," they claimed.

And so the administration folded. All student conduct investigations have been closed, according to The Daily Tar Heel.

At CUNY, student-protesters crashed a planned speech by South Texas College of Law Professor Josh Blackman. The talked over him for the first ten minutes of the event before leaving, which prevented Blackman from delivering his full remarks and may have intimidated would-be attendees. CUNY Law Dean Mary Lu Bilek essentially said that this was fine—Blackman was able to speak for some the time, so no college policy had been violated. "This non-violent, limited protest was a reasonable exercise of protected free speech, and it did not violate any university policy," she said.

Several CUNY professors—Martin Burke, David Gordon, K.C. Johnson, and David Seidermann—have now written a letter to CUNY Chanellor James Milliken asking him to "reaffirm CUNY's support for the rights of invited speakers to speak and the rights of students in their audience to hear their remarks." According to their letter:

Dean Bilek cited no provision of the student handbook to sustain her claim that "limited" disruptions of an invited speaker's talk do not violate CUNY policy. The handbook, we should note, implies the reverse, holding that "a member of the academic community shall not intentionally obstruct and/or forcibly prevent others from the exercise of their rights. Nor shall she/he interfere with the institution's educational process or facilities, or the rights of those who wish to avail themselves of any of the institution's instructional, personal, administrative, recreational, and community services."

It is noteworthy that, according to Blackman, the disruptors were intimidating enough to discourage some students from entering the room while the protesters were there. He was, he has stated, "not able to give the presentation I wanted—both in terms of duration and content—because of the hecklers. The Dean is simply incorrect when said the protest was only 'limited.'" (Blackman had planned a 45-minute address, to be followed by a question-and-answer session, thereby planning to allow time for CUNY Law students, including his critics, to ask him questions about his arguments.) Photographs of the event show the disruptors not only preventing him from delivering a portion of his planned remarks but also obstructing the audience's view of his PowerPoint presentation.

It's appropriate for university officials to exercise some caution when contemplating disciplinary action against students. But at some point, letting students face absolutely no consequences for such behavior is putting the free speech rights of everyone else at risk.

26 Apr 17:59

Lab-Grown Meat Is Coming to Your Supermarket. Ranchers Are Fighting Back: New at Reason

by Zach Weissmueller

Would you eat a hamburger or a chicken nugget made of meat grown in a laboratory?

Joshua Tetrick, co-founder and CEO of JUST, is betting that you will. The San Francisco-based company has been producing and selling non-animal versions of food, like mayonnaise, since 2013, and it's raised more than $310 million in venture capital.

Tetrick and his team have created products like Just Mayo by identifying plant-based alternatives to common animal products, like eggs, using a combination of lab experiments and machine-learning.

JUST is one of a handful of tech companies working to disrupt the meat production industry.

While many of its competitors are pursuing better plant-based meat substitutes, JUST is pushing ahead with so-called "clean meat," or lab-grown animal tissue that requires no farming, no feeding of livestock, and no slaughterhouses. Only a single sample from a single animal that's duplicated endlessly.

JUST and companies like it are poised to disrupt the livestock industry. So established players are turning to the government to help protect their turf.

The United States Cattlemen's Association, which declined to participate in this story, submitted a petition still under consideration by the United States Department of Agriculture asking that the words "meat" and "beef" exclude any products that "are neither derived from animals, nor slaughtered in the traditional manner."

Tetrick says accurate labeling will be essential when marketing his lab-grown "clean meat," which he hopes will transcend the vegan vs. carnivore paradigm.

"We don't allow the term 'vegan' to be used in our company," says Tetrick. "That word ends up turning off ninety-nine percent of people."

This isn't Tetrick's first fight with entrenched food interests.

When the company's first product, Just Mayo, appeared on the shelves of major retailers, the American Egg Board went on the offensive.

According to internal emails obtained by MIT researchers through the Freedom of Information Act, Egg Board members tried and failed to get Whole Foods to pull the product from its shelves and hired a network of writers to trash the product on food review sites.

Target stopped selling Just Mayo after receiving an anonymous letter about food safety, but a Food and Drug Administration investigation later found that the product was safe. Investigators failed to identify the author of the letter.

At one point, Egg Board members even discussed putting out a "hit" on Tetrick, with one member writing that he should get have his "old buddies from Brooklyn pay him a visit." The officials later told investigators that they were joking.

Whether or not consumers are ready for lab-grown meat is yet to be seen, and the company landed in hot water with the SEC in 2016 after being accused of buying its own products off the shelves to boost sales figures with the goal of raising more venture capital, though the company claims it was a quality control measure. No charges resulted from the investigation.

With JUST products in more than 20,000 stores, plans to release lab-grown clean meat onto the market by the end of the year at a retail price within 30 percent of that of traditional meat, Tetrick is optimistic about the future of the company and the global food system.

"In tomorrow's world, you can eat more meat, hopefully safer meat, even better tasting meat, without eating the animal," says Tetrick.

Produced by Zach Weissmueller. Camera by Alex Manning.

Subscribe to our YouTube channel.

Like us on Facebook.

Follow us on Twitter.

Subscribe to our podcast at iTunes.

"Scuba" by Metre is licensed under a Attribution-NonCommercial-ShareAlike License (https://creativecommons.org/licenses/by-nc-sa/4.0/) Source: http://freemusicarchive.org/music/Metre/Circuit_1731/Scuba_1957

Artist: http://freemusicarchive.org/music/Metre/

"Space Probe" by Metre is licensed under a Attribution-NonCommercial-ShareAlike License (https://creativecommons.org/licenses/by-nc-sa/4.0/) Source: http://freemusicarchive.org/music/Metre/Circuit_1731/Space_Probe

Artist: http://freemusicarchive.org/music/Metre/

"Deluge" by Cellophane Sam is licensed under a Attribution-NonCommercial 3.0 License (https://creativecommons.org/licenses/by-nc/3.0/us/) Source: http://freemusicarchive.org/music/Cellophane_Sam/Sea_Change/01_Deluge

Artist: http://freemusicarchive.org/music/Cellophane_Sam/

"Deluge" by Cellophane Sam is licensed under a Attribution-NonCommercial 3.0 License (https://creativecommons.org/licenses/by-nc/3.0/us/) Source: http://freemusicarchive.org/music/Cellophane_Sam/Sea_Change/01_Deluge

Artist: http://freemusicarchive.org/music/Cellophane_Sam/

Photo Credits: Sasha Gusov / Axiom Photographic/Newscom

View this article.

26 Apr 17:23

Applied Category Theory Course: Ordered Sets

by John Baez

My applied category theory course based on Fong and Spivak’s book Seven Sketches is going well. Over 250 people have registered for the course, which allows them to ask question and discuss things. But even if you don’t register you can read my “lectures”.

Here are all the lectures on Chapter 1, which is about adjoint functors between posets, and how they interact with meets and joins. We study the applications to logic – both classical logic based on subsets, and the nonstandard version of logic based on partitions. And we show how this math can be used to understand “generative effects”: situations where the whole is more than the sum of its parts!

Lecture 1 – Introduction
Lecture 2 – What is Applied Category Theory?
Lecture 3 – Chapter 1: Preorders
Lecture 4 – Chapter 1: Galois Connections
Lecture 5 – Chapter 1: Galois Connections
Lecture 6 – Chapter 1: Computing Adjoints
Lecture 7 – Chapter 1: Logic
Lecture 8 – Chapter 1: The Logic of Subsets
Lecture 9 – Chapter 1: Adjoints and the Logic of Subsets
Lecture 10 – Chapter 1: The Logic of Partitions
Lecture 11 – Chapter 1: The Poset of Partitions
Lecture 12 – Chapter 1: Generative Effects
Lecture 13 – Chapter 1: Pulling Back Partitions
Lecture 14 – Chapter 1: Adjoints, Joins and Meets
Lecture 15 – Chapter 1: Preserving Joins and Meets
Lecture 16 – Chapter 1: The Adjoint Functor Theorem for Posets
Lecture 17 – Chapter 1: The Grand Synthesis

If you want to discuss these things, please visit the Azimuth Forum and register! Use your full real name as your username, with no spaces, and use a real working email address. If you don’t, I won’t be able to register you. Your email address will be kept confidential.

I’m finding this course a great excuse to put my thoughts about category theory into a more organized form, and it’s displaced most of the time I used to spend on Google+ and Twitter. That’s what I wanted: the conversations in the course are more interesting!

26 Apr 17:19

Meteorologist

Hi, I'm your new meteorologist and a former software developer. Hey, when we say 12pm, does that mean the hour from 12pm to 1pm, or the hour centered on 12pm? Or is it a snapshot at 12:00 exactly? Because our 24-hour forecast has midnight at both ends, and I'm worried we have an off-by-one error.
26 Apr 16:36

'Qui ouvre une école, ferme une prison.'

by Minnesotastan

The title is a quote from Victor Hugo: "Each time you open a new school, you shut down a prison."

Photo cropped for size and emphasis and brightened from the original here.
26 Apr 16:27

A concise history of hookworm in the American south

by Minnesotastan
For more than three centuries, a plague of unshakable lethargy blanketed the American South.

It began with “ground itch,” a prickly tingling in the tender webs between the toes, which was soon followed by a dry cough. Weeks later, victims succumbed to an insatiable exhaustion and an impenetrable haziness of the mind that some called stupidity. Adults neglected their fields and children grew pale and listless. Victims developed grossly distended bellies and “angel wings”—emaciated shoulder blades accentuated by hunching. All gazed out dully from sunken sockets with a telltale “fish-eye” stare.

The culprit behind “the germ of laziness,” as the South’s affliction was sometimes called, was Necator americanus—the American murderer. Better known today as the hookworm, millions of those bloodsucking parasites lived, fed, multiplied, and died within the guts of up to 40% of populations stretching from southeastern Texas to West Virginia. Hookworms stymied development throughout the region and bred stereotypes about lazy, moronic Southerners...

“You had an entire class of Southern society—including whites, blacks, and Native Americans—that was looked upon as shiftless, lazy good-for-nothings who can’t do a day’s work,” my mom explained to me. “Hookworms tainted the nation’s picture of what a Southerner looked and acted like.”
The rest of the story, with a video, is at PBS.
26 Apr 16:02

How to upper-bound the probability of something bad

by Scott

Scott Alexander has a new post decrying how rarely experts encode their knowledge in the form of detailed guidelines with conditional statements and loops—or what one could also call flowcharts or expert systems—rather than just blanket recommendations.  He gives, as an illustration of what he’s looking for, an algorithm that a psychiatrist might use to figure out which antidepressants or other treatments will work for a specific patient—with the huge proviso that you shouldn’t try his algorithm at home, or (most importantly) sue him if it doesn’t work.

Compared to a psychiatrist, I have the huge advantage that if my professional advice fails, normally no one gets hurt or gets sued for malpractice or commits suicide or anything like that.  OK, but what do I actually know that can be encoded in if-thens?

Well, one of the commonest tasks in the day-to-day life of any theoretical computer scientist, or mathematician of the Erdös flavor, is to upper bound the probability that something bad will happen: for example, that your randomized algorithm or protocol will fail, or that your randomly constructed graph or code or whatever it is won’t have the properties needed for your proof.

So without further ado, here are my secrets revealed, my ten-step plan to probability-bounding and computer-science-theorizing success.

Step 1. “1” is definitely an upper bound on the probability of your bad event happening.  Check whether that upper bound is good enough.  (Sometimes, as when this is an inner step in a larger summation over probabilities, the answer will actually be yes.)

Step 2. Try using Markov’s inequality (a nonnegative random variable exceeds its mean by a factor of k at most a 1/k fraction of the time), combined with its close cousin in indispensable obviousness, the union bound (the probability that any of several bad events will happen, is at most the sum of the probabilities of each bad event individually).  About half the time, you can stop right here.

Step 3. See if the bad event you’re worried about involves a sum of independent random variables exceeding some threshold. If it does, hit that sucker with a Chernoff or Hoeffding bound.

Step 4. If your random variables aren’t independent, see if they at least form a martingale: a fancy word for a sum of terms, each of which has a mean of 0 conditioned on all the earlier terms, even though it might depend on the earlier terms in subtler ways.  If so, Azuma your problem into submission.

Step 5. If you don’t have a martingale, but you still feel like your random variables are only weakly correlated, try calculating the variance of whatever combination of variables you care about, and then using Chebyshev’s inequality: the probability that a random variable differs from its mean by at most k times the standard deviation (i.e., the square root of the variance) is at most 1/k2.  If the variance doesn’t work, you can try calculating some higher moments too—just beware that, around the 6th or 8th moment, you and your notebook paper will likely both be exhausted.

Step 6. OK, umm … see if you can upper-bound the variation distance between your probability distribution and a different distribution for which it’s already known (or is easy to see) that it’s unlikely that anything bad happens. A good example of a tool you can use to upper-bound variation distance is Pinsker’s inequality.

Step 7. Now is the time when you start ransacking Google and Wikipedia for things like the Lovász Local Lemma, and concentration bounds for low-degree polynomials, and Hölder’s inequality, and Talagrand’s inequality, and other isoperimetric-type inequalities, and hypercontractive inequalities, and other stuff that you’ve heard your friends rave about, and have even seen successfully used at least twice, but there’s no way you’d remember off the top of your head under what conditions any of this stuff applies, or whether any of it is good enough for your application. (Just between you and me: you may have already visited Wikipedia to refresh your memory about the earlier items in this list, like the Chernoff bound.) “Try a hypercontractive inequality” is surely the analogue of the psychiatrist’s “try electroconvulsive therapy,” for a patient on whom all milder treatments have failed.

Step 8. So, these bad events … how bad are they, anyway? Any chance you can live with them?  (See also: Step 1.)

Step 9. You can’t live with them? Then back up in your proof search tree, and look for a whole different approach or algorithm, which would make the bad events less likely or even kill them off altogether.

Step 10. Consider the possibility that the statement you’re trying to prove is false—or if true, is far beyond any existing tools.  (This might be the analogue of the psychiatrist’s: consider the possibility that evil conspirators really are out to get your patient.)

26 Apr 15:51

Poem of the Day: In a Word, a World

by C. D. Wright
I love them all.

I love that a handful, a mouthful, gets you by, a satchelful can land you a job, a
well-chosen clutch of them could get you laid, and that a solitary word can initiate
a stampede, and therefore can be formally outlawed—even by a liberal court
bent on defending a constitution guaranteeing unimpeded utterance. I love that
the Argentine gaucho has over two hundred words for the coloration of horses
and the Sami language of Scandinavia has over a thousand words for reindeer
based on age, sex, appearance—e.g., a busat has big balls or only one big ball.
More than the pristine, I love the filthy ones for their descriptive talent as well as
transgressive nature. I love the dirty ones more than the minced, in that I respect
extravagant expression more than reserved. I admire reserve, especially when
taken to an ascetic nth. I love the particular lexicons of particular occupations.
The substrate of those activities. The nomenclatures within nomenclatures. I am
of the unaccredited school that believes animals did not exist until Adam assigned
them names. My relationship to the word is anything but scientific; it is a matter
of faith on my part, that the word endows material substance, by setting the thing
named apart from all else. Horse, then, unhorses what is not horse.
 

C. D. Wright, "In a Word, a World" from The Poet, the Lion, Talking Pictures, El Farolito, a Wedding in St. Roch, the Big Box Store, the Warp in the Mirror, Spring, Midnights, Fire & All. Copyright © 2016 by C. D. Wright. Reprinted by permission of Copper Canyon Press, www.coppercanyonpress.org.

Source: The Poet, the Lion, Talking Pictures, El Farolito, a Wedding in St. Roch, the Big Box Store, the Warp in the Mirror, Spring, Midnights, Fire & All(Copper Canyon Press, 2016)

C. D. Wright

Biography
More poems by this author

11 Apr 21:20

Dynamical Systems and Their Steady States

by john
MathML-enabled post (click for more details).

guest post by Maru Sarazola

Now that we know how to use decorated cospans to represent open networks, the Applied Category Theory Seminar has turned its attention to open reaction networks (aka Petri nets) and the dynamical systems associated to them.

In A Compositional Framework for Reaction Networks (summarized in this very blog by John Baez not too long ago), authors John Baez and Blake Pollard put Fong’s results to good use and define cospan categories RxNet\mathbf{RxNet} and Dynam\mathbf{Dynam} of (open) reaction networks and (open) dynamical systems. Once this is done, the main goal of the paper is to show that the mapping that associates to an open reaction network its corresponding dynamical system is compositional, as is the mapping that takes an open dynamical system to the relation that holds between its constituents in steady state. In other words, they show that the study of the whole can be done through the study of the parts.

I would like to place the focus on dynamical systems and the study of their steady states, taking a closer look at this correspondence called “black-boxing”, and comparing it to previous related work done by David Spivak.

MathML-enabled post (click for more details).

Baez–Pollard’s approach

The category Dynam\mathbf{Dynam} of open dynamical systems

Let’s start by introducing the main players. A dynamical system is usually defined as a manifold MM whose points are “states”, together with a smooth vector field on MM saying how these states evolve in time. Since the motivation in this paper comes from chemistry, our manifolds will be euclidean spaces ℝ S\mathbb{R}^S, where SS should be thought of as the finite set of species involved, and a vector c∈ℝ Sc\in\mathbb{R}^S gives the concentration of each species. Then, the dynamical system is a differential equation

dc(t)dt=v(c(t))\frac{d c(t)}{d t}=v(c(t))

where c:ℝ→ℝ Sc:\mathbb{R}\to\mathbb{R}^S gives the concentrations as a function of time, and vv is a vector field on ℝ S\mathbb{R}^S.

Now imagine our motivating chemical system is open; that is, we are allowed to inject molecules of some chosen species, and remove some others. An open dynamical system is a cospan of finite sets

together with a vector field vv on ℝ S\mathbb{R}^S. Here the legs of the cospan mark the species that we’re allowed to inject and remove, labeled ii (oo) for input (output).

So, how can we build a category from this? Loosely citing a result of Fong, if the decorations of the cospan (in this case, the vector fields) can be given through a functor F:(FinSet,+)→(Set,×)F:(\mathbf{FinSet},+)\to(\mathbf{Set},\times ) that is lax monoidal, then we can form a category whose objects are finite sets, and whose morphisms are (iso classes of) decorated cospans.

Indeed, this can be done in a very natural way, and therefore gives rise to the category Dynam\mathbf{Dynam}, whose morphisms are open dynamical systems.

The black-boxing functor ▪:Dynam→Rel\blacksquare :\mathbf{Dynam}\to\mathbf{Rel}

Given a dynamical system, one of the first things we might like to do is to study its fixed points; in our case, study the concentration vectors that remain constant in time. When working with an open dynamical system, it’s clear that the amounts that we choose to inject and remove will alter the change in concentration of our species, and hence it makes sense to consider the following.

For an open dynamical system (X→iS←oY,v)(X\xrightarrow{i} S \xleftarrow{o} Y, v), together with a constant inflow I∈ℝ XI\in\mathbb{R}^X and constant outflow O∈ℝ YO\in\mathbb{R}^Y, a steady state (with inflows II and outflows OO) is a constant vector of concentrations c∈ℝ Sc\in\mathbb{R}^S such that

v(c)+i *(I)−o *(O)=0v(c)+i_{\ast} (I)-o_{\ast} (O)=0

Here i *(I)i_{\ast} (I) is the vector in ℝ S\mathbb{R}^S given by i *(I)(s)=∑ x∈X:i(x)=sI(x)i_{\ast} (I)(s)=\sum_{x\in X: i(x)=s} I(x); that is, the inflow concentration of all species as marked by the input leg of the cospan. As the authors concisely put it, “in a steady state, the inflows and outflows conspire to exactly compensate for the reaction velocities”.

Note that the inflow and outflow functions II and OO won’t affect any species not marked by the legs of the cospan, and so any steady state cc must be such that v(c)=0v(c)=0 when restricted to these inner species that we can’t reach.

What we want to do next is build a functor that, given an open dynamical system, records all possible combinations of input concentrations, output concentrations, inflows and outflows that hold in steady state. This process will be called black-boxing, since it discards information that can’t be seen at the inputs and outputs.

The black-boxing functor ▪:Dynam→Rel\blacksquare:\mathbf{Dynam}\to \mathbf{Rel} takes a finite set XX to the vector space ℝ X⊕ℝ X\mathbb{R}^X\oplus\mathbb{R}^X, and a morphism, that is, an open dynamical system f=(X→iS←oY,v)f=(X\xrightarrow{i} S \xleftarrow{o} Y, v), to the subset

▪(f)⊆ℝ X⊕ℝ X⊕ℝ Y⊕ℝ Y\blacksquare(f)\subseteq\mathbb{R}^X\oplus\mathbb{R}^X\oplus\mathbb{R}^Y\oplus\mathbb{R}^Y

▪(f)={(i *(c),I,o *(c),O):c  is a steady state with inflows  I  and outflows  O}\blacksquare(f)=\{(i^{\ast} (c),I,o^{\ast} (c),O): c &nbsp; \text{ is a steady state with inflows } &nbsp; I &nbsp; \text{ and outflows } &nbsp; O\}

where i *(c)i^{\ast} (c) is the vector in ℝ X\mathbb{R}^X defined by i *(c)(x)=c(i(x))i^{\ast} (c) (x)=c(i(x)); that is, the concentration of the input species.

The authors prove that black-boxing is indeed a functor, which implies that if we want to study the steady states of a complex open dynamical system, we can break it up into smaller, simpler pieces and study their steady states. In other words, studying the steady states of a big system, which is given by the composition of smaller systems (as morphisms in the category Dynam\mathbf{Dynam}) amounts to studying the steady states of each of the smaller systems, and composing them (as morphisms in Rel\mathbf{Rel}).

Spivak’s approach

The category 𝒲\mathcal{W} of wiring diagrams

Instead of dealing with dynamical systems from the start, Spivak takes a step back and develops a syntax for boxes, which are things that admit inputs and outputs.

Let’s define the category 𝒲 𝒞\mathcal{W}_\mathcal{C} of 𝒞\mathcal{C}-boxes and wiring diagrams, for a category 𝒞\mathcal{C} with finite products. Its objects are pairs

X=(X in,X out)X=(X^\text{in},X^\text{out})

where each of these coordinates is a finite product of objects of 𝒞\mathcal{C}. For example, we interpret the pair (A 1×A 2,B 1×B 2×B 3)(A_1\times A_2, B_1\times B_2\times B_3) as a box with input ports (a 1,a 2)∈A 1×A 2(a_1 ,a_2)\in A_1\times A_2 and output ports (b 1,b 2,b 3)∈B 1×B 2×B 3(b_1 ,b_2 ,b_3 )\in B_1\times B_2\times B_3.

Its morphisms are wiring diagrams φ:X→Y\varphi:X\to Y, that is, pairs of maps (φ in,φ out)(\varphi^\text{in},\varphi^\text{out}) which we interpret as a rewiring of the box XX inside of the box YY. The function φ in\varphi^\text{in} indicates whether an input port of XX should be attached to an input of YY or to an output of XX itself; the function φ out\varphi^\text{out} indicates how the outputs of XX feed the outputs of YY. Examples of wirings are

Composition is given by a nesting of wirings.

Given boxes XX and YY, we define their parallel composition by

X⊠Y=(X in×Y in,X out×Y out)X\boxtimes Y=(X^\text{in}\times Y^\text{in},X^\text{out}\times Y^\text{out})

This gives a monoidal structure to the category 𝒲 𝒞\mathcal{W}_\mathcal{C}. Parallel composition is true to its name, as illustrated by

The huge advantage of this approach is that one can now fill the boxes with suitable “inhabitants”, and model many different situations that look like wirings at their core. These inhabitants will be given through functors 𝒲 𝒞→Set\mathcal{W}_\mathcal{C}\to\mathbf{Set}, taking a box to the set of its desired interpretations, and giving a meaning to the wiring of boxes.

The functor ODS:𝒲 Euc→SetODS:\mathcal{W}_{\mathbf{Euc}}\to\mathbf{Set} of open dynamical systems

The first of our inhabitants will be, as you probably guessed by now, open dynamical systems. Here 𝒞=Euc\mathcal{C}=\mathbf{Euc} is the category of Euclidean spaces ℝ n\mathbb{R}^n and smooth maps.

From the perspective of Spivak’s paper, an (ℝ X,ℝ Y)(\mathbb{R}^X,\mathbb{R}^Y)-open dynamical system is a 3-tuple (ℝ S,f dyn,f rdt)(\mathbb{R}^S,f^\text{dyn},f^\text{rdt}) where

  • ℝ S\mathbb{R}^S is the state space

  • f dyn:ℝ X×ℝ S→ℝ Sf^\text{dyn}:\mathbb{R}^X\times\mathbb{R}^S\to\mathbb{R}^S is a vector field parametrized by the inputs ℝ X\mathbb{R}^X, giving the differential equation of the system

  • f rdt:ℝ S→ℝ Yf^\text{rdt}:\mathbb{R}^S\to\mathbb{R}^Y is the readout function at the outputs ℝ Y\mathbb{R}^Y.

One should notice the similarity with our previously defined dynamical systems, although it’s clear that the two definitions are not equivalent.

The functor ODS:𝒲 Euc→SetODS:\mathcal{W}_{\mathbf{Euc}}\to\mathbf{Set} exhibiting dynamical systems as inhabitants of input-output boxes, takes a box X=(X in,X out)X=(X^\text{in},X^\text{out}) to the set of all (ℝ X in,ℝ X out)(\mathbb{R}^{X^\text{in}},\mathbb{R}^{X^\text{out}})-dynamical systems

ODS(X)={(ℝ S,f dyn:ℝ X in×ℝ S→ℝ S,f rdt:ℝ S→ℝ X out)}ODS(X)=\{(\mathbb{R}^S,f^\text{dyn}:\mathbb{R}^{X^\text{in}}\times\mathbb{R}^S\to\mathbb{R}^S,f^\text{rdt}:\mathbb{R}^S\to\mathbb{R}^{X^\text{out}})\}

You can surely figure out how ODSODS acts on wirings by drawing a picture and doing a bit of careful bookkeeping.

Note that there’s a natural notion of parallel composition of two dynamical systems, which amounts to carrying out the processes indicated by the two dynamical systems in parallel. Spivak shows that ODSODS is a functor, and, furthermore, that

ODS(X⊠Y)≃ODS(X)⊠ODS(Y)ODS(X\boxtimes Y)\simeq ODS(X)\boxtimes ODS(Y)

The functor Mat:𝒲 𝒞→SetMat:\mathcal{W}_{\mathcal{C}}\to\mathbf{Set} of Set\mathbf{Set}-matrices

Our second inhabitants will be given by matrices of sets. For objects X,YX,Y, an (X,Y)(X,Y)-matrix of sets is a function MM that assigns to each pair (x,y)(x,y) a set M x,yM_{x,y}. In other words, it is a matrix indexed by X×YX\times Y that, instead of coefficients, has sets in each position.

The functor Mat:𝒲 𝒞→SetMat:\mathcal{W}_{\mathcal{C}}\to\mathbf{Set} exhibiting Set\mathbf{Set}-matrices as inhabitants of input-output boxes, takes a box X=(X in,X out)X=(X^\text{in},X^\text{out}) to the set of all (X in,X out)(X^\text{in},X^\text{out})-matrices of sets

Mat(X)={{M i,j} X in×X out:M i,j  is a set}Mat(X)=\{\{M_{i,j}\}_{X^\text{in}\times X^\text{out}} : M_{i,j} &nbsp; \text{ is a set}\}

Once again, it’s not too hard to figure out how MatMat should act on wirings.

Like before, there’s a notion of parallel composition of two matrices of sets, and the author shows that MatMat is a functor such that

Mat(X⊠Y)≃Mat(X)⊠Mat(Y)Mat(X\boxtimes Y)\simeq Mat(X)\boxtimes Mat(Y)

The steady-state natural transformation Stst:ODS→MatStst:ODS\to Mat

Finally, we explain how to use all this to study steady states of dynamical systems.

Given an (ℝ X,ℝ Y)(\mathbb{R}^X,\mathbb{R}^Y)-dynamical system f=(ℝ S,f dyn,f rdt)f=(\mathbb{R}^S,f^\text{dyn},f^\text{rdt}) and an element (I,O)∈ℝ X×ℝ Y(I,O)\in\mathbb{R}^X\times\mathbb{R}^Y, an (I,O)(I,O)-steady state is a state c∈ℝ Sc\in\mathbb{R}^S such that

f dyn(I,c)=0   and   f rdt(c)=Of^\text{dyn}(I,c)=0 &nbsp; &nbsp; \text{ and } &nbsp; &nbsp; f^\text{rdt}(c)=O

Since dynamical systems are encoded by the functor ODSODS, it makes sense to study steady states through a natural transformation out of ODSODS. We define Stst:ODS→MatStst:ODS\to Mat as the transformation that assigns to each box XX, the function

Stst X:ODS(X)⟶Mat(X)Stst_X:ODS(X)\longrightarrow Mat(X)

taking a dynamical system (ℝ S,f dyn,f rdt)(\mathbb{R}^S,f^\text{dyn},f^\text{rdt}) to its matrix of steady states

M I,O={c∈ℝ S:f dyn(I,c)=0, f rdt(c)=O}M_{I,O}=\{c\in\mathbb{R}^S : f^\text{dyn}(I,c)=0, &nbsp; f^\text{rdt}(c)=O\}

where (I,O)∈ℝ X in×ℝ X out(I,O)\in \mathbb{R}^{X^\text{in}}\times \mathbb{R}^{X^\text{out}}. The author proceeds to show that StstStst is a monoidal natural transformation.

Is it possible to use this machinery to draw the same conclusion as before, that is, that the steady states of a composition of systems comes from the composition of the steady states of the parts?

Indeed, it is! Given two boxes X 1X_1 and X 2X_2, we recover the usual notion of (serial) composition by first setting them in parallel X 1⊠X 2X_1 \boxtimes X_2,

and wiring this by φ:X 1⊠X 2→Y\varphi:X_1 \boxtimes X_2\to Y as follows:

The fact that StstStst is a monoidal natural transformation, combined with the facts that the functors ODSODS and MatMat respect parallel composition, allows us to write the following diagram, where both squares are commutative

Then, chasing the diagram along the top and left sides gives the steady states of the serial composition of the dynamical systems X 1X_1 and X 2X_2, while chasing it along the right and bottom sides gives the composition of the steady states of X 1X_1 and of X 2X_2, and the two must agree.

The two approaches, side by side

So how are these two perspectives related? Looking at the definitions we can immediately see that Spivak’s approach has a broader scope than Baez and Pollard’s, so it’s apparent that his results won’t be implied by theirs.

For the converse direction, recall that in the first paper, a dynamical system is given by a decorated cospan f=(X→iS←oY,v)f=(X\xrightarrow{i} S \xleftarrow{o} Y, v), and a steady state with inflows II and outflows OO is a constant vector of concentrations c∈ℝ Sc\in\mathbb{R}^S such that

v(c)+i *(I)−o *(O)=0v(c)+i_{\ast} (I)-o_{\ast} (O)=0

Thus, studying the steady states for this cospan system corresponds to studying the box system

f=(ℝ S,f dyn:ℝ X×ℝ S→ℝ S,f rdt:ℝ S→ℝ Y)f=(\mathbb{R}^S, f^\text{dyn}:\mathbb{R}^X\times\mathbb{R}^S\to\mathbb{R}^S, f^\text{rdt}:\mathbb{R}^S\to\mathbb{R}^Y)

with dynamics given by f dyn(I,c)=v(c)+i *(I)−o *(f rdt(c))f^\text{dyn}(I,c)=v(c)+i_{\ast} (I)-o_{\ast} (f^\text{rdt}(c)), since its (I,O)(I,O)-steady states are vectors c∈ℝ Sc\in\mathbb{R}^S such that

f dyn(I,c)=0   and   f rdt(c)=Of^\text{dyn}(I,c)=0 &nbsp; &nbsp; \text{ and } &nbsp; &nbsp; f^\text{rdt}(c)=O

Thus, the study of the steady states of a given cospan dynamical system can be done just as well by looking at it as a box dynamical system and running it through Spivak’s machinery. However, setting two such box systems in serial composition will not yield the box system representing the composition of the cospan systems as one would (naively?) hope, so it doesn’t seem that Spivak’s compositional results will imply those of Baez and Pollard.

This is a bit disconcerting, but instead of it being discouraging, I believe it should be seen as an invitation to delve into the semantics of open dynamical systems and find the right perspective, which manages to subsume both of the approaches presented here.

11 Apr 13:42

A molecule that manufactures asymmetry

A molecule that manufactures asymmetry

A molecule that manufactures asymmetry, Published online: 10 April 2018; doi:10.1038/d41586-018-04256-4

Compound’s variations could spawn catalysts that favour certain chiral forms.
07 Apr 14:20

What is computational neuroscience? (XXIX) The free energy principle

by admin
Nosimpler

"Surprise me."

The free energy principle is the theory that the brain manipulates a probabilistic generative model of its sensory inputs, which it tries to optimize by either changing the model (learning) or changing the inputs (action) (Friston 2009; Friston 2010). The “free energy” is related to the error between predictions and actual inputs, or “surprise”, which the organism wants to minimize. It has a more precise mathematical formulation, but the conceptual issues I want to discuss here do not depend on it.

Thus, it can be seen as an extension of the Bayesian brain hypothesis that accounts for action in addition to perception. It shares the conceptual problems of the Bayesian brain hypothesis, namely that it focuses on statistical uncertainty, inferring variables of a model (called “causes”) when the challenge is to build and manipulate the structure of the model. It also shares issues with the predictive coding concept, namely that there is a conflation between a technical sense of “prediction” (expectation of the future signal) and a broader sense that is more ecologically relevant (if I do X, then Y will happen). In my view, these are the main issues with the free energy principle. Here I will focus on an additional issue that is specific of the free energy principle.

The specific interest of the free energy principle lies in its formulation of action. It resonates with a very important psychological theory called cognitive dissonance theory. That theory says that you try to avoid dissonance between facts and your system of beliefs, by either changing the beliefs in a small way or avoiding the facts. When there is a dissonant fact, you generally don’t throw your entire system of beliefs: rather, you alter the interpretation of the fact (think of political discourse or in fact, scientific discourse). Another strategy is to avoid the dissonant facts: for example, to read newspapers that tend to have the same opinions as yours. So there is some support in psychology for the idea that you act so as to minimize surprise.

Thus, the free energy principle acknowledges the circularity of action and perception. However, it is quite difficult to make it account for a large part of behavior. A large part of behavior is directed towards goals; for example, to get food and sex. The theory anticipates this criticism and proposes that goals are ingrained in priors. For example, you expect to have food. So, for your state to match your expectations, you need to seek food. This is the theory’s solution to the so-called “dark room problem” (Friston et al., 2012): if you want to minimize surprise, why not shut off stimulation altogether and go to the closest dark room? Solution: you are not expecting a dark room, so you are not going there in the first place.

Let us consider a concrete example to show that this solution does not work. There are two kinds of stimuli: food, and no food. I have two possible actions: to seek food, or to sit and do nothing. If I do nothing, then with 100% probability, I will see no food. If I seek food, then with, say, 20% probability, I will see food.

Let’s say this is the world in which I live. What does the free energy principle tell us? To minimize surprise, it seems clear that I should sit: I am certain to not see food. No surprise at all. The proposed solution is that you have a prior expectation to see food. So to minimize the surprise, you should put yourself into a situation where you might see food, ie to seek food. This seems to work. However, if there is any learning at all, then you will quickly observe that the probability of seeing food is actually 20%, and your expectations should be adjusted accordingly. Also, I will also observe that between two food expeditions, the probability to see food is 0%. Once this has been observed, surprise is minimal when I do not seek food. So, I die of hunger. It follows that the free energy principle does not survive Darwinian competition.

Thus, either there is no learning at all and the free energy principle is just a way of calling predefined actions “priors”; or there is learning, but then it doesn’t account for goal-directed behavior.

The idea to act so as to minimize surprise resonates with some aspects of psychology, like cognitive dissonance theory, but that does not constitute a complete theory of mind, except possibly of the depressed mind. See for example the experience of flow (as in surfing): you seek a situation that is controllable but sufficiently challenging that it engages your entire attention; in other words, you voluntarily expose yourself to a (moderate amount of) surprise; in any case certainly not a minimum amount of surprise.

07 Apr 14:13

Role of the 5-HT2A Receptor in Self- and Other-Initiated Social Interaction in Lysergic Acid Diethylamide-Induced States: A Pharmacological fMRI Study

by Preller, K. H., Schilbach, L., Pokorny, T., Flemming, J., Seifritz, E., Vollenweider, F. X.
Nosimpler

I can't wait for more of these studies.

Distortions of self-experience are critical symptoms of psychiatric disorders and have detrimental effects on social interactions. In light of the immense need for improved and targeted interventions for social impairments, it is important to better understand the neurochemical substrates of social interaction abilities. We therefore investigated the pharmacological and neural correlates of self- and other-initiated social interaction. In a double-blind, randomized, counterbalanced, crossover study 24 healthy human participants (18 males and 6 females) received either (1) placebo + placebo, (2) placebo + lysergic acid diethylamide (LSD; 100 μg, p.o.), or (3) ketanserin (40 mg, p.o.) + LSD (100 μg, p.o.) on three different occasions. Participants took part in an interactive task using eye-tracking and functional magnetic resonance imaging completing trials of self- and other-initiated joint and non-joint attention. Results demonstrate first, that LSD reduced activity in brain areas important for self-processing, but also social cognition; second, that change in brain activity was linked to subjective experience; and third, that LSD decreased the efficiency of establishing joint attention. Furthermore, LSD-induced effects were blocked by the serotonin 2A receptor (5-HT2AR) antagonist ketanserin, indicating that effects of LSD are attributable to 5-HT2AR stimulation. The current results demonstrate that activity in areas of the "social brain" can be modulated via the 5-HT2AR thereby pointing toward this system as a potential target for the treatment of social impairments associated with psychiatric disorders.

SIGNIFICANCE STATEMENT Distortions of self-representation and, potentially related to this, dysfunctional social cognition are central hallmarks of various psychiatric disorders and critically impact disease development, progression, treatment, as well as real-world functioning. However, these deficits are insufficiently targeted by current treatment approaches. The administration of lysergic acid diethylamide (LSD) in combination with functional magnetic resonance imaging and real-time eye-tracking offers the unique opportunity to study alterations in self-experience, their relation to social cognition, and the underlying neuropharmacology. Results demonstrate that LSD alters self-experience as well as basic social cognition processing in areas of the "social brain". Furthermore, these alterations are attributable to 5-HT2A receptor stimulation, thereby pinpointing toward this receptor system in the development of pharmacotherapies for sociocognitive deficits in psychiatric disorders.