David Demitri Africa

February 19, 2026

Notes on Text has curvature

Notes on Text has curvature. (might be interesting to linguistics-y people)

  • The base intuition behind their formalism is this. If you're reading a sentence and you come to a word that's been blanked out, like
    • “The experiment _____ that the model generalized”
  • Reading from the left (the prefix) as a human does, you might predict the blank contain “confirmed, rejected, illustrated.”
  • Imagine instead you could only read from the right (the suffix). This gives you a different set of predictions, anything that precedes “that the model generalized” like “such, so, etc.”
  • The key question the paper asks is: when you combine these two sources of evidence, what happens?
  • Sometimes the left and right context agree and focus down to essentially one answer.
    • “The cat sat on the _____ and purred contentedly by the fire.”
    • Left context lets us infer: mat, rug, couch, chair, cushion... Right context lets us infer something comfortable, near a fire... probably mat or rug!
    • In this sense, meaning is tightly clustered to a few guesses, which the paper calls positive curvature.
  • Other times, the left and right context both constrain but they pull in different directions, sustaining multiple competing readings:
    • “The bank was closed, so she went to the _____ instead.”
    • Left context with “bank”: financial institution? Riverbank? “Closed” is ambiguous too. Right context: “instead” implies an alternative, but to what? Combined: multiple readings survive. In the paper's parlance, they “fan out” (sidenote: it's clear the paper uses an LLM to write, but it's fine to my eyes since it still parses well).
  • The big question is: why draw on the notion of curvature? Curvature is a highly loaded mathematical concept. But this is fine since the paper is making an analogy to actual geometric curvature, which I find clarifying.
    • If two people start at the equator and walk due north (carefully noting they walk parallel paths), they converge and meet at the North Pole. The sphere has positive curvature, meaning parallel paths come together.
    • On a saddle (such as a pringle), two parallel paths diverge instead, and you have negative curvature on a saddle.
    • On a flat plane, parallel paths neither converge nor diverge, and so you have zero curvature.
    • Following this, the crux of the formalization (which lets us make testable hypotheses, and so on) is:
      • If you think of the left-context belief and the right-context belief as two “paths” of evidence converging on a word, the way they meet tells you about the curvature of the text at that point.
  • The rest of the paper then revolves around taking a frozen LLM, and for a given token, feed it left context only and build a probability distribution over what could fill the slot, and do the same with the right context. They then employ some mathematical machinery to do this (with some stuff from optimal transport theory I don't yet understand) and derive texture, or a curvature field (or more importantly, a POINT/TOKEN/SLOTWISE notion of curvature).

← Back to notes