Returning Time to the Reader On naYana, and why English spelling is the largest unacknowledged tax on human attention

Scope. This is naYana for English (v0.1) — specifically General American, since that's the dialect with the most reliable open phonetic data (CMUdict). Other English varieties (RP, Indian English) and other languages will ship as parallel projects sharing the same writing system; their phonetic spellings will differ because their pronunciations differ.

About this version. This essay performs its own argument. A handful of substitution rules are introduced one at a time as you read. From the moment a rule is named, every later word in the essay applies it. By the end you'll be reading naYana — five small steps, no quiz.

Hover any highlighted word to see its original spelling. The phase indicator at the top of the page tracks where in the arc you currently are.

Honest disclaimer. The substitutions you'll see were produced by hand to demonstrate what the naYana engine produces. The engine itself works — try it on the /type page. This essay is the gentle reading-side preview.

An eight-year-old girl I know writes Hindi in the Roman alphabet. She has no training in transliteration. She has no teacher correcting her spelling. What she has is an ear, and a set of letters she learned in school. The letters represent sounds, the sounds represent words, and so she writes. Her spellings are her own, and they are perfectly consistent. Mausam, paani, khushi. If you can read English letters and you know Hindi, you can read everything she writes.

A few weeks ago she asked me what I do for a living. I told her I work in philosophy. She wrote it down in her journal: filosafi.

She was, of course, wrong about the spelling. She was also, just as clearly, right about everything else. She had heard a word, identified its sounds, and assigned each sound the nearest letter she knew. This is what every literate child in the world does, except for one population. Children learning to read English do it once or twice in kindergarten and then spend the next several years learning that what they did was wrong. The word is philosophy. The reasons are historical. The teacher will not explain them, because there is no time, and because the teacher does not know them either.

This essay is about what that time costs, why every previous attempt to recover it has failed, and why we believe that, after a century of failures, a different approach is now possible.

I. The cost

English spelling is the single largest learnable inconsistency in human literacy. The numbers are not subtle.

Children learning to read in Finnish, Italian, Spanish, Korean, or Indonesian reach reading fluency in roughly one to two years. Children learning English take three times as long. This is not a difference of effort or pedagogy or intelligence; the gap holds across socioeconomic backgrounds, educational systems, and teaching methods. It tracks one variable: the regularity of the orthography — what linguists call orthographic depth.

Finnish has roughly one spelling per sound and one sound per spelling. Indonesian was rationalized in the twentieth century and is similarly regular. Spanish has small irregularities at the edges (the silent h, the b/v merger), and a child can decode 95% of words by the end of first grade. Hindi in Devanagari is fully phonetic; once a child knows the consonants and the vowel marks, every new word is readable on first sight.

English is none of these. The letter a represents at least seven distinct vowel sounds (cat, father, care, about, late, all, any). The sound /f/ is spelled at least four ways (fish, phone, enough, off). The letter sequence ough is pronounced seven different ways (though, through, tough, cough, bough, thorough, hiccough). There are no rules that resolve these. The child must memorize each word. A literate English speaker has spent years of childhood building a mental lookup table.

The cost of that lookup table is not just instructional. It is opportunity.

Consider what a Finnish child does in the two years that an English child is still learning to spell. She reads stories. She writes her own. She participates in the literate culture of her community as a full member at the age of seven. Her English counterpart, at the same age, is still being told that write and right are different words even though they sound identical, that knight contains a silent letter she has no way to predict, that island and isle contain an s that nobody pronounces because of a sixteenth-century Latin misattribution.

By age eleven, the Finnish child has read perhaps three hundred books. The English child has read fewer. By age eighteen, the gap has narrowed only because the English child has grimly memorized the lookup table. But the years of reading lost cannot be reclaimed. The cultural participation she could have had — the early entry into the conversations of literate humans, the formation of identity through text — was deferred. For some children, who could not bear the opacity of English spelling and were diagnosed as dyslexic or simply gave up, it was deferred permanently.

The numbers are catastrophic. English is the first language of about 400 million people and the second language of perhaps another 1.5 billion. If even one year of childhood is lost per child to spelling mastery — a conservative estimate — and that year contains roughly 1,000 hours of school instruction, the total annual cost across the world is on the order of two trillion hours. The dollar cost, valued at the prevailing wage of teaching time alone, runs into the hundreds of billions per year. The energy cost, valued in the electricity to light the classrooms and the food to fuel the children, is its own catastrophe, multiplied across decades.

But the cost that matters is none of these. The cost that matters is the creative time — the time a child of seven could be spending on stories, on music, on building things, on absorbing the literate culture she was born into — that she instead spends decoding the spelling history of a language that has neglected its own writing system for five hundred years.

We are asking children to pay, in childhood, for the failures of dead typesetters and pedants. The children pay. We do not even count.

II. Why every previous attempt has failed

This problem is not new. It has been seen, named, and attacked repeatedly. Each attempt has failed for reasons worth understanding, because their reasons remain instructive.

George Bernard Shaw, in his will, left a substantial sum for the design of a new English alphabet — what became the Shavian alphabet, published in 1962. Shavian was technically sound: a phonetic script with distinct, simple shapes, designed by a typographer who knew what he was doing. It was used to print exactly one book: a parallel-text edition of Androcles and the Lion. It was never adopted by anyone. It had no place to live. The technology of 1962 could not deliver a typeface to a reader without printing presses, and printing presses do not adopt new alphabets on a whim.

The Deseret alphabet of the nineteenth-century Mormon settlers was a similar effort, similarly doomed by its dependence on infrastructure that would not be repurposed. Type was cast. Books were printed. A generation of children was taught to read it. Then the church moved on, the type was melted, and the alphabet went extinct within fifty years.

The Initial Teaching Alphabet (ITA) of 1960s Britain came closer. ITA was used in primary schools to teach early reading: forty-four characters, each representing one sound. Children learned to read ITA quickly — much faster than traditional spelling — and then were transitioned to standard English. The program ran for years and was eventually abandoned. The problem turned out to be the transition. Children who learned to read ITA had a harder time, not easier, switching to standard English than children who had been taught traditionally from the start. The crutch became an obstacle. The new alphabet had not failed to teach reading; it had failed to let go.

More recent efforts — SoundSpel, Cut Spelling, SaypU, Unifon — have all foundered on the same rock: the network effect. Any new spelling exists in a world where every existing book, sign, contract, website, and trained reader is in the old spelling. To adopt the new script is to lose access to everything written before, and to be unreadable to everyone who has not adopted it. The benefit is small and individual; the cost is large and immediate. Rational individuals do not adopt. The reform dies.

There is a deeper pattern across these failures. They all asked the reader to choose between two scripts at a moment in time — the old or the new. They all required commitment before the reform proved itself. They all underestimated how much existing literacy was a sunk cost the reader had no incentive to abandon. And they all operated in a physical world of paper and type, where every reform required the cooperation of printers, publishers, schools, and governments — all conservative by structure.

These are the failures we are trying not to repeat.

III. What is different now

Four things have changed since the last serious attempt. None of them is small, and together they are decisive.

The substrate is now digital. A reader's text is no longer captured on paper. It is rendered by a font, served by a browser, or displayed by an application. The same underlying text — the same sequence of Unicode characters — can be displayed differently on different devices, for different readers, at different times, with no change to the source. A reform of the rendering layer does not require the cooperation of any printer, publisher, school, or government. It requires only that the reader install a font.

This is the change that makes everything else possible. Shaw could not have done what we can do, because Shaw lived in a world where the appearance of a text was fixed at the moment of printing. We do not live in that world. We live in a world where every text is remade, pixel by pixel, every time it is read. The remaking is under software control. Software is malleable.

The reform can be phased. No previous attempt allowed a reader to move gradually from the old script to the new, learning one substitution rule at a time, with the previous month's progress preserved and the next month's introduced only when the reader is ready. Every previous attempt asked the reader to commit fully or not at all. The phased approach inverts this: at each stage, the reader has learned a small, named, well-defined thing. After one phase, ph becomes f. After another, ck becomes k. After many phases, the reader is reading a phonetically faithful script without ever having confronted a wall of unfamiliar text.

Phase 1: drop the dead weight. Words like philosophy spell /f/ as ph because Greek scribes did. Words like know, write, lamb, thought carry letters that nobody pronounces — leftovers of word history. naYana stops writing them: ph → f, silent k/w/b/gh dropped. From here on, that's what you'll see. No new letters to learn.

This solves the cold-start problem that killed every previous reform. There is no moment at which the reader cannot read. The script the reader sees at every stage is a script the reader can already mostly read, with one small new rule applied.

The reform is reversible. A reader who has installed the font and the engine can, at any moment, turn it off. The original text is preserved underneath. The new spelling sits as a rendering of the original, not a replacement of it. This means there is no risk to the reader of being cut off from existing literature. Every book ever written remains available in its original form by toggling a switch. The reader keeps full backward compatibility with five centuries of English text.

This is the answer to the ITA problem. ITA failed because the transition back was hard. Our system has no transition back to fail at, because the original is never erased. It is one click away at all times.

The encoding is IPA. Under the surface, the script the reader eventually arrives at is not a new invention. It is the International Fonetic Alfabet — the standard used by every linguist, every language-learning textbook, every speech-recognition system, every text-to-speech engine on earth. The Unicode codepoints we use are not ours. They are the codepoints assigned by Unicode to IPA, and they have been stable for decades and will be stable for decades more.

This means three things. It means that a reader who learns naYana can also read IPA — that the script they learn is not a private language but a passport to every dictionary and language resource in the world. It means that text written in naYana is automatically compatible with speech synthesizers and language tools, because those tools already accept IPA. And it means that the script can be extended to other languages, because IPA is universal — any sound in any language is already represented in it.

The novelty in naYana is not the encoding. The encoding is borrowed from a hundred years of careful linguistic work. The novelty is the glyph design: the shapes that the IPA characters take when rendered. IPA was designed by linguists for transcription, not by educators for learning. Its shapes are scientifically precise but cognitively expensive — mirror-image pairs, near-duplicates, marks that depend on tiny distinctions. naYana redraws each IPA character with a shape chosen for learnability: distinct from every other shape, easy to write by hand, easy to recognize at small sizes, free of mirror confusion. The underlying Unicode is IPA. The visible script is naYana.

IV. What this is for

Phase 2: when c sounds like k, write k. When it sounds like s, write s. The letter c is doing two jobs in English (cat vs city) and neither of them is its own. naYana picks the letter that matches the actual sound. From here, the letter c begins to disappear from the page — replaced by whichever letter you were already saying.

We have built the first fazes of this system. They work. A child can install the font on her browser and the engine progressively transforms English text — philosophy into filosofy, cat into kat, city into sity — one named rule at a time. Each rule is small. Each is named. Each is reversible. The arc of this essay follows the same shape, compressed into five steps; by the end the child is reading a fonetic skript in which every sound has exactly one spelling and every spelling has exactly one sound. The skript she is reading is not English. It is also not a new invention. It is the actual sounds of English, written down at last with the konsistency that every other major language in the world has long enjoyed.

We do not believe this will be easy. We do not believe everyone will adopt it. We do not even believe most people will adopt it. What we believe is that the option should exist — that a child should not be required, in the twenty-first sentury, to spend five years memorizing the etymological aksidents of reseive and believe when she could have been reading stories. The choise should be hers, or her parents', or her teacher's. The kost of not offering the choise is the kontinued, unmeasured loss of trillions of hours of human childhood every year, forever, with no end and no audit.

That is the kase for naYana. It is not a manifesto. It is a proposal. The proposal is: let us build the rendering layer that makes fonetic English available to anyone who wants it, without asking them to give up English; let us see whether children take to it; and let us measure whether the time saved is real.

If it works, the saved time is the answer to a question humanity has been ignoring for five hundred years. If it does not, we will have spent some years and some money trying, which is the smallest expense we could have made on a problem this size.

V. The longer ark

Phase 3: two Greek shapes for sounds English has been faking. English writes "th" for two completely different sounds — the hiss in think and the buzz in this. naYana gives them distinct characters: θ for the hiss, ð for the buzz. Greek lent us the shapes; they sit naturally in a Latin word. Also: the letter d has been redesigned to avoid mirroring b and p. Where words are rendered fully in naYana, you'll see d drawn as a Greek capital — for instance would, design, build. From here on, words in highlight that contain "th" or "d" carry the new shapes.

The name naYana is a Sanskrit word meaning "eye" or "guidance." It echoes the Nyāya skool of Indian logik, founded on the principle ðat careful seeing precedes korrekt reasoning. The skript is meant to guide the eye, gently, from the spelling it nos to the spelling ðat matches the world.

The name is also a fonetic palindrome. Read it forward: na-Ya-na. Read it backward: na-Ya-na. The kapitalized Y in the middle is the axis of symmetry. This was not an accident. A skript ðat works for English must, in principle, also work for languages that read rite-to-left like Arabic and Hebrew, or top-to-bottom like klassikal Chinese and Japanese. The palindrome marks the skript's intent: riting is a recording of speech, and speech does not have a preferred direction. The orthography should not impose one.

We are starting with English because English is the language with the largest mismatch between its sounds and its riting — and because English, as the global second language, exports ðat mismatch to every country where it is taught. A child in Maharashtra learning English does not just learn English; she learns to spend years of her childhood on the same fossilized irregularities ðat English-speaking children spend years on. The kost is global. The fix would be global.

Phase 4: three small steps toward IPA. Short i (as in sit, this, with) becomes ɪ so it stops sharing a letter with long i (as in kind, find). The digraph sh becomes ʃ — one sound, one letter. And word-final ng becomes ŋ. Look at English: it has all three. Eŋglɪʃ.

But there is no reason the skript needs to stop at English. The IPA encoding underneath naYana is universal. Any laŋguage ever transcribed in IPA — French, Mandarin, Arabic, Yoruba, Tamil — can be rendered in naYana shapes. A child who learns naYana for Eŋglɪʃ has also, without nowɪŋ it, learned the skript for every other laŋguage she might encounter. The literacy is transferable the way no orthography in history has been.

This is the larger vision, stated as carefully as we can state it: ðat the time stolen from billions of chɪldren by inconsistent ritɪŋ systems is recoverable; ðat the recovery requires no new laws, no government adoption, no schoolboard approval — only a font, an engine, and the consent of one reader at a time; and ðat the human capacity for creative work, for cultural participation, for early entry into the literate world, can be restored to its natural age of seven instead of being deferred to fourteen.

What ðat early entry would mean, at scale, we do not know. We have never had it. Every literate adult alive today was educated under the old system. Whole generations of chɪldren, in every Eŋglɪʃ-speaking country, have spent the most plastic years of their lives on a problem we could have removed.

It is time to remove it.

Phase 5: the most common sound in English finally gets its own letter. Listen to how you actually say about, the, of, banana. The unstressed vowels all collapse to one soft, neutral sound. English hides it behind five different letters (a, e, i, o, u). naYana shows it: ə. The closing passage applies every substitution introduced so far.

naYana ɪs ən open projekt. ðə code, ðə font, and ðə engine are publɪk. ðə skrɪpt itself ɪs built on IPA and ɪs unowned. We invite contributors — typographers, linguists, educators, software developers, parents, and chɪldrən — tə use it, test it, and improve it. Particularly chɪldrən. They will tell us, faster than any study, whether what we have built ɪs worth using.

If ə child can rite filosafi and be understood, we have done nothing for her. If ə child can rite filosafi and be rite, we have given her back her childhood.

— for the gnowledge lab project, naYana for universal literacy

A note on this version. The substitutions you just lived through were introduced in five named phases — a deliberately small slice of the project's planned arc, chosen because each one shows a piece of the script's design without overwhelming a first-time reader. The naYana engine ships substantially more (full schwa coverage, the long-vowel marker ː, the ʒ sound in vision, r-coloured vowels ɝ/ɚ, the tʃ/dʒ ligatures rendered as c and j). Those are the next handful of steps. Try the engine to meet them.

You arrived here by reading. The next reader could too.

A plain-English version of this essay, with full citations, is available as a companion document. The naYana project is open and hosted at gnowledge.org/projects/naYana.