Conlangs with over 10,000 words

From FrathWiki
Jump to: navigation, search

Because scrappers and sandboxers create and scrap so many conlangs, the vast majority of conlangs have a lexicon that doesn't get very large by the time they're scrapped. In fact, many conlangs have about 7 to 10 words by the time they're scrapped, and some never have a single word of lexicon created.

In this forest of creosote bushes, however, there are some sequoias that have astoundingly large lexica, such as Talossan, Arka, or Spocanian. Occasionally, a conlang will become known mainly for its large lexicon size (as happened with Arka).

Here is a list of the largest of the large -- conlangs that have at least ten thousand words in their lexicon.

Excluded are:

  1. Superset and subset languages.
  2. Conlangs that consist of making only a few predictable changes to an existing language, such as Muphridian. Note that this includes all language games such as Pig Latin and Adibikicyan, as well as conlangs that are ciphers in the truest sense of the word (e.g. the No Man's Sky conlangs, or a conlang in which every A in the English word becomes an I, every B in the English word becomes a T, every C in the English word becomes an L, every D in the English word becomes an X, every E in the English word becomes a U, and so on). Conlangs that are relexes, but not cryptographic ciphers of natlang words, such as Froogleyboy's Aveata, although they may be frowned upon in the conlanging community, are eligible for this list, as are conlangs in which someone relexed a list like WordNet, or the Landau Core Vocabulary, or the Ethnologue lexicon questionnaire.
  3. Conlangs whose lexicon consists of an unedited computer-generated data dump, such as Nunihongo. (Computer-generated conlangs whose lexicon has been cleaned up, such as Classical Yiklamu, are eligible for this list).
  4. Computer-generated material for making new words for a conlang, such as that made by Larry Rogers of Michigan (BA Linguistics, Michigan State University, 2009) for his expansion of Marc Okrand's Atlantean language. This was not intended as a conlang dictionary but merely as material which could be used to make new words, computer-generated words approximating the phonology of the original words.

Although James Landau, who has researched this list, has done quite an extensive search, there are no doubt at least a few conlangs that belong on this list that he's missed out on. Anyone is free to add them here.

Finally, it should be clarified that this list counts only how many words the creator or creators have actually created in the language, not the number of words that the creator says exist in-universe.

The list

  • about 10,000 Lingwa de planeta (Lidepla) - Dmitri Ivanov, A. Lysenko and others; worldlang based on 10 languages
  • 10,000 Farlingo - Vladimir Farber & Matvei Farber; auxlang based on SAE languages, Russian, Hebrew and Esperanto
  • 10,000 Faudanian - Josh Hien; a posteriori personal language
  • 10,000 Fith - Jeffrey Henning; alien language with "last in, first out" structure
  • 10,000 Lugasuese - Jurre Lagerwaard; spoken on the planet Aranii, based on Germanic and Tolkienian languages
  • 10,000 Unish - Language Research Institute, Sejong University; auxlang based on 16 languages from many phyla
  • more than 10,000 Loglan - James Cooke Brown; logical language with a posteriori lexicon
  • 10,910 Town Speech/Urban Basanawa - k1234567890y; West Germanic artlang (its Germanic nature obscured by writing system and large amount of Sino-Xenic vocabulary)
  • 11,000 LANGUST - Grigoriy Korolev; auxlang based on 45 "atoms", aUI-style
  • over 11,000 Uropi - Joël Landais; zonal language for Europe based on Indo-European roots
  • 11,111 Rireinutire - Prettydragoon; spoken on the planet Rireinu
  • 11,200 Paolanté - B. Christopher Suchsland-Gutiérrez; fictional Romance language
  • 11,500 Interslavic - Juraj Križanic; Slavic zonal language
  • about 12,000 Rodinian - Rodiniye; worldlang repurposed as an artlang
  • 12,000 HOOM - Rood Hume; intuitive a priori auxlang for the universe
  • 12,000 Tundrian - Gábor Sándi; Romance language spoken in the fictional country of Tundria
  • 12,130 Géarthnuns - Douglas Koller; a priori language spoken on a fictional island in the Sea of Japan
  • 12,363 Mango - Natalia Gruscha; spoken by the Tiger People of the planet Pii, based on Indo-Aryan languages
  • 13,000 Mila - Gary Taylor-Raebel; set in an alien colony, invented as a fauxlang but has since evolved naturally
  • 13,552 Tomato - Catty; innovative a priori auxlang
  • 14,000 Europeano - Jay Bowks; Euroclone
  • 14,000 Ido - Louis de Beaufront, Louis Couturat; Esperanto spinoff
  • 14,750 Kavrinian - Ultimate Ridley et al.; collaborative language spoken in Lhavres on the planet Sahar; most words created by others
  • 14,787 Minhyan - Jeffrey Henning; VSO fictional diachronic language with a priori vocabulary
  • 15,000 Chaldon-Siberian - Yaroslav Zolotaryov; purification of Slavic languages without Old Church Slavonic borrowings
  • around 15,000 Deyryck - Threr; fictional language spoken in the multiverse of Alaaban
  • 16,000 Otg - Spencer Spurgeon; quirky fictional diachronic language inspired by Celtic and Turkish
  • 16,627 Arka - Seren Arbazard; spoken on the planet Kaldia
  • over 17,000 Sambahsa-Mundialect - Dr. Olivier Simon; worldlang based on PIE, Arabic, Chinese, Indonesian, Swahili and Turkish
  • more than 17,000 Kotava - Staren Fetcey; a priori auxlang
  • about 18,000 Interslavic - Jan van Steenbergen et al.; Slavic zonal language based on Old Church Slavonic, previously known as Slovianski
  • 20,000 Aixosixomi - Alexander M. Koch; language of a fictional hunter-gatherer people of Earth
  • 20,000 Sydvetlish - Matheus Filipe da Silva Leal; Germanic auxlang
  • over 20,000 Lingua Franca Nova - George C. Boeree; Romance auxlang with creole grammar
  • 20,680 Noxilo - Sentaro Mizuta; worldlang that allows for many different word orders
  • around 21,000 Celinese - Andy Ayres; macrolanguage spoken on the planet Lorech
  • 21,100 Basha Humrayan - MissTerry; a conlang and autocryptolect 90% based on the Sanskrit spectrum (ancient Vedic to Classical and "Modern") with very regular Esperanto style grammar
  • 21,100 Itlani - James E. Hopkins; spoken on the planet Itlán
  • 21,715 Lojban - The Logical Language Group; logical language, a reform of Loglan
  • 22,956 Pantakakiano - Javier Valladolid Antoranz; used in the novel El sueño en verso
  • over 25,000 Spocanian - Rolandt Tweehuysen; a priori fictional language from the Atlantic phylum
  • 25,234 FeNeKeRe - Jonathan Sodt; a priori fauxlang of Earth's Dragon People of the arts, with millions of possible names
  • 26,339 Blissymbolics - Charles K. Bliss; pasigraphy
  • 26,352 Sermo - Jose Soares Da Silva; Euroclone
  • over 35,000 Talossan - R. Ben Madison; micronational language, Romance but lacking a consistent derivation from Latin
  • 39,765 Nuu - Thomas Keyes; a priori engelang/artlang spoken on Ung
  • 44,000 Slovio - Mark Hucko; Slavic zonal language
  • 46,950 EDA/Edanic - Arne Arotnow; Euroclone based on Italian
  • 51,831 Vedanic - Arne Arotnow; reform of Edanic
  • over 60,000 Interlingua - International Auxiliary Language Association; Euroclone
  • some 75,000 Neo - Arturo Alfandari; Euroclone
  • 77,000 Esperanto - Ludwig Lazarus Zamenhof; auxlang and the world's most successful conlang
  • over 83,000 xuxuxi - John Cowan; based on the same principles as Classical Yiklamu
  • 86,000 Kankonian - James Landau; spoken on the planet Kankonia in the Lehola Galaxy
  • 91,591 Classical Yiklamu - Mark P. Line; engelang with computer-generated vocabulary based on WordNet and no derivation
  • 100,000 M̄av̄ī - Imjustadudeontheinternet; language created with the sole purpose of having 100,000 words. Relex of Toki Pona with a huge number of synonyms for each word.