Defining brivla without using lujvo rules

From Lojban
Jump to navigation Jump to search

Most descriptions of brivla morphology rely on lujvo construction to show behaviour common to all brivla and leave the effect on non-lujvo unexplained. Of course, the morphology was planned around the existence of lujvo, so this is natural, but it also seems to lead to speakers producing badly formed brivla. This page describes brivla (or a superset of them, anyway) in a way that introduces lujvo concepts at the last possible moment. Whether this has better results is to be seen.

Background knowledge

To parse brivla one needs to be able to parse syllables and cmavo.

A syllable consists of an onset, a nucleus, and optionally a coda.

  1. The onset is a single consonant, a glide (ĭ or ŭ), or an initial consonant cluster. A syllable at the beginning of a word can also have an empty onset, and a syllable inside a word can also have an h (') as its onset.
  2. The nucleus is one of the ten vowels and diphthongs.
  3. A syllable can have a coda of one consonant, and only when this consonant plus the onset of the next syllable do not together form a valid onset.
.a spa pan blaif stra ŭo 'u 'ei 'am by .y gry zbly
syllables
*girt *zra *t'u *fĭe *stsa
not syllables

Note the dot before the syllables beginning with vowels. Such syllables must be separated from ones before them with a glottal stop or a longer pause.

A cmavo consists of a syllable beginning with a single consonant, glide, or vowel, and without a coda, followed optionally by any number of syllables beginning with an h and without a coda.

.a ba bai ba'i ba'ai by by'i ĭa ĭai ĭy ŭa'ai'y
cmavo
*fas *zdi *'ai *sfu'a
not cmavo

When the parser can't extend the cmavo currently being parsed, it starts parsing the next one. Sequences of cmavo spoken or written together are split unambiguously before each consonant.

.iŭa fa'emu bysai
sequences of cmavo

Simple brivla

Let a brivla be a sequence of syllables that

  • begins with anything other than h,
  • contains no y,
  • ends with a vowel, and
  • has stress (marked here with a gràve àccent) on its second-last syllable, and nowhere else.

At the start of a text, the parser tries to detect a cmavo or a brivla, in that order. If, at a position, a cmavo is detected, no brivla will be detected there, and therefore no brivla is present there. This isn't a quirk of the parser, it is part of the definition of the language.

Here is an example of brivla (red) interspersed with cmavo (blue).

brivla-cmavo-path.svg

Notice how a brivla is opened at the beginning of every syllable not permitted in a cmavo, and closed at the end of the syllable after the first stressed syllable inside, making way for the next word to be parsed. What goes between these two important syllables (if they aren't one and the same) may well look like a cmavo; as long it carries no stress, it has no effect on the extent of the brivla.

What about strings like fàcki, which are, so we hear, words? is a cmavo, and *cki is a non-word, so shouldn't fàcki have no right to exist?

To allow words like fàcki, càudri, and .ìglu, the parser, after it finishes reading a cmavo, checks whether the string after that begins with a valid word.[1] If the check fails, there is by definition no cmavo at that point in the text, and the string beginning with that would-be cmavo potentially begins with a brivla. Words like fàcki do satisfy the brivla rule, and that together with the invalidity of as a cmavo before *cki means that fàcki is a brivla.

Medial-only syllables in brivla

Note the "contains no y" line in the definition above, which excludes words like fàtnygau. Syllables with y are in fact allowed in brivla, but with a more restricted distribution: they may not be at the beginning or end of a brivla. Also, they are always unstressed, and the penultimate stress rule ignores them when looking for the end of the brivla after encountering its stressed syllable.

durkyrvòre[2] sàpyvla murgentàli fàckytoi kricystàli
brivla
tilìste ŭasàpyvla bybrìvla
cmavo followed by brivla
krìcystali fàckytoixu
brivla followed by cmavo
*brivlàby *byrbrìvla *ticỳsku
non-words

Forbidden brivla

The grammar was built around the existence of lujvo, brivla made from components of certain shapes. Brivla space is warped in favour of lujvo: some shapes are declared non-words, in the style of *cki above, so that some strings that would usually fall apart into a cmavo plus a brivla parse as single brivla instead. For example, *zmà'i is forbidden by this rule so that sozmà'i can be a single word. Such forbidden words are called slinku'i.

The main slinku'i rule checks would-be brivla up to the first y, or to the end of the word if no y is present. If this string consists of a consonant followed by a rafsi string, and the whole string with the added consonant is not a rafsi string itself, there is no brivla present at this position. A rafsi string is any number of the patterns below, arranged in any order.

  • C is any consonant
  • cc is a consonant pair allowed in onsets
  • V is any of a e i o u
  • VV is any of ai ei oi au
  • y is y, followed optionally by h
  • R is r, unless it is followed by r, in which case it is n
  • brackets mark optional letters
  • a grave accent means mandatory stress
rafsi-string components
CVC[y] ccV[hy] CVV[hy] CVVR CVhV[hy] CVhVR
CVCCy CVCCVhy CV̀CCV ccVCy ccVCVhy ccV̀CV
skùdji kùdji mà'i barda'y
rafsi strings
zmà'i jbardà'y
slinku'i
skùdji
brivla despite slinku'i rule
pajbardà'ybla
brivla thanks to slinku'i rule

There is also a secondary slinku'i rule: a syllable starting with a consonant cluster followed by the syllable 'y may not start a brivla.

sprài'yvau
slinku'i
pasprài'yvau
brivla thanks to slinku'i rule
sprài'yrvau
brivla; not affected since second syllable is 'yr

CVCy rule

There is one last rule made to accommodate lujvo in brivla space. A CV syllable followed by a Cy syllable followed by a brivla or rafsi string may not start a cmavo.

fèsyvau rodyĭòlpe
brivla thanks to CVCy rule

Since pauses always separate words, adding them between these syllables stops this rule from applying.

fè.syvau fèsy.vau
two ways to pronounce this string of three cmavo

More rules?

The rules on this page should be enough to segment any text into cmavo and brivla, but the official brivla space is a subset of the one described here. More on this can be found on the pages linked on top.

Notes

  1. In principle this would mean that each attempt to parse a word would require a trip to the end of the text, but in practice these searches are shorter, ending at the next pause (in writing, the next space) or sequence of two syllables where the first one contains no y and where the second of them can start a brivla, whichever comes first.
  2. Also spelled durkrvòre – the way to syllabify that won't be explained here.