informal description of the PEG morphology algorithm: Difference between revisions

From Lojban
Jump to navigation Jump to search
No edit summary
No edit summary
 
Line 6: Line 6:
# Extract all [[Morphology: special characters|pauses]], if there are any.  
# Extract all [[Morphology: special characters|pauses]], if there are any.  
# Extract a [[Morphology: words|word]] and then any following [[Morphology: special characters|pauses]], if there are any.
# Extract a [[Morphology: words|word]] and then any following [[Morphology: special characters|pauses]], if there are any.
# Repeat 2 until there is nothing left.
# Repeat 2 until there is nothing left.<noinclude>
<noinclude>
See also [[Morphology: Rules of formation]]
See also [[Morphology: Rules of formation]]
=Sections=
=Sections=

Latest revision as of 12:43, 17 May 2017

This is a description of the formal PEG morphology algorithm. If there are or appear to be discrepancies between this description and the formal grammar, the formal grammar wins.

Given any string of characters, this algorithm will split it uniquely into words, including possibly some non-lojban words.

We examine the string from the left, and words are extracted one by one from the left.

  1. Extract all pauses, if there are any.
  2. Extract a word and then any following pauses, if there are any.
  3. Repeat 2 until there is nothing left.

See also Morphology: Rules of formation

Sections