informal description of the PEG morphology algorithm: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
Line 6: | Line 6: | ||
# Extract all [[Morphology: special characters|pauses]], if there are any. | # Extract all [[Morphology: special characters|pauses]], if there are any. | ||
# Extract a [[Morphology: words|word]] and then any following [[Morphology: special characters|pauses]], if there are any. | # Extract a [[Morphology: words|word]] and then any following [[Morphology: special characters|pauses]], if there are any. | ||
# Repeat 2 until there is nothing left. | # Repeat 2 until there is nothing left.<noinclude> | ||
<noinclude> | |||
See also [[Morphology: Rules of formation]] | See also [[Morphology: Rules of formation]] | ||
=Sections= | =Sections= |
Latest revision as of 12:43, 17 May 2017
This is a description of the formal PEG morphology algorithm. If there are or appear to be discrepancies between this description and the formal grammar, the formal grammar wins.
Given any string of characters, this algorithm will split it uniquely into words, including possibly some non-lojban words.
We examine the string from the left, and words are extracted one by one from the left.
- Extract all pauses, if there are any.
- Extract a word and then any following pauses, if there are any.
- Repeat 2 until there is nothing left.
See also Morphology: Rules of formation