User:AugPi/Lojban

Digraphs

 * dj : see
 * tc : see

Digits

 * 0 : no
 * 1 : pa
 * 2 : re
 * 3 : ci
 * 4 : vo
 * 5 : mu
 * 6 : xa
 * 7 : ze
 * 8 : bi
 * 9 : so

Gismu acting as either selbri or sumti
Examples:
 * pilno
 * bloti
 * barda ("sumti modifier", corresponding to English adjective)
 * sutra ("selbri modifier", corresponding to English adverb)

Grammatical terms

 * selbri : predicate
 * sumti : argument
 * bridi : predication (predicate + argument(s))
 * cmavo : structural word
 * brivla : content word
 * morphologically there are three kinds of these: gismu, lujvo, and fu'ivla.
 * functionally, these act in either of two ways: as selbri or as sumti (this latter case only when modified by certain gadri).
 * tanru : a compound brivla
 * seltau : first component of a tanru (cf. LISP's )
 * tertau : second component of a tanru (cf. LISP's )
 * lujvo : a "fossilized" tanru (made up of rafsi)
 * fu'ivla : a word borrowed from another language, prepended with a semantic disambiguation tag
 * cmene : a "name", i.e., proper noun
 * gismu : a root (content) word
 * rafsi : an abbreviated form of a gismu (useful especially for forming lujvo)
 * place structure (of a selbri) : the selbri's definition (in terms of parameters x1, x2, etc.); Lojban's version of.
 * Place structure inheritance: e.g., tumxra inherits its place structure from its component pixra.
 * But there may be "pruning": e.g., trutca from tcadu.
 * But... zgipli inherits sumti slots from both of its components. Each sumti slot of a lujvo would be inherited from (and equated to) some sumti slot of either one or the other one of its components.
 * observative : a bridi without an sumti (or is it just a bridi without any sumti?)
 * gadri : articles
 * conversion : swapping of a selbri's first sumti with another one of that selbri's sumti.
 * performed by modifying a selbri with a cmavo of selma'o SE.
 * sumti tcita : preposition (for an extra, "labeled sumti", place in a selbri's place structure); the preposition is the "modal" form of some gismu which determines the preposition's meaning. (example: fi'e)
 * if the labeled sumti is an internal sumti, then the sumti tcita is preceded by.
 * if the internal sumti modifies a cmene then use instead of be.
 * internal sumti : ≈ relative clause prepositional phrase
 * begun with.
 * MEX : mathematical expression
 * abstraction : a kind of subordinate clause
 * begun with cmavo of selma'o NU.
 * tend to be sumtified by being modified with a gadri.
 * relative clauses are begun with cmavo of selma'o NOI and are not abstractions.
 * abstractor : transforms a selbri (right before it becomes sumtified)
 * rough analogies: Laplace transform, wavelet transform (DHWT, ...), ...
 * descriptor : a gadri
 * description : a noun phrase

Investiganda

 * What part of speech are the compound numbers?
 * "Compound cmavo" in &sect;3.3 of lojban_guide.pdf.
 * Cmavo clusters (according to vlasisku).
 * "Compound cmavo" in LRG &sect;6.10.
 * What is a selma'o exactly? (en.wikt's definition is mystifying)
 * http://www.lojban.org/publications/wordlists/cmavo_selmaho_order.txt

Tanru vs. lujvo
Rough idea:
 * tanru : two (or more) brivla written separately but acting as a single sumti or a single selbri
 * lujvo : two (or more?) rafsi fused into a single brivla

Lujvo cmene

 * Would ritygu'e be a lujvo or a cmene (or both)?
 * If it is a lujvo, then what would be its place structure (cmene don't have place structure): would only be fillable with brito or could any arbitrary subset of brito also do (because if it is fixed to brito then that would obviate the need for ).
 * Perhaps more to the point, wouldn't the sumti of ritygu'e have to be ritygu'e itself?  Also, in, ritygu'e is modified with , which should modify cmene, so it looks like ritygu'e should be classified as cmene even though it is morphologically more of a lujvo (e.g., it does not end in consonant and full stop like cmene usually do, but ends with vowel, rather anomalously): that is, when form (morphology) clashes with function (part of speech), go with the function; as in the adage «».
 * See http://jbovlaste.lojban.org/dict/ritygu%27e : according to its definition, sumti and those following it are obviated,  remains.  Someone in ritygu'e could spread the arms and say: "Ti ritygu'e" and it would make sense, especially if addressing visitors.
 * http://www.mail-archive.com/lojban-beginners@lojban.org/msg05849.html (tijlan.) recommends the label "Lujvo cmene": ritygu'e would be (strictly speaking) a lujvo which becomes a cmene only when modified with la, but then again, cmene are only cmene when modified with la (otherwise they are just cmevla, but how would a non-cmene cmevla actually function in a Lojban sentence?). Anyway, since there are already POS headers such as "Proper noun" and "Compound cmavo" being used, then there could be a "Lujvo cmene" POS header and category as well...

sumti tcita

 * ri'a is a sumti tcita so should that mean that it should act more like a preposition ("because of") rather than a conjunction ("because")?
 * If acting (apparently) as conjunction it is probably followed immediately by gadri + abstractor, such as lenu, so the actual conjunction would be, say, ri'alenu, rather than just ri'a.
 * because = ki'ulenu/ri'alenu/&c./ki'ulonu/ri'alonu/&c.; because of = ki'u/ri'a/&c.

Lojban ELE
The following POS headers are currently admissible:
 * 1) Cmavo
 * 2) * Use this also for compound cmavo, though the category for these would be Category:Lojban compound cmavo instead of Category:Lojban cmavo.
 * 3) Gismu
 * 4) * This is a closed category, and WT already appears to have entries for all of Lojban's gismu (1342 of them).
 * 5) Rafsi
 * 6) Brivla
 * 7) * use this for lujvo and fu'ivla, categorized under Category:Lojban lujvo and Category:Lojban fu'ivla respectively. Do not use this for gismu: for gismu the POS header "Gismu" is used instead.
 * 8) Proper noun
 * 9) * use this for all three kinds of cmene: cmevla cmene, lujvo cmene, and fu'ivla cmene; which go in categories Category:Lojban cmene, Category:Lojban lujvo cmene, and Category:Lojban fu'ivla cmene, respectively.

Tutorials

 * Lojban for Beginners by Robin Turner & Nick Nicholas

Grammars

 * Lojban Reference Grammar by The Logical Language Group
 * Parsing Expression Grammar (PEG) for Lojban

Gismu

 * Xah Lee's list of gismu place structures

Rafsi

 * Lojban/Rafsi

Cmavo

 * cmavo/selma'o list (ordered by cmavo) which the LLG calls ma'oste

Selma'o

 * cmavo/selma'o list (ordered by selma'o) <tt>po la lojbangirz.</tt>
 * An asterisk suffixed to a selma'o's name means that the cmavo is compound.

Online Parser/Translator (<tt>jboski</tt>)

 * Lojban-To-English Translation

Another one

 * <tt>genrei.lojban.org</tt>

Offline Parser (runs on Terminal)
How to set up and run:
 * Official Parser
 * http://home.ccil.org/~cowan/parser-3.0.00.tar.gz
 * move uncompressed folder to Applications.
 * start Terminal and <tt>cd</tt> on over to <tt>Applications/parser-3.0.00</tt> folder.
 * type <tt>cc -o parser *.c</tt> and press Enter.
 * https://github.com/kennethreitz/osx-gcc-installer/
 * Go to the <tt>Applications/parser-3.0.00</tt> folder through Finder.
 * Double click on <tt>parser</tt>.
 * At the <tt>>>></tt> prompt, type in a Lojban sentence to be parsed and press Enter.
 * Examine the result, then repeat.
 * When/if done, close the Terminal window.

Offline Parser (runs on a web browser, through JavaScript)
How to set up and run:
 * Go to http://mhagiwara.github.io/camxes.js/
 * Save the web page as an HTML file.
 * By the above action, a new folder should also be automatically created near that new HTML file, which contains the file <tt>camxes.js</tt>
 * Open that HTML file in a web browser (that can run JavaScript).
 * Type a lojbanic expression inside the Demo textbox. (Its parsing should end up being shown under it, if it has one; otherwise it should show an error message.)

Texts

 * fu lu le cmalu noltru li'u pu fanva fa la xorxes.

Lojban formal grammar
!  ' |\t' ;
 * http://dag.github.com/cll/21/1/ (Lojban YACC grammar on github)
 * http://jscc.jmksf.com/ (JS/CC)
 * http://jscc.jmksf.com/jscc/jscc.html (JS/CC Web Environment)
 * How to use: remove comments from the Lojban YACC grammar; replace %% with ##; add definitions above the ## for terminals and non-terminals (similarly to the "non-associative" part of the example); make sure to include
 * at the top: it defines whitespace! Do not use any whitespace in any regexes (in the lex part) unless they are actually part of the Lojban terms.  Replace the <tt>error</tt> 's in the "gaps" with nothing (∅, &epsilon;, i.e., whitespace).  Paste code into the big text box in the middle, click on Build (the window blows up widthwise), then click on Run.  That should implement Step 6 of the parser.  (Lexical tokens, if any, should be added manually, or through some pre-parsing code.).  A neat parse tree should be generated on the bottom right corner as well as action and goto tables for the LALR(1) parser (consisting of 894 states!) on the under-middle left side.