kicad-developers team mailing list archive

Thread
Date

Re: Re: We should decide a quoting convention...

To: kicad-devel@xxxxxxxxxxxxxxx
From: Dick Hollenbeck <dick@...>
Date: Tue, 22 Dec 2009 22:14:54 -0600
In-reply-to: <hgrgh8+ml03@eGroups.com>
Scanners: none
User-agent: Thunderbird 2.0.0.23 (X11/20090817)

Re: DSNLEXER

OK, I spent the last several hours improving DSNLEXER.

My last posting stands. Keywords must be ASCII, not UTF8. These aretokens which are known ahead of time, so there's no reason to stray fromASCII, and it is silly trying to sort UTF8 strings. Keywords are sortedahead of time. UTF8 must be quoted and is where you would put appspecific identifiers.

As far as my A) is concerned, DSNLEXER now handles quoted stringsproperly, you simply have to double up the quote character if it isembedded in a quoted string. The quote_char is dynamically assignableper the Specctra DSN spec to one of the 3 chars: ' " or $, but this isextraneous information.

DSNLEXER also handles multi-line strings, and these must be quoted.(This is a multi-line string from the perspective of the lexer, who willreturn it as a single token.)

You could still run a single line string through the lexer that containssome other newline delimiter such as \n, but this is of no concern tothe lexer.

So the grammar designer can handle newlines any way he/she wants to,either as:


1) true newlines embedded in a quoted string or
2) as \n or \x9, or
3) %9

4) other, say embedding a number of newline-less quoted strings in a ()element.

the DSNLEXER does not care, and therefore neither do I at this point.We are now covered, and I am quite hopeful that I did not break thecompatibility of DSNLEXER with the Specctra DSN syntax. We now supporta super-set of the original syntax, in that DSNLEXER strips out thedoubled-up quotes for the parser, handles multi-lines in quoted strings,and can optionally return comments to the parser. A comment string is aline of text in the stream whose first non-blank character is a #. Theentire line of text is returned as the token, so that you can preserveleading whitespace should you want to preserve these comments in somelater output that is respectful of indentation.

This is now a fairly solid platform to start designing grammars on topof. The syntax issues I think are good enough for anything that I canforesee. (floats are OK, as long as they do not contain an exponent.You want exponents, then a little more work is required.)

I will be adding some more support than what already exists forgenerating quoted strings, so that the doubled-up quoting is done foryou in OUTPUTFORMATTER.


-------------

So I would like to shift the discussion to grammar, and away from thissyntax churn / noise. When I say grammar, I am speaking of a sequenceof tokens & keywords which are returned from the DSNLEXER. Think higherlevel now.

If anybody has some improvements on my footprint file grammar firstdraft, then these need to be offered up by the time I get around tocoding it this spring, April or so. The footprint file grammar isflexible, the syntax is probably firmed up IMO.


If you want to contribute, go back and find my (footprint ... ) example.


Dick


PS

I'll commit the DSNLEXER changes in the morning.

Follow ups

Re: Re: We should decide a quoting convention...
From: jean-pierre.charras@..., 2009-12-23
Re: We should decide a quoting convention...
From: Lorenzo, 2009-12-23

References

Re: We should decide a quoting convention...
From: Lorenzo, 2009-12-22