Handling Long Lines in Artwork in Internet-Drafts and RFCsJuniper Networkskwatsen@juniper.netHuawei Technologiesbill.wu@huawei.comJuniper Networksafarrel@juniper.netCisco Systems,
Inc.bclaise@cisco.com
Operations
NETMOD Working GroupartworksourcecodeThis document introduces a simple and yet time-proven strategy for
handling long lines in artwork in drafts using a backslash ('\')
character where line-folding has occurred. The strategy works on any
text based artwork, but is primarily intended for sample text and
formatted examples and code, rather than for graphical artwork. The
approach produces consistent results regardless of the content and
uses a per-artwork header. The strategy is both self-documenting and
enables automated reconstitution of the original artwork.sets out the requirements for
plain-text RFCs and states that each line of an RFC (and hence of
an Internet-Draft) must be limited to 72 characters followed by
the character sequence that denotes an end-of-line (EOL).Internet-Drafts and RFCs often include example text or code
fragments. In order to render the formatting of such text it is
usually presented as a figure using the "<artwork>" element in the
source XML. Many times the example text or code exceeds the 72
character line-length limit and the "xml2rfc" utility does not
attempt to wrap the content of artwork, simply issuing a warning
whenever artwork lines exceed 69 characters. According to the RFC
Editor, there is currently no convention in place for how to handle
long lines, other than advising authors to clearly indicate what
manipulation has occurred.This document introduces a simple and yet time-proven strategy for
handling long lines using a backslash ('\') character where line-
folding has occurred. The strategy works on any text based artwork,
but is primarily intended for sample text and formatted examples and
code, rather than for graphical artwork. The approach produces
consistent results regardless of the content and uses a per-artwork
header. The strategy is both self-documenting and enables automated
reconstitution of the original artwork.Note that text files are represent as lines having their first
character in column 1, and a line length of N where the last
character is in the Nth column and is immediately followed by an end
of line character sequence.The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL
NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED",
"MAY", and "OPTIONAL" in this document are to be interpreted as
described in BCP 14
when, and only when, they appear in all capitals, as shown here.Automated folding of long lines is needed in order to support
draft compilations that entail a) validation of source input
files (e.g., XML, JSON, ABNF, ASN.1) and/or b) dynamic
generation of output, using a tool that doesn't observe line
lengths, that is stitched into the final document to be submitted.Generally, in order for tooling to be able to process input
files, the files must be in their original/natural state, which
may include having some long lines. Thus, these source files
need to be modified before inclusion in the document in order to
satisfy the line length limits. This modification SHOULD be
automated to reduce effort and errors resulting from manual
effort.Similarly, dynamically generated output (e.g., tree diagrams)
must also be modified, if necessary, in order for the resulting
document to satisfy the line length limits. When needed, this effort
again SHOULD be automated to reduce effort and errors
resulting from manual effort.Automated reconstitution of the original artwork is needed to
support validation of artwork extracted from documents. YANG
modules are already extracted from
Internet-Drafts and validated as part of the draft-submission
process. Additionally, there has been some discussion regarding
needing to do the same for example YANG fragments contained
within Internet-Drafts ().
Thus, it SHOULD be possible to mechanically reconstitute artwork
in order to satisfy the tooling input parsers.While the solution presented in this document will work on any
kind of text-based artwork, it is most useful on artwork that
represents sourcecode (XML, JSON, etc.) or, more generally, on
artwork that has not been laid out in two dimensions (e.g., diagrams).Fundamentally, the issue is whether the artwork remains readable
once folded. Artwork that is unpredictable is especially susceptible
to looking bad when folded; falling into this category are most
UML diagrams.It is NOT RECOMMENDED to use the solution presented in
this document on graphical artwork.The solution presented in this document works generically
for all artwork, as it only views artwork as plain text.
However, various formats sometimes have built-in mechanisms
that can be used to prevent long lines.For instance, both the `pyang` and `yanglint` utilities
have the command line option "--tree-line-length" that can
be used to indicate a desired maximum line length for when
generating tree diagrams .In another example, some source formats (e.g., YANG
) allow any quoted string to be
broken up into substrings separated by a concatenation
character (e.g., '+'), any of which can be on a different
line.In yet another example, some languages allow factoring
chunks of code into call outs, such as functions. Using
such call outs is especially helpful when in some deeply-nested
code, as they typically reset the indentation back to the first
column.As such, it is RECOMMENDED that authors do as much as
possible within the selected format to avoid long lines.Artwork that has been folded as specified by this document
MUST contain the following structure.The header is two lines long.The first line is the following 46-character string that
MAY be surrounded by any number of printable characters.
This first line cannot itself be folded.
[Note to RFC Editor: Please replace XX and XXXX with the numbers
assigned to this document and delete this note. Please make this
change in multiple places in this document.]The second line is a blank line. This line provides visual
separation for readability.The character encoding is the same as described in Section 2
of , except that, per ,
tab characters are prohibited.Lines that have a backslash ('\') occurring as the last character in
a line immediately followed by the end of line character sequence, when
the subsequent line starts with a backslash ('\') as the first non-space
(' ') character, are considered "folded".Really long lines may be folded multiple times.Determine the desired maximum line length from input. If no
value is explicitly specified, the value "69" SHOULD be used.Ensure that the desired maximum line length is not less than
the minimum header, which is 46 characters. If the desired
maximum line length is less than this minimum, exit (this artwork
can not be folded).Scan the artwork to see if any line exceeds the desired maximum.
If no line exceeds the desired maximum, exit (this artwork does not
need to be folded).Scan the artwork for horizontal tab characters. If any
horizontal tab characters appear, either resolve them to space
characters or exit, forcing the input provider to convert them
to space characters themselves first.Scan the artwork to ensure no existing lines already end with a
backslash ('\') character when the subsequent line starts with a
backslash ('\') character as the first non-space (' ') character,
as this would lead to an ambiguous result.
If such a line is found, exit (this artwork cannot be folded).For each line in the artwork, from top-to-bottom, if the line exceeds
the desired maximum, then fold the line at the desired maximum column
by 1) inserting the character backslash ('\') character at the maximum
column, 2) inserting the end of line character sequence, inserting any
number of space (' ') characters, and 4) inserting a further backslash
('\') character.The result of this previous operation is that the next line starts
with an arbitrary number of space (' ') characters, followed by a
backslash ('\') character, immediately followed by the character that
was previously in the maximum column.Continue in this manner until reaching the end of the artwork. Note
that this algorithm naturally addresses the case where the remainder
of a folded line is still longer than the desired maximum, and hence
needs to be folded again, ad infinitum.Authors may choose to fold text examples and source code by
hand to produce a document that is more pleasant for a human reader
but which can still be automatically unfolded (as described in
) to produce single lines that are
longer than the maximum document line length.For example, an author may choose to make the fold at convenient
gaps between words such that the backslash is placed in a lower
column number than the artwork's maximum column value.Additionally, an author may choose to indent the start of a
continuation line by inserting space characters before the line
continuation marker backslash character.Manual folding may also help handle the cases that cannot be
automatically folded as described in Section 6.All unfolding is assumed to be automated although a reader will
mentally perform the act of unfolding the text to understand the true
nature of the artwork or source code.Scan the beginning of the artwork for the header described in
. If the header is not present, starting
on the first line of the artwork, exit (this artwork does not
need to be unfolded).Remove the 2-line header from the artwork.For each line in the artwork, from top-to-bottom, if the line has
a backslash ('\') character immediately followed by the end of line
character sequence, and if the next line has a backslash ('\') character
as the first non-space (' ') character, then the lines can be unfolded.
Remove the first backslash ('\') character, the end of line character
sequence, any leading space (' ') characters, and the second backslash
('\') character, which will bring up the next line. Then continue to
scan each line in the artwork starting with the current line (in case
it was multiply folded).Continue in this manner until reaching the end of the artwork. introduces the vocabulary for version 3 of
the xml2rfc tool. This includes a new element, "<sourcecode>"
used to present sourcecode examples and fragments and to distinguish
them from general artwork and in particular figures and graphics.The folding and unfolding described in this document is applicable to
the "<artwork>" element in both v2 and v3 of xml2rfc, and is
equally applicable to the "<sourcecode>" element in xml2rfc v3.The following self-documenting examples illustrate a folded
document.The source artwork cannot be presented here, as it would again need
to be folded. Alas, only the result can be provided.The examples in Sections 8.1 through 8.4 were automatically folded
on column 69, the default value. Section 8.5 shows an example of
manual folding.This example illustrates a boundary condition test using
numbers for counting purposes. The input contains 5 lines,
each line one character longer than the previous.Any printable character (including ' ' and '\') can be used
as a substitute for any number, except for on the 4th row,
the trailing '9' is not allowed to be a '\' character if the
first non-space character of the next line is a '\' character,
as that would lead to an ambiguous result.This example illustrates one very long line (280 characters).Any printable character (including ' ' and '\') can be used
as a substitute for any number.This example has a '\' character in the wrapping column. The native text
includes the sequence "fish\fowl" with the '\' character occurring on the
69th column.This example has whitespace spanning the wrapping column. The native input
contains 15 space (' ') characters between "like" and "white".This example was manually wrapped to cause the folding to occur
after each term, putting each term on its own line. Indentation
is used to additionally improve readability. Also note that the
mandatory header is surrounded by different printable characters
than shown in the other examples.The manual folding produces a more readable result than the following
equivalent folding that contains no indentation.This BCP has no Security Considerations.This BCP has no IANA Considerations.[yang-doctors] automating yang doctor reviewsThis non-normative appendix section includes a shell script
that can both fold and unfold artwork.The authors thank the following folks for their various
contributions (sorted by first name):
Jonathan Hansford, Joel Jaeggli, Lou Berger, Martin Bjorklund,
Italo Busi, and Rob Wilton.The authors additionally thank the RFC Editor, for confirming
that there is no set convention today for handling long lines in
artwork.