Re: Draft JEP for upcoming work on snippets

Jonathan Gibbons Mon, 08 Feb 2021 08:54:07 -0800

Dmitry,

Some answers inline.


-- Jon

On 2/6/21 7:56 AM, Dmitry Timofeev wrote:

Hi Pavel,

Thanks for your response! Inline:


On Tue, 2 Feb 2021 at 20:18, Pavel Rappo <[email protected]> wrote:

Resending my recent email in plain text; apologies for sending it previously in 
rich text.

***

Dmitry,

8201533 is a Draft JEP for a feature that is currently under active 
development. As such, that draft is prone to inconsistencies, inaccuracies, 
typos, etc. As we further develop the feature and receive feedback on it, that 
draft will improve and eventually reach the high bar of JEP Candidate.

Thank you for the clarification! I discovered it on r/java, and saw it
was in a very good shape already. BTW, big thanks to your team members
who reach out to users through less formal channels than mailing lists
— otherwise, many aren't likely to even know of most in progress
features.

Thank you for the feedback; detailed replies are inline.

On 29 Jan 2021, at 06:52, Dmitry Timofeev <[email protected]> wrote:

Hi everyone,

Happy to see a JEP improving support for code snippets! I’ve got some feedback 
from the user perspective, and from some prototyping to support Javadoc snippet 
compilation:

hide = regex —

Would it be possible to have a simpler mechanism? For example, Rust code 
snippets use # at the beginning of the line to hide it [1]. From the user 
perspective, it is very easy to use and maintain — you don’t have to design a 
regex, you won’t get any surprises from its evaluation, or read/maintain some 
over-complicated pattern that slipped through code review.

[1]: 
https://doc.rust-lang.org/rustdoc/documentation-tests.html#hiding-portions-of-the-example

We are open to amending the initial set of markup constructs. We could consider adding 
the parameterless trailing "@hide" construct which hides the line on which it 
stands; you won't need regex to use such a construct. Do you see much use for that 
particular construct or you provided it as an illustration for alternative, regex-less 
markup?

The latter — as an illustration of a construct that worked well for
this purpose in a different language.

As for #, I don't think prepending a line with # to hide that line would be apt 
for snippets in JavaDoc. This is because it breaks invariance, one of the major 
qualities we sought in markup. Markup is invariant if it doesn't depend on 
whether it is used in an inline or external snippet. Although using leading # 
to hide a line seems to work fine with inline snippets, it won't work with 
external snippets. Indeed, in Java source a line starting with # results in a 
syntax error, whereas in properties file the leading # starts a comment.

I see, I wasn't aware of such a requirement for the snippets. With
both inline and external snippets it looks reasonable. Rust, AFAIK,
does not support external snippets. However, good support for internal
works fine: in jni-rs we literally launch a JVM in a documentation
test :-)
https://github.com/jni-rs/jni-rs/blob/242c9545890ba39246ddac53608a9928d1300fda/src/wrapper/java_vm/vm.rs#L67-L106

By allowing only valid Java code in snippets, do you mean only valid
fragments of Java _source files_, or would you also allow valid jshell
snippets? For example, would you allow using (and hiding) import
statements, mixed with other statements:

import static java.util.stream.Collectors.toList; // @hide
import java.util.stream.Stream; // @hide (or hide=^import.+ in a
pattern based variant)
Stream.of("foo", "baz", "").filter(s -> !s.isEmpty()).collect(toList());


The input language for JShell is not Java (it's very similar, but obviously
has additional constructs). As such, I would expect JShell to be handled
as a different language, in the same way that we would handle properties
files and plain text files, by using the "lang" attribute.

lang=name — […] Valid names are java, properties and text

Would it be possible to specify any language, as the goal seems to be to pass 
this information to any rendering tools? Some converter libraries might use 
snippets with configs (xml, json, …), some FFI libs — in C++/Rust, some libs 
might provide examples for some JVM-based languages.

Currently, we see no reasons for not allowing an arbitrary language. That said, the JEP 
draft needs to clarify that. The draft needs to convey that while we are not restricting 
the set of languages, only the specified languages are guaranteed to be recognized by the 
standard doclet. What "recognized" translates to must also be explained.

region=name —

Would it make sense to provide some kind of selectors for Java constructs? E.g., 
block:<method_name, or class_name> selecting the corresponding method or class? 
This plugin for Mkdocs does that, but it operates on text, not on AST, therefore, 
some unexpected curly braces break it (e.g., in string literals) [2]. However, even 
simple text-based selection works well. Does the Javadoc have the luxury of having 
ready access to the AST? Or is it too complex to implement/maintain?

[2]: https://github.com/rnorth/mkdocs-codeinclude-plugin#usage

We are considering allowing to specify method bodies as complete snippet source 
or a region thereof.

For inline snippets, especially those that are not a full compilation unit, it will be up 
to the test infrastructure to "wrap" the code fragment in a full compilation 
unit, such that it can be compiled and possibly executed.

Would it be possible to keep the non-goal of not providing a standard tool to 
test them (to limit the scope of the JEP), but suggest a 
format/restrictions/expansion procedure on the snippets to support such test 
infra well (or enable adding it to JDK in the future)? I agree that external 
snippets work well (especially for large-ish fragments), but for some small 
things it is an overkill (you have to somehow configure the build system to 
build these files, but exclude from the final artifact, etc.), yet as a user 
I’d love to check their correctness. Would it make sense to:
- Specify a standard way how a snippet is expanded (e.g., wrapped inside 
Callable#call, or some standard template — Java language designers are in a 
perfect position to pick a great one). [3]
- Specify a set of tags that define the expectations of the expanded snippet 
behaviour (not sure it can be pulled into the scope, but the tools will require 
that information): [4]
    - Compiles/fails compilation
    - Runs successfully/Throws exception
    - Is totally ignored.

What you describe is all part of the non-goal, but that does not mean itwill not be addressed at some point down the road. The primary testharness for JDK itself is jtreg, and we will certainly be looking at howto analyze snippets in the context of that system, and that may informusers of other test frameworks. It's not clear that we need to imposeany guidelines in the spec of the tag itself: I think that guidelineswill arise from the context of the code used to analyze the snippets.


In Rust doctests, for example, all these things are supported:
[3] 
https://doc.rust-lang.org/rustdoc/documentation-tests.html#pre-processing-examples
 (adds main unless you do that)
[4] https://doc.rust-lang.org/rustdoc/documentation-tests.html#attributes

The "Snippets" feature's first and foremost use case is JDK itself. Snippets found in 
"<pre>{@code" compounds in JDK are so different from each other that it doesn't seem possible to 
provide a solution sufficiently general to test those snippets automatically. Although the concept of 
"doctest", which you are referring to, offers a highly attractive end-to-end testing, that concept 
applies cleanly to snippets that were written with that concept in mind, such as snippets structured as tests or 
executable as-is. The thing is, we don't have many of those in JDK.

Since we cannot generally ensure that every snippet is correct in the resulting 
documentation, we aim for the next best thing: enabling authors to ensure that 
the snippets are produced from correct sources. Authors don't have to build 
these sources, exclude them from the build artifacts, or use external snippets 
altogether; it's an option.

While doctests are not immediately useful in JDK, we recognize that they might 
benefit other projects. That's why we are exposing snippets in JavaDoc API. 
That API may be used to test snippets externally or in JavaDoc. For example, 
using attributes of the {@snippet} tag, snippet markup and a custom taglet, one 
should be able to teach JavaDoc doctesting.

Totally agree that an expansion scheme that works for arbitrary legacy
code is an impossible task. I rather meant specifying a good enough
expansion scheme for most snippets that are written with it in mind.
For legacy ones (that, on top of being written without any
restrictions, might reference non-existent symbols, or have an
ellipsis in place of an initializing expression) there could be an
option to skip executing, or even compiling such a snippet. Rustdoc,
for instance, supports both. Any legacy snippets then could be
migrated automatically with an option to not test them, and adjusted
later.

I agree that producing snippets from correct, separate sources is a
useful feature. However, as it is simpler to write them inline and
immediately see inside the non-rendered Javadoc, could you possibly
consider specifying the expansion scheme, so that tools share the same
specification (IDEs, a doctest tool)? For example, as Rust does
specify that, these inline snippets are understood by both the doc
test tool, and an IntelliJ plugin, which supports syntax highlighting,
inspections and refactorings.

Also, I think it may help the renderers: they could provide an option
to copy a complete snippet, or run it in jshell (with all the hidden
code, and, possibly, all the required imports inherited from the
containing class).

If there is already a way to add metadata to the snippet tag that
could help communicate the intent to a doc testing tool, that's
awesome.

While doctests are not immediately useful in JDK, we recognize that they might 
benefit other projects.

Absolutely, I've certainly used them in the API specs in my projects,
and benefitted from the examples in popular Java libraries: Mockito
has 250+; Guava — over 500; and AssertJ — 2.5K (with some
false-positives):
- 
https://sourcegraph.com/search?q=%5E%5Cs*%5C*.%2B%3Cpre+lang:java+repo:mockito/mockito+count:1000&patternType=regexp
- 
https://sourcegraph.com/search?q=%5E%5Cs*%5C*.%2B%3Cpre%3E+lang%3Ajava+repo%3Agoogle%2Fguava+count%3A1000&patternType=regexp
- 
https://sourcegraph.com/search?q=%5E%5Cs*%5C*.%2B%3Cpre+lang:java+repo:assertj/assertj-core+count:1000&patternType=regexp


Also, one more thing: there is a variety of present options for
embedding code snippets with various restrictions, and I often had to
reopen this comparison to choose the right one:
https://reflectoring.io/howto-format-code-snippets-in-javadoc/#code-markup-features-at-a-glance
Will @snippets require any escaping for "@"?

Finally, you might be interested in the previous attempts to implement such a 
tool for Java: https://github.com/jakewins/javadoctest
Beware the docs aren’t updated, its current version does not require writing 
test manually, and finds and extracts snippets on itself, e.g.:
https://github.com/jakewins/javadoctest/blob/master/junit-platform-engine/src/test/java/javadoctest/engine/fixture/FixtureDocTestSimple.java#L18-L22
--
Best regards,
Dmitry Timofeev

Thanks for providing the links to mkdocs and javadoctest.

-Pavel


--
Best,
Dmitry

Re: Draft JEP for upcoming work on snippets

Reply via email to