RFR: 8299080: Wrong default value of snippet lang attribute [v3]

Pavel Rappo prappo at openjdk.org
Tue Jul 16 09:27:22 UTC 2024


> Please review this bugfix to the way the language of a snippet is determined and processed.
> 
> The language of a snippet affects the form of snippet markup and enables external syntax highlighting, such as that provided by prism.js. The language of a snippet is [determined](https://docs.oracle.com/en/java/javase/22/docs/specs/javadoc/doc-comment-spec.html#snippet) as follows:
> 
>> A snippet may specify a `lang` attribute, which identifies the kind of content in the snippet. For an inline snippet, the default value is `java`. For an external snippet, the default value is derived from the extension of the name of the file containing the snippet's content.
> 
> There are two issues that this PR fixes. The first issue is a specification issue. The spec says nothing about the language of a hybrid snippet, which has features of both an inline and external snippets. It makes sense to specify that in the absence of the `lang` attribute, the language of a hybrid snippet is derived from the file extension. Put differently, when determining the language, a hybrid snippet behaves like an external snippet, not like an inline snippet.
> 
> The second issue is an implementation issue. If the `lang` attribute or the file extension is `java` or `properties`, then the form of markup corresponds to that language and the HTML construct modelling the snippet is attributed with `class=language-java` or `class=language-properties` respectively. This is expected. However, if the `lang` attribute or the file extension is neither of those, or the `lang` attribute is default, then the form of markup is assumed to be that of `java`, but the HTML construct modelling the snippet is not attributed, which means that the language is not passed through to the 3rd party syntax highlighters.
> 
> Stepping out of this PR for a moment, there is clearly a conflation between the language of a snippet and the form of snippet markup. Those are linked and controlled by a single knob. That and the design whereby every snippet in an unsupported language can use markup for the Java language was purposeful: it was considered simple and practical.
> 
> This PR proposes that the language of a snippet is determined and processed as follows:
> 
> 1. If the `lang` attribute is present, then its value is the language; if that value is empty, then the language is undefined
> 2. Otherwise, 
>    1. If the snippet is inline, then the language is `java`
>    2. Otherwise (i.e. the snippet is external or hybrid), the language is determined as...

Pavel Rappo has updated the pull request incrementally with one additional commit since the last revision:

  Avoid Path.of as suggested

-------------

Changes:
  - all: https://git.openjdk.org/jdk/pull/19971/files
  - new: https://git.openjdk.org/jdk/pull/19971/files/97df652d..04a37ad4

Webrevs:
 - full: https://webrevs.openjdk.org/?repo=jdk&pr=19971&range=02
 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=19971&range=01-02

  Stats: 4 lines in 1 file changed: 3 ins; 0 del; 1 mod
  Patch: https://git.openjdk.org/jdk/pull/19971.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/19971/head:pull/19971

PR: https://git.openjdk.org/jdk/pull/19971


More information about the javadoc-dev mailing list