RFR (L): 8046148: JEP 158 Unified JVM Logging

Thu Sep 10 09:27:28 UTC 2015

Hi Vitaly,

I think it would be much better to go towards a digital logging system. However, as difficult as it is to talk a tag system, I think it would be more difficult to talk about a digital system.

> It sounds like you're proposing a hierarchical naming scheme for log statements; if anyone is familiar with the Graphite tool, it does something similar for metric names.  You can then wildcard certain parts of the path when using its query tools.  Is that what you have in mind?
> 
> 
This is similar to what I have in mind.
> That's flexible but can lead to other issues.
> 
Agreed, there is no free lunch. The question now becomes, where do you want to pay the cost. Next how much of that cost is real? I try to answer that below by adding some context. As I mentioned before, if you don’t want to pay the cost you can simply define levels if that is what you like and that is how your taxonomy works. This isn’t about preventing people from using levels, it’s about not forcing them to use them. It’s also about the advantages that tags can cover situations that you’ve yet to imagine. In fact it seems as if tags are in because there were situations that were not imagined in the first spec. And thats ok, we can’t always imagine everything that could happen. All we can hope for is that when the unimagined happens, the system is flexible enough to cope with if.
> Every log line will need to know which path it's using, which can lead to an explosion of these paths as JVM devs will need to know what tags already exist for the (sub)component they're logging from.  It's much easier to quickly decide what level is appropriate.
> 
If you put this into the context of GC logging, what is a warning log level? IMHO, it makes no sense. I can’t even put a reliable warning metric into Censum that would be a warning in all circumstances.

> Consumers will still need to know what tags exist to get the right level of logging; wildcarding parts may be too much, fully specifying all desired tags can become unwieldy.
> 
> 
Indeed, on the surface it looks like it can. However if you look at the current logging system in GC there are not that many flags that most people would be interested in. Certainly the amount isn’t unwieldy. Converting that to a tag system would be straight forward. Moving to levels… I wouldn’t even no where to start. So theory doesn’t really meet practice in my experience.

I’m happy that we’re able to engage in this conversation though I’m not sure how useful it is. What I’m trying to impart here is my experiences tuning the very bad and broken logging that I often run into in many applications. It seems that my experiences are quite different than yours (as in the group that is implementing this JEP). FWIW, what I can say is the most successful engagements have been where I’ve been able to convince teams that logging should be an architectural level that involves conversations with operations and/or support groups to make sure that their needs are meet. Most of that work involves deriving a taxonomy or a way of categorizing the types of log messages that need to be produced. None of the current logging Java logging frameworks adequately support the outputs of this activity. Defining custom levels doesn’t work and with some frameworks leads to some very disastrous result *with no fault to the application developers*  As you can see, the shape of the name space and/or the names used to describe the categories of messages is, or should be beyond the scope of this JEP. Unfortunately it isn’t. IMHO, although this version of the JEP is much much better than the original version, it has still over reached. It’s over reached because from the start the wrong logging model was used to help define the JEP. If you absolutely disagree I’ll let you push on without interruption.

> Before going further, I'll stop to confirm that this is your gist.
> 
> As for Chronicle, that seems like a separate issue altogether which is the mechanics of the logging.  I do agree that logging needs to be as efficient as possible as it's done synchronously.  This also implies the filtering scheme has to be efficient when the JVM decides whether the log should be emitted or not.
> 
> 
Yes, performance is a different (yet some what related) issue. My apology for not clearly articulating that. The idea behind Chronicle is to not filter messages. In fact it drops them on disk in a raw format. If you want to filter and reformat you can do that out of band. When logging latencies are significant, which they quite often are, I first look to how to reduce volume and then I work to strip out all the decorators and filtering. In fact, I’ve been recommending and moving apps to use more compact, information dense binary formats. All of this gets particularly critical in virtualized environments where disk is typically NAS. In all cases the less you do, the better things get.

Kind regards,
Kirk Pepperdine