RFR: 2637: Decoding emails from quoted-printable is broken [v2]

Zhao Song zsong at openjdk.org
Wed Dec 17 01:32:23 UTC 2025


On Tue, 16 Dec 2025 23:18:17 GMT, Erik Joelsson <erikj at openjdk.org> wrote:

>> During my initial implementation of Mailman 3 support, I made an attempt at decoding quoted-printable encoded email bodies. That implementation isn't working that well. I only took 2 byte encoded UTF-8 characters into account, but we of course need to also handle 3 and 4 byte characters.
>> 
>> Instead of trying to do this with regular expressions, I bit the bullet and started working on a byte array, byte by byte. That actually makes it a lot simpler as we just need to translate each encoded triplet (`=XX`) at a time and then just convert the resulting byte array using Java's built in character set decoder.
>
> Erik Joelsson has updated the pull request incrementally with one additional commit since the last revision:
> 
>   Added logging of failed email parsing

Marked as reviewed by zsong (Reviewer).

-------------

PR Review: https://git.openjdk.org/skara/pull/1747#pullrequestreview-3585619708


More information about the skara-dev mailing list