Target Java 17 for main branch #5139

ctubbsii · 2024-12-04T21:54:04Z

Java 17 has some nice new features, and earlier LTS versions of Java are EOL. For new development branches, we should have the option of using newer Java features. Java 17 has records, for example, that are of particular interest. Also of personal interest to me, is the ability to use the more robust switch statements.

The main blocker for this is that there is an unknown issue with starting Hadoop Yarn for MapReduce on a Java 17 VM. So, we can't put Accumulo byte-code in to anything going through MapReduce until that's figured out. We do know that the HDFS itself works on JDK 17, and the HDFS client does as well, because we've been running that in our ITs for awhile now. So, it's just whatever the issue is with Yarn, as far as I can tell.

For testing, apache/fluo-uno#297 circumvented the problem by skipping starting Yarn when running on with Java newer than version 11, but that's not a suitable solution for any production deployment of Accumulo (it's not even a suitable solution for testing, if you want to test MapReduce).

The issue may be quite trivial... I'm not sure.

Some initial investigation occurred in Slack that may hint at some Java module options need to be added to the yarn env script, or something similar, but there may be more to it than that.

For reference, Hadoop tracking ticket for Java 17 support is here.

cshannon · 2024-12-05T16:43:53Z

I can revisit this and see what I come up with, it would be nice if we just needed to add a new java module option, etc but I'll try and find out if there's a bigger issue preventing JDK 17 from being used.

cshannon · 2024-12-06T15:47:10Z

I tested this out with main (4.0.0-SNAPSHOT), Uno, and accumulo-testing and it seems to be working. Based on the Hadoop Jira ticket it seems like there are some incompatibilities with libraries such as Guice 4.x but testing with the extra java arguments seems to work in my testing. I didn't see any map reduce failures as noted in slack.

I made the following changes to get things to work:

Bumped the required target JDK to 17 for accumulo and I built a new 4.0.0-SNAPSHOT off main.
Set up a new accumulo install with Uno using 4.0.0-SNAPSHOT
Modified hadoop.sh to no longer skip yarn when using JDK 17 and to start it up
Tried to start up the instance and verified the same errors we saw before were in the resource manager and node manager logs as shown in the Slack chat.
Added the following to yarn-env.sh:

export YARN_RESOURCEMANAGER_OPTS="--add-opens java.base/java.lang=ALL-UNNAMED"
export YARN_NODEMANAGER_OPTS="--add-opens java.base/java.lang=ALL-UNNAMED"

Restarted Uno and verified all the errors were gone out of the hadoop logs
I bumped the required JDK target to 17 for accumulo-testing project and updated the accumulo dependency to 4.0.0-SNAPSHOT and rebuilt the project.
I ran the continuous ingest test and ingested some data for a few minutes and then i ran the verify test which submits a map/reduce job and it completed successfully. I ran this a few times. Both the hadoop resource manager web page and the output from the logs and console show it completed without error.

cshannon · 2024-12-06T15:54:28Z

I had a bunch of weird issues at first trying to test until I realized I had to also update accumulo-testing to target JDK 17 and depend on accumulo 4.0 and once I did that (and removed some of the deprecated features that were removed in 4.0) things worked without issue

ctubbsii · 2024-12-10T22:11:49Z

Awesome! Thanks, @cshannon . I think that unblocks us from targeting Java 17, but to help with development, we need some changes in the accumulo-testing repo and in the fluo-uno repo.

Hadoop is using some older dependencies that still require access to internal JDK features so this enables that by adding JVM args See apache/accumulo#5139

cshannon · 2024-12-13T16:45:32Z

Awesome! Thanks, @cshannon . I think that unblocks us from targeting Java 17, but to help with development, we need some changes in the accumulo-testing repo and in the fluo-uno repo.

fluo-uno should now correctly start up Yarn when running with JDK 17 and execute map reduce correctly after merging apache/fluo-uno#305 .

I think we need to wait on changes to accumulo-testing for changes to the main branch in accumulo first. The biggest thing to change in the testing repo is to bump the target JDK to 17 to be in sync with the target JDK of 17 in accumulo when ti's time.

ctubbsii added this to the 4.0.0 milestone Dec 4, 2024

cshannon mentioned this issue Dec 5, 2024

Support dedicating threads to specific types of Fate transactions #5130

Open

cshannon self-assigned this Dec 6, 2024

cshannon added a commit to cshannon/fluo-uno that referenced this issue Dec 13, 2024

Support running Yarn with JDK 17

4d72e5f

Hadoop is using some older dependencies that still require access to internal JDK features so this enables that by adding JVM args See apache/accumulo#5139

cshannon mentioned this issue Dec 13, 2024

Support running Yarn with JDK 17 apache/fluo-uno#305

Merged

ctubbsii pushed a commit to apache/fluo-uno that referenced this issue Dec 13, 2024

Support running Yarn with JDK 17 (#305)

7763f60

Hadoop is using some older dependencies that still require access to internal JDK features so this enables that by adding JVM args See apache/accumulo#5139

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Target Java 17 for main branch #5139

Target Java 17 for main branch #5139

ctubbsii commented Dec 4, 2024

cshannon commented Dec 5, 2024

cshannon commented Dec 6, 2024 •

edited

Loading

cshannon commented Dec 6, 2024

ctubbsii commented Dec 10, 2024

cshannon commented Dec 13, 2024

Target Java 17 for main branch #5139

Target Java 17 for main branch #5139

Comments

ctubbsii commented Dec 4, 2024

cshannon commented Dec 5, 2024

cshannon commented Dec 6, 2024 • edited Loading

cshannon commented Dec 6, 2024

ctubbsii commented Dec 10, 2024

cshannon commented Dec 13, 2024

cshannon commented Dec 6, 2024 •

edited

Loading