[performance] Avoid reading SourceFile twice #3030

jukzi · 2024-09-30T08:31:22Z

During compile parsing happens in two stages:

diet parse (any blocks like method bodies are skipped)
parse bodies Both phases did read the source .java file from file system. With this change the file contents is kept in CompilationResult.contentRef until no longer needed. It is cached in
a SoftReference to avoid OutOfMemoryError.

HannesWell · 2024-09-30T17:08:20Z

2. It is cached in
a SoftReference to avoid OutOfMemoryError.

Since I see you using SoftReference in more and more places I want to add a word of warning:
While the idea is tempting, using a lot of SoftReferences can cause serious performance problems. Depending on their count, the memory they retain, the JVM implementation respectively the GC in use and it's settings, it might happen that the GC is in constant 'panic' and trying to free memory but only a little and blocking the entire application with that.

See for example from this article Weak, Soft, and Phantom References in Java:

You need to keep in mind that filling almost all your memory can slow down your program so much that a cache hardly matters.
It’s easy to verify this just by running the program and uncommenting the line that creates the SoftReference.

Or for example this comment were I learned this myself: google/guava#5311 (comment)

So if SoftReferences are used (a lot), because all this is fuzzy, actually sophisticated benchmarks with different loads, GCs and CG-settings should be done to ensure there is an overall net benefit in using them.
Of course it would be ideal if they are simply avoided by a different logic or program flow, but I have no clue if this is possible.

During compile parsing happens in two stages: 1. diet parse (any blocks like method bodies are skipped) 2. parse bodies Both phases did read the source .java file from file system. With this change the file contents is kept in CompilationResult.contentRef until no longer needed. It is cached in a SoftReference to avoid OutOfMemoryError. eclipse-jdt#2691

jukzi · 2024-10-01T12:20:27Z

Depending on their count, the memory they retain

therefore do measurements, measurements and measurements. Like in all good science.

@stephan-herrmann i think i adapted the code as you demanded and i like the CompilationResult way. Do you want to review? Please just leave a message if you need more time.

Merge conflicts solved - CompilationResult had javadoc above the imports, which could not be automatically solved.

jukzi requested a review from stephan-herrmann September 30, 2024 08:31

jukzi mentioned this pull request Sep 30, 2024

[performance] Avoid reading SourceFile twice #2910

Closed

jukzi force-pushed the CompilationResult.contentRef branch from ffdb9d9 to 6468536 Compare September 30, 2024 08:33

jukzi force-pushed the CompilationResult.contentRef branch from 6468536 to 8f5a2bf Compare October 1, 2024 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[performance] Avoid reading SourceFile twice #3030

[performance] Avoid reading SourceFile twice #3030

jukzi commented Sep 30, 2024

HannesWell commented Sep 30, 2024

jukzi commented Oct 1, 2024

[performance] Avoid reading SourceFile twice #3030

Are you sure you want to change the base?

[performance] Avoid reading SourceFile twice #3030

Conversation

jukzi commented Sep 30, 2024

HannesWell commented Sep 30, 2024

jukzi commented Oct 1, 2024