Consolidated repo vs. old forest size differences
Volker Simonis
volker.simonis at gmail.com
Thu Sep 28 22:15:25 UTC 2017
Hi,
not sure if this has been discussed before but at least I couldn't
find any references in the previous mail threads on the repo
consolidation.
I've just realized that the size of the repository history (i.e.
everything under .hg) has doubled in the new consolidated repo (800mb
vs. 1600mb) and I don't exactly understand why:
$ du -shc jdk10-hs-old/*/.hg jdk10-hs-old/.hg
16M jdk10-hs-old/corba/.hg
141M jdk10-hs-old/hotspot/.hg
49M jdk10-hs-old/jaxp/.hg
57M jdk10-hs-old/jaxws/.hg
453M jdk10-hs-old/jdk/.hg
76M jdk10-hs-old/langtools/.hg
33M jdk10-hs-old/nashorn/.hg
8,1M jdk10-hs-old/.hg
829M total
$ du -sh jdk10-hs/.hg
1,6G jdk10-hs/.hg
I wonder why this is the case?
Is this because the consolidated repo has more and bigger merge changes?
The consolidated repo has a total of 47297 changes with about 13878
merge changes:
$ hg -R jdk10-hs log --template "{rev}\n" -r tip
47297
$ hg -R jdk10-hs log --template "{rev}\n" -k Merge | wc
13878 13878 79600
The old forest had a total of 43102 changes with about 10408 merge changes:
$ bash common/bin/hgforest.sh log --template "{rev}\n" | wc
43102 86285 1295798
$ bash common/bin/hgforest.sh log -k "Merge" --template "{rev}\n" | wc
10408 20897 312491
So the new consolidated repo has about 3000-4000 more changes of which
all are merge changesets. Does anybody know a nice command to sum up
the size of all merge changesets?
Any other insights or comments? It would be especially interesting to
know how this will evolve in the future.
Regards,
Volker
PS: this also partially explains why downloading the new repo takes
considerably longer compared to the old forest (the fact that the
get_sources.sh script downloaded the forest in parallel being the
second reason).
More information about the jdk10-dev
mailing list