Почему G1 стоит столько времени на копирование объектов?

Вот мой журнал gc:

2016-08-16T01:45:35.968+0000: 62265.934: [GC pause (G1 Evacuation Pause) (young)
Desired survivor size 473956352 bytes, new threshold 15 (max 15)
- age   1:   12641224 bytes,   12641224 total
- age   2:    3092400 bytes,   15733624 total
- age   3:    1914704 bytes,   17648328 total
- age   4:     204696 bytes,   17853024 total
- age   5:     389896 bytes,   18242920 total
- age   6:     101528 bytes,   18344448 total
- age   7:    1106168 bytes,   19450616 total
- age   8:     344336 bytes,   19794952 total
- age   9:     301328 bytes,   20096280 total
- age  10:     309576 bytes,   20405856 total
- age  11:     305464 bytes,   20711320 total
- age  12:      32672 bytes,   20743992 total
- age  13:      41264 bytes,   20785256 total
- age  14:      50960 bytes,   20836216 total
- age  15:      56904 bytes,   20893120 total
 62265.934: [G1Ergonomics (CSet Construction) start choosing CSet, _pending_cards: 7520, predicted base time: 185.93 ms, remaining time: 14.07 ms, target pause time: 200.00 ms]
 62265.934: [G1Ergonomics (CSet Construction) add young regions to CSet, eden: 1793 regions, survivors: 9 regions, predicted young region time: 7.32 ms]
 62265.934: [G1Ergonomics (CSet Construction) finish choosing CSet, eden: 1793 regions, survivors: 9 regions, old: 0 regions, predicted pause time: 193.25 ms, target pause time: 200.00 ms]
2016-08-16T01:45:36.626+0000: 62266.592: [SoftReference, 0 refs, 0.0208511 secs]2016-08-16T01:45:36.647+0000: 62266.613: [WeakReference, 21 refs, 0.0101522 secs]2016-08-16T01:45:36.657+0000: 62266.623: [FinalReference, 5106 refs, 0.0153084 secs]2016-08-16T01:45:36.673+0000: 62266.639: [PhantomReference, 24 refs, 0 refs, 0.0473559 secs]2016-08-16T01:45:36.720+0000: 62266.686: [JNI Weak Reference, 0.0000260 secs], 0.7593297 secs]
   [Parallel Time: 655.2 ms, GC Workers: 20]
      [GC Worker Start (ms): Min: 62265934.2, Avg: 62265936.5, Max: 62265953.3, Diff: 19.0]
      [Ext Root Scanning (ms): Min: 0.0, Avg: 0.9, Max: 8.7, Diff: 8.7, Sum: 17.9]
      [Update RS (ms): Min: 0.0, Avg: 0.6, Max: 1.4, Diff: 1.4, Sum: 12.9]
         [Processed Buffers: Min: 0, Avg: 6.2, Max: 24, Diff: 24, Sum: 125]
      [Scan RS (ms): Min: 0.1, Avg: 0.3, Max: 0.5, Diff: 0.3, Sum: 6.2]
      [Code Root Scanning (ms): Min: 0.0, Avg: 0.0, Max: 0.1, Diff: 0.1, Sum: 0.5]
      [Object Copy (ms): Min: 0.0, Avg: 256.0, Max: 650.0, Diff: 650.0, Sum: 5119.2]
      [Termination (ms): Min: 0.0, Avg: 394.2, Max: 637.9, Diff: 637.9, Sum: 7883.4]
         [Termination Attempts: Min: 1, Avg: 2.5, Max: 9, Diff: 8, Sum: 51]
      [GC Worker Other (ms): Min: 0.0, Avg: 0.2, Max: 0.5, Diff: 0.4, Sum: 3.8]
      [GC Worker Total (ms): Min: 635.7, Avg: 652.2, Max: 654.6, Diff: 18.9, Sum: 13043.7]
      [GC Worker End (ms): Min: 62266588.5, Avg: 62266588.7, Max: 62266588.9, Diff: 0.4]
   [Code Root Fixup: 0.2 ms]
   [Code Root Purge: 0.0 ms]
   [Clear CT: 1.4 ms]
   [Other: 102.5 ms]
      [Choose CSet: 0.0 ms]
      [Ref Proc: 96.4 ms]
      [Ref Enq: 1.6 ms]
      [Redirty Cards: 1.1 ms]
      [Humongous Register: 0.2 ms]
      [Humongous Reclaim: 0.0 ms]
      [Free CSet: 2.6 ms]
   [Eden: 7172.0M(7172.0M)->0.0B(928.0M) Survivors: 36.0M->60.0M Heap: 7522.8M(11.7G)->375.5M(11.7G)]
 [Times: user=8.15 sys=3.37, real=0.76 secs] 

И мои варианты Java:

-server -Xms8G -Xmx12G -XX:+UnlockDiagnosticVMOptions -XX:G1HeapRegionSize=4m -Xss512K -XX:+HeapDumpOnOutOfMemoryError -XX:+UseG1GC -XX:MaxGCPauseMillis=200 -XX:ParallelGCThreads=20 -XX:ConcGCThreads=5 -XX:InitiatingHeapOccupancyPercent=70 -XX:+ParallelRefProcEnabled -XX:+PrintGCDateStamps -XX:+PrintGCDetails -XX:+PrintReferenceGC -XX:+PrintAdaptiveSizePolicy -XX:+PrintTenuringDistribution -Xloggc:./gc.log

Кроме того, мой процессор имеет 16 ядер. Так что после того, как я уменьшил ParallelGCThreads до 15, он сократил время молодого gc вдвое. Но все же эта проблема произошла. И вот новый журнал gc:

2016-08-22T07:42:58.651+0000: 65778.814: [GC pause (G1 Evacuation Pause) (young)
Desired survivor size 322961408 bytes, new threshold 15 (max 15)
- age   1:    6522744 bytes,    6522744 total
- age   2:    2609200 bytes,    9131944 total
- age   3:    1348504 bytes,   10480448 total
- age   4:     557144 bytes,   11037592 total
- age   5:     251744 bytes,   11289336 total
- age   6:     972272 bytes,   12261608 total
- age   7:     500232 bytes,   12761840 total
- age   8:    2242544 bytes,   15004384 total
- age   9:     116296 bytes,   15120680 total
- age  10:     280728 bytes,   15401408 total
- age  11:      85728 bytes,   15487136 total
- age  12:     394272 bytes,   15881408 total
- age  13:     295568 bytes,   16176976 total
- age  14:     728600 bytes,   16905576 total
- age  15:    1356328 bytes,   18261904 total
 65778.814: [G1Ergonomics (CSet Construction) start choosing CSet, _pending_cards: 8112, predicted base time: 62.52 ms, remaining time: 137.48 ms, target pause time: 200.00 ms]
 65778.814: [G1Ergonomics (CSet Construction) add young regions to CSet, eden: 1223 regions, survivors: 5 regions, predicted young region time: 7.95 ms]
 65778.814: [G1Ergonomics (CSet Construction) finish choosing CSet, eden: 1223 regions, survivors: 5 regions, old: 0 regions, predicted pause time: 70.46 ms, target pause time: 200.00 ms]
2016-08-22T07:43:02.747+0000: 65782.910: [SoftReference, 0 refs, 0.0021318 secs]2016-08-22T07:43:02.749+0000: 65782.912: [WeakReference, 4 refs, 0.0013452 secs]2016-08-22T07:43:02.750+0000: 65782.913: [FinalReference, 4029 refs, 0.0025002 secs]2016-08-22T07:43:02.753+0000: 65782.916: [PhantomReference, 11 refs, 0 refs, 0.0027905 secs]2016-08-22T07:43:02.756+0000: 65782.919: [JNI Weak Reference, 0.0000197 secs], 4.1088277 secs]
   [Parallel Time: 4094.4 ms, GC Workers: 15]
      [GC Worker Start (ms): Min: 65778814.7, Avg: 65778817.2, Max: 65778820.8, Diff: 6.1]
      [Ext Root Scanning (ms): Min: 0.0, Avg: 0.6, Max: 1.5, Diff: 1.5, Sum: 9.2]
      [Update RS (ms): Min: 0.0, Avg: 1.1, Max: 1.9, Diff: 1.9, Sum: 16.0]
         [Processed Buffers: Min: 0, Avg: 8.7, Max: 27, Diff: 27, Sum: 130]
      [Scan RS (ms): Min: 0.1, Avg: 0.2, Max: 0.3, Diff: 0.3, Sum: 3.0]
      [Code Root Scanning (ms): Min: 0.0, Avg: 0.0, Max: 0.1, Diff: 0.1, Sum: 0.5]
      [Object Copy (ms): Min: 4059.8, Avg: 4081.9, Max: 4090.2, Diff: 30.4, Sum: 61228.9]
      [Termination (ms): Min: 0.0, Avg: 7.4, Max: 28.2, Diff: 28.2, Sum: 111.1]
         [Termination Attempts: Min: 1, Avg: 1.3, Max: 3, Diff: 2, Sum: 19]
      [GC Worker Other (ms): Min: 0.0, Avg: 0.1, Max: 0.3, Diff: 0.2, Sum: 1.6]
      [GC Worker Total (ms): Min: 4087.8, Avg: 4091.4, Max: 4094.0, Diff: 6.2, Sum: 61370.5]
      [GC Worker End (ms): Min: 65782908.5, Avg: 65782908.6, Max: 65782908.7, Diff: 0.2]
   [Code Root Fixup: 0.2 ms]
   [Code Root Purge: 0.0 ms]
   [Clear CT: 0.8 ms]
   [Other: 13.5 ms]
      [Choose CSet: 0.0 ms]
      [Ref Proc: 9.5 ms]
      [Ref Enq: 0.8 ms]
      [Redirty Cards: 0.8 ms]
      [Humongous Register: 0.1 ms]
      [Humongous Reclaim: 0.0 ms]
      [Free CSet: 1.8 ms]
   [Eden: 4892.0M(4892.0M)->0.0B(388.0M) Survivors: 20.0M->20.0M Heap: 5284.2M(8192.0M)->392.7M(8192.0M)]
 [Times: user=0.00 sys=62.51, real=4.11 secs]

1 ответ

Хотя коллектор G1 пытается соблюдать максимальное время GC, он не является жестким ограничением и может легко преодолеть это время. В одном случае может быть скопирован большой объект, или кажется, что по крайней мере одному из потоков потребовалось много времени для его копирования.

Если у вас менее 20 процессоров или запущен другой процесс, что означает, что у вас менее 20 свободных процессоров, поток GC можно запланировать на некоторое время, прежде чем он получит возможность снова запустить и завершить.

Если вы посмотрите на

user=8.15 sys=3.37, real=0.76 secs

Это говорит о том, что вы использовали 11,52 секунды ЦП за 0,76 секунды, что означает, что у вас было одновременно только около 15,1 ЦП, поэтому около 5 из них не могли работать в среднем.

Я бы попытался уменьшить количество процессоров до 15, чтобы у вас не было больше потоков, чем у вас есть свободные процессоры.

Другие вопросы по тегам