harmony 鸿蒙GC

  • 2025-06-06
  • 浏览 (3)

GC

Garbage Collection (GC) is a process that identifies and reclaims memory no longer in use by a program. It aims to free up unused memory space. Modern programming languages implement two primary types of GC algorithms: reference counting and tracing GC.

GC Algorithm Overview

Types of GC

Reference Counting

When object A is referenced by object B, A’s reference count increases by 1. Conversely, when the reference is removed, A’s reference count decreases by 1. When A’s reference count reaches 0, object A is reclaimed.

  • Pros: Reference counting is simple to implement and allows for immediate memory reclamation, avoiding a dedicated Stop The World (STW) phase where the application is paused.
  • Cons: The extra counting step during object manipulation increases the overhead of memory allocation and assignment, affecting performance. Most critically, it fails to handle circular references.
class Parent {
  constructor() {
    this.child = null;
  }
  child: Child|null = null;
}

class Child {
  constructor() {
    this.parent = null;
  }
  parent: Parent|null = null;
}

function main() {
  let parent: Parent = new Parent();
  let child: Child = new Child();
  parent.child = child;
  child.parent = parent;
}

In the example above, parent holds a reference to child, and child holds a reference to parent. This circular reference means that neither object’s reference count will reach zero, even after the main function ends, resulting in a memory leak.

Tracking GC

image

Tracing GC identifies live objects by traversing the object graph starting from root objects, which include stack objects and global objects that are guaranteed to be alive. Objects referenced by these root objects are also considered live. The traversal marks all reachable objects (blue) as live, whereas unreachable objects (yellow) are considered garbage.

  • Pros: Tracing GC solves the circular reference problem and does not incur additional overhead during memory allocation and assignment.
  • Cons: Tracing GC is more complex than reference counting and introduces a brief STW phase. Additionally, GC can be delayed, leading to floating garbage.

Given the limitations of reference counting, especially the critical issue of circular references, ArkTS Runtime uses tracing GC.

Types of Tracing GC

Tracing GC algorithms identify garbage by traversing the object graph. Based on the collection method, tracing GC can be categorized into three basic types: mark-sweep, mark-copy, and mark-compact. In the diagrams below, blue indicates live objects, whereas yellow indicates garbage.

Mark-Sweep Collection

image

After traversing the object graph, the algorithm clears the contents of unreachable objects and places them in a free list for future allocations. This approach is highly efficient as it does not move objects. However, it can lead to memory fragmentation, reducing allocation efficiency and potentially preventing the allocation of large objects despite ample free memory.

Mark-Copy Collection

image

During the traversal of the object graph, reachable objects are copied to a new, contiguous memory space. After the traversal, the old memory space is reclaimed entirely. This approach eliminates memory fragmentation and completes the GC process in a single traversal, making it efficient. However, it requires reserving half of the memory space to ensure all live objects can be copied, resulting in lower space utilization.

Mark-Compact Collection

image

After the traversal, live objects (blue) are copied to the beginning of the current space (or a designated area), and the copied objects are reclaimed and placed in the free list. This approach addresses memory fragmentation without wasting half of the memory space, though it incurs higher performance overhead when compared with mark-copy collection.

HPP GC

High Performance Partial Garbage Collection (HPP GC) is designed for high performance in partial GC, leveraging generational models, hybrid algorithms, and optimized GC processes. In terms of algorithms, HPP GC uses different collection approaches based on different object regions.

Generational Model

ArkTS Runtime uses a traditional generational model, categorizing objects based on their lifetimes. Given that most newly allocated objects are short-lived and collected during the first GC cycle, whereas objects surviving multiple GC cycles are likely to remain alive, ArkTS Runtime divides objects into young and old generations, allocating them to separate spaces.

image

Newly allocated objects are placed in the from space of Young Space. After surviving one GC cycle, they are moved to the to space, which then swaps roles with the from space. Objects surviving another GC cycle are copied to Old Space.

Hybrid Algorithm

HPP GC employs a hybrid algorithm combining mark-copy, mark-compact, and mark-sweep, tailored to the characteristics of young and old generation objects.

  • Mark-copy for young generation Given the short lifetimes, small size, and frequent collection of young generation objects, ArkTS Runtime uses the mark-copy algorithm to efficiently reclaim memory for these objects.
  • Mark-compact + mark-sweep for old generation Based on the characteristics of old generation objects, a heuristic collection set (CSet) algorithm is used. The fundamental idea behind this algorithm is to collect the sizes of live objects in each region during the marking phase. During the collection phase, ArkTS Runtime uses mark-compact for regions with fewer live objects and lower collection costs, but mark-sweep for the remaining regions.

The collection policies are as follows:

  • Identifies regions with live object sizes below a threshold, places them in the initial CSet, and sorts them by liveness (survival rate = live object size/region size).

  • Selects a final CSet based on a predefined number of regions for compaction.

  • Sweeps the remaining regions.

This heuristic approach combines the benefits of mark-compact and mark-sweep algorithms, addressing memory fragmentation while maintaining performance.

Process Optimization

HPP GC introduces extensive concurrency and parallelism optimizations to minimize the impact on application performance. The GC process includes concurrent and parallel marking, sweeping, evacuation, updating, and clearing tasks.

Heap Structure and Parameters

Heap Structure

image

  • Semi Space: space for storing young generation, that is, newly created objects with low survival rates. The copying algorithm is used to reclaim memory.
  • Old Space: space for storing old generation, that is, objects that survive multiple GC cycles. Multiple algorithms are used for memory reclamation.
  • Huge Object Space: dedicated regions for storing large objects.
  • Read Only Space: space for storing read-only data at runtime.
  • Non-Movable Space: space for storing non-movable objects.
  • Snapshot Space: space for storing heap snapshots.
  • Machine Code Space: space for storing machine codes.

Each space is managed in regions, which are the units requested from the memory allocator.

Parameters

NOTE

Parameters not marked as configurable are system-defined and cannot be adjusted by developers.

Based on the total heap size ranges (64 MB to 128 MB, 128 MB to 256 MB, or greater than 256 MB), the system sets different sizes for the following parameters. If a parameter has a single value in the range, it remains constant regardless of the total heap size. The default total heap size for mobile phones is greater than 256 MB. You can use related APIs to query memory information by referring to HiDebug API Reference.

Heap Size Parameters

Name Value or Value Range Description
HeapSize 448 MB Default total heap size for the main thread. The value is adjusted for low-memory devices based on the actual memory pool size.
SemiSpaceSize 2–4 MB/2–8 MB/2–16 MB Size of Semi Space.
NonmovableSpaceSize 2 MB/6 MB/64 MB Size of Non-Movable Space.
SnapshotSpaceSize 512 KB Size of Snapshot Space.
MachineCodeSpaceSize 2 MB Size of Machine Code Space.

Worker Thread Heap Limit

Name Value Description
HeapSize 768 MB Heap size for worker threads.

Parameters of Semi Space

The heap contains two Semi Spaces for copying.

Name Value or Value Range Description
semiSpaceSize 2–4 MB/2–8 MB/2–16 MB Size of Semi Space. The value varies according to the total heap size.
semiSpaceTriggerConcurrentMark 1 M/1.5 M/1.5 M Threshold for independently triggering concurrent marking in Semi Space for the first time.
semiSpaceStepOvershootSize 2 MB Maximum overshoot size of Semi Space.

Parameters of Old Space and Huge Object Space

Both spaces are initialized to the remaining unallocated heap size. By default, the upper limit of OldSpaceSize of the main thread on mobile phones approaches 350 MB.

Name Value or Value Range Description
oldSpaceOvershootSize 4 MB/8 MB/8 MB Maximum overshoot size of Old Space.

Parameters of Other Spaces

Name Value or Value Range Description
defaultReadOnlySpaceSize 256 KB Default size of Read Only Space.
defaultNonMovableSpaceSize 2 MB/6 MB/64 MB Default size of Non-Movable Space.
defaultSnapshotSpaceSize 512 KB/512 KB/ 4 MB Default size of Snapshot Space.
defaultMachineCodeSpaceSize 2 MB/2 MB/8 MB Default size of Machine Code Space.

Interpreter Stack Size

Name Value or Value Range Description
maxStackSize 128 KB Maximum size of the interpreter stack.

Concurrency Parameters

Name Value Description
gcThreadNum 7 Number of GC threads. The default value is 7. You can set this parameter using gc-thread-num.
MIN_TASKPOOL_THREAD_NUM 3 Minimum number of threads in the thread pool.
MAX_TASKPOOL_THREAD_NUM 7 Maximum number of threads in the thread pool.

Note: The thread pool is used to execute concurrent tasks in the GC process. During thread pool initialization, all the three parameters need to be considered. If gcThreadNum is negative, the number of threads in the initialized thread pool is the number of CPU cores divided by 2.

Other Parameters

Name Value Description
minAllocLimitGrowingStep 2 M/4 M/8 M Minimum growth step of oldSpace, heapObject, and globalNative when the heap size is recalculated.
minGrowingStep 4 M/8 M/16 M Minimum growth step of oldSpace.
longPauseTime 40 ms Threshold for identifying long GC pauses, which trigger detailed GC log printing for analysis. It can be set using gc-long-paused-time.

Additional: The native total memory limit for ArrayBuffer within a single VM is set to 4 GB.

GC Process

image

Types of HPP GC

Young GC

  • When to trigger: The young GC threshold ranges from 2 MB to 16 MB, and it can be adjusted dynamically based on the allocation speed and survival rate.
  • Description: primarily collects newly allocated objects in Semi Space.
  • Scenario: foreground
  • Log keywords: [ HPP YoungGC ]

Old GC

  • When to trigger: The old GC threshold ranges from 20 MB to 300 MB. Typically, the threshold of the first old GC is about 20 MB, and the threshold for subsequent old GC operations is adjusted based on the survival rate and memory usage.
  • Description: compacts and compresses the young generation space and parts of the old generation space while sweeping other spaces. It occurs less frequently than young GC, with a longer duration (approximately 5 ms to 10 ms) due to full marking.
  • Scenario: foreground
  • Log keywords: [ HPP OldGC ]

Full GC

  • When to trigger: Full GC is not triggered based on the memory threshold. After the application transitions to the background, full GC is triggered if the predicted reclaimable object size exceeds 2 MB. You can also trigger full GC using the DumpHeapSnapshot and AllocationTracker tools or calling native interfaces and ArkTS interfaces.
  • Description: fully compacts both young and old generations, maximizing memory reclamation in performance-insensitive scenarios.
  • Scenario: background
  • Log keywords: [ CompressGC ]

Subsequent Smart GC or IDLE GC selections are made from the above three types of GC.

Trigger Strategies

Space Threshold Triggering

  • Functions: AllocateYoungOrHugeObject, AllocateHugeObject, and other allocation-related functions
  • Restriction parameters: corresponding space thresholds
  • Description: GC is triggered when object allocation reaches the space threshold.
  • Log keywords: GCReason::ALLOCATION_LIMIT

Native Binding Size Triggering

  • Functions: GlobalNativeSizeLargerThanLimit
  • Restriction parameters: globalSpaceNativeLimit
  • Description: It affects the decision for performing full marking and concurrent marking.

Background Switch Triggering

  • Functions: ChangeGCParams
  • Description: Full GC is triggered when the application switches to the background.
  • Log keywords: app is inBackground, app is not inBackground, and GCReason::SWITCH_BACKGROUND

Execution Strategies

Concurrent Marking

  • Function: TryTriggerConcurrentMarking
  • Description: attempts to trigger concurrent marking and delegate the task of marking objects to the thread pool to reduce the suspension time of the UI main thread.
  • Log keywords: fullMarkRequested, trigger full mark, Trigger the first full mark, Trigger full mark, Trigger the first semi mark, and Trigger semi mark

Adjusting Thresholds Before and After New Space GC

  • Function: AdjustCapacity
  • Description: adjusts the Semi Space trigger threshold after GC to optimize space structure.
  • Log keywords: There is no direct log. The GC statistics logs show dynamic adjustments to young space thresholds before and after GC.

Adjusting Threshold After First Old GC

  • Function: AdjustOldSpaceLimit
  • Description: adjusts the Old Space threshold based on the minimum growth step and average survival rate.
  • Log keyword: AdjustOldSpaceLimit

Adjusting Old Space/Global Space Thresholds and Growth Factors After Subsequent Old GCs

  • Function: RecomputeLimits
  • Description: recalculates and adjusts newOldSpaceLimit, newGlobalSpaceLimit, globalSpaceNativeLimit, and growth factors based on current GC statistics.
  • Log keyword: RecomputeLimits

CSet Selection Strategies for Partial GC

  • Function: OldSpace::SelectCSet()
  • Description: selects regions with fewer live objects and lower collection costs for partial GC.
  • Typical Logs
    • Select CSet failure: number is too few
    • Max evacuation size is 6_MB. The CSet region number
    • Select CSet success: number is

SharedHeap

Shared Heap Structure

image

  • Shared Old Space: shared space for storing general shared objects. The young generation and old generation are not distinguished in the shared heap.
  • Shared Huge Object Space: shared space for storing large objects. A separate region is used for each large object.
  • Shared Read Only Space: shared space for storing read-only data at runtime.
  • Shared Non-Movable Space: shared space for storing non-movable objects.

Note: The shared heap is designed for objects shared across threads to improve efficiency and save memory. It does not belong to any single thread and stores objects with shared value. It typically has higher survival rates and does not involve Semi Space.

Features

Smart GC

Description

In performance-sensitive scenarios, the GC trigger threshold of the thread is temporarily adjusted to the maximum heap size (448 MB for the main thread by default), minimizing GC-triggered frame drops. (Smart GC does not take effect for the Worker thread and TaskPool thread.) However, if a performance-sensitive scenario persists too long and object allocation reaches the maximum heap size, GC is triggered, potentially resulting in longer GC times due to accumulated objects.

Performance-Sensitive Scenarios

  • Application cold start
  • Application scrolling
  • Page transitions via clicks
  • Jumbo frame

Currently, this feature is managed by the system, and third-party applications do not have APIs to directly call this feature.

Log keyword: SmartGC

Interaction Process

image

Mark performance-sensitive scenarios by tagging the heap upon entry and exit to avoid unnecessary GCs and maintain high performance.

Log Interpretation

Enabling Full Logs

By default, detailed GC logs are printed only when GC duration exceeds 40 ms. To enable logs for all GC executions, run commands on the device.

Example

# Enable full GC logs with parameter 0x905d. Disable full GC logs and revert to the default value (0x105c).
hdc shell param set persist.ark.properties 0x905d
# Reboot to apply changes.
hdc shell reboot

Typical Logs

The following logs represent a complete GC execution, with variations based on the type of GC. You can search for the keyword [gc] in the exported log file to view GC-related logs. You can also check for the keyword ArkCompiler to view more comprehensive VM-related logs.

// Pre-GC object size (region size) -> Post-GC object size (region size), total duration (+concurrentMark duration), GC trigger reason.
C03F00/ArkCompiler: [gc]  [ CompressGC ] 26.1164 (35) -> 7.10049 (10.5) MB, 160.626(+0)ms, Switch to background
// Various states during GC execution and application name.
C03F00/ArkCompiler: [gc] IsInBackground: 1; SensitiveStatus: 0; OnStartupEvent: 0; BundleName: com.example.demo;
// Duration statistics for each GC phase.
C03F00/ArkCompiler: [gc] /***************** GC Duration statistic: ****************/
C03F00/ArkCompiler: [gc] TotalGC:                 160.626 ms
C03F00/ArkCompiler: Initialize:              0.179   ms
C03F00/ArkCompiler: Mark:                    159.204 ms
C03F00/ArkCompiler: MarkRoots:               6.925   ms
C03F00/ArkCompiler: ProcessMarkStack:        158.99  ms
C03F00/ArkCompiler: Sweep:                   0.957   ms
C03F00/ArkCompiler: Finish:                  0.277   ms
// Memory usage statistics after GC.
C03F00/ArkCompiler: [gc] /****************** GC Memory statistic: *****************/
C03F00/ArkCompiler: [gc] AllSpaces        used:  7270.9KB     committed:   10752KB
C03F00/ArkCompiler: ActiveSemiSpace  used:       0KB     committed:     256KB
C03F00/ArkCompiler: OldSpace         used:  4966.9KB     committed:    5888KB
C03F00/ArkCompiler: HugeObjectSpace  used:    2304KB     committed:    2304KB
C03F00/ArkCompiler: NonMovableSpace  used:       0KB     committed:    2304KB
C03F00/ArkCompiler: MachineCodeSpace used:       0KB     committed:       0KB
C03F00/ArkCompiler: HugeMachineCodeSpace used:       0KB     committed:       0KB
C03F00/ArkCompiler: SnapshotSpace    used:       0KB     committed:       0KB
C03F00/ArkCompiler: AppSpawnSpace    used: 4736.34KB     committed:    4864KB
C03F00/ArkCompiler: [gc] Anno memory usage size:  45      MB
C03F00/ArkCompiler: Native memory usage size:2.99652 MB
C03F00/ArkCompiler: NativeBindingSize:       0.577148KB
C03F00/ArkCompiler: ArrayBufferNativeSize:   0.0117188KB
C03F00/ArkCompiler: RegExpByteCodeNativeSize:0.280273KB
C03F00/ArkCompiler: ChunkNativeSize:         19096   KB
C03F00/ArkCompiler: [gc] Heap alive rate:         0.202871
// Summary statistics for this type of GC in the VM.
C03F00/ArkCompiler: [gc] /***************** GC summary statistic: *****************/
C03F00/ArkCompiler: [gc] CompressGC occurs count  6
C03F00/ArkCompiler: CompressGC max pause:    2672.33 ms
C03F00/ArkCompiler: CompressGC min pause:    160.626 ms
C03F00/ArkCompiler: CompressGC average pause:1076.06 ms
C03F00/ArkCompiler: Heap average alive rate: 0.635325
  • GC type, which can be [HPP YoungGC], [HPP OldGC], [CompressGC], and [SharedGC].
  • TotalGC: total duration. The following lists the duration for each phase, including Initialize, Mark, MarkRoots, ProcessMarkStack, Sweep, and Finish. The actual phases may vary depending on the GC process.
  • IsInBackground: specifies whether the application is in the background (1) or foreground (0).
  • SensitiveStatus: specifies whether the application is in a sensitive scenario (1) or not (0).
  • OnStartupEvent: specifies whether the application is in a cold start scenario (1) or not (0).
  • used: actual memory usage of allocated objects.
  • committed: actual memory allocated to the heap. Since memory spaces are allocated in regions that are not always fully utilized by objects, committed is greater than or equal to used. For Huge Space, these values are equal because each object occupies a separate region.
  • Anno memory usage size: total memory usage of all heaps in the process, including heap and shared heap.
  • Native memory usage size: total native memory usage of the process.
  • NativeBindingSize: native memory usage of objects bound to the heap.
  • ArrayBufferNativeSize: native memory usage of array buffers requested by the process.
  • RegExpByteCodeNativeSize: native memory usage of regular expression bytecode requested by the process.
  • ChunkNativeSize: native memory usage of chunks requested by the process.
  • Heap alive rate: survival rate of objects in the heap.

GC Developer Debugging Interfaces

NOTE

The following interfaces are for debugging purposes only and are not official SDK interfaces. They should not be used in production applications.

ArkTools.hintGC()

  • Invocation: ArkTools.hintGC()
  • Type: ArkTS interface
  • Description: triggers the VM to assess whether a full GC should be executed. Full GC is initiated in the background or if the expected memory survival rate is below a threshold. It will not trigger in sensitive scenarios.
  • Use case: developers prompting the system to perform GC.
  • Log keywords: There is no direct log. Only external trigger (GCReason::TRIGGER_BY_JS) can be found.

Usage example:

// Declare the interface first.
declare class ArkTools {
     static hintGC(): void;
}

@Entry
@Component
struct Index {
  @State message: string = 'Hello World';
  build() {
  Row() {
    Column() {
      Text(this.message)
        .fontSize(50)
        .fontWeight(FontWeight.Bold)
      Button("Trigger Hint GC").onClick((event: ClickEvent) => {
          ArkTools.hintGC(); // Call the method directly.
      })
    }
    .width('100%')
  }
  .height('100%')
}
}

你可能感兴趣的鸿蒙文章

harmony 鸿蒙ArkTS

harmony 鸿蒙Configuring arkOptions in build-profile.json5

harmony 鸿蒙Asynchronous Lock

harmony 鸿蒙Ark Bytecode File Format

harmony 鸿蒙Naming Conventions for Ark Bytecode Functions

harmony 鸿蒙Ark Bytecode Fundamentals

harmony 鸿蒙Overview of Ark Bytecode

harmony 鸿蒙Shared Container

harmony 鸿蒙Asynchronous Waiting

harmony 鸿蒙ArkTS Cross-Language Interaction

0  赞