netty 4.0.24版本Direct Memory Leak

現象

top顯示常駐內存已經達到14G，而JVM本身的內存佔用不高 -XMX 配置的是4096M.

分析

jmap -heap pid

jvm本身是沒有問題的，而且應用表現也沒有什麼異常，但機器的內存已經佔用很高，觸發了機器監控的

內存報警.因爲這個應用使用了netty，因爲初步分析應該是有Direct Memory 沒有回收處理。

採用pmap 查看進程內存情況

pmap -x pid

發現有大量的131072k 即128M。這樣的內存地址有77個，應該這就是爲什麼內存一直高的原因了。

接着找到是誰怎麼創建的這些堆外內存，首先懷疑的重點對象就是netty了。netty 這個高性能的nio框架

會申請堆外內存。但不能去全盤看netty的源碼，這不太可能，所以只能先看看這些內存裏面是存的是什

麼東西，看看能否有什麼線索。採用gdb這個工具來嘗試一把

gdb --pid pid

然後執行

dump binary memory result.bin 0x00007f2908000000 0x00007f2910000000

會有一個result.bin的文件，然後通過hexdump 進行查看發現如下數據

內存放的大量的

Error while read (...): Connection reset by peer.

這個就是解決問題的突破點了

定位

來怎麼來定位是netty哪塊代碼執行會導致這樣的大量的堆外內存了，根據其存在的字符串分析應該是

在讀取數據的時候發送斷連接斷掉了。查看netty源碼，checkout 4.0 reset到4.0.24 final版本

全文search Error while read (…): Connection reset by peer。然後在io_netty_channel_epoll_Native.c中看到如下代碼

jint read0(JNIEnv * env, jclass clazz, jint fd, void *buffer, jint pos, jint limit) {
    ssize_t res;
    int err;
    do {
        res = read(fd, buffer + pos, (size_t) (limit - pos));
        // Keep on reading if we was interrupted
    } while (res == -1 && ((err = errno) == EINTR));

    if (res < 0) {
        if (err == EAGAIN || err == EWOULDBLOCK) {
            // Nothing left to read
            return 0;
        }
        if (err == EBADF) {
            throwClosedChannelException(env);
            return -1;
        }
        throwIOException(env, exceptionMessage("Error while read(...): ", err));
        return -1;
    }



void throwIOException(JNIEnv *env, char *message) {
    (*env)->ThrowNew(env, ioExceptionClass, message);
}

問題很可能就是在ThrowNew這個異常這裏了 google下，發現如下

fixing small leak on exception on the transport-epoll-native allocation
Motivation:
the JNI function ThrowNew won’t release any allocated memory.
The method exceptionMessage is allocating a new string concatenating 2 constant strings
What is creating a small leak in case of these exceptions are happening.
Modifications:
Added new methods that will use exceptionMessage and free resources accordingly.
I am also removing the inline definition on these methods as they could be reused by
other added modules (e.g. libaio which should be coming soon)
Result:
No more leaks in case of failures.

同時在netty 的issuse中也能找到對應的記錄但作者似乎說這是個優化改進

解決

出現這個問題的場景是在開啓epoll而且有特別多的這種connection reset的現象

解決就很簡單了升級netty版本

其他

在分析解決這個問題的過程中google發現一篇帖子，現象和我遇到的場景很像

地址作者通過優化glibc的環境變量能優化一些堆外內存的佔用，但個人覺得這個應該不是解決問題的根本點

java進程消耗內存大於xmx配置值情況

Garbage collection. As you might recall, Java is a garbage-collected language. In order for the garbage collector to know which objects are eligible for collection, it needs to keep track of the object graphs. So this is one part of the memory lost to this internal bookkeeping. G1 is especially known for its excessive appetite for additional memory, so be aware of this.

JIT optimization. Java Virtual Machine optimizes the code during runtime. Again, to know which parts to optimize it needs to keep track of the execution of certain code parts. So again, you are going to lose memory

Off-heap allocations. If you happen to use off-heap memory, for example while using direct or mapped ByteBuffers yourself or via some clever 3rd party API then voila – you are extending your heap to something you actually cannot control via JVM configuration

JNI code. When you are using native code, for example in the format of Type 2 database drivers, then again you are loading code in the native memory.

Metaspace. If you are an early adopter of Java 8, you are using metaspace instead of the good old permgen to store class declarations. This is unlimited and in a native part of the JVM.

netty 4.0.24版本Direct Memory Leak

現象

分析

定位

解決

其他

java進程消耗內存大於xmx配置值情況

再談23種設計模式（3）：行爲型模式（學習筆記）

Power Automate Desktop 安裝完，登錄後老是提示one driver 錯誤

微前端學習筆記(4):從微前端到微模塊之EMP與hel-micro方案探索

微前端學習筆記（1）：微前端總體架構概述，從微服務發微

985 碩士程序員，空窗 4 個月沒有 Offer！

一文搞懂 Spring 循環依賴

賽博鬥地主——使用大語言模型扮演Agent智能體玩牌類遊戲。

VScode右鍵打開(添加到右鍵)

記一次 .NET某工控視覺自動化系統卡死分析

WindowsServer--SQL Server搭建主從同步實現讀寫分離 - 事務性分發

JVM運行時的數據區域情況

最適合程序員口味的10部電影

HBase Files

Top 21 Java 版本的CMS

Hbase slides

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結