资讯

历史

科技

环境与自然

成长

游戏

财经

文学与艺术

美食

健康

家居

文化

情感

汽车

三农

军事

旅行

运动

教育

生活

星座命理

CPU 100% 优化排查实战

创作时间:

作者:

@小白创作中心

CPU 100% 优化排查实战

引用

CSDN

https://blog.csdn.net/gaosw0521/article/details/144935416

1 问题背景

某服务器负载异常升高，经初步排查发现，服务器上仅运行着一个Java应用程序。技术人员随即展开了详细的排查和优化工作。

2 排查步骤

2.1 获取进程信息

首先使用ps命令获取应用的PID：

ps -ef | grep java

2.2 查看线程 CPU 使用情况

使用top命令查看该进程的线程信息，并按CPU使用率排序（输入大写P）：

top -Hp <pid>

发现某些线程的CPU使用率高达99.9%。

2.3 导出线程栈信息

为了进一步分析，使用jstack命令将线程栈信息导出到日志文件中：

jstack <pid> > pid.log

2.4 分析线程栈

在99.9% CPU使用率的线程中，随机选择一个线程（例如pid=194283），将其转换为16进制（2f6eb），并在线程快照中查找对应的线程信息。发现这些线程都与Disruptor队列相关，且都在执行java.lang.Thread.yield方法。

2.5 使用分析工具

为了更直观地查看线程状态，将线程快照信息上传到fastthread.io进行分析。分析结果显示，几乎所有消耗CPU的线程都与Disruptor队列相关，且都在执行yield方法。

2.6 初步判断

初步判断，大量线程执行yield方法后，互相竞争导致CPU使用率增高。通过对堆栈的分析，发现确实与Disruptor有关。

3 Disruptor 使用方式

3.1 引入依赖

在pom.xml文件中引入Disruptor的依赖：

<dependency>
    <groupId>com.lmax</groupId>
    <artifactId>disruptor</artifactId>
    <version>3.4.2</version>
</dependency>

3.2 定义事件

定义事件LongEvent：

public static class LongEvent {
    private long value;
    public void set(long value) {
        this.value = value;
    }
    @Override
    public String toString() {
        return "LongEvent{value=" + value + '}';
    }
}

3.3 定义事件工厂

定义事件工厂LongEventFactory：

public static class LongEventFactory implements EventFactory<LongEvent> {
    @Override
    public LongEvent newInstance() {
        return new LongEvent();
    }
}

3.4 定义事件处理器

定义事件处理器LongEventHandler：

public static class LongEventHandler implements EventHandler<LongEvent> {
    @Override
    public void onEvent(LongEvent event, long sequence, boolean endOfBatch) {
        System.out.println("Event: " + event);
    }
}

3.5 定义事件发布者

定义事件发布者：

public static void main(String[] args) throws InterruptedException {
    // 指定 Ring Buffer 的大小
    int bufferSize = 1024;
    // 构建 Disruptor
    Disruptor<LongEvent> disruptor = new Disruptor<>(
            new LongEventFactory(),
            bufferSize,
            Executors.defaultThreadFactory());
    // 连接事件处理器
    disruptor.handleEventsWith(new LongEventHandler());
    // 启动 Disruptor
    disruptor.start();
    // 获取 Ring Buffer
    RingBuffer<LongEvent> ringBuffer = disruptor.getRingBuffer();
    // 生产事件
    ByteBuffer bb = ByteBuffer.allocate(8);
    for (long l = 0; l < 100; l++) {
        bb.putLong(0, l);
        ringBuffer.publishEvent((event, sequence, buffer) -> event.set(buffer.getLong(0)), bb);
        Thread.sleep(1000);
    }
    // 关闭 Disruptor
    disruptor.shutdown();
}

简单解释下：