為什么list.sort()比Stream().sorted()更快？

作者：技術(shù)老男孩 2023-09-14 15:48:53

開(kāi)發(fā) 測(cè)試

本文主要介紹關(guān)于 list.sort() 和 list.strem().sorted() 排序的差異，到底哪個(gè)性能更好一些。

看到一個(gè)評(píng)論，里面提到了list.sort()和list.strem().sorted()排序的差異。

說(shuō)到list.sort()排序比stream().sorted()排序性能更好。

但沒(méi)說(shuō)到為什么。

有朋友也提到了這一點(diǎn)。

本文重新開(kāi)始，先問(wèn)是不是，再問(wèn)為什么。

真的更好嗎?

先簡(jiǎn)單寫(xiě)個(gè) demo。

List<Integer> userList = new ArrayList<>();
    Random rand = new Random();
    for (int i = 0; i < 10000 ; i++) {
        userList.add(rand.nextInt(1000));
    }
    List<Integer> userList2 = new ArrayList<>();
    userList2.addAll(userList);

    Long startTime1 = System.currentTimeMillis();
    userList2.stream().sorted(Comparator.comparing(Integer::intValue)).collect(Collectors.toList());
    System.out.println("stream.sort耗時(shí)："+(System.currentTimeMillis() - startTime1)+"ms");

    Long startTime = System.currentTimeMillis();
    userList.sort(Comparator.comparing(Integer::intValue));
    System.out.println("List.sort()耗時(shí)："+(System.currentTimeMillis()-startTime)+"ms");

輸出：

stream.sort耗時(shí)：62ms
List.sort()耗時(shí)：7ms

由此可見(jiàn) list 原生排序性能更好。

能證明嗎?

不一定吧。

再把 demo 變換一下，先輸出stream.sort。

List<Integer> userList = new ArrayList<>();
Random rand = new Random();
for (int i = 0; i < 10000 ; i++) {
    userList.add(rand.nextInt(1000));
}
List<Integer> userList2 = new ArrayList<>();
userList2.addAll(userList);

Long startTime = System.currentTimeMillis();
userList.sort(Comparator.comparing(Integer::intValue));
System.out.println("List.sort()耗時(shí)："+(System.currentTimeMillis()-startTime)+"ms");

Long startTime1 = System.currentTimeMillis();
userList2.stream().sorted(Comparator.comparing(Integer::intValue)).collect(Collectors.toList());
System.out.println("stream.sort耗時(shí)："+(System.currentTimeMillis() - startTime1)+"ms");

此時(shí)輸出變成了：

List.sort()耗時(shí)：68ms
stream.sort耗時(shí)：13ms

這能證明上面的結(jié)論錯(cuò)誤了嗎?

都不能。

兩種方式都不能證明到底誰(shuí)更快。

使用這種方式在很多場(chǎng)景下是不夠的，某些場(chǎng)景下，JVM 會(huì)對(duì)代碼進(jìn)行 JIT 編譯和內(nèi)聯(lián)優(yōu)化。

Long startTime = System.currentTimeMillis();
...
System.currentTimeMillis() - startTime

此時(shí)，代碼優(yōu)化前后執(zhí)行的結(jié)果就會(huì)非常大。

基準(zhǔn)測(cè)試是指通過(guò)設(shè)計(jì)科學(xué)的測(cè)試方法、測(cè)試工具和測(cè)試系統(tǒng)，實(shí)現(xiàn)對(duì)一類(lèi)測(cè)試對(duì)象的某項(xiàng)性能指標(biāo)進(jìn)行定量的和可對(duì)比的測(cè)試。

基準(zhǔn)測(cè)試使得被測(cè)試代碼獲得足夠預(yù)熱，讓被測(cè)試代碼得到充分的 JIT 編譯和優(yōu)化。

下面是通過(guò) JMH 做一下基準(zhǔn)測(cè)試，分別測(cè)試集合大小在 100，10000，100000 時(shí)兩種排序方式的性能差異。

import org.openjdk.jmh.annotations.*;
import org.openjdk.jmh.infra.Blackhole;
import org.openjdk.jmh.results.format.ResultFormatType;
import org.openjdk.jmh.runner.Runner;
import org.openjdk.jmh.runner.RunnerException;
import org.openjdk.jmh.runner.options.Options;
import org.openjdk.jmh.runner.options.OptionsBuilder;

import java.util.*;
import java.util.concurrent.ThreadLocalRandom;
import java.util.concurrent.TimeUnit;
import java.util.stream.Collectors;

@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.MICROSECONDS)
@Warmup(iterations = 2, time = 1)
@Measurement(iterations = 5, time = 5)
@Fork(1)
@State(Scope.Thread)
public class SortBenchmark {
    @Param(value = {"100", "10000", "100000"})
    private int operationSize; 
    private static List<Integer> arrayList;
    public static void main(String[] args) throws RunnerException {
        // 啟動(dòng)基準(zhǔn)測(cè)試
        Options opt = new OptionsBuilder()
            .include(SortBenchmark.class.getSimpleName()) 
            .result("SortBenchmark.json")
            .mode(Mode.All)
            .resultFormat(ResultFormatType.JSON)
            .build();
        new Runner(opt).run(); 
    }
    @Setup
    public void init() {
        arrayList = new ArrayList<>();
        Random random = new Random();
        for (int i = 0; i < operationSize; i++) {
            arrayList.add(random.nextInt(10000));
        }
    }
    @Benchmark
    public void sort(Blackhole blackhole) {
        arrayList.sort(Comparator.comparing(e -> e));
        blackhole.consume(arrayList);
    }
    @Benchmark
    public void streamSorted(Blackhole blackhole) {
        arrayList = arrayList.stream().sorted(Comparator.comparing(e -> e)).collect(Collectors.toList());
        blackhole.consume(arrayList);
    }
}

性能測(cè)試結(jié)果：

可以看到，list.sort()效率確實(shí)比stream().sorted()要好。

為什么更好?

流本身的損耗

java 的 stream 讓我們可以在應(yīng)用層就可以高效地實(shí)現(xiàn)類(lèi)似數(shù)據(jù)庫(kù) SQL 的聚合操作了，它可以讓代碼更加簡(jiǎn)潔優(yōu)雅。

但是，假設(shè)我們要對(duì)一個(gè) list 排序，得先把 list 轉(zhuǎn)成 stream 流，排序完成后需要將數(shù)據(jù)收集起來(lái)重新形成 list，這部份額外的開(kāi)銷(xiāo)有多大呢?

我們可以通過(guò)以下代碼來(lái)進(jìn)行基準(zhǔn)測(cè)試：

import org.openjdk.jmh.annotations.*;
import org.openjdk.jmh.infra.Blackhole;
import org.openjdk.jmh.results.format.ResultFormatType;
import org.openjdk.jmh.runner.Runner;
import org.openjdk.jmh.runner.RunnerException;
import org.openjdk.jmh.runner.options.Options;
import org.openjdk.jmh.runner.options.OptionsBuilder;

import java.util.ArrayList;
import java.util.Comparator;
import java.util.List;
import java.util.Random;
import java.util.concurrent.TimeUnit;
import java.util.stream.Collectors;

@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.MICROSECONDS)
@Warmup(iterations = 2, time = 1)
@Measurement(iterations = 5, time = 5)
@Fork(1)
@State(Scope.Thread)
public class SortBenchmark3 {
    @Param(value = {"100", "10000"})
    private int operationSize; // 操作次數(shù)
    private static List<Integer> arrayList;
    public static void main(String[] args) throws RunnerException {
        // 啟動(dòng)基準(zhǔn)測(cè)試
        Options opt = new OptionsBuilder()
            .include(SortBenchmark3.class.getSimpleName()) // 要導(dǎo)入的測(cè)試類(lèi)
            .result("SortBenchmark3.json")
            .mode(Mode.All)
            .resultFormat(ResultFormatType.JSON)
            .build();
        new Runner(opt).run(); // 執(zhí)行測(cè)試
    }

    @Setup
    public void init() {
        // 啟動(dòng)執(zhí)行事件
        arrayList = new ArrayList<>();
        Random random = new Random();
        for (int i = 0; i < operationSize; i++) {
            arrayList.add(random.nextInt(10000));
        }
    }

    @Benchmark
    public void stream(Blackhole blackhole) {
        arrayList.stream().collect(Collectors.toList());
        blackhole.consume(arrayList);
    }

    @Benchmark
    public void sort(Blackhole blackhole) {
        arrayList.stream().sorted(Comparator.comparing(Integer::intValue)).collect(Collectors.toList());
        blackhole.consume(arrayList);
    }
}

方法 stream 測(cè)試將一個(gè)集合轉(zhuǎn)為流再收集回來(lái)的耗時(shí)。

方法 sort 測(cè)試將一個(gè)集合轉(zhuǎn)為流再排序再收集回來(lái)的全過(guò)程耗時(shí)。

測(cè)試結(jié)果如下：