Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: AnswerDotAI/gpu.cpp
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: openfluke/gpu.cpp
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 4 commits
  • 12 files changed
  • 1 contributor

Commits on May 17, 2025

  1. looks promising i think

    (base) samuel@Steamy:~/git/gpu.cpp/examples/hello_world$ ./build/benchmark_forward_layer
    CPU forwardLayer: 5.04969 ms
    GPU forwardLayer: 2.32466 ms
    Mean abs diff: 1.63608e-06
    Max  abs diff: 2.67029e-05
    openfluke-dev committed May 17, 2025
    Configuration menu
    Copy the full SHA
    8a0b63d View commit details
    Browse the repository at this point in the history
  2. amazing stuff

    (base) samuel@Steamy:~/git/gpu.cpp/examples/hello_world$ clang++ -std=c++17 -I../.. -I../../third_party/headers -L../../third_party/lib benchmark_forward_layer_big.cpp -ldl -lwebgpu_dawn -DNO_LOG -o build/benchmark_forward_layer_big
    ./build/benchmark_forward_layer_big
    
    --- Benchmarking inSize: 256, outSize: 128 ---
    CPU forwardLayer avg: 0.0801554 ms (best: 0.079726 ms)
    GPU forwardLayer avg: 0.253004 ms (best: 0.200489 ms)
    Mean abs diff: 2.88826e-07, Max abs diff: 1.90735e-06
    
    --- Benchmarking inSize: 1024, outSize: 512 ---
    CPU forwardLayer avg: 1.26819 ms (best: 1.24063 ms)
    GPU forwardLayer avg: 0.559954 ms (best: 0.482283 ms)
    Mean abs diff: 8.38525e-07, Max abs diff: 7.62939e-06
    
    --- Benchmarking inSize: 4096, outSize: 2048 ---
    CPU forwardLayer avg: 20.5462 ms (best: 20.2136 ms)
    GPU forwardLayer avg: 2.14954 ms (best: 1.33978 ms)
    Mean abs diff: 2.47097e-06, Max abs diff: 2.67029e-05
    
    --- Benchmarking inSize: 8192, outSize: 4096 ---
    CPU forwardLayer avg: 82.5799 ms (best: 82.1086 ms)
    GPU forwardLayer avg: 27.9018 ms (best: 25.2048 ms)
    Mean abs diff: 4.28932e-06, Max abs diff: 5.34058e-05
    openfluke-dev committed May 17, 2025
    Configuration menu
    Copy the full SHA
    8cac867 View commit details
    Browse the repository at this point in the history

Commits on May 26, 2025

  1. Configuration menu
    Copy the full SHA
    4195857 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1bf2d19 View commit details
    Browse the repository at this point in the history
Loading