fix: use grid-stride looping for kernels with variable-length loops #3130

ManasviGoyal · 2024-05-27T08:10:59Z

No description provided.

…rnels

ManasviGoyal · 2024-05-29T18:07:42Z

@jpivarski Since awkward_ListArray_getitem_next_range kernel is needed for use, I think this can be merged if the everything works fine. I can make a new PR to work on other kernels.

jpivarski

So far, I haven't been able to test it because of thousands of

E   cupy._util.PerformanceWarning: Jitify is performing a one-time only warm-up to populate the persistent cache, this may take a few seconds and will be improved in a future release...

warnings. #3113 was supposed to fix this, but it apparently isn't. I'll have to follow up again later.

ManasviGoyal · 2024-05-29T19:15:35Z

@jpivarski This seems to be an issue with the more recent versions of cupy. Maybe testing on a older version 12.x.x can be done so that we don't have to wait for fix.

jpivarski

I think CuPy 13 added a persistent cache to reduce the time spent compiling after the first compilation. In fact, the message is only presented the first time it encounters a kernel, so that must be what's going on. The messages might be helpful to end-users, but our test framework is set to consider warnings as errors. I added a filter to ignore these warnings when running the tests.

I think this PR is ready to be merged! I made an edit, but not to your code, so I don't think you'll have any counter-edits. I'll merge it as soon as the (CPU) tests pass.

Of course, I ran all of the GPU tests, and they all pass.

fix: use grid-stride looping

1d28a2f

ManasviGoyal added the gpu Concerns the GPU implementation (backend = "cuda') label May 27, 2024

ManasviGoyal temporarily deployed to docs May 27, 2024 08:28 — with GitHub Actions Inactive

feat: add awkward_ListArray_getitem_next_range_carrylength kernel

a0d15d3

ManasviGoyal temporarily deployed to docs May 27, 2024 14:16 — with GitHub Actions Inactive

feat: add awkward_ListArray_getitem_next_range kernel

6a2d9ea

ManasviGoyal temporarily deployed to docs May 28, 2024 10:32 — with GitHub Actions Inactive

test: add integration tests

ca12576

ManasviGoyal temporarily deployed to docs May 28, 2024 13:04 — with GitHub Actions Inactive

ManasviGoyal mentioned this pull request May 29, 2024

Slicing ak.Array in cuda backend breaks #3133

Open

Merge branch 'main' into ManasviGoyal/improve-variable-length-loop-ke…

4f44c63

…rnels

ManasviGoyal temporarily deployed to docs May 29, 2024 18:01 — with GitHub Actions Inactive

ManasviGoyal marked this pull request as ready for review May 29, 2024 18:07

jpivarski reviewed May 29, 2024

View reviewed changes

ignore 'Jitify is performing a one-time only warm-up' messages

b156e69

jpivarski approved these changes May 29, 2024

View reviewed changes

jpivarski enabled auto-merge (squash) May 29, 2024 20:19

jpivarski deployed to docs May 29, 2024 20:25 — with GitHub Actions View deployment

jpivarski merged commit 0b9f6f4 into main May 29, 2024
41 checks passed

jpivarski deleted the ManasviGoyal/improve-variable-length-loop-kernels branch May 29, 2024 20:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use grid-stride looping for kernels with variable-length loops #3130

fix: use grid-stride looping for kernels with variable-length loops #3130

ManasviGoyal commented May 27, 2024

ManasviGoyal commented May 29, 2024

jpivarski left a comment

ManasviGoyal commented May 29, 2024

jpivarski left a comment

fix: use grid-stride looping for kernels with variable-length loops #3130

fix: use grid-stride looping for kernels with variable-length loops #3130

Conversation

ManasviGoyal commented May 27, 2024

ManasviGoyal commented May 29, 2024

jpivarski left a comment

Choose a reason for hiding this comment

ManasviGoyal commented May 29, 2024

jpivarski left a comment

Choose a reason for hiding this comment