Reduce peak memory in `UMAP.fit`/`UMAP.fit_transform` #6323

jcrist · 2025-02-14T21:17:13Z

This reduces the peak GPU memory usage during a UMAP.fit call by releasing certain temporaries as soon as possible.

This required splitting the FuzzySimplSet::run function into a few sub-functions and intermingling them within _get_graph to let us drop the temporary arrays earlier. While I was at it, I noticed that _get_graph and _get_graph_supervised were almost identical, so I've merged these paths a bit to reduce duplication. There's a lot more duplication in here that could be cleaned up (and I went down that rabbit hole a bit before stopping myself), but for now I think this should be sufficient.

On an input of 190 GiB this dropped 12 GiB off the peak, and moved the high point into the SimplSetEmbed routines instead of in FuzzySimplSet. Further memory improvements will come in a later PR.

By dropping intermediate objects earlier we can reduce peak memory usage. Also reduces some code duplication between `_get_graph` and `_get_graph_supervised`.

copy-pr-bot · 2025-02-14T21:17:16Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

jcrist · 2025-02-15T02:46:30Z

Hold off on merging this - the commits ran fine on a 270 GiB dataset yesterday, but after rebasing on #6314 there's now an underflow happening on large data (I think from inputs.n now being an int instead of a uint64_t). I'll take a look on Tuesday morning, I suspect something in that PR got mistakenly dropped when I rebased.

jcrist · 2025-02-17T16:14:23Z

I suspect something in that PR got mistakenly dropped when I rebased.

This is not a bug in the change here, but rather a regression added in #6314. See my comment here: #6314 (comment).

jcrist added 2 commits February 14, 2025 16:43

Split up FuzzySimplSet into a few subroutines

fd1c024

Reduce memory in _get_graph/_get_graph_supervised

f01e98c

By dropping intermediate objects earlier we can reduce peak memory usage. Also reduces some code duplication between `_get_graph` and `_get_graph_supervised`.

jcrist marked this pull request as ready for review February 14, 2025 21:17

jcrist requested a review from a team as a code owner February 14, 2025 21:17

jcrist requested review from tfeher and lowener February 14, 2025 21:17

github-actions bot added the CUDA/C++ label Feb 14, 2025

jcrist added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce peak memory in `UMAP.fit`/`UMAP.fit_transform` #6323

Reduce peak memory in `UMAP.fit`/`UMAP.fit_transform` #6323

jcrist commented Feb 14, 2025

copy-pr-bot bot commented Feb 14, 2025

jcrist commented Feb 15, 2025 •

edited

Loading

jcrist commented Feb 17, 2025

Reduce peak memory in UMAP.fit/UMAP.fit_transform #6323

Are you sure you want to change the base?

Reduce peak memory in UMAP.fit/UMAP.fit_transform #6323

Conversation

jcrist commented Feb 14, 2025

copy-pr-bot bot commented Feb 14, 2025

jcrist commented Feb 15, 2025 • edited Loading

jcrist commented Feb 17, 2025

Reduce peak memory in `UMAP.fit`/`UMAP.fit_transform` #6323

Reduce peak memory in `UMAP.fit`/`UMAP.fit_transform` #6323

jcrist commented Feb 15, 2025 •

edited

Loading