-
Notifications
You must be signed in to change notification settings - Fork 932
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Separate stats filtering helpers to reuse in page pruning #18034
base: branch-25.04
Are you sure you want to change the base?
Separate stats filtering helpers to reuse in page pruning #18034
Conversation
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
@@ -740,4 +462,157 @@ std::optional<std::vector<std::vector<size_type>>> collect_filtered_row_group_in | |||
return {filtered_row_group_indices}; | |||
} | |||
|
|||
// All required templates for `stats_caster_base` and `stats_caster_base::host_column<T>` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved definitions as-is. All required templates for row group or page pruning are instantiated here as well. Definitions moved to stats_filter_helpers.hpp
as it seems like it doesn't instantiate all templates needed for page pruning
size_type total_row_groups; | ||
std::vector<metadata> const& per_file_metadata; | ||
host_span<std::vector<size_type> const> row_group_indices; | ||
|
||
template <typename ToType, typename FromType> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved to stats_filter_helpers.hpp
@@ -248,146 +110,6 @@ struct stats_caster { | |||
} | |||
}; | |||
|
|||
/** |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved completely to stats_filter_helpers.hpp/cpp
Co-authored-by: David Wendt <[email protected]>
Co-authored-by: David Wendt <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. Some minor comments.
Co-authored-by: David Wendt <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍
Any failing GTests are fixed by #18019 |
|
||
#include "io/parquet/parquet_common.hpp" | ||
|
||
#include <cudf/ast/detail/expression_transformer.hpp> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved definitions for stats_caster_base
here as not all templates instantiated at one place.
Description
Contributes to #17896
This PR separates stats based filtering helpers for reuse in page pruning using stats in Parquet PageIndex.
Checklist