Decouple df.head() from the Cramer's computation

I got kind of surprise when I did the following display

```python
import skrub
from sklearn.datasets import fetch_california_housing

skrub.patch_display()
X, y = fetch_california_housing(return_X_y=True, as_frame=True)
X.head()
```

![image](https://github.com/user-attachments/assets/3489931e-b91e-48d0-b462-25db0a14d906)

It took some time to understand that the reason was due to the `X.head()` and that in this case, it was making sense.

I'm wondering if you should avoid computing all the different values when one call `X.head()` instead of showing the statistics on few line. It can be misleading.

An alternative is to compute the statistics on the full dataset instead even if a user request to check the `.head()`. However if you call `.head()` it might be only because you are interested of seeing the couple of first line of the dataframe without checking any other statistics.

@jeromedockes WDYT?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decouple df.head() from the Cramer's computation #1179

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development