What is the problem or goal the end user is trying to solve or accomplish?
The customer wants to create charts with high-cardinality dimensions (e.g., email domains or product categories) limited to the top 10 categories but still display the percentage contribution of these categories relative to the overall total dataset. Currently, when limiting the series to the top 10 categories, the "Row Contribution" recalculates percentages to sum to 100% for the filtered set, which misrepresents their contribution to the global total.
How are they solving it currently?
They are using custom SQL queries to pre-calculate percentages based on the entire dataset. However, this approach is cumbersome because it requires creating and maintaining separate SQL queries for each chart, limiting scalability. Additionally, custom SQL bypasses native Superset features, like dashboard filters, making the solution less flexible.
What is the recommended solution by the Customer?
Add a feature to calculate "Row Contribution" percentages before applying the Top-N filter. This would ensure percentages reflect contributions to the global total while retaining the ability to show only the top contributors. This solution should integrate seamlessly with dashboard filters and avoid the need for custom SQL, making it user-friendly and maintainable.