Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Show number of samples per endpoint #2

Open
donbowman opened this issue Aug 7, 2023 · 2 comments
Open

Show number of samples per endpoint #2

donbowman opened this issue Aug 7, 2023 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@donbowman
Copy link

Describe the solution you'd like

I have an issue where some pages on my site that are not super popular are visited by some bot once in a while that has a very slow client side. this gives e.g. FCP in the thousands of seconds. And, it confuses me looking at the results page.

I think having a column on the reports page showing the # of samples that contribute to the number would help. And a filter that would allow hiding rows with less than some number of samples.

It is not clear to me what e.g. the FCP column refers to. Is it the average of the samples? the 75%ile? the median?

I would like to not see results that don't have some confidence to them.

I think also some method to remove or ignore outliers would be useful.

Example:

MySQL [www_database]> select timestamp,country,class,FCP_SUM from wp_vibes_statistics where endpoint = '/angular-content-security-policy-google-tagmanager/';
+------------+---------+---------+---------+
| timestamp  | country | class   | FCP_SUM |
+------------+---------+---------+---------+
| 2023-08-06 | NZ      | desktop |       0 |
| 2023-08-07 | US      | desktop |    6560 |
| 2023-08-07 | UA      | desktop |       0 |
| 2023-08-07 | DE      | desktop |    2878 |
| 2023-08-07 | FR      | desktop |       0 |
| 2023-08-07 | FI      | desktop |     825 |
| 2023-08-07 | DE      | mobile  |    2764 |
| 2023-08-07 | EE      | desktop |     983 |
| 2023-08-07 | IN      | mobile  | 9251220 |
| 2023-08-07 | PE      | desktop |    1471 |
+------------+---------+---------+---------+
@donbowman donbowman added the enhancement New feature or request label Aug 7, 2023
@Pierre-Lannoy
Copy link
Owner

AH! So you're querying values right in the database 🤩
So to be clear, for all measurements, you have 4 columns: *_sum (which is … the sum), then *_good, *_impr and *_poor which are number of samples in the Good, Needs Improvements and Poor (Google) classification.
If you want the total number of samples, just add *_good, *_impr and *_poor values.

@donbowman
Copy link
Author

donbowman commented Sep 11, 2023

thanks for the explanation.

the underlying issue, i have a page which has not a lot of traffic. Some bot opens it, and low and slow reads it for 9000s (yes i have this datapoint!). this completely skews my results since its averaged in against a small number of 0.9s page loads.

so i'm suggesting a couple of features:

  1. remove outliers
  2. a 'quality' of score column (e.g. the 'N') so i can focus on ones that are behaving poorly and have enough data points to support this

ideally I would be able to sort by popular pages with low score.

another option would be a metric like holtz-winters prediction, where no one data point outweighs the others dramatically

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants