Show tasks duration and resource usage metrics in `test_cluster_performance` output #5390

Selutario · 2024-05-15T16:17:59Z

Description

We need to modify the test below:

https://github.com/wazuh/wazuh-qa/tree/master/tests/performance/test_cluster/test_cluster_performance

It fails when any of the cluster stats (task duration or resource usage) exceeds a predefined threshold. However, it may be helpful to review what those stats really are even if the test does not fail, so that we can detect slight increases in some of the metrics.

To make this easier, the test should print (and include in the report) the detailed metrics that it uses internally. For example:

>>> from wazuh_testing.tools.performance.csv_parser import ClusterCSVTasksParser
>>> ClusterCSVTasksParser('/home/selu/Descargas/cluster_performance/517/artifacts_480_rc1').get_stats()
{
    "setup_phase": {
        "integrity_check": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_17", 0.3481111111111111),
                    "max": ("worker_14", 3.176),
                },
                "master": {
                    "mean": ("master", 0.05240245824141191),
                    "max": ("master", 0.709),
                },
            }
        },
        "integrity_sync": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_8", 0.04211764705882353),
                    "max": ("worker_23", 0.163),
                },
                "master": {
                    "mean": ("master", 0.5421203007518796),
                    "max": ("master", 3.217),
                },
            }
        },
        "agent-info_sync": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_18", 0.9509827586206897),
                    "max": ("worker_9", 10.639),
                },
                "master": {
                    "mean": ("master", 0.687005693950178),
                    "max": ("master", 10.257),
                },
            }
        },
    },
    "stable_phase": {
        "integrity_check": {
            "time_spent(s)": {
                "workers": {
                    "mean": ("worker_3", 0.01140740740740741),
                    "max": ("worker_3", 0.04),
                },
                "master": {
                    "mean": ("master", 0.00456888888888889),
                    "max": ("master", 0.017),
                },
            }
        },
        "agent-info_sync": {
            "time_spent(s)": {
                "workers": {"mean": ("worker_18", 0.00964), "max": ("worker_18", 0.025)}
            }
        },
    },
}

Selutario · 2024-05-16T15:10:56Z

We should also decrease these task thresholds:

wazuh-qa/tests/performance/test_cluster/test_cluster_performance/data/25w_50000a_thresholds.yaml

Lines 2 to 43 in d6616bf

    
           setup_phase: 
        
             agent-info_sync: 
        
               time_spent(s): 
        
                 master: 
        
                   max: 31 
        
                   mean: 3.1 
        
                 workers: 
        
                   max: 50 
        
                   mean: 8 
        
             integrity_check: 
        
               time_spent(s): 
        
                 master: 
        
                   max: 50 
        
                   mean: 8.3 
        
                 workers: 
        
                   max: 55 
        
                   mean: 13.5 
        
             integrity_sync: 
        
               time_spent(s): 
        
                 master: 
        
                   max: 54 
        
                   mean: 11 
        
                 workers: 
        
                   max: 22 
        
                   mean: 3.2 
        
           stable_phase: 
        
             agent-info_sync: 
        
               time_spent(s): 
        
                 master: 
        
                   max: 5 
        
                   mean: 1 
        
                 workers: 
        
                   max: 8.5 
        
                   mean: 3.3 
        
             integrity_check: 
        
               time_spent(s): 
        
                 master: 
        
                   max: 6.5 
        
                   mean: 3 
        
                 workers: 
        
                   max: 10 
        
                   mean: 6

Artifacts like the ones attached here should be making the test fail.

Selutario added level/task Task issue type/enhancement labels May 15, 2024

Selutario mentioned this issue May 16, 2024

Release 4.8.0 - Release Candidate 2 - Workload benchmarks metrics wazuh/wazuh#23440

Closed

1 task

juliamagan mentioned this issue May 17, 2024

Release 4.8.0 - RC 2 wazuh/wazuh#23405

Closed

damarisg mentioned this issue Jul 15, 2024

Define and adapt the QA team repositories #5475

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show tasks duration and resource usage metrics in `test_cluster_performance` output #5390

Show tasks duration and resource usage metrics in `test_cluster_performance` output #5390

Selutario commented May 15, 2024

Selutario commented May 16, 2024

Show tasks duration and resource usage metrics in test_cluster_performance output #5390

Show tasks duration and resource usage metrics in test_cluster_performance output #5390

Comments

Selutario commented May 15, 2024

Description

Selutario commented May 16, 2024

Show tasks duration and resource usage metrics in `test_cluster_performance` output #5390

Show tasks duration and resource usage metrics in `test_cluster_performance` output #5390